amazon-transcribe-output-word-document
AutoSub
amazon-transcribe-output-word-document | AutoSub | |
---|---|---|
2 | 2 | |
44 | 556 | |
- | - | |
1.8 | 4.1 | |
about 2 years ago | 4 months ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
amazon-transcribe-output-word-document
- Transcript to word docs
-
AWS - NLP newsletter - 2021. Aug.
Amazon Transcribe Call Analytics Amazon Transcribe Call Analytics is a new machine learning (ML) powered conversation insights API that enables developers to improve customer experience and agent productivity. This API can analyze call recordings to generate turn-by-turn call transcripts and actionable insights for understanding customer-agent interactions, identifying trending issues, and tracking performance metrics. Launch content: AWS News Blog, What's New Post, Webpage, Documentation, GitHub Demo, LinkedIn.
AutoSub
-
Does there exist a free as in freedom solution to apply speech-to-text recognition to a video to get subtitles?
There's an interesting project here https://github.com/abhirooptalasila/AutoSub which uses FFmpeg & Deepspeech. YMMV around the accuracy.
-
Need to create subtitles
Check autosub
What are some alternatives?
aws-lambda-docker-serverless-inference - Serve scikit-learn, XGBoost, TensorFlow, and PyTorch models with AWS Lambda container images support.
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
kalliope - Kalliope is a framework that will help you to create your own personal assistant.
ffsubsync - Automagically synchronize subtitles with video.
NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
DeepSpeech-Italian-Model - Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
amazon-transcribe-post-call-analytics
netflix-to-srt - Rip, extract and convert subtitles to .srt closed captions from .xml/dfxp/ttml and .vtt/WebVTT (e.g. Netflix, YouTube)
SpeechRecognition - Speech recognition module for Python, supporting several engines and APIs, online and offline.
ffmpeg - 基于FFmpeg的python视频处理包-因疫情影响,工作比较繁忙,心情也没在视频上面再研究,该项目已经搁置,源码很简单,大家可以自己研究一下自己扩展
cleanvid - cleanvid is a little script to mute profanity in video files
cheetah - On-device streaming speech-to-text engine powered by deep learning