Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 14 automatic-speech-recognition Open-Source Projects
-
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
-
Auto-Subtitled-Video-Generator
Input a YouTube video link or upload a video file and get a video with subtitles.
-
go-subgen
Automatically generate subtitles for your media using whisper.cpp via webhooks with support for Radarr & Sonarr
-
werpy
🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
wenet-e2e/wenet
Project mention: Rest in Peas: The Unrecognized Death of Speech Recognition (2010) | news.ycombinator.com | 2023-05-04What has happened since then? I know Common Voice has come and gone https://en.wikipedia.org/wiki/Common_Voice https://github.com/coqui-ai/STT
And I've seen some neural approaches too
No idea where to look for comparisons though.
Project mention: How I converted a podcast into a knowledge base using Orama search and OpenAI whisper and Astro | dev.to | 2023-05-23
Source: https://colab.research.google.com/github/ArthurFDLR/whisper-youtube/blob/main/whisper_youtube.ipynb
Project mention: Self hosted call transcribing and searchable text solution | /r/selfhosted | 2023-06-09you might be able to use go-subgen (it's my project) if you're willing/able to trigger it via http posts. There might also be a whisper frontend that can do filesystem watching that could work better, but I don't know of any off the top of my head.
automatic-speech-recognition related posts
- Magyar Youtube feliratozo v.1.1 (ingyenes colab)
- Telex videón magyar és angol felirat demó (whisper-ctranslate2)
- Numen - FOSS voice control for handsfree computing
- I've built a few tools on top of GPT-3.5 (text generation, q&a with embeddings). AMA about resources and AI dev stacks for building with OpenAI's APIs
- I've built an Auto Subtitled Video Generator using Streamlit and OpenAI Whisper, hosted on HuggingFace spaces. All you have to do is input a YouTube video link and get a video with subtitles (alongside with .txt, .vtt, .srt files).
- [Project] I've built an Auto Subtitled Video Generator using Streamlit and OpenAI Whisper, hosted on HuggingFace spaces.
- Introducing Whisper
-
A note from our sponsor - InfluxDB
www.influxdata.com | 29 Apr 2024
Index
What are some of the best open-source automatic-speech-recognition projects? This list will help you:
Project | Stars | |
---|---|---|
1 | wenet | 3,691 |
2 | awesome-speech-recognition-speech-synthesis-papers | 2,870 |
3 | STT | 2,131 |
4 | whisper-asr-webservice | 1,617 |
5 | cheetah | 552 |
6 | leopard | 406 |
7 | whisper-youtube | 319 |
8 | soxan | 218 |
9 | FAST-RIR | 137 |
10 | Auto-Subtitled-Video-Generator | 65 |
11 | go-subgen | 52 |
12 | ovos-stt-plugin-vosk | 14 |
13 | werpy | 9 |
14 | joureka-app | 5 |
Sponsored