awesome-speech-recognition-speech-synthesis-papers
timit
Our great sponsors
awesome-speech-recognition-speech-synthesis-papers | timit | |
---|---|---|
- | 1 | |
2,870 | 273 | |
- | - | |
3.5 | 0.0 | |
6 months ago | about 2 years ago | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-speech-recognition-speech-synthesis-papers
We haven't tracked posts mentioning awesome-speech-recognition-speech-synthesis-papers yet.
Tracking mentions began in Dec 2020.
timit
-
Hey folks, I need a bunch of phoneme data
I used the DARPA TIMIT dataset for my undergraduate capstone! I think this is the link, but it’s been a while. Otherwise you could search for that name. https://github.com/philipperemy/timit
What are some alternatives?
pytorch-seq2seq - Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
text-to-speech-ubuntu - 🙊 Setup "selectable" text to speech / TTS on Ubuntu Linux 24.04 22.04 22.10 23.04 23.10 . Ideal for speed reading, programming, editing and writing.
CodeSearchNet - Datasets, tools, and benchmarks for representation learning of code.
MockingBird - 🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
lingvo - Lingvo
web-whisper - OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
Best_AI_paper_2020 - A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code
Awesome-Efficient-LLM - A curated list for Efficient Large Language Models
nnsvs-english-support - The Original Support for English NNSVS Dataset Creation
deepspeech-playbook - A crash course for training speech recognition models using DeepSpeech.
Awesome-Video-Diffusion - A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
OpenSeq2Seq - Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP