Get non-trivial tests (and trivial, too!) suggested right inside your IDE, so you can code smart, create more value, and stay confident when you push. Learn more →
Top 9 Python Speech Data Projects
-
SpeechRecognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Project mention: Unpopular Opinion: a lot of Obsidian community make Obsidian sound like something cringey/productivity guru-y | reddit.com/r/ObsidianMD | 2023-05-14This is the library: https://github.com/Uberi/speech_recognition
-
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Project mention: Anyone know of a tool to align (existing) subtitles to audio along sentence boundaries? | reddit.com/r/LanguageTechnology | 2023-02-03You could try aeneas. Syncabook apparently uses the afaligner library, which says that it was inspired by aeneas but uses FastDTW to find an approximation to the optimal warping path. This might make it slightly less accurate than aeneas.
-
CodiumAI
TestGPT | Generating meaningful tests for busy devs. Get non-trivial tests (and trivial, too!) suggested right inside your IDE, so you can code smart, create more value, and stay confident when you push.
-
Watson Developer Cloud Python SDK
:snake: Client library to use the IBM Watson services in Python and available in pip as watson-developer-cloud
-
speechpy
:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
-
-
praatIO
A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc).
-
-
ONLYOFFICE
ONLYOFFICE Docs — document collaboration in your environment. Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises
-
ProMo
Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.
-
pysle
Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.
Python Speech Data related posts
- Unpopular Opinion: a lot of Obsidian community make Obsidian sound like something cringey/productivity guru-y
- Nvim-VoiceRec : Add Speech-To-Text To Neovim! (useful for gpt)
- Speech-to-text software
- Anyone know of a tool to align (existing) subtitles to audio along sentence boundaries?
- Voice commands in Doom Eternal possible?
- Need help with speech recognition
- Wiki for the podcast
-
A note from our sponsor - CodiumAI
codium.ai | 31 May 2023
Index
What are some of the best open-source Speech Data projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | SpeechRecognition | 7,207 |
2 | aeneas | 2,190 |
3 | Watson Developer Cloud Python SDK | 1,428 |
4 | speechpy | 877 |
5 | Prosodylab-Aligner | 302 |
6 | praatIO | 248 |
7 | pyAcoustics | 75 |
8 | ProMo | 72 |
9 | pysle | 41 |