Python Speech Data

Open-source Python projects categorized as Speech Data

Top 9 Python Speech Data Projects

  • SpeechRecognition

    Speech recognition module for Python, supporting several engines and APIs, online and offline.

    Project mention: Unpopular Opinion: a lot of Obsidian community make Obsidian sound like something cringey/productivity guru-y | reddit.com/r/ObsidianMD | 2023-05-14

    This is the library: https://github.com/Uberi/speech_recognition

  • aeneas

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

    Project mention: Anyone know of a tool to align (existing) subtitles to audio along sentence boundaries? | reddit.com/r/LanguageTechnology | 2023-02-03

    You could try aeneas. Syncabook apparently uses the afaligner library, which says that it was inspired by aeneas but uses FastDTW to find an approximation to the optimal warping path. This might make it slightly less accurate than aeneas.

  • CodiumAI

    TestGPT | Generating meaningful tests for busy devs. Get non-trivial tests (and trivial, too!) suggested right inside your IDE, so you can code smart, create more value, and stay confident when you push.

  • Watson Developer Cloud Python SDK

    :snake: Client library to use the IBM Watson services in Python and available in pip as watson-developer-cloud

  • speechpy

    :speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

  • Prosodylab-Aligner

    Python interface for forced audio alignment using HTK and SoX

  • praatIO

    A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc).

  • pyAcoustics

    A collection of python scripts for extracting and analyzing acoustics from audio files.

  • ONLYOFFICE

    ONLYOFFICE Docs — document collaboration in your environment. Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises

  • ProMo

    Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.

  • pysle

    Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-05-14.

Python Speech Data related posts

Index

What are some of the best open-source Speech Data projects in Python? This list will help you:

Project Stars
1 SpeechRecognition 7,207
2 aeneas 2,190
3 Watson Developer Cloud Python SDK 1,428
4 speechpy 877
5 Prosodylab-Aligner 302
6 praatIO 248
7 pyAcoustics 75
8 ProMo 72
9 pysle 41
Write Clean Python Code. Always.
Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
www.sonarsource.com