How to convert Speech-to-Text with Python?

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

wav2letter

3 6,331 4.5 C++

Facebook AI Research's Automatic Speech Recognition Toolkit

Flashlight is a fast, flexible machine learning library written entirely in C++ from the Facebook AI Research Speech team and the creators of Torch and Deep Speech. Flashlight's ASR application (formerly the wav2letter project) provides training and inference capabilities for end-to-end speech recognition systems. This engine is really performant but you will need to compile all the C++ libraries before using it with Python.

examples

142 7,742 6.2 Jupyter Notebook

TensorFlow examples (by tensorflow)

DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project