Speech Recognition module in Python

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

pip

108 9,264 9.8 Python

The Python package installer

% pip install --global-option='build_ext' --global-option='-I/opt/homebrew/include/portaudio.h' --global-option='-L/opt/homebrew/Cellar/portaudio/19.7.0/include/portaudio.h' pyaudio WARNING: Implying --no-binary=:all: due to the presence of --build-option / --global-option / --install-option. Consider using --config-settings for more flexibility. DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 Collecting pyaudio Using cached PyAudio-0.2.12.tar.gz (42 kB) Installing build dependencies ... done Getting requirements to build wheel ... done Preparing metadata (pyproject.toml) ... done Building wheels for collected packages: pyaudio WARNING: Ignoring --global-option when building pyaudio using PEP 517 Building wheel for pyaudio (pyproject.toml) ... error error: subprocess-exited-with-error

LiveWhisper

2 283 0.0 Python

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

I've run into this EXACT SAME problem, and ended up creating my own SpeechRecognition alternative, using sounddevice (which unlike pyaudio IS compatible with my Linux Mint's audio drivers), and OpenAI's Whisper model.. Cause that was my only option, other than risking messing up my audio drivers.. heh

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
whisper

343 59,916 6.8 Python

Robust Speech Recognition via Large-Scale Weak Supervision

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project