Speech Recognition module in Python

This page summarizes the projects mentioned and recommended in the original post on /r/Python

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • pip

    The Python package installer

  • % pip install --global-option='build_ext' --global-option='-I/opt/homebrew/include/portaudio.h' --global-option='-L/opt/homebrew/Cellar/portaudio/19.7.0/include/portaudio.h' pyaudio WARNING: Implying --no-binary=:all: due to the presence of --build-option / --global-option / --install-option. Consider using --config-settings for more flexibility. DEPRECATION: --no-binary currently disables reading from the cache of locally built wheels. In the future --no-binary will not influence the wheel cache. pip 23.1 will enforce this behaviour change. A possible replacement is to use the --no-cache-dir option. You can use the flag --use-feature=no-binary-enable-wheel-cache to test the upcoming behaviour. Discussion can be found at https://github.com/pypa/pip/issues/11453 Collecting pyaudio Using cached PyAudio-0.2.12.tar.gz (42 kB) Installing build dependencies ... done Getting requirements to build wheel ... done Preparing metadata (pyproject.toml) ... done Building wheels for collected packages: pyaudio WARNING: Ignoring --global-option when building pyaudio using PEP 517 Building wheel for pyaudio (pyproject.toml) ... error error: subprocess-exited-with-error

  • LiveWhisper

    A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

  • I've run into this EXACT SAME problem, and ended up creating my own SpeechRecognition alternative, using sounddevice (which unlike pyaudio IS compatible with my Linux Mint's audio drivers), and OpenAI's Whisper model.. Cause that was my only option, other than risking messing up my audio drivers.. heh

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts