Python Whisper

Open-source Python projects categorized as Whisper

Top 23 Python Whisper Projects

  • PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

  • Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

    PaddlePaddle/PaddleSpeech

  • buzz

    Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

  • Project mention: Buzz: Transcribe and translate audio offline on your personal computer | news.ycombinator.com | 2024-03-21
  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • whisperX

    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

  • Project mention: Easy video transcription and subtitling with Whisper, FFmpeg, and Python | news.ycombinator.com | 2024-04-06

    It uses this, which does support diarization: https://github.com/m-bain/whisperX

  • faster-whisper

    Faster Whisper transcription with CTranslate2

  • Project mention: Using Groq to Build a Real-Time Language Translation App | dev.to | 2024-04-05

    For our real-time STT needs, we'll employ a fantastic library called faster-whisper.

  • distil-whisper

    Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

  • Project mention: FLaNK Stack 05 Feb 2024 | dev.to | 2024-02-05
  • FunASR

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。

  • Project mention: FunASR: Fundamental End-to-End Speech Recognition Toolkit | news.ycombinator.com | 2024-01-13
  • chatgpt-telegram-bot

    🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python (by n3d1117)

  • Project mention: Are you selfhosting a ChatGPT alternative? | /r/selfhosted | 2023-06-09
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • inference

    Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

  • Project mention: GreptimeAI + Xinference - Efficient Deployment and Monitoring of Your LLM Applications | dev.to | 2024-01-24

    Xorbits Inference (Xinference) is an open-source platform to streamline the operation and integration of a wide array of AI models. With Xinference, you’re empowered to run inference using any open-source LLMs, embedding models, and multimodal models either in the cloud or on your own premises, and create robust AI-driven applications. It provides a RESTful API compatible with OpenAI API, Python SDK, CLI, and WebUI. Furthermore, it integrates third-party developer tools like LangChain, LlamaIndex, and Dify, facilitating model integration and development.

  • whisper-timestamped

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

  • Project mention: Show HN: AI Dub Tool I Made to Watch Foreign Language Videos with My 7-Year-Old | news.ycombinator.com | 2024-02-28

    Yes. But Whisper's word-level timings are actually quite inaccurate out of the box. There are some Python libraries that mitigate that. I tested several of them. whisper-timestamped seems to be the best one. [0]

    [0] https://github.com/linto-ai/whisper-timestamped

  • yt-whisper

    Using OpenAI's Whisper to automatically generate YouTube subtitles

  • auto-subtitle

    Automatically generate and overlay subtitles for any video.

  • WhisperLive

    A nearly-live implementation of OpenAI's Whisper.

  • Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29

    Everything runs locally, we use:

    - WhisperLive for the transcription - https://github.com/collabora/WhisperLive

  • subsai

    🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️

  • Project mention: Porting CP/M to the Brother SuperPowerNote Z80 laptop thing [video] | news.ycombinator.com | 2023-12-13

    Adding Whisper subtitles was really easy and they're dramatically better than the automatic Google ones (I did it via https://github.com/abdeladim-s/subsai, which was really easy to use). So there is now a reasonably good transcript available in the video comments.

  • whisper.api

    This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

  • Project mention: Do you know any quality FastAPI starter projects? | /r/flask | 2023-10-10
  • truss

    The simplest way to serve AI/ML models in production (by basetenlabs)

  • whisper-standalone-win

    Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

  • Project mention: Question : is this a movie only tracker? | /r/Karagarga | 2023-07-03

    On the other hand, if you need subtitles for a movie that doesn't have some. There are some automated solutions like Whisper that can do a very decent job in most cases : https://github.com/Purfview/whisper-standalone-win

  • whisper-playground

    Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

  • whisper-ctranslate2

    Whisper command line client compatible with original OpenAI client based on CTranslate2.

  • Project mention: Firefox slow to load YouTube? Just another front in Google's war on ad blockers | news.ycombinator.com | 2023-12-12

    Much better, actually. Try the large-v3 model, it's great. I use it via whisper-ctranslate2 which is a faster implementation.

    https://github.com/Softcatala/whisper-ctranslate2

  • Whisper-WebUI

    A Web UI for easy subtitle using whisper model.

  • AI-Waifu-Vtuber

    AI Vtuber for Streaming on Youtube/Twitch

  • whisper_mic

    Project that allows one to use a microphone with OpenAI whisper.

  • agentchain

    Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks

  • pyannote-whisper

  • Project mention: Summarization of long transcriptions | /r/LocalLLaMA | 2023-07-18

    These will be 3-5 hour recordings of 4-5 people. I plan to use https://github.com/yinruiqing/pyannote-whisper to generate the transcript from the recording.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Whisper related posts

Index

What are some of the best open-source Whisper projects in Python? This list will help you:

Project Stars
1 PaddleSpeech 10,120
2 buzz 9,869
3 whisperX 8,965
4 faster-whisper 8,723
5 distil-whisper 3,125
6 FunASR 3,299
7 chatgpt-telegram-bot 2,686
8 inference 2,512
9 whisper-timestamped 1,501
10 yt-whisper 1,313
11 auto-subtitle 1,173
12 WhisperLive 1,180
13 subsai 1,051
14 whisper.api 838
15 truss 833
16 whisper-standalone-win 757
17 whisper-playground 759
18 whisper-ctranslate2 743
19 Whisper-WebUI 670
20 AI-Waifu-Vtuber 626
21 whisper_mic 615
22 agentchain 563
23 pyannote-whisper 414

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com