Python Whisper

Open-source Python projects categorized as Whisper

Top 23 Python Whisper Projects

  • PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

    Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

    PaddlePaddle/PaddleSpeech

  • buzz

    Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

    Project mention: MacWhisper: Transcribe audio files on your Mac | news.ycombinator.com | 2023-08-23
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • whisperX

    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

    Project mention: FLaNK 15 Jan 2024 | dev.to | 2024-01-15
  • faster-whisper

    Faster Whisper transcription with CTranslate2

    Project mention: Whisper: Nvidia RTX 4090 vs. M1 Pro with MLX | news.ycombinator.com | 2023-12-13

    Could someone elaborate how is this accomplished and is there any quality disparity compared to original whisper?

    Repos like https://github.com/SYSTRAN/faster-whisper makes immediate sense about why it's faster than the original, but this one, not so much, especially considering it's even much faster.

  • distil-whisper

    Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

    Project mention: FLaNK Stack 05 Feb 2024 | dev.to | 2024-02-05
  • FunASR

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

    Project mention: FunASR: Fundamental End-to-End Speech Recognition Toolkit | news.ycombinator.com | 2024-01-13
  • inference

    Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

    Project mention: GreptimeAI + Xinference - Efficient Deployment and Monitoring of Your LLM Applications | dev.to | 2024-01-24

    Xorbits Inference (Xinference) is an open-source platform to streamline the operation and integration of a wide array of AI models. With Xinference, you’re empowered to run inference using any open-source LLMs, embedding models, and multimodal models either in the cloud or on your own premises, and create robust AI-driven applications. It provides a RESTful API compatible with OpenAI API, Python SDK, CLI, and WebUI. Furthermore, it integrates third-party developer tools like LangChain, LlamaIndex, and Dify, facilitating model integration and development.

  • WorkOS

    The modern API for authentication & user identity. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • whisper-timestamped

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

    Project mention: AI-assisted removal of filler words from video recordings | dev.to | 2023-11-01

    whisper-timestamped, which is a layer on top of the Whisper set of models enabling us to get accurate word timestamps and include filler words in transcription output. This transcriber downloads the selected Whisper model to the machine running the demo and no third-party API keys are required.

  • yt-whisper

    Using OpenAI's Whisper to automatically generate YouTube subtitles

  • auto-subtitle

    Automatically generate and overlay subtitles for any video.

    Project mention: Built this app to generate subtitles, summaries, and chapters for videos, all self-hostable with a single Docker image | /r/selfhosted | 2023-03-28

    Have a look at this repo , it generates subtitles with whisper locally

  • subsai

    🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️

    Project mention: Porting CP/M to the Brother SuperPowerNote Z80 laptop thing [video] | news.ycombinator.com | 2023-12-13

    Adding Whisper subtitles was really easy and they're dramatically better than the automatic Google ones (I did it via https://github.com/abdeladim-s/subsai, which was really easy to use). So there is now a reasonably good transcript available in the video comments.

  • WhisperLive

    A nearly-live implementation of OpenAI's Whisper.

    Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29

    Everything runs locally, we use:

    - WhisperLive for the transcription - https://github.com/collabora/WhisperLive

  • whisper.api

    This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

    Project mention: Do you know any quality FastAPI starter projects? | /r/flask | 2023-10-10
  • truss

    The simplest way to serve AI/ML models in production (by basetenlabs)

  • whisper-playground

    Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

  • whisper-ctranslate2

    Whisper command line client compatible with original OpenAI client based on CTranslate2.

    Project mention: Firefox slow to load YouTube? Just another front in Google's war on ad blockers | news.ycombinator.com | 2023-12-12

    Much better, actually. Try the large-v3 model, it's great. I use it via whisper-ctranslate2 which is a faster implementation.

    https://github.com/Softcatala/whisper-ctranslate2

  • whisper-standalone-win

    Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

    Project mention: Question : is this a movie only tracker? | /r/Karagarga | 2023-07-03

    On the other hand, if you need subtitles for a movie that doesn't have some. There are some automated solutions like Whisper that can do a very decent job in most cases : https://github.com/Purfview/whisper-standalone-win

  • AI-Waifu-Vtuber

    AI Vtuber for Streaming on Youtube/Twitch

    Project mention: AI VTUBER | /r/VirtualYoutubers | 2023-04-04
  • agentchain

    Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks

    Project mention: Chain together LLMs for reasoning and orchestrate multiple large models for accomplishing complex tasks like phoning someone using a GPT-4 model | /r/Python | 2023-03-15
  • pyannote-whisper

    Project mention: Summarization of long transcriptions | /r/LocalLLaMA | 2023-07-18

    These will be 3-5 hour recordings of 4-5 people. I plan to use https://github.com/yinruiqing/pyannote-whisper to generate the transcript from the recording.

  • ChatFred

    Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more.

    Project mention: What is your favorite mac app that you just discover in first half of 2023? | /r/macapps | 2023-07-01
  • Speech-Translate

    A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

  • LiveWhisper

    A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

  • Onboard AI

    ChatGPT with full context of any GitHub repo. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at app.getonboardai.com.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-05.

Python Whisper related posts

Index

What are some of the best open-source Whisper projects in Python? This list will help you:

Project Stars
1 PaddleSpeech 9,736
2 buzz 8,897
3 whisperX 7,965
4 faster-whisper 7,402
5 distil-whisper 2,865
6 FunASR 2,508
7 inference 1,773
8 whisper-timestamped 1,330
9 yt-whisper 1,270
10 auto-subtitle 1,034
11 subsai 926
12 WhisperLive 832
13 whisper.api 820
14 truss 788
15 whisper-playground 738
16 whisper-ctranslate2 668
17 whisper-standalone-win 588
18 AI-Waifu-Vtuber 581
19 agentchain 555
20 pyannote-whisper 372
21 ChatFred 358
22 Speech-Translate 328
23 LiveWhisper 268
ChatGPT with full context of any GitHub repo.
Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at app.getonboardai.com.
app.getonboardai.com