WhisperLive
LiveWhisper
WhisperLive | LiveWhisper | |
---|---|---|
4 | 2 | |
1,253 | 293 | |
17.0% | - | |
9.4 | 0.0 | |
8 days ago | 5 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
WhisperLive
-
Show HN: WhisperFusion β Ultra-low latency conversations with an AI chatbot
Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive
-
WhisperSpeech β An Open Source text-to-speech system built by inverting Whisper
Check out WhisperLive: https://github.com/collabora/WhisperLive
If you're grappling with the slow march from cool tech demos to real-world language model apps, you might wanna check out WhisperLive. It's this rad open-source project thatβs all about leveraging Whisper models for slick live transcription. Think real-time, on-the-fly translated captions for those global meetups. It's a neat example of practical, user-focused tech in action. Dive into the details on their GitHub page
-
Whisper: Nvidia RTX 4090 vs. M1 Pro with MLX
https://github.com/collabora/WhisperLive
The is another one that uses huggingface's implementation, but I haven't tried it since my spec doesn't support flash-att2
-
Triple Threat: The Power of Transcription, Summary, and Translation
Curious to see how this works? Check out our demo page - https://col.la/transcription to generate your own transcription, summary, and translation, or use our browser extension - https://github.com/collabora/WhisperLive to get live transcriptions.
LiveWhisper
-
Speech Recognition module in Python
I've run into this EXACT SAME problem, and ended up creating my own SpeechRecognition alternative, using sounddevice (which unlike pyaudio IS compatible with my Linux Mint's audio drivers), and OpenAI's Whisper model.. Cause that was my only option, other than risking messing up my audio drivers.. heh
-
How to install and deploy OpenAI Whisper with Python
If anyone's interested, I took a wack at making Whisper transcribe semi-live, to the terminal: https://github.com/Nikorasu/LiveWhisper
What are some alternatives?
cog-whisper-diarization - Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
whisper-openai-gradio-implementation - Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation
whisper-writer - π¬π A small dictation app using OpenAI's Whisper speech recognition model.
whisper-standalone-win - Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
obs-zoom-and-follow - Dynamic zoom and mouse tracking script for OBS Studio
web-whisper - OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
gpt_chatbot - This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows
SwiftWhisper - π€ The easiest way to transcribe audio in Swift
whisper_streaming - Whisper realtime streaming for long speech-to-text transcription and translation
FlorenceBot - A fully interactive domain-specific chatbot implemented using Prolog and PySwip.
gpt-voice-conversation-chatbot - Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
whisper-subtitles-webui - A gradio interface for making transcribed and translated subtitles for videos