WhisperLive vs whisper-writer

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

WhisperLive		whisper-writer
	Project
4	Mentions	2
1,253	Stars	188
17.0%	Growth	-
9.4	Activity	6.6
8 days ago	Latest Commit	9 days ago
Python	Language	Python
MIT License	License	GNU General Public License v3.0 only

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

WhisperLive

Posts with mentions or reviews of WhisperLive. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-29.

Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot
7 projects | news.ycombinator.com | 29 Jan 2024

Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive
WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper
9 projects | news.ycombinator.com | 17 Jan 2024

Check out WhisperLive: https://github.com/collabora/WhisperLive
If you're grappling with the slow march from cool tech demos to real-world language model apps, you might wanna check out WhisperLive. It's this rad open-source project that’s all about leveraging Whisper models for slick live transcription. Think real-time, on-the-fly translated captions for those global meetups. It's a neat example of practical, user-focused tech in action. Dive into the details on their GitHub page
Whisper: Nvidia RTX 4090 vs. M1 Pro with MLX
10 projects | news.ycombinator.com | 13 Dec 2023

https://github.com/collabora/WhisperLive
The is another one that uses huggingface's implementation, but I haven't tried it since my spec doesn't support flash-att2
Triple Threat: The Power of Transcription, Summary, and Translation
1 project | news.ycombinator.com | 3 Aug 2023

Curious to see how this works? Check out our demo page - https://col.la/transcription to generate your own transcription, summary, and translation, or use our browser extension - https://github.com/collabora/WhisperLive to get live transcriptions.

whisper-writer

Posts with mentions or reviews of whisper-writer. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-08.

Show HN: WhisperWriter – Speech-to-text using OpenAI's Whisper, coded by ChatGPT
2 projects | news.ycombinator.com | 8 May 2023
Using ChatGPT to generate a GPT project end-to-end
4 projects | news.ycombinator.com | 6 May 2023

I've also made six small apps completely coded by ChatGPT (with GitHub Copilot contributing a bit as well). Here are the two largest:
PlaylistGPT (https://github.com/savbell/playlist-gpt): A fun little web app that allows you to ask questions about your Spotify playlists and receive answers from Python code generated by OpenAI's models. I even added a feature where if the code written by GPT runs into errors, it can send the code and the error back to the model and ask it to fix it. It actually can debug itself quite often! One of the most impressive things for me was how it was able to model the UI after the Spotify app with little more than me asking it to do exactly that.
WhisperWriter (https://github.com/savbell/whisper-writer): A small speech-to-text app that uses OpenAI's Whisper API to auto-transcribe recordings from a user's microphone. It waits for a keyboard shortcut to be pressed, then records from the user's microphone until it detects a pause in their speech, and then types out the Whisper transcription to the active window. It only took me two hours to get a working prototype up and running, with additions such as graphic indicators taking a few more hours to implement.
I created the first for fun and the second to help me overcome a disability that impacts my ability to use a keyboard. I now use WhisperWriter literally every day (I'm even typing part of this comment with it), and I used it to prompt ChatGPT to write the code for a few additional personal projects that improve my quality-of-life in small ways. If people are interested, I may write up more about the prompting and pair programming process, since I definitely learned a lot as I worked through these, including some similar lessons to the article!
Personally, I am super excited about the possibilities these AI technologies open up for people like me, who may be facing small challenges that could be easily solved with a tiny app written in a few hours tailored specifically to their problem. I had been struggling to use my desktop computer because the Windows Dictation tool was very broken for me, but now I feel like I can use it to my full capacity again because I can type with WhisperWriter. Coding now takes a minimal amount of keyboard use thanks to these AI coding assistants -- and I am super grateful for that!

What are some alternatives?

When comparing WhisperLive and whisper-writer you can also consider the following projects:

cog-whisper-diarization - Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote

kaldi-active-grammar - Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

obs-zoom-and-follow - Dynamic zoom and mouse tracking script for OBS Studio

AI-Waifu-Vtuber - AI Vtuber for Streaming on Youtube/Twitch

gpt_chatbot - This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows

playlist-gpt - 🎶👩‍💻 A fun little web app that analyzes your Spotify playlists with help from OpenAI's language models.

whisper_streaming - Whisper realtime streaming for long speech-to-text transcription and translation

easy-chat - A ChatGPT UI for young readers, written by ChatGPT

gpt-voice-conversation-chatbot - Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.

whisper-openai-gradio-implementation - Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation

WhisperFusion - WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.

shorthanddictation - Dictation program, which uses the reading speed unit syllables per minute

WhisperLive vs cog-whisper-diarization whisper-writer vs kaldi-active-grammar WhisperLive vs obs-zoom-and-follow whisper-writer vs AI-Waifu-Vtuber WhisperLive vs gpt_chatbot whisper-writer vs playlist-gpt WhisperLive vs whisper_streaming whisper-writer vs easy-chat WhisperLive vs gpt-voice-conversation-chatbot whisper-writer vs whisper-openai-gradio-implementation WhisperLive vs WhisperFusion whisper-writer vs shorthanddictation

Compare WhisperLive vs whisper-writer and see what are their differences.

WhisperLive

whisper-writer

WhisperLive

whisper-writer

What are some alternatives?