WhisperFusion vs WhisperLive

WhisperFusion

WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI. (by collabora)

Suggest topics

Source Code

Suggest alternative

Edit details

WhisperLive

A nearly-live implementation of OpenAI's Whisper. (by collabora)

dictation obs openai text-to-speech Translation voice-recognition Whisper

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

WhisperFusion		WhisperLive
	Project
3	Mentions	4
1,390	Stars	1,223
3.0%	Growth	15.0%
8.7	Activity	9.4
about 2 months ago	Latest Commit	6 days ago
Python	Language	Python
-	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

WhisperFusion

Posts with mentions or reviews of WhisperFusion. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-05.

FLaNK Stack 05 Feb 2024
49 projects | dev.to | 5 Feb 2024
Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot
7 projects | news.ycombinator.com | 29 Jan 2024
WhisperFusion: Ultra-low latency conversations with an AI chatbot
2 projects | news.ycombinator.com | 25 Jan 2024

WhisperFusion is fully open-source - https://github.com/collabora/WhisperFusion

WhisperLive

Posts with mentions or reviews of WhisperLive. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-29.

Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot
7 projects | news.ycombinator.com | 29 Jan 2024

Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive
WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper
9 projects | news.ycombinator.com | 17 Jan 2024

Check out WhisperLive: https://github.com/collabora/WhisperLive
If you're grappling with the slow march from cool tech demos to real-world language model apps, you might wanna check out WhisperLive. It's this rad open-source project that’s all about leveraging Whisper models for slick live transcription. Think real-time, on-the-fly translated captions for those global meetups. It's a neat example of practical, user-focused tech in action. Dive into the details on their GitHub page
Whisper: Nvidia RTX 4090 vs. M1 Pro with MLX
10 projects | news.ycombinator.com | 13 Dec 2023

https://github.com/collabora/WhisperLive
The is another one that uses huggingface's implementation, but I haven't tried it since my spec doesn't support flash-att2
Triple Threat: The Power of Transcription, Summary, and Translation
1 project | news.ycombinator.com | 3 Aug 2023

Curious to see how this works? Check out our demo page - https://col.la/transcription to generate your own transcription, summary, and translation, or use our browser extension - https://github.com/collabora/WhisperLive to get live transcriptions.

What are some alternatives?

When comparing WhisperFusion and WhisperLive you can also consider the following projects:

WhisperSpeech - An Open Source text-to-speech system built by inverting Whisper.

cog-whisper-diarization - Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote

metaflow - :rocket: Build and manage real-life ML, AI, and data science projects with ease!

whisper-writer - 💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

FLiPStackWeekly - FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...

obs-zoom-and-follow - Dynamic zoom and mouse tracking script for OBS Studio

stable_diffusion.openvino

gpt_chatbot - This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows

openvino-ai-plugins-gimp - GIMP AI plugins with OpenVINO Backend

whisper_streaming - Whisper realtime streaming for long speech-to-text transcription and translation

onnxruntime - ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

gpt-voice-conversation-chatbot - Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.