WhisperLive
gpt-voice-conversation-chatbot
WhisperLive | gpt-voice-conversation-chatbot | |
---|---|---|
4 | 4 | |
1,253 | 287 | |
17.0% | - | |
9.4 | 5.0 | |
8 days ago | 7 days ago | |
Python | Python | |
MIT License | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
WhisperLive
-
Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot
Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive
-
WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper
Check out WhisperLive: https://github.com/collabora/WhisperLive
If you're grappling with the slow march from cool tech demos to real-world language model apps, you might wanna check out WhisperLive. It's this rad open-source project that’s all about leveraging Whisper models for slick live transcription. Think real-time, on-the-fly translated captions for those global meetups. It's a neat example of practical, user-focused tech in action. Dive into the details on their GitHub page
-
Whisper: Nvidia RTX 4090 vs. M1 Pro with MLX
https://github.com/collabora/WhisperLive
The is another one that uses huggingface's implementation, but I haven't tried it since my spec doesn't support flash-att2
-
Triple Threat: The Power of Transcription, Summary, and Translation
Curious to see how this works? Check out our demo page - https://col.la/transcription to generate your own transcription, summary, and translation, or use our browser extension - https://github.com/collabora/WhisperLive to get live transcriptions.
gpt-voice-conversation-chatbot
- How to protect against prompt injection in a web app?
-
I used ChatGPT with Elevenlabs and this awesome Python app from Github to talk to Snoop Dogg
All credit goes to Adri6336: https://github.com/Adri6336/gpt-voice-conversation-chatbot
-
Better value: ChatGPT+ or Self Hosting using API?
Afaik there a few tools out there that replicate the ChatGPT experience if you've got an API key. Like this one here https://github.com/Adri6336/gpt-voice-conversation-chatbot that allows you to talk with the bot in a console window. The way I see it, getting an API key opens the door to a wider array of AI uses if you know where to look
-
ChatGPT-like GPT-3 chatbot that allows you to have a spoken conversation, build robo-familiarity, and customize bot
This chatbot was made using GPT-3 with the hope of emulating ChatGPT and adding some features that I wanted for myself. You can find the code at this repo: https://github.com/Adri6336/gpt3-speech-to-text-chatbot
What are some alternatives?
cog-whisper-diarization - Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
chatgpt-demo - Minimal web UI for ChatGPT.
whisper-writer - 💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
LLMChat - A Discord chatbot that supports popular LLMs for text generation and ultra-realistic voices for voice chat.
obs-zoom-and-follow - Dynamic zoom and mouse tracking script for OBS Studio
Presto-Change-O - GUI-based program for batch converting audio files from one format to another.
gpt_chatbot - This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows
chatbot_utils - RegEx-based fuzzy command / response handling for conversational chatbots
whisper_streaming - Whisper realtime streaming for long speech-to-text transcription and translation
elevenlabs-unleashed - Provides unlimited ElevenLabs API calls.
WhisperFusion - WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
CatGDP - Meow meow, dis iz a GitPurr repository of CatGDP fur feline whiskerful conversations. Pawsome, right? Hiss-tory in the making! Happy Caturday! 🐾