gpt_chatbot
WhisperLive
gpt_chatbot | WhisperLive | |
---|---|---|
1 | 4 | |
52 | 1,253 | |
- | 17.0% | |
6.8 | 9.4 | |
5 months ago | 11 days ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gpt_chatbot
-
I made a new ChatGPT interface!
I like it! Would be really nice if you could add text-to-speech, ideally both with cheap options (Azure/Google/Amazon) and something like elevenlabs. Like in https://github.com/1nnovat1on/gpt_chatbot and https://github.com/C-Nedelcu/talk-to-chatgpt
WhisperLive
-
Show HN: WhisperFusion β Ultra-low latency conversations with an AI chatbot
Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive
-
WhisperSpeech β An Open Source text-to-speech system built by inverting Whisper
Check out WhisperLive: https://github.com/collabora/WhisperLive
If you're grappling with the slow march from cool tech demos to real-world language model apps, you might wanna check out WhisperLive. It's this rad open-source project thatβs all about leveraging Whisper models for slick live transcription. Think real-time, on-the-fly translated captions for those global meetups. It's a neat example of practical, user-focused tech in action. Dive into the details on their GitHub page
-
Whisper: Nvidia RTX 4090 vs. M1 Pro with MLX
https://github.com/collabora/WhisperLive
The is another one that uses huggingface's implementation, but I haven't tried it since my spec doesn't support flash-att2
-
Triple Threat: The Power of Transcription, Summary, and Translation
Curious to see how this works? Check out our demo page - https://col.la/transcription to generate your own transcription, summary, and translation, or use our browser extension - https://github.com/collabora/WhisperLive to get live transcriptions.
What are some alternatives?
langchain-chatbot - Chatbot using LLM chat model and Langchain, LangSmith.
cog-whisper-diarization - Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
elevenlabs-python - The official Python API for ElevenLabs Text to Speech.
whisper-writer - π¬π A small dictation app using OpenAI's Whisper speech recognition model.
elevenlabs-unleashed - Provides unlimited ElevenLabs API calls.
obs-zoom-and-follow - Dynamic zoom and mouse tracking script for OBS Studio
LLMChat - A Discord chatbot that supports popular LLMs for text generation and ultra-realistic voices for voice chat.
whisper_streaming - Whisper realtime streaming for long speech-to-text transcription and translation
BentoChain - A voice-enabled chatbot application built using of π¦οΈπ LangChain, text-to-speech, and speech-to-text models from π€ Hugging Face, and π± BentoML.
gpt-voice-conversation-chatbot - Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
M.I.L.E.S - M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searches global weather, delivers date and time, autonomously chooses and retains long-term memories. Available for macOS and Windows.
WhisperFusion - WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.