gpt_chatbot
talk-to-chatgpt
gpt_chatbot | talk-to-chatgpt | |
---|---|---|
1 | 19 | |
52 | 1,943 | |
- | - | |
6.8 | 6.6 | |
5 months ago | 6 days ago | |
Python | JavaScript | |
- | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gpt_chatbot
-
I made a new ChatGPT interface!
I like it! Would be really nice if you could add text-to-speech, ideally both with cheap options (Azure/Google/Amazon) and something like elevenlabs. Like in https://github.com/1nnovat1on/gpt_chatbot and https://github.com/C-Nedelcu/talk-to-chatgpt
talk-to-chatgpt
-
Now, ChatGPT can see my screen. Should I build this out? [video]
I'm trying to decide if it's worth cleaning up the code and publishing this. Is it something that would be interesting to others? It's still pretty buggy, in part because ChatGPT itself is buggy, particularly these past few days! But also, I have to regularly refresh the chrome tab when mic access or audio playback stops working. It's going to take some effort to make this thing more reliable. I need to better understand how the background page & service workers works with Chrome extensions.
This project started after I tried a bunch of Chrome plugins that let you speak to Chat GPT. The one I liked best was this one: https://github.com/C-Nedelcu/talk-to-chatgpt
I forked the code and refactored it quite a bit in order to improve the voice recognition, improve the quality of text-to-speech (using WellSaid) and then I added the screen capture capabilities. That's when it started feeling truly magical and useful to me.
The biggest issue is that ChatGPT is still too slow, but Sam Altman claimed during devday that improving the speed is the next biggest priority for them.
-
NEW: ChatGPT plugin store can be searched now!
That will be amazing for sure, but there is Talk-to-GPT in the meantime. The speechki plugin can also do limited text to speech.
-
WhisperChat: An Open Source, Voice Based conversational assistant using React/Node
Talk-To-ChatGPT is a popular one, and I believe it has API integration with ElevenLabs, so you can create custom voices. The repository is here and the extension is here. There are many more if you search the Chrome web store.
-
What's the most user friendly voice interface to GPT? Speech for both input and output, paid or providing your own API key is ok.
i like https://github.com/C-Nedelcu/talk-to-chatgpt a lot. you can use elevenlabs api key and use own custom voices.
-
Streamlining GPT Workflow
There's this: https://github.com/C-Nedelcu/talk-to-chatgpt
-
Is there a version of ChatGPT that you can speak to and have it speak back?
https://github.com/C-Nedelcu/talk-to-chatgpt (and the actual resultant extension: https://chrome.google.com/webstore/detail/talk-to-chatgpt/hodadfhfagpiemkeoliaelelfbboamlk)
- SUPER COOL!! Crosstalk with talk-to-chatgpt chrome extension powered AI partner.
-
I made a new ChatGPT interface!
I like it! Would be really nice if you could add text-to-speech, ideally both with cheap options (Azure/Google/Amazon) and something like elevenlabs. Like in https://github.com/1nnovat1on/gpt_chatbot and https://github.com/C-Nedelcu/talk-to-chatgpt
- Integrated voice recognition and text to speech
-
What are some ways that you make ChatGPT verbally conversational or otherwise more *naturally* conversational?
Today I discovered https://github.com/C-Nedelcu/talk-to-chatgpt, which is a great concept. Having a verbal conversation with ChatGPT is much more engaging for me. Do you have any special ways to simulate a more real conversation?
What are some alternatives?
langchain-chatbot - Chatbot using LLM chat model and Langchain, LangSmith.
CopperAI - CopperAI offers a hands-free, voice-to-voice interaction system with a Large Language Model (LLM)
elevenlabs-python - The official Python API for ElevenLabs Text to Speech.
rust-bert - Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
elevenlabs-unleashed - Provides unlimited ElevenLabs API calls.
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
LLMChat - A Discord chatbot that supports popular LLMs for text generation and ultra-realistic voices for voice chat.
askai - Command Line Interface for OpenAi ChatGPT
WhisperLive - A nearly-live implementation of OpenAI's Whisper.
BentoChain - A voice-enabled chatbot application built using of 🦜️🔗 LangChain, text-to-speech, and speech-to-text models from 🤗 Hugging Face, and 🍱 BentoML.
M.I.L.E.S - M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searches global weather, delivers date and time, autonomously chooses and retains long-term memories. Available for macOS and Windows.
gpt-voice-conversation-chatbot - Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.