Our great sponsors
-
LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
-
quillman
A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.
-
SurveyJS
Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
-
big-AGI
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
What a coincidence, was just looking something similar for local models and stumbled up on this, his Repo seems full of TTS/STT projects..
https://github.com/KoljaB/LocalAIVoiceChat
https://github.com/modal-labs/quillman
I built something similar using WebKit speech recognition but that's limited to Chromium.
Related posts
- Fine-tuning Local LLMs for "Code Interpreter" use: Seeking Experience and Insights
- OpenInterpreter – Natural language interface to your computer
- The Next Generation of Claude (Claude 3)
- Simulatrex, an open-source Large Language Model based simulation framework
- Ask HN: What are some actual use cases of AI Agents?