Serverless voice chat with Vicuna-13B

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

text-generation-webui

876 36,293 9.9 Python

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

Been using textgen and downloading tons of models, the models are all over the place. The problems of accuracy and short term memory are major issues that people are trying to implement work arounds.
Check out textgen, it has voice in/out, graphics in/out, memory plugin, api, all running locally.
https://github.com/oobabooga/text-generation-webui

quillman

7 945 6.7 JavaScript

A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
tortoise-tts

144 11,755 8.2 Jupyter Notebook

A multi-voice TTS system trained with an emphasis on quality

Nice to see Tortoise being used - I still think it's the best TTS system out there now. Generation time is slow, but quality is incredible. I wonder if the code can be optimised to speed up the generation, but I don't think the author is maintaining it any longer.[0]
[0]https://github.com/neonbjb/tortoise-tts

bark

67 32,517 6.5 Jupyter Notebook

🔊 Text-Prompted Generative Audio Model

Tortoise looks really nice! The output is very "polished" and audiobook-like. It's a contrast to Bark[0] which is far more expressive but unpredictable.
[0]: https://github.com/suno-ai/bark

TTS

231 29,174 9.5 Python

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Coqui also looks interesting.
https://github.com/coqui-ai/TTS
Support for it was recently added to vocode:
https://github.com/vocodedev/vocode-python/pull/56

vocode-python

9 2,287 9.1 Python

🤖 Build voice-based LLM agents. Modular + open source.

Coqui also looks interesting.
https://github.com/coqui-ai/TTS
Support for it was recently added to vocode:
https://github.com/vocodedev/vocode-python/pull/56

whisper.cpp

187 31,174 9.8 C

Port of OpenAI's Whisper model in C/C++
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
mimic3

24 972 0.0 Python

A fast local neural text to speech engine for Mycroft

It took quite a bit of digging to find the repo link https://github.com/MycroftAI/mimic3#readme and it's AGPL-3 for those interested in such things

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

CoquiTTS: 🐸💬 - Open Source Text-to-Speech framework.

3 projects | /r/programming | 30 Aug 2021
OpenAI deems its voice cloning tool too risky for general release

1 project | news.ycombinator.com | 31 Mar 2024
What things are happening in ML that we can't hear oer the din of LLMs?

3 projects | news.ycombinator.com | 28 Mar 2024
Base TTS (Amazon): The largest text-to-speech model to-date

3 projects | news.ycombinator.com | 14 Feb 2024
Coqui Is Shutting Down

1 project | news.ycombinator.com | 11 Jan 2024

Serverless voice chat with Vicuna-13B

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Python openai text-to-speech speech-to-text Deep Learning
Post date: 25 Apr 2023

text-generation-webui

quillman

InfluxDB

tortoise-tts

bark

TTS

vocode-python

whisper.cpp

SaaSHub

mimic3

Related posts

CoquiTTS: 🐸💬 - Open Source Text-to-Speech framework.

OpenAI deems its voice cloning tool too risky for general release

What things are happening in ML that we can't hear oer the din of LLMs?

Base TTS (Amazon): The largest text-to-speech model to-date

Coqui Is Shutting Down

Serverless voice chat with Vicuna-13B

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Python openai text-to-speech speech-to-text Deep Learning Post date: 25 Apr 2023

Related posts

CoquiTTS: 🐸💬 - Open Source Text-to-Speech framework.

OpenAI deems its voice cloning tool too risky for general release

What things are happening in ML that we can't hear oer the din of LLMs?

Base TTS (Amazon): The largest text-to-speech model to-date

Coqui Is Shutting Down

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Python openai text-to-speech speech-to-text Deep Learning
Post date: 25 Apr 2023