Serverless voice chat with Vicuna-13B

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • text-generation-webui

    A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

  • Been using textgen and downloading tons of models, the models are all over the place. The problems of accuracy and short term memory are major issues that people are trying to implement work arounds.

    Check out textgen, it has voice in/out, graphics in/out, memory plugin, api, all running locally.

    https://github.com/oobabooga/text-generation-webui

  • quillman

    A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • tortoise-tts

    A multi-voice TTS system trained with an emphasis on quality

  • Nice to see Tortoise being used - I still think it's the best TTS system out there now. Generation time is slow, but quality is incredible. I wonder if the code can be optimised to speed up the generation, but I don't think the author is maintaining it any longer.[0]

    [0]https://github.com/neonbjb/tortoise-tts

  • bark

    πŸ”Š Text-Prompted Generative Audio Model

  • Tortoise looks really nice! The output is very "polished" and audiobook-like. It's a contrast to Bark[0] which is far more expressive but unpredictable.

    [0]: https://github.com/suno-ai/bark

  • TTS

    πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

  • Coqui also looks interesting.

    https://github.com/coqui-ai/TTS

    Support for it was recently added to vocode:

    https://github.com/vocodedev/vocode-python/pull/56

  • vocode-python

    πŸ€– Build voice-based LLM agents. Modular + open source.

  • Coqui also looks interesting.

    https://github.com/coqui-ai/TTS

    Support for it was recently added to vocode:

    https://github.com/vocodedev/vocode-python/pull/56

  • whisper.cpp

    Port of OpenAI's Whisper model in C/C++

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • mimic3

    A fast local neural text to speech engine for Mycroft

  • It took quite a bit of digging to find the repo link https://github.com/MycroftAI/mimic3#readme and it's AGPL-3 for those interested in such things

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • CoquiTTS: πŸΈπŸ’¬ - Open Source Text-to-Speech framework.

    3 projects | /r/programming | 30 Aug 2021
  • OpenAI deems its voice cloning tool too risky for general release

    1 project | news.ycombinator.com | 31 Mar 2024
  • What things are happening in ML that we can't hear oer the din of LLMs?

    3 projects | news.ycombinator.com | 28 Mar 2024
  • Base TTS (Amazon): The largest text-to-speech model to-date

    3 projects | news.ycombinator.com | 14 Feb 2024
  • Coqui Is Shutting Down

    1 project | news.ycombinator.com | 11 Jan 2024