SaaSHub helps you find the best software and product alternatives Learn more β
Top 23 Voice Open-Source Projects
-
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
-
jovo-framework
π The React for Voice and Chat: Build Apps for Alexa, Messenger, Instagram, the Web, and more
-
voice_datasets
π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
figaro
Real-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. π΅ (by MattMoony)
-
wunjo.wladradchenko.ru
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
-
Voice Overlay iOS
π£ An overlay that gets your userβs voice permission and input as text in a customizable UI
-
TTS-Voice-Wizard
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
-
mimic-recording-studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
-
Simple-Voice-Recorder
An easy way of recording any discussion or sounds without ads or internet access
-
vonage-node-sdk
Vonage API client for Node.js. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
-
ESP32-Rhasspy-Satellite
The repo has implementing an esp32 standalone MQTT audio streamer. Is is desinged to work as a satellite for Rhasspy (https://rhasspy.readthedocs.io/en/latest/). It supports multiple devices
-
kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
RVC does live voice changing with a little latency: https://github.com/RVC-Project/Retrieval-based-Voice-Convers...
The product isn't exactly spectacular, but most of the works seems to have bene done. Just needs someone to go over the UI and make it less unstable, really.
Project mention: Ask HN: What are some unpopular technologies you wish people knew more about? | news.ycombinator.com | 2023-12-02Noisetorch. https://github.com/noisetorch/NoiseTorch
Project mention: Having trouble with getting microphone to be recognized | /r/archlinux | 2023-06-25Yes, I think it pipes the default input (source) to the default output. I have a noise cancelling source and if I switch to it the loopback follows it and I hear the de-noised one
Project mention: OpenAI's Whisper is another case study in Colonisation | news.ycombinator.com | 2024-02-06Mozillas Common Voice Project (https://commonvoice.mozilla.org/) is creating an open dataset for many minority languages to make it easier to support them in STT systems. If you speak one of these languages please consider donating a few minutes of your voice.
Project mention: Launch HN: Aqua Voice (YC W24) β Voice-driven text editor | news.ycombinator.com | 2024-03-26What are your opinions on https://www.cursorless.org/ ?
Are you targeting developers?
My understanding was people who are serious about developing via voice use it pretty exclusively.
Like, yeah you need to learn commands, but "are often not worth it" feels like brushing a pretty massive offering under the rug.
Is learning vi / emacs commands not worth it (or shortcuts in another IDE?)
Is there a middle ground?
EDDiscovery
AI Retouch Tool & Segmentation Mask
GitHub Stars Needed!
We're at 499 stars on GitHub, just 13 away from a cool milestone! If you like what you see, I'd appreciate your support. Check it out and drop a star if you find it interesting.
GitHub Repository: https://github.com/wladradchenko/wunjo.wladradchenko.ru
Thanks a bunch for your time and support!
Maybe that? https://github.com/VRCWizard/TTS-Voice-Wizard
Install the Vonage Server SDK for Node.js (@vonage/server-sdk).
Project mention: Ask HN: How do you get started with adding voice commands to a computer system? | news.ycombinator.com | 2023-11-21https://github.com/dictation-toolbox/dragonfly
https://github.com/daanzu/kaldi-active-grammar
Project mention: Godot and using external C# libraries (like Univoice for unity) where there are functional gaps | /r/godot | 2023-06-01Opportunity?: Noted solutions via C# libraries (for unity admitedly...) that was "thinking" could reference. e.g. GitHub - adrenak/univoice: Voice chat/VoIP solution for unity.
Voice related posts
- Send SMS Messages with Cloud Functions For Firebase Gen 2
- Launch HN: Aqua Voice (YC W24) β Voice-driven text editor
- OpenAI's Whisper is another case study in Colonisation
- Cursorless: Voice Coding at the Speed of Thought
- TurnVoice: Transform and translate voices in YouTube videos
- I made a theme song for Vito Loses
- Mozilla Launching a Public Voice Dataset
-
A note from our sponsor - SaaSHub
www.saashub.com | 23 Apr 2024
Index
What are some of the best open-source Voice projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Retrieval-based-Voice-Conversion-WebUI | 18,860 |
2 | NoiseTorch | 8,966 |
3 | annyang | 6,547 |
4 | noise-suppression-for-voice | 4,374 |
5 | common-voice | 3,247 |
6 | jovo-framework | 1,670 |
7 | voice_datasets | 1,525 |
8 | cursorless | 1,066 |
9 | EDDiscovery | 745 |
10 | figaro | 739 |
11 | wunjo.wladradchenko.ru | 678 |
12 | Twilio-csharp | 653 |
13 | Voice Overlay iOS | 538 |
14 | TTS-Voice-Wizard | 510 |
15 | mimic-recording-studio | 486 |
16 | tgcalls | 476 |
17 | Simple-Voice-Recorder | 428 |
18 | vonage-node-sdk | 370 |
19 | ESP32-Rhasspy-Satellite | 347 |
20 | voice-gender | 331 |
21 | kaldi-active-grammar | 329 |
22 | Caster | 328 |
23 | univoice | 325 |
Sponsored