Top 23 Voice Open-Source Projects

Retrieval-based-Voice-Conversion-WebUI

56 18,860 9.6 Python

Voice data <= 10 mins can also be used to train a good VC model!

Project mention: OpenVoice: Versatile Instant Voice Cloning | news.ycombinator.com | 2024-03-29

RVC does live voice changing with a little latency: https://github.com/RVC-Project/Retrieval-based-Voice-Convers...
The product isn't exactly spectacular, but most of the works seems to have bene done. Just needs someone to go over the UI and make it less unstable, really.

NoiseTorch

106 8,966 5.9 Go

Real-time microphone noise suppression on Linux.

Project mention: Ask HN: What are some unpopular technologies you wish people knew more about? | news.ycombinator.com | 2023-12-02

Noisetorch. https://github.com/noisetorch/NoiseTorch

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
annyang

2 6,547 0.0 JavaScript

:speech_balloon: Speech recognition for your site
noise-suppression-for-voice

108 4,374 0.0 C

Noise suppression plugin based on Xiph's RNNoise

Project mention: Having trouble with getting microphone to be recognized | /r/archlinux | 2023-06-25

Yes, I think it pipes the default input (source) to the default output. I have a noise cancelling source and if I switch to it the loopback follows it and I hear the de-noised one

common-voice

66 3,247 10.0 TypeScript

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

Project mention: OpenAI's Whisper is another case study in Colonisation | news.ycombinator.com | 2024-02-06

Mozillas Common Voice Project (https://commonvoice.mozilla.org/) is creating an open dataset for many minority languages to make it easier to support them in STT systems. If you speak one of these languages please consider donating a few minutes of your voice.

jovo-framework

5 1,670 7.8 TypeScript

🔈 The React for Voice and Chat: Build Apps for Alexa, Messenger, Instagram, the Web, and more
voice_datasets

3 1,525 3.5

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
cursorless

22 1,066 9.6 TypeScript

Don't let the cursor slow you down

Project mention: Launch HN: Aqua Voice (YC W24) – Voice-driven text editor | news.ycombinator.com | 2024-03-26

What are your opinions on https://www.cursorless.org/ ?
Are you targeting developers?
My understanding was people who are serious about developing via voice use it pretty exclusively.
Like, yeah you need to learn commands, but "are often not worth it" feels like brushing a pretty massive offering under the rug.
Is learning vi / emacs commands not worth it (or shortcuts in another IDE?)
Is there a middle ground?

EDDiscovery

119 745 9.5 C#

Captains log and 3d star map for Elite Dangerous

Project mention: What are your must-have plugins/resources for ED? | /r/EliteDangerous | 2023-11-29

EDDiscovery

figaro

5 739 0.0 Python

Real-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎵 (by MattMoony)
wunjo.wladradchenko.ru

6 678 9.5 Python

Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.

Project mention: Check out Wunjo AI – open-source AI Toolkit | news.ycombinator.com | 2024-01-16

AI Retouch Tool & Segmentation Mask
GitHub Stars Needed!
We're at 499 stars on GitHub, just 13 away from a cool milestone! If you like what you see, I'd appreciate your support. Check it out and drop a star if you find it interesting.
GitHub Repository: https://github.com/wladradchenko/wunjo.wladradchenko.ru
Thanks a bunch for your time and support!

Twilio-csharp

2 653 8.2 C#

Twilio C#/.NET Helper Library for .NET6+.
Voice Overlay iOS

0 538 0.0 Swift

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
TTS-Voice-Wizard

8 510 8.9 C#

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

Project mention: VRchat chatbox to speech | /r/VRchat | 2023-05-29

Maybe that? https://github.com/VRCWizard/TTS-Voice-Wizard

mimic-recording-studio

4 486 0.0 JavaScript

Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
tgcalls

2 476 0.0 Python

Voice chats, private incoming and outgoing calls in Telegram for Developers
Simple-Voice-Recorder

3 428 8.1 Kotlin

An easy way of recording any discussion or sounds without ads or internet access
vonage-node-sdk

2 370 8.5 TypeScript

Vonage API client for Node.js. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.

Project mention: Send SMS Messages with Cloud Functions For Firebase Gen 2 | dev.to | 2024-04-11

Install the Vonage Server SDK for Node.js (@vonage/server-sdk).

ESP32-Rhasspy-Satellite

1 347 4.1 C++

The repo has implementing an esp32 standalone MQTT audio streamer. Is is desinged to work as a satellite for Rhasspy (https://rhasspy.readthedocs.io/en/latest/). It supports multiple devices
voice-gender

2 331 0.0 R

Gender recognition by voice and speech analysis
kaldi-active-grammar

10 329 0.0 Python

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Project mention: Ask HN: How do you get started with adding voice commands to a computer system? | news.ycombinator.com | 2023-11-21

https://github.com/dictation-toolbox/dragonfly
https://github.com/daanzu/kaldi-active-grammar

Caster

7 328 2.9 Python

Dragonfly-Based Voice Programming and Accessibility Toolkit
univoice

2 325 3.0 C#

Voice chat/VoIP solution for unity.

Project mention: Godot and using external C# libraries (like Univoice for unity) where there are functional gaps | /r/godot | 2023-06-01

Opportunity?: Noted solutions via C# libraries (for unity admitedly...) that was "thinking" could reference. e.g. GitHub - adrenak/univoice: Voice chat/VoIP solution for unity.

SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Voice related posts

Send SMS Messages with Cloud Functions For Firebase Gen 2
2 projects | dev.to | 11 Apr 2024
Launch HN: Aqua Voice (YC W24) – Voice-driven text editor
3 projects | news.ycombinator.com | 26 Mar 2024
OpenAI's Whisper is another case study in Colonisation
1 project | news.ycombinator.com | 6 Feb 2024
Cursorless: Voice Coding at the Speed of Thought
1 project | news.ycombinator.com | 31 Jan 2024
TurnVoice: Transform and translate voices in YouTube videos
1 project | news.ycombinator.com | 10 Dec 2023
I made a theme song for Vito Loses
1 project | /r/biggestproblem | 7 Dec 2023
Mozilla Launching a Public Voice Dataset
1 project | news.ycombinator.com | 7 Dec 2023
A note from our sponsor - SaaSHub
www.saashub.com | 23 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Voice projects? This list will help you:

	Project	Stars
1	Retrieval-based-Voice-Conversion-WebUI	18,860
2	NoiseTorch	8,966
3	annyang	6,547
4	noise-suppression-for-voice	4,374
5	common-voice	3,247
6	jovo-framework	1,670
7	voice_datasets	1,525
8	cursorless	1,066
9	EDDiscovery	745
10	figaro	739
11	wunjo.wladradchenko.ru	678
12	Twilio-csharp	653
13	Voice Overlay iOS	538
14	TTS-Voice-Wizard	510
15	mimic-recording-studio	486
16	tgcalls	476
17	Simple-Voice-Recorder	428
18	vonage-node-sdk	370
19	ESP32-Rhasspy-Satellite	347
20	voice-gender	331
21	kaldi-active-grammar	329
22	Caster	328
23	univoice	325