Voice

Top 23 Voice Open-Source Projects

  • Retrieval-based-Voice-Conversion-WebUI

    Voice data <= 10 mins can also be used to train a good VC model!

  • Project mention: OpenVoice: Versatile Instant Voice Cloning | news.ycombinator.com | 2024-03-29

    RVC does live voice changing with a little latency: https://github.com/RVC-Project/Retrieval-based-Voice-Convers...

    The product isn't exactly spectacular, but most of the works seems to have bene done. Just needs someone to go over the UI and make it less unstable, really.

  • NoiseTorch

    Real-time microphone noise suppression on Linux.

  • Project mention: Ask HN: What are some unpopular technologies you wish people knew more about? | news.ycombinator.com | 2023-12-02

    Noisetorch. https://github.com/noisetorch/NoiseTorch

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • annyang

    :speech_balloon: Speech recognition for your site

  • noise-suppression-for-voice

    Noise suppression plugin based on Xiph's RNNoise

  • Project mention: Having trouble with getting microphone to be recognized | /r/archlinux | 2023-06-25

    Yes, I think it pipes the default input (source) to the default output. I have a noise cancelling source and if I switch to it the loopback follows it and I hear the de-noised one

  • common-voice

    Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

  • Project mention: OpenAI's Whisper is another case study in Colonisation | news.ycombinator.com | 2024-02-06

    Mozillas Common Voice Project (https://commonvoice.mozilla.org/) is creating an open dataset for many minority languages to make it easier to support them in STT systems. If you speak one of these languages please consider donating a few minutes of your voice.

  • jovo-framework

    πŸ”ˆ The React for Voice and Chat: Build Apps for Alexa, Messenger, Instagram, the Web, and more

  • voice_datasets

    πŸ”Š A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • cursorless

    Don't let the cursor slow you down

  • Project mention: Launch HN: Aqua Voice (YC W24) – Voice-driven text editor | news.ycombinator.com | 2024-03-26

    What are your opinions on https://www.cursorless.org/ ?

    Are you targeting developers?

    My understanding was people who are serious about developing via voice use it pretty exclusively.

    Like, yeah you need to learn commands, but "are often not worth it" feels like brushing a pretty massive offering under the rug.

    Is learning vi / emacs commands not worth it (or shortcuts in another IDE?)

    Is there a middle ground?

  • EDDiscovery

    Captains log and 3d star map for Elite Dangerous

  • Project mention: What are your must-have plugins/resources for ED? | /r/EliteDangerous | 2023-11-29

    EDDiscovery

  • figaro

    Real-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎡 (by MattMoony)

  • wunjo.wladradchenko.ru

    Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.

  • Project mention: Check out Wunjo AI – open-source AI Toolkit | news.ycombinator.com | 2024-01-16

    AI Retouch Tool & Segmentation Mask

    GitHub Stars Needed!

    We're at 499 stars on GitHub, just 13 away from a cool milestone! If you like what you see, I'd appreciate your support. Check it out and drop a star if you find it interesting.

    GitHub Repository: https://github.com/wladradchenko/wunjo.wladradchenko.ru

    Thanks a bunch for your time and support!

  • Twilio-csharp

    Twilio C#/.NET Helper Library for .NET6+.

  • Voice Overlay iOS

    πŸ—£ An overlay that gets your user’s voice permission and input as text in a customizable UI

  • TTS-Voice-Wizard

    Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

  • Project mention: VRchat chatbox to speech | /r/VRchat | 2023-05-29

    Maybe that? https://github.com/VRCWizard/TTS-Voice-Wizard

  • mimic-recording-studio

    Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2

  • tgcalls

    Voice chats, private incoming and outgoing calls in Telegram for Developers

  • Simple-Voice-Recorder

    An easy way of recording any discussion or sounds without ads or internet access

  • vonage-node-sdk

    Vonage API client for Node.js. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.

  • Project mention: Send SMS Messages with Cloud Functions For Firebase Gen 2 | dev.to | 2024-04-11

    Install the Vonage Server SDK for Node.js (@vonage/server-sdk).

  • ESP32-Rhasspy-Satellite

    The repo has implementing an esp32 standalone MQTT audio streamer. Is is desinged to work as a satellite for Rhasspy (https://rhasspy.readthedocs.io/en/latest/). It supports multiple devices

  • voice-gender

    Gender recognition by voice and speech analysis

  • kaldi-active-grammar

    Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

  • Project mention: Ask HN: How do you get started with adding voice commands to a computer system? | news.ycombinator.com | 2023-11-21

    https://github.com/dictation-toolbox/dragonfly

    https://github.com/daanzu/kaldi-active-grammar

  • Caster

    Dragonfly-Based Voice Programming and Accessibility Toolkit

  • univoice

    Voice chat/VoIP solution for unity.

  • Project mention: Godot and using external C# libraries (like Univoice for unity) where there are functional gaps | /r/godot | 2023-06-01

    Opportunity?: Noted solutions via C# libraries (for unity admitedly...) that was "thinking" could reference. e.g. GitHub - adrenak/univoice: Voice chat/VoIP solution for unity.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Voice related posts

Index

What are some of the best open-source Voice projects? This list will help you:

Project Stars
1 Retrieval-based-Voice-Conversion-WebUI 18,860
2 NoiseTorch 8,966
3 annyang 6,547
4 noise-suppression-for-voice 4,374
5 common-voice 3,247
6 jovo-framework 1,670
7 voice_datasets 1,525
8 cursorless 1,066
9 EDDiscovery 745
10 figaro 739
11 wunjo.wladradchenko.ru 678
12 Twilio-csharp 653
13 Voice Overlay iOS 538
14 TTS-Voice-Wizard 510
15 mimic-recording-studio 486
16 tgcalls 476
17 Simple-Voice-Recorder 428
18 vonage-node-sdk 370
19 ESP32-Rhasspy-Satellite 347
20 voice-gender 331
21 kaldi-active-grammar 329
22 Caster 328
23 univoice 325

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com