SaaSHub helps you find the best software and product alternatives Learn more →
Kaldi Speech Recognition Toolkit Alternatives
Similar projects and alternatives to Kaldi Speech Recognition Toolkit
-
vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
-
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
-
-
speech-and-text-unity-ios-android
Speed to text in Unity iOS use Native Speech Recognition
-
rhasspy
Offline private voice assistant for many human languages
-
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
bert-for-inference
A small repo showing how to easily use BERT (or other transformers) for inference
-
kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
-
-
-
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
-
tailscale
The easiest, most secure way to use WireGuard and 2FA.
-
-
-
PeerTube
ActivityPub-federated video streaming platform using P2P directly in your web browser
-
-
innernet
A private network system that uses WireGuard under the hood.
-
AnySoftKeyboard
Android (f/w 2.1+) on screen keyboard for multiple languages.
-
void-packages
The Void source packages collection
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Kaldi Speech Recognition Toolkit reviews and mentions
- Unsupervised (Semi-Supervised) ASR/STT training recipes
-
[D] What's stopping you from working on speech and voice?
- https://github.com/kaldi-asr/kaldi
-
C++ for machine learning
Additionally, C++ may be used for extremely high levels of optimization even for cloud-based ML. Dlib and Kaldi are C++ libraries used as dependencies in Python codebases for computer vision and audio processing, for example. So if your application requires you to customize any functions similar to those libraries, then you'll need C++ knowhow.
-
xbp-src to only cross compile 32-bit
Hello. I'm trying to package the openfst library (here)[https://github.com/void-linux/void-packages/pull/39015] but a developer says 32-bit must be cross compiled from 64-bit. I see xbps-src has a nocross option, but I don't see a way to only cross compile. What do you think I should do? I have currently limited the archs to 64-bit ones. Here's my issue with the developer's response: https://github.com/kaldi-asr/kaldi/issues/4808 Thank you.
-
Is there a way to integrate a raspberry pi with a keyboard to do speech to text?
State-of-the-art ASR, like what you get on smartphones, has unfortunately high resource requirements. Some recent smartphone models are able to run ASR on-device, but more typically, ASR is done by sending audio to a web service. Check out the (currently experimental) Web SpeechRecognition API in a Chrome browser. Here is a demo of the API in action. For something open source, check out Kaldi ASR.
- How to get high-quality, low-cost Speech-to-Text transcription?
-
SOTA speech to text framework?
I'm not how sure it stacks with recent state of art, but Kaldi toolkit (https://github.com/kaldi-asr/kaldi) used to be popular for building all kinds of practical integrations and experiments for speech recognition.
-
5 Best Open Source Libraries and APIs for Speaker Diarization
Kaldi ASR is a well-known open source Speech Recognition platform. To use its Speaker Diarization library, you’ll need to either download their PLDA backend or pre-trained X-Vectors, or train your own models.
-
Nerd-dictation, hackable speech to text on Linux
Vosk-api isn't an SST engine itself, it is built using the Kaldi speech recognition toolkit (https://github.com/kaldi-asr/kaldi) and nicely implements and packages an API for Kaldi chain/LF-MMI models.
-
Help picking a good speech recognition library
https://kaldi-asr.org/ (best out of the box accuracy but it is a complicated toolkit and not beginner friendly)
-
A note from our sponsor - SaaSHub
www.saashub.com | 29 Mar 2024
Stats
kaldi-asr/kaldi is an open source project licensed under GNU General Public License v3.0 or later which is an OSI approved license.
The primary programming language of Kaldi Speech Recognition Toolkit is Shell.
Popular Comparisons
- Kaldi Speech Recognition Toolkit VS vosk-api
- Kaldi Speech Recognition Toolkit VS DeepSpeech
- Kaldi Speech Recognition Toolkit VS pyannote-audio
- Kaldi Speech Recognition Toolkit VS speech-and-text-unity-ios-android
- Kaldi Speech Recognition Toolkit VS espnet
- Kaldi Speech Recognition Toolkit VS rhasspy
- Kaldi Speech Recognition Toolkit VS bert-for-inference
- Kaldi Speech Recognition Toolkit VS TTS
- Kaldi Speech Recognition Toolkit VS kaldi-gstreamer-server
- Kaldi Speech Recognition Toolkit VS OpenAL