Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more β
PaddleSpeech Alternatives
Similar projects and alternatives to PaddleSpeech
-
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
-
-
demucs
Discontinued Code for the paper Hybrid Spectrogram and Waveform Source Separation, but the goddamm motherfucker doesn't work.
-
-
DeepSpeech
Discontinued DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
-
common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
-
InfluxDB
InfluxDB β Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) (by mozilla)
-
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
-
common-voice-android
Repository of "CV Project" app. It's an unofficial app for Mozilla Common Voice, which permits you to contribute to this project via your device.
-
-
-
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
-
STT
πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
-
whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
-
-
-
-
-
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
PaddleSpeech discussion
PaddleSpeech reviews and mentions
-
Open Source Libraries
PaddlePaddle/PaddleSpeech
- I made Lisa-nee TTS (Imai Lisa)
- project
-
is there addon that recognize speech (from video) into text?
I couldn't find any add-ons that did what you needed. I'm sorry. Maybe you could try using PaddleSpeech to see if it works for you, but it is not a Firefox add-on, it's a CLI tool.
-
Mozilla Common Voice Adds 16 New Languages and 4,600 New Hours of Speech
Ah, damn. Didn't realise.
It also looks like Baidu are now developing their Deep Speech as open source? https://github.com/PaddlePaddle/DeepSpeech
- Server-Side Audio Transcription Software
-
A note from our sponsor - Stream
getstream.io | 12 Jul 2025
Stats
PaddlePaddle/PaddleSpeech is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of PaddleSpeech is Python.