SaaSHub helps you find the best software and product alternatives Learn more β
Whisper Alternatives
Similar projects and alternatives to whisper
-
Home Assistant
:house_with_garden: Open source home automation that puts local control and privacy first.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
-
-
-
-
-
-
-
Whisper
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model (by Const-me)
-
-
buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
-
-
pytube
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
-
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
-
-
-
-
whisper discussion
whisper reviews and mentions
- Smartphone buyers meh on AI, care much more about battery life
-
Ask HN: Real-time speech-to-speech translation
Has anyone had any luck with an offline, free, open-source real-time speech-to-speech translation app on under-powered devices (i.e., older smart phones)?
* https://github.com/ictnlp/StreamSpeech
* https://github.com/k2-fsa/sherpa-onnx
* https://github.com/openai/whisper
I'm looking for a simple app that can listen for English, translate into Korean (and other languages), then perform speech synthesis on the translation. Basically, a Babelfish that doesn't stick in the ear. Although real-time would be great, a 3- to 5-second delay is manageable.
RTranslator is awkward (couldn't get it to perform speech-to-speech using a single phone). 3PO sprouts errors like dandelions and requires an online connection.
Any suggestions?
-
Cross-compile a distributed Electron App
My last post around LiveCaptioning mentions WhisperScript a macOS only electron app. On the thread there are (rightly so) 50ish complaints about it not being availalbe for Windows or Linux.
- OpenAI Whisper large-v3-turbo model release
- OpenAI released Whisper large-v3-turbo model
- New OpenAI Whisper model: "turbo"
-
Built in Days, Acquired for $20K: The NuloApp Story
In order to programtically get the correct clips to extract, AI tools like Meta's llama-3-70b LLM and OpenAI's Whisper were also used. Whisper allowed for fast speech-to-text transcription, which could then be passed on the llama in order to find segments worth extracting.
- Whisper-WebUI
- OTranscribe: A free and open tool for transcribing audio interviews
-
A note from our sponsor - SaaSHub
www.saashub.com | 4 Dec 2024
Stats
openai/whisper is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of whisper is Python.