The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Whisper-asr-webservice Alternatives
Similar projects and alternatives to whisper-asr-webservice
-
bazarr
Bazarr is a companion application to Sonarr and Radarr. It manages and downloads subtitles based on your requirements. You define your preferences by TV show or movie and Bazarr takes care of everything for you.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
generate-subtitles
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
-
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
-
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
-
whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
-
Whisper_Notebook
A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
whisper-asr-webservice reviews and mentions
- How I converted a podcast into a knowledge base using Orama search and OpenAI whisper and Astro
-
Bazarr AI subs
Check https://github.com/openai/whisper & https://github.com/ahmetoner/whisper-asr-webservice
-
Bulk download subtitles
I see that bazarr had already been mentioned. If there are no subtitles available, you can also generate the subtitles by connecting bazarr to the AI model whisper which you can self host locally. I run everything in containers, tried it a few times and it works quite well for me! It does however use some computational resources to generate the subtitles, how long processing takes depends on the chosen model accuracy.
-
Writeout.ai – Transcribe and translate any audio files. Free and open source
You (essentially) need GPU but here you go:
https://github.com/ahmetoner/whisper-asr-webservice
For your requirements the medium.en model (max) should be satisfactory.
-
Whispers AI Modular Future
What utilities related to Whisper do you wish existed? What have you had to build yourself?
On the end user application side, I wish there was something that let me pick a podcast of my choosing, get it fully transcribed, and get an embeddings search plus answer q&a on top of that podcast or set of chosen podcasts. I've seen ones for specific podcasts, but I'd like one where I can choose the podcast. (Probably won't build it)
Also on the end user side, I wish there was an Otter alternative (still paid $30/mo, but unlimited minutes per month) that had longer transcription limits. (Started building this, not much interest from users though)
Things I've seen on the dev tool side:
Gladia (API call version of Whisper)
Whisper.cpp
Whisper webservice (https://github.com/ahmetoner/whisper-asr-webservice) - via this thread
Live microphone demo (not real time, it still does it in chunks) https://github.com/mallorbc/whisper_mic
Streamlit UI https://github.com/hayabhay/whisper-ui
Whisper playground https://github.com/saharmor/whisper-playground
Real time whisper https://github.com/shirayu/whispering
Whisper as a service https://github.com/schibsted/WAAS
Improved timestamps and speaker identification https://github.com/m-bain/whisperX
MacWhisper https://goodsnooze.gumroad.com/l/macwhisper
Crossplatform desktop Whisper that supports semi-realtime https://github.com/chidiwilliams/buzz
-
I made a free transcription service powered by Whisper AI
I think there's been talk to do speaker diarization with whisper-asr-webservice[0] which is also written in python and should be able to make use of goodies such as pyannote-audio, py-webrtcvad, etc.
Whisper is great but at the point we get to kludging various things together it starts to make more sense to use something like Nvidia NeMo[1] which was built with all of this in mind and more
[0] - https://github.com/ahmetoner/whisper-asr-webservice
[1] - https://github.com/NVIDIA/NeMo
- whisper-asr-webservice-client - A self-hosted OpenAI Whisper API client
-
Show HN: A self-hosted OpenAI Whisper API client
(read the docs in the repo)
In terms of me not storing your data for this (I don't) I guess you'll just have to trust me?
[0] - https://github.com/ahmetoner/whisper-asr-webservice
-
[P] OpenAI Whisper ASR Webservice API released
For more details: https://github.com/ahmetoner/whisper-asr-webservice
-
A note from our sponsor - WorkOS
workos.com | 27 Apr 2024
Stats
ahmetoner/whisper-asr-webservice is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of whisper-asr-webservice is Python.
Popular Comparisons
- whisper-asr-webservice VS whisper.cpp
- whisper-asr-webservice VS whisper
- whisper-asr-webservice VS generate-subtitles
- whisper-asr-webservice VS whisperX
- whisper-asr-webservice VS whisper-asr-webservice-client
- whisper-asr-webservice VS gitbar-2023
- whisper-asr-webservice VS openai-whisper-cpu
- whisper-asr-webservice VS FlexGen
- whisper-asr-webservice VS pyannote-audio
- whisper-asr-webservice VS WAAS
Sponsored