vid2cleantxt vs distil-whisper

vid2cleantxt

Python API & command-line tool to easily transcribe speech-based video files into clean text (by pszemraj)

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate. (by huggingface)

Audio speech-recognition Whisper

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

vid2cleantxt		distil-whisper
	Project
1	Mentions	9
156	Stars	3,199
-	Growth	6.2%
0.0	Activity	8.9
over 1 year ago	Latest Commit	4 days ago
Jupyter Notebook	Language	Python
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

vid2cleantxt

Posts with mentions or reviews of vid2cleantxt. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-01-20.

Downloader for video.ethz.ch videos
2 projects | /r/ethz | 20 Jan 2022

distil-whisper

Posts with mentions or reviews of distil-whisper. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-05.

FLaNK Stack 05 Feb 2024
49 projects | dev.to | 5 Feb 2024
Distil-Whisper: a distilled variant of Whisper that is 6x faster
1 project | /r/AudioAI | 17 Nov 2023

Training code will be released in the Distil-Whisper repository this week, enabling anyone in the community to distill a Whisper model in their choice of language!
FLaNK Stack Weekly 06 Nov 2023
21 projects | dev.to | 6 Nov 2023
AI — weekly megathread!
3 projects | /r/artificial | 5 Nov 2023

Hugging Face released Distil-Whisper, a distilled version of Whisper that is 6 times faster, 49% smaller, and performs within 1% word error rate (WER) on out-of-distribution evaluation sets [Details].
Distil-Whisper: distilled version of Whisper that is 6 times faster, 49% smaller
1 project | /r/hackernews | 3 Nov 2023

14 projects | news.ycombinator.com | 31 Oct 2023
Distil-Whisper is up to 6x faster than Whisper while performing within 1% Word-Error-Rate on out-of-distribution eval sets
1 project | /r/speechtech | 2 Nov 2023
Distilling Whisper on 20,000 hours of open-sourced audio data
1 project | /r/AudioAI | 2 Nov 2023

- GitHub page: https://github.com/huggingface/distil-whisper/tree/main
Talk-Llama
8 projects | news.ycombinator.com | 2 Nov 2023

Is https://github.com/huggingface/distil-whisper on its way to whisper.cpp?

What are some alternatives?

When comparing vid2cleantxt and distil-whisper you can also consider the following projects:

SpecVQGAN - Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

WhisperInput - Offline voice input panel & keyboard with punctuation for Android.

PipeWire-Guide - PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.

pyvideotrans - Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

web-whisper - OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.

faster-whisper - Faster Whisper transcription with CTranslate2

WOLOF-ASR-Wav2Vec2 - Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.

json-masker - High-performance JSON masker library in Java with no runtime dependencies

silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

willow - Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

web-speech-synthesis-and-recognition - Speech to Text and Text to Speech on a web browser

vid2cleantxt vs SpecVQGAN distil-whisper vs WhisperInput vid2cleantxt vs PipeWire-Guide distil-whisper vs pyvideotrans vid2cleantxt vs web-whisper distil-whisper vs faster-whisper vid2cleantxt vs WOLOF-ASR-Wav2Vec2 distil-whisper vs json-masker vid2cleantxt vs silero-models distil-whisper vs willow vid2cleantxt vs web-speech-synthesis-and-recognition distil-whisper vs web-whisper

Compare vid2cleantxt vs distil-whisper and see what are their differences.

vid2cleantxt

distil-whisper

vid2cleantxt

distil-whisper

What are some alternatives?