Silero-vad Alternatives

Similar projects and alternatives to silero-vad

whisper

343 59,916 6.8 Python silero-vad VS whisper

Robust Speech Recognition via Large-Scale Weak Supervision
aider

61 9,167 9.9 Python silero-vad VS aider

aider is AI pair programming in your terminal
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
openai-python

60 19,670 9.5 Python silero-vad VS openai-python

The official Python library for the OpenAI API
faster-whisper

22 8,578 8.3 Python silero-vad VS faster-whisper

Faster Whisper transcription with CTranslate2
subsync

13 1,196 0.0 C++ silero-vad VS subsync

Subtitle Speech Synchronizer
cheetah

5 552 8.3 Python silero-vad VS cheetah

On-device streaming speech-to-text engine powered by deep learning (by Picovoice)
whisper-auto-transcribe

8 192 6.1 Python silero-vad VS whisper-auto-transcribe

Auto transcribe tool based on whisper
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
kaldi-active-grammar

10 329 0.0 Python silero-vad VS kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
GassistPi

1 1,013 0.0 Python silero-vad VS GassistPi

Google Assistant for Single Board Computers
mr-robot

1 4 0.0 Python silero-vad VS mr-robot

A multi-utility discord bot. Playback hilarious voice tracks on-demand, wiki for anything, turn on/off IoT enabled devices, and more!
hollow-knight-voice-commands

1 1 3.9 Python silero-vad VS hollow-knight-voice-commands

A fun little python tool to play Hollow Knight with only voice commands
subgen

4 354 9.8 Python silero-vad VS subgen

Autogenerate subtitles using OpenAI Whisper Model via Jellyfin, Plex, Emby, Tautulli, or Bazarr
mycroft-precise

3 792 0.0 Python silero-vad VS mycroft-precise

A lightweight, simple-to-use, RNN wake word listener
yarn

3 1,130 7.1 Python silero-vad VS yarn

YaRN: Efficient Context Window Extension of Large Language Models (by jquesnelle)
subsai

3 1,040 8.1 Python silero-vad VS subsai

🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
Caster

7 327 2.9 Python silero-vad VS Caster

Dragonfly-Based Voice Programming and Accessibility Toolkit
spokestack-python

7 132 3.3 Python silero-vad VS spokestack-python

Discontinued Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better silero-vad alternative or higher similarity.

Suggest an alternative to silero-vad

silero-vad reviews and mentions

Posts with mentions or reviews of silero-vad. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-06.

New models and developer products announced at OpenAI DevDay
8 projects | news.ycombinator.com | 6 Nov 2023

>How do you detect speech starting and stopping?
https://github.com/snakers4/silero-vad
[Discussion] Video Translation Task
2 projects | /r/MachineLearning | 13 Jul 2023

you could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad
Using Whisper to transcribe the entire Forensic Files series
5 projects | /r/DataHoarder | 4 Jun 2023

I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:
Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
5 projects | /r/LearnJapanese | 22 Sep 2022

By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file.
[P] A more detailed post about Silero VAD on The Gradient
1 project | /r/MachineLearning | 19 Feb 2022

The VAD is always available on Github
Silero VAD: pre-trained enterprise-grade voice activity detector
1 project | news.ycombinator.com | 30 Dec 2021
[P] Silero VAD: One voice detector to rule them all
2 projects | /r/MachineLearning | 18 Dec 2021

I also pinned some interesting comments here regarding mobile and IOT usage here - https://github.com/snakers4/silero-vad/issues/37
One voice detector to rule them all
1 project | news.ycombinator.com | 7 Dec 2021
A note from our sponsor - InfluxDB
www.influxdata.com | 19 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Stats

Basic silero-vad repo stats

Mentions

Stars

2,780

Activity

6.5

Last Commit

1 day ago

snakers4/silero-vad is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of silero-vad is Python.

silero-vad

Silero-vad Alternatives

Similar projects and alternatives to silero-vad

silero-vad reviews and mentions

Stats

Popular Comparisons