pyctcdecode

A fast and lightweight python-based CTC beam search decoder for speech recognition. (by kensho-technologies)

Pyctcdecode Alternatives

Similar projects and alternatives to pyctcdecode

  • DeepSpeech

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

  • tevr-asr-tool

    State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better pyctcdecode alternative or higher similarity.

pyctcdecode reviews and mentions

Posts with mentions or reviews of pyctcdecode. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-08-10.
  • Show HN: State-of-the-Art German Speech Recognition in 284 lines of C++
    5 projects | news.ycombinator.com | 10 Aug 2022
    I wrote "284 lines of C++" to indicate that this is compact enough for people to actually read and understand the source code. Also, compiling my implementation is super easy and straightforward ... something which can't be said for Kaldi, Vosk, or DeepSpeech.

    If you try to read the CTC beam search decoder from Mozilla's DeepSpeech [1], that alone is about 2000 LOC in multiple files.

    If you try to read the pyctcdecode source that is used by HuggingFace [2], that's 1000+ LOC of Python.

    But this implementation is all the client-side, i.e. the entire "native_client" folder hierarchy in DeepSpeech [3], narrowed down to a mere 284 lines.

    [1] https://github.com/mozilla/DeepSpeech/tree/master/native_cli...

    [2] https://github.com/kensho-technologies/pyctcdecode

    [3] https://github.com/mozilla/DeepSpeech/tree/master/native_cli...

  • kensho-technologies/pyctcdecode
    1 project | /r/speechtech | 25 Jun 2021

Stats

Basic pyctcdecode repo stats
2
405
2.1
10 months ago

kensho-technologies/pyctcdecode is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of pyctcdecode is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com