free-spoken-digit-dataset VS lingvo

Compare free-spoken-digit-dataset vs lingvo and see what are their differences.

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
free-spoken-digit-dataset lingvo
1 1
596 2,780
- 0.2%
0.0 8.7
over 1 year ago 15 days ago
Python Python
- Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

free-spoken-digit-dataset

Posts with mentions or reviews of free-spoken-digit-dataset. We have used some of these posts to build our list of alternatives and similar projects.

lingvo

Posts with mentions or reviews of lingvo. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-04-08.
  • Voice assistant that can be taught how to swear (Part 1)
    5 projects | dev.to | 8 Apr 2022
    To calculate the Word Error Rate I took a python script from the tensorflow/lingvo project and rewrote it in js. In essence, it is just a simple solution of the Edit Distance task, in addition to error calculation for each of the three types: deletion, insertion, and replacement. In the end, I did not the most intelligent method of comparing texts, and yet it was sufficient enough to later on add parameters to queries to Speech-to-Tex.

What are some alternatives?

When comparing free-spoken-digit-dataset and lingvo you can also consider the following projects:

NSynth-MIDI-Renderer - Sample based concatenative synthesizer for the NSynth dataset. Render any MIDI (.mid) sequence with the notes of NSynth.

TTS-Voice-Wizard - Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

ESC-50 - ESC-50: Dataset for Environmental Sound Classification

seq2seq - A general-purpose encoder-decoder framework for Tensorflow

spokestack-python - Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

allosaurus - Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

dnn_from_scratch - A high level deep learning library for Convolutional Neural Networks,GANs and more, made from scratch(numpy/cupy implementation).

awesome-speech-recognition-speech-synthesis-papers - Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Mava - 🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

deepspeech-playbook - A crash course for training speech recognition models using DeepSpeech.

pocketsphinx-python - Python interface to CMU Sphinxbase and Pocketsphinx libraries

spinorama - A library to display and compare spinorama (speakers measurements) graphs.