A PyTorch-based Speech Toolkit (by speechbrain)

Speechbrain Alternatives

Similar projects and alternatives to speechbrain

  • GitHub repo AugLy

    A data augmentations library for audio, image, text, and video.

  • GitHub repo imgaug

    Image augmentation for machine learning experiments.

  • GitHub repo best-of-ml-python

    🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

  • GitHub repo Resemblyzer

    A python package to analyze and compare voices with deep learning

  • GitHub repo pyannote-audio

    Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

NOTE: The number of mentions on this list indicates mentions on common posts. Hence, a higher number means a better speechbrain alternative or higher similarity.


Posts where speechbrain has been mentioned. We have used some of these posts to build our list of alternatives and similar projects - the last one was on 2021-06-19.
  • [N] Facebook AI Open Sources AugLy: A New Python Library For Data Augmentation To Develop Robust Machine Learning Models
    We have a bunch of recipes with examples of using SpecAugment / speed perturbation / room impulse response corruption (see e.g. https://github.com/speechbrain/speechbrain/blob/develop/recipes/LibriSpeech/ASR/seq2seq/train.py )
  • Issue with dataset download
    reddit.com/r/datasets | 2021-05-07
    Can't find automated scripts to download datasets.(I tried running a script, but couldn't understand the SSL error)
  • Are there any speech recognition modules so I can write one and do not have to rely on google and the likes?
  • [D] state of art for Speaker Diarization?
    It may be worth checking SpeechBrain out as well which was recently released. Has some pre-trained models, it might give you a reasonable baseline to start with but haven't used it personally.
  • SpeechBrain Toolkit – a plurality of pretrained models and useful audio tools
    news.ycombinator.com | 2021-03-22
  • A PyTorch-based speech toolkit
    news.ycombinator.com | 2021-03-16
  • [N] SpeechBrain Public Release
    SpeechBrain is an Open Source toolkit designed to make research and development of speech and audio technologies faster. It is flexible, modular, easy-to-use, well documented.
  • [R] SpeechBrain is out. A PyTorch Speech Toolkit.
    SpeechBrain currently supports speech recognition, speaker recognition, verification and diarization, spoken language understanding, speech enhancement, speech separation and multi-microphone signal processing. For all these tasks we have competitive or state-of-the-art performance (see https://github.com/speechbrain/speechbrain).
    That sounds wacky! The beam search (e.g. https://github.com/speechbrain/speechbrain/blob/5782510f81606ae99c02cfd48d1b40ef493d8f3c/speechbrain/decoders/seq2seq.py#L253) can be set to return multiple hypotheses, so you could maybe do that, compute an alignment between two hypotheses using our edit distance utils, and find substitutions (gum/gun), or something like that?
  • SpeechBrain: A PyTorch Powered Speech Toolkit
    news.ycombinator.com | 2021-03-15
  • [Q] About speaker diarization
  • speechbrain/speechbrain finally on github


Basic speechbrain repo stats
4 days ago

speechbrain/speechbrain is an open source project licensed under Apache License 2.0 which is an OSI approved license.