Spoken-Keyword-Spotting
SincNet
Our great sponsors
Spoken-Keyword-Spotting | SincNet | |
---|---|---|
1 | 3 | |
80 | 1,097 | |
- | - | |
0.0 | 0.0 | |
over 1 year ago | almost 3 years ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Spoken-Keyword-Spotting
-
How to train large deep learning models as a startup
The search term you're looking for is "Keyword Spotting" - and that's what's implemented locally for ~embedded devices that sit and wait for something relevant to come along so that they know when to start sending data up to the mothership (or even turn on additional higher-power cores locally).
Here's an example repo that might be interesting (from initial impressions, though there are many more out there) : https://github.com/vineeths96/Spoken-Keyword-Spotting
SincNet
- Does this SincNet (neural architecture) contain a discriminator?
-
TypeError: layer_norm(): argument 'input' (position 1) must be Tensor, not SincNet.
the sincnet class is taken from here: https://github.com/mravanelli/SincNet/blob/master/dnn_models.py
-
[R][P] Announcing audax, a audio ML/DL framework in Jax
Code for https://arxiv.org/abs/1808.00158 found: https://github.com/mravanelli/SincNet
What are some alternatives?
pocketsphinx - A small speech recognizer
pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
spokestack-python - Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
speechbrain - A PyTorch-based Speech Toolkit
svm-pytorch - Linear SVM with PyTorch
cnnimageretrieval-pytorch - CNN Image Retrieval in PyTorch: Training and evaluating CNNs for Image Retrieval in PyTorch
determined - Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
Image-Forgery-Detection-CNN - Image forgery detection using convolutional neural networks. Group 10's final project for TU Delft's course CS4180 Deep Learning 2019.
ruptures - ruptures: change point detection in Python
UHV-OTS-Speech - A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
stereo-image-generation - This repository contains code to generate stereo (Side by side) image from a single image.
whisper-timestamped - Multilingual Automatic Speech Recognition with word-level timestamps and confidence