wenet
basic-pitch
wenet | basic-pitch | |
---|---|---|
5 | 8 | |
3,699 | 2,941 | |
1.6% | 2.4% | |
9.6 | 8.4 | |
2 days ago | 3 days ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wenet
-
Open Source Libraries
wenet-e2e/wenet
-
Deploying speech recognition models at scale
Try wenet wenet
-
Ask HN: Are there any good open source Text-to-Speech tools?
For STT, take a look at Wenet: https://github.com/wenet-e2e/wenet
They provide support for running in a Raspberry Pi and it runs in real-time. I have tried the desktop version and the quality is good enough when the audio is clean.
- Project Alice – an open source virtual assistant that can run offline
- Wenet results on Gigaspeech - on par with best results (Espnet). Pretrained model is available .
basic-pitch
-
Open Source Libraries
spotify/basic-pitch: Audio to midi converter
-
Mac users: is it best to just rent a linux server?
I did just get it to work setting an alias for Python pointing to 3.11 but ran into this issue: https://github.com/spotify/basic-pitch/issues/63
-
Transcribing music from audio?
There's https://github.com/spotify/basic-pitch, a free converter, but require CLI usage.
-
Recommended python library for converting audio file into midi ?
Might be worthwhile checking out Spotify’s basic pitch library.
-
How to make a sub bass follow a bit from a mudpie?
quick add on: definitely take a look at this https://github.com/spotify/basic-pitch its a much much better offline pitch detection algo that you could use to get the midi for your mudpie. I'd probably bounce a low passed copy at around 150-200Hz, making sure its completley mono and then feed that wav/aiff file to that pitch detector
-
Spotify Research Open-Sources ‘Basic Pitch’: A Machine Learning Tool For Converting Audio Into MIDI
Continue reading | Check out the paper, github, project and post
-
Spotify Introduces an Open-Source Tool to Fix a Big Problem for Modern Musicians - It's FOSS News
GitHub link
What are some alternatives?
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
ai-music - A vanilla Trasformer Decoder music generation model trained on Final Fantasy OST MIDI songs
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
THIRTY-DOLLAR-HAIRCUT-GENERATOR - 30 dollar haircut website MIDI converter - Using MIDIs, QUICKLY generate a chart for the "DON'T YOU LECTURE ME WITH YOUR THIRTY DOLLAR HAIRCUT" website. The site's by GDcolon, if you need to search it up.
FasterTransformer - Transformer related optimization, including BERT, GPT
gitpod - The developer platform for on-demand cloud development environments to create software faster and more securely.
whisper - Robust Speech Recognition via Large-Scale Weak Supervision
concordia - Crowdsourcing platform for full text transcription and tagging. https://crowd.loc.gov
fstalign - An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.
PiDTLN - Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi
functorch - functorch is a prototype of JAX-like composable function transforms for PyTorch. [Moved to: https://github.com/pytorch/functorch]
torchlambda - Lightweight tool to deploy PyTorch models to AWS Lambda