megadetector-gui
SpeechLoop
Our great sponsors
megadetector-gui | SpeechLoop | |
---|---|---|
3 | 6 | |
41 | 18 | |
- | - | |
0.0 | 0.0 | |
over 1 year ago | over 1 year ago | |
JavaScript | Python | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
megadetector-gui
-
Ask HN: What not-profit-seeking project are you tinkering with this week?
On (some) weekends I work on Megadetector GUI [0]. Megadetector [1] is an object detection model trained on millions of camera trap images and is widely used by conservationists. The issue is that it's rather technical to set up, so I made a GUI for it.
Currently working on a brand new version (not public just yet) that will use the latest MD version (v5 is way faster), better UI and most importantly GPU support out of the box.
[0] https://github.com/petargyurov/megadetector-gui
[1] https://github.com/microsoft/CameraTraps/blob/main/megadetec...
-
Ask HN: Who Wants to Collaborate?
> ML to identify gorillas by their unique nose prints
Really cool stuff. I wonder if face detection is sufficient too? It has been proven to work for brown bears [0].
I have also been doing some open source work [1] to democratise object detection in this space but I haven't had the time to make improvements to the project in a while.
* [0] http://bearresearch.org/
* [1] https://github.com/petargyurov/megadetector-gui
-
Kea parrots perform domain-general statistical inference
In case any camera trap folk are here, I'm currently volunteering with the NZ Department of Conservation to build an AI-assisted image sorting tool that speeds up the weeding out of empty images.
https://github.com/petargyurov/megadetector-gui
This currently used to assist the conservation efforts of various endemic species such as the kakapo, kea and takahe. (The ML model used is not limited to just these species!)
SpeechLoop
- Ask HN: Offline, Embeddable Speech Recognition?
-
Ask HN: Who Wants to Collaborate?
I created a toolkit to evaluate many different speech recognition engines.
https://github.com/robmsmt/SpeechLoop
Comparing speech systems can take a long time esp for a dev who doesn't have the background in audio/ml. How do you know which one will work best? Will new shiny transformer model perform well enough? Most end up using one of the big tech companies existing API to throw their data at. Whilst this is convenient, I think that it's a travesty that opensource speech systems have not are not as easy to use. I was hoping to change that to make it easy to evaluate and compare them!
-
Introducing Speechloop, answering the question, what is the best ASR for me?
Checkout: https://github.com/robmsmt/SpeechLoop looking for feedback on:
- Introducing Speechloop, answering the question, what is the best ASR?
- Introducing SpeechLoop
-
I made a Speech Recognition library designed to answer question, what is the best ASR?
I made this to make it easy[ish]).... check: https://github.com/robmsmt/SpeechLoop
What are some alternatives?
futurecoder - 100% free and interactive Python course for beginners
allosaurus - Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
HPI - Human Programming Interface 🧑👽🤖
pocketsphinx-python - Python interface to CMU Sphinxbase and Pocketsphinx libraries
mnm - mnm implements TMTP protocol. Let Internet sites message members directly, instead of unreliable, insecure email. Contributors welcome! (Server)
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
schemats - A postgres & mysql -> typescript interface generator
vox - Vox language compiler. AOT / JIT / Linker. Zero dependencies
CameraTraps - PyTorch Wildlife: a Collaborative Deep Learning Framework for Conservation.
praat - Praat: Doing Phonetics By Computer
readable-thrift - Human-friendly Thrift encoder/decoder
Porcupine  - On-device wake word detection powered by deep learning