testing-samples
kaldi-gstreamer-server
Our great sponsors
testing-samples | kaldi-gstreamer-server | |
---|---|---|
1 | 4 | |
9,105 | 1,054 | |
0.3% | - | |
4.9 | 0.0 | |
15 days ago | over 3 years ago | |
Java | Python | |
Apache License 2.0 | BSD 2-clause "Simplified" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
testing-samples
-
Ask HN: What problem are you close to solving and how can we help?
https://github.com/android/testing-samples/blob/main/ui/espr...
https://github.com/android/testing-samples
Alternatively, Squish [3] is a very polished and more elegant commercial testing tool that lets you record test-cases using a GUI tool and convert them into (ideally modularized) methods that verify object properties or compare (masked) screenshots of the GUI:
[3] https://www.froglogic.com/squish/features/
Demo video: https://youtu.be/ElH-3MVHPRw?t=864
They abstract away a lot of the functionality using the Gherkin [4] domain-specific language so that tests are easier to read at a high level (but you can still dig down into the underlying programmatic implementation).
[4] https://cucumber.io/docs/guides/overview/
This is probably too much complexity for your use-case, but may provide some ideas or inspiration for what is possible.
kaldi-gstreamer-server
- Real-time full-duplex speech recognition server, based on Kaldi and GStreamer
- Ask HN: What problem are you close to solving and how can we help?
-
Open Source ASR with user-specific custom vocabularies?
Through my research, the most promising real-time transcription options appear to be Vosk or Kaldi Gstreamer. I’ve set them both up & they appear to work well for general transcription, but I’m not sure how to handle the user-specific custom vocabularies.
-
Speech to text software
It is kind of difficult to find something like this free of charge (and open source) since the ASR service needs to be hosted somewhere. If you are really interested in the topic then you could take a lit into kaldi and its pretrained models (but kaldi is kind of difficult to learn so I don't really recommend it if you want something quick) and then you could also combine that with kaldi-gstreamer in order to set up a server which you can turn on and off whenever you like.
What are some alternatives?
Kaldi Speech Recognition Toolkit - kaldi-asr/kaldi is the official location of the Kaldi project.
espnet - End-to-End Speech Processing Toolkit
mtpng - A parallelized PNG encoder in Rust
vosk-server - WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
ZeroTier - A Smart Ethernet Switch for Earth
tailscale - The easiest, most secure way to use WireGuard and 2FA.
rhasspy - Offline private voice assistant for many human languages
Nebula - A scalable overlay networking tool with a focus on performance, simplicity and security
bert-for-inference - A small repo showing how to easily use BERT (or other transformers) for inference
map-generation
ChessPositionRanking - Software suite for ranking chess positions and accurately estimating the number of legal chess positions