Our great sponsors
-
openWakeWord
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
whisper.el
Speech-to-Text interface for Emacs using OpenAI's whisper model and whisper.cpp as inference engine.
Good improvements for many languages, numbers here
https://github.com/openai/whisper/blob/main/language-breakdo...
https://github.com/dscripka/openWakeWord
Balancing wake reliability vs false wake activation is a tricky balance. OWW is decent but could certainly be better.
It's used with Home Assistant now so I expect the training data and implementation overall to get significantly better fairly soon.
I implemented a dummy real-time (tested on Mac M1) transcription approach with Whisper. You can find the project here: https://github.com/gaborvecsei/whisper-live-transcription
The idea was to provide transcription results as fast as you can, and you can refine it along the way by providing more and more context.
Related posts
- Einsum in 40 Lines of Python
- Show HN: Free GitHub Copilot CLI with your own model or API
- Show HN: Cognita – open-source RAG framework for modular applications
- Show HN: Data Bonsai: a Python package to clean your data with LLMs
- Ask HN: Seeking On-Premises Website Examples for Uptime Comparison Experiment