RealtimeSTT
FLaNK-Halifax
RealtimeSTT | FLaNK-Halifax | |
---|---|---|
3 | 14 | |
918 | 1 | |
- | - | |
8.1 | 7.3 | |
2 days ago | 6 months ago | |
Python | TypeScript | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
RealtimeSTT
-
Ask HN: Speech to text models, are they usable yet?
I have been using this with a lot of success for a while now: https://github.com/KoljaB/RealtimeSTT/tree/master , it works in real time, without any delays on an old Nvidia card.
I tried it with German & English without issues. It should also work for French but might need a bit of tweaking. The code is very straightforward, but depending on the context I'd recommend experimenting with the parameters that would suit you.
It's using a model called "Whisper" under the hood.
Have fun :)
- FLaNK Stack Weekly 16 October 2023
-
Realtime Library for Python
Demo: Video Code: Github
FLaNK-Halifax
- FLaNK Stack Weekly for 27 November 2023
- FLaNK Stack Weekly for 20 Nov 2023
- FLaNK Stack Weekly for 13 November 2023
- FLaNK Stack Weekly 06 Nov 2023
- FLaNK Stack Weekly for 30 Oct 2023
- FLaNK Stack Weekly 23 Oct 2023
- FLaNK Stack Weekly 16 October 2023
- FLaNK Stack Weekly 09 Oct 2023
- FLaNK Stack Weekly 2 October 2023
- FLaNK Stack for 25 September 2023
What are some alternatives?
RealtimeTTS - Converts text to speech in realtime
rivet - The open-source visual AI programming environment and TypeScript library
yolov7 - Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
SeaGOAT - local-first semantic code search engine
inference - A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
flink-cdc - Flink CDC is a streaming data integration tool
JsonGenius - Get structured JSON data from any page.
vimGPT - Browse the web with GPT-4V and Vimium
Kouncil - Powerful dashboard for your Kafka. Monitor status, manage groups, topics, send messages and diagnose problems. All in one user friendly web dashboard.
CML_AMP_Intelligent-QA-Chatbot-with-NiFi-Pinecone-and-Llama2 - The prototype deploys an Application in CML using a Llama2 model from Hugging Face to answer questions augmented with knowledge extracted from the website. This prototype introduces Pinecone as a database for storing vectors for semantic search.
llmware - Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
co-tracker - CoTracker is a model for tracking any point (pixel) on a video.