RealtimeSTT
RealtimeTTS
RealtimeSTT | RealtimeTTS | |
---|---|---|
3 | 1 | |
918 | 1,137 | |
- | - | |
8.1 | 8.9 | |
2 days ago | 8 days ago | |
Python | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
RealtimeSTT
-
Ask HN: Speech to text models, are they usable yet?
I have been using this with a lot of success for a while now: https://github.com/KoljaB/RealtimeSTT/tree/master , it works in real time, without any delays on an old Nvidia card.
I tried it with German & English without issues. It should also work for French but might need a bit of tweaking. The code is very straightforward, but depending on the context I'd recommend experimenting with the parameters that would suit you.
It's using a model called "Whisper" under the hood.
Have fun :)
- FLaNK Stack Weekly 16 October 2023
-
Realtime Library for Python
Demo: Video Code: Github
RealtimeTTS
What are some alternatives?
yolov7 - Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
fastkafka - FastKafka is a powerful and easy-to-use Python library for building asynchronous web services that interact with Kafka topics. Built on top of Pydantic, AIOKafka and AsyncAPI, FastKafka simplifies the process of writing producers and consumers for Kafka topics.
inference - A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
JsonGenius - Get structured JSON data from any page.
langstream - LangStream. Event-Driven Developer Platform for Building and Running LLM AI Apps. Powered by Kubernetes and Kafka.
Kouncil - Powerful dashboard for your Kafka. Monitor status, manage groups, topics, send messages and diagnose problems. All in one user friendly web dashboard.
llmware - Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
Wails - Create beautiful applications using Go
DeepSpeech - DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
kafka-ui - Open-Source Web UI for Apache Kafka Management