SpotifyTranscripts
realtime-transcription-playground
SpotifyTranscripts | realtime-transcription-playground | |
---|---|---|
1 | 2 | |
140 | 142 | |
- | - | |
6.3 | 1.0 | |
5 months ago | about 1 year ago | |
JavaScript | JavaScript | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
SpotifyTranscripts
-
Show HN: PodText.ai – Search anything said on a podcast, Highlight text to play
This is great, I was working on something similar in the last few days, but since it is hard to cover every podcast, I stopped to think of a way to niche down. I feel your pain with GPU and scalability to transcript podcasts.
I was thinking of adding something like this for the UI https://github.com/johan-akerman/SpotifyTranscripts in case you find it useful.
Good luck! It is a really nice project.
realtime-transcription-playground
-
Best Practices for Streaming Speech Recognition / gRPC
-I was able to find this repo https://github.com/saharmor/realtime-transcription-playground/tree/main which uses web sockets instead, but this seems suboptimal/ not gRPC. Is this a viable approach?
- *Real-time* Transcription Playground for building speech2text apps in minutes (Python, React, GCP)
What are some alternatives?
Saveddit - Search and Filter through your Saved Reddit Posts
LedFx - LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
rapviz - 🔥🎤 See your bars broken down right in the browser. Powered by Spotify, Genius, and Railway.
gecko - Gecko - A Tool for Effective Annotation of Human Conversations
autocropper.io - API to automatically crop and output individual photos from multi-photo scans (deprecated)
react-transcript-editor - A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
threaddit - Threaddit is a full-stack Reddit clone; it's a comprehensive web application inspired by Reddit, built using Flask and its diverse libraries for the backend, combined with PostgreSQL for robust database management. The frontend is developed using React.js and its rich set of libraries, offering a seamless and dynamic user experience.
glaemscribe - Glaemscribe, the tolkienian languages/writings transcription engine.
modal-examples - Examples of programs built using Modal
disable-autogain-control-extension - A chrome extension which disables the automatic microphone gain control in the MediaStream Web API.
whisper - Robust Speech Recognition via Large-Scale Weak Supervision
pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding