audio_alignment
glancer
audio_alignment | glancer | |
---|---|---|
1 | 6 | |
11 | 154 | |
- | - | |
1.7 | 2.6 | |
12 months ago | 9 months ago | |
Python | Haskell | |
MIT License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
audio_alignment
-
YouTubeTranscript.com
This is a great idea; I had done something like this a couple of years ago for some specific set of videos (e.g. https://shreevatsa.net/tex/program/videos/s10/ — compare with https://youtubetranscript.com/?v=_0Cv1G_s4gQ for the same video), but never got around to making it general; glad someone has done it. It takes just a few lines of Javascript, using the Youtube API, to do this i.e. keeping the video and text in sync (just view source on either page to see the JS at the bottom).
Something like this can also help with audio recordings (generating the alignment automatically is called "forced alignment" and there are tools like "aeneas" for this). In case anyone's interested or wants to help (for Sanskrit texts): see https://github.com/shreevatsa/web-align-audio-text deployed at https://shreevatsa.net/ramayana/sarga/ and better version at https://github.com/avinashvarna/audio_alignment deployed at https://avinashvarna.github.io/audio_alignment/
glancer
- Now I Can Just Print That Video
-
YouTubeTranscript.com
I had a huge backlog of tech videos, so I wrote me this (also to play a bit with Haskell, the base idea can be replicated easily in any language though): https://github.com/rberenguel/glancer
-
Data is always something new to learn in the area, some new tool to try, some new insight to discover, — Ruben Berenguel
At work, most of my projects related to high volume data processing, with a focus on keeping costs low and performance high. Out of work, I like working on many different things, like custom productivity tools (like this one to save time watching technical presentations) or creating generative artwork.
- creates a static HTML page with regular shots from the video with the corresponding Youtube CC on the side
-
Show HN: 10x Your YouTube Productivity
I did a similar (in spirit, to save time) thing [1] to be able to skim on technical presentations: it creates a static HTML page with regular shots from the video with the corresponding Youtube CC on the side. It can help decide if a presentation is worth a watch (or just get the gist of one). Uses youtube-dl and ffmpeg under the hood.
[1]: https://github.com/rberenguel/glancer
-
Running external programs with haskell
I have some live (as in in code I use) examples here: https://github.com/rberenguel/glancer/blob/main/src/Process.hs#L36 (using readProcess and callProcess)
What are some alternatives?
gentle - gentle forced aligner
ChatGPT-YouTube-summarizer - This Chrome extension lets you summarize YouTube videos using the ChatGPT.
chatgpt_system_prompt - A collection of GPT system prompts and various prompt injection/leaking knowledge.
YouWhisper - Convert YouTube videos to text using openai/whisper
yt-whisper - Using OpenAI's Whisper to automatically generate YouTube subtitles
web-align-audio-text - Ramayana audio/text alignment website
ambuda - Main application code for Ambuda, a breakthrough Sanskrit library (ambuda.org)
whisper.cpp - Port of OpenAI's Whisper model in C/C++