glancer
gentle
glancer | gentle | |
---|---|---|
6 | 12 | |
154 | 1,385 | |
- | 0.9% | |
2.6 | 2.3 | |
8 months ago | 9 days ago | |
Haskell | Python | |
BSD 3-clause "New" or "Revised" License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
glancer
- Now I Can Just Print That Video
-
YouTubeTranscript.com
I had a huge backlog of tech videos, so I wrote me this (also to play a bit with Haskell, the base idea can be replicated easily in any language though): https://github.com/rberenguel/glancer
-
Data is always something new to learn in the area, some new tool to try, some new insight to discover, — Ruben Berenguel
At work, most of my projects related to high volume data processing, with a focus on keeping costs low and performance high. Out of work, I like working on many different things, like custom productivity tools (like this one to save time watching technical presentations) or creating generative artwork.
- creates a static HTML page with regular shots from the video with the corresponding Youtube CC on the side
-
Show HN: 10x Your YouTube Productivity
I did a similar (in spirit, to save time) thing [1] to be able to skim on technical presentations: it creates a static HTML page with regular shots from the video with the corresponding Youtube CC on the side. It can help decide if a presentation is worth a watch (or just get the gist of one). Uses youtube-dl and ffmpeg under the hood.
[1]: https://github.com/rberenguel/glancer
-
Running external programs with haskell
I have some live (as in in code I use) examples here: https://github.com/rberenguel/glancer/blob/main/src/Process.hs#L36 (using readProcess and callProcess)
gentle
-
I'm looking for a way to automate an animation based on an audio file, so that it "flipbooks" a character's mouth just by flipping between a handful of frames based on the audio file's volume.
Gentle actually works on Linux, there's just no pre-built binary: https://github.com/lowerquality/gentle
-
Looking for a tool that can synthesis my own voice in text to speech.
I just found Gentle yesterday and it looks like it might be a free tool to do what you’re looking for: https://lowerquality.com/gentle/
-
YouTubeTranscript.com
Thank you!
Yes, exactly. We do forced alignment when you edit your transcript. The new words don't have any timestamps, so we need to align them. For short sections we use interpolation. If we need align whole sections we use Gentle[^1].
[^1]: https://github.com/lowerquality/gentle
-
[D] Voice recording to phonemes with timestamps? (Colab notebook, or...?)
Gentle also has web interface, only for English. Uses DNN acoustic models.
-
Could I use modern voice-to-text tools to generate LIP files?
I found this old wiki page about the LIP file format used in Fallout 2: https://falloutmods.fandom.com/wiki/LIP_File_Format. FO2 != SSE, but it seems like a LIP file is primarily: what phonemes to use, and when. That could be pretty easily generated by a modern tool like Gentle (built on Kaldi). The trick would then be to tranform Gentle's output to whatever Skyrim expects in a LIP.
-
The HTML Element
This is neat, and immediately made me think of the annotations that show up when you hit the play button on https://lowerquality.com/gentle/ , but it turns out those are made with absolutely-positioned divs and a lot of offline-precalculated px math.
-
Automatic lip-sync test, I love how easy it is to hack stuff like this together using the Python API.
More info: I used gentle to convert the audio (and a text document with what was said) into a list of phonemes and the times at which they were said. I then used a Python script inside Blender to convert this list into an animation by smoothly gliding to whichever shape key has the current phoneme in its name.
- Can anyone please translate this circular writing? Thanks!
-
Any software that can annotate (grapheme/phonogram) in a word with the matching phoneme?
Gentle by lowerquality didn't help? (https://github.com/lowerquality/gentle) It returns time aligned phoneme sequences for each word, like 'ice' -> 'ai': t0, 's': t1. I suppose it doesn't tell you which exact letters are paired, but it matches individual words with phonemes using a set vocabulary, the CMU one. (http://www.speech.cs.cmu.edu/tools/lextool.html)
-
Has anyone used aeneas or Festival TTS for word-level forced alignment? Struggling to get accurate results. Does Festival need to be installed?
We’ve had good results with gentle force alignment. https://github.com/lowerquality/gentle
What are some alternatives?
audio_alignment - Align various Sanskrit texts and audio
ChatGPT-YouTube-summarizer - This Chrome extension lets you summarize YouTube videos using the ChatGPT.
web-align-audio-text - Ramayana audio/text alignment website
chatgpt_system_prompt - A collection of GPT system prompts and various prompt injection/leaking knowledge.
zeroth - Kaldi-based Korean ASR (한국어 음성인식) open-source project
YouWhisper - Convert YouTube videos to text using openai/whisper
yt-whisper - Using OpenAI's Whisper to automatically generate YouTube subtitles