rosie
alass
Our great sponsors
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
rosie
-
I created a library for parsing text in Kotlin. Better than regular expressions. What do you think?
I think you'd be better served to try to port Rosie Pattern Language (https://gitlab.com/rosie-pattern-language/rosie) to Kotlin than to try to roll your own. There are definite corner cases in RegEx (sounds like you've already hit some) where the asympototic performance is so large that the code is practically unrunnable. Rosie addresses several of those cases.
-
Is the regex crate a bottleneck in your program? If so, can you share the details?
I had to spend a lot of time clicking through links to finally find an example: https://gitlab.com/rosie-pattern-language/rosie/blob/master/rpl/date.rpl
-
Ask HN: What small library or tool do you want that doesn’t exist?
Something could be based on the "Rosie Pattern Language"[0]. There is already a parser for en_US/en_EU dates[1], which should be simple to extend to date ranges.
[0]: https://gitlab.com/rosie-pattern-language/rosie/-/blob/maste...
[1]: https://gitlab.com/rosie-pattern-language/rosie/blob/master/...
-
Parsing Tools
Maybe something like Rosie?
alass
- Automatic Language-Agnostic Subtitle Synchronization
-
Which software is suitable for achieving my goal?
Try alass or ffsubsync.
-
How to condense\shorten the timing of all the lines synchronously if the subtitles end too late?
I use https://github.com/kaegi/alass to retime subtitles automatically. I've never worked with a file that has the issue you're describing but it does claim to fix frame rate issues.
- Behind the curtain: what it feels like to work in AI
-
How can I replicate the features of Animelon with my own media?
You need an english softsubbed source, JP subs (kitsuneko), sync them with https://github.com/kaegi/alass, then set up POTPlayer for multiple subtitle files.
-
How to get perfect subtitles – Guide
Check this project here: https://github.com/kaegi/alass
-
Is it normal for Japanese streaming services to not offer subtitles
If you want subtitles for anime, kitsunekko usually has subtitles (and Netflix also usually has native subtitles and you can download them using subadub). The downside with using subtitles from a different source is you need to re-time them but tools like alass make this basically painless in most cases. For movies, Netflix or the DVD version of movies usually have subtitles.
- alass: Automatic Language-Agnostic Subtitle Synchronization
-
Ask HN: What small library or tool do you want that doesn’t exist?
I've used https://github.com/kaegi/alass and sometimes it works, sometimes it doesn't. It's not an easy problem to address automatically...
Video players wouldn't want that in their codebase, maybe if it was available as a C library.
-
AI could end foreign-language subtitles
With tools like alass[1] (using it to synchronise against the original language subtitles) it is about as close to solved you can get.
All of the attempts I've seen of using audio information to synchronise subtitles have been awful (some languages subtitle everything, even screams and incoherent shouts -- such as Japanese -- while others only subtitle dialogue and often rework dialogue for the purposes of making the subtitles short enough to be readable easily).
What are some alternatives?
instaparse
ffsubsync - Automagically synchronize subtitles with video.
langs
subaligner - Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
elder_launcher - A Launcher focused on simplicity and legibility.
bazarr - Bazarr is a companion application to Sonarr and Radarr. It manages and downloads subtitles based on your requirements. You define your preferences by TV show or movie and Bazarr takes care of everything for you.
snapdrop - A Progressive Web App for local file sharing
Sushi - Automatic subtitle shifter based on audio
once - Collect and deduplicate stories (RSS, Hacker News, Lobsters or Reddit) and look at them once.
aeneas - aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
KotlinSpirit - Lightweight library for parsing text in Kotlin, inspired by C++ boost spirit library.
hora - 🚀 efficient approximate nearest neighbor search algorithm collections library written in Rust 🦀 .