gesetze-im-internet
mastodon-scraping


gesetze-im-internet | mastodon-scraping | |
---|---|---|
1 | 1 | |
2 | 3 | |
- | - | |
0.0 | 0.0 | |
6 days ago | 3 days ago | |
Ruby | ||
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
gesetze-im-internet
-
Git scraping: track changes over time by scraping to a Git repository
https://github.com/jandinter/gesetze-im-internet
Parsing the legal acts with the tools you mention looks very interesting! Currently, I simply collect the published XML files whose structure is optimized for laying out the text and not so much for representing a structure of sections and subsections.
mastodon-scraping
-
Git scraping: track changes over time by scraping to a Git repository
Thanks for linking to the topic, that was interesting
As a heads up to anyone trying this stunt, please be mindful that git-diff is ultimately a line oriented action (yeah, yeah, "git stores snapshots")
For example https://github.com/pmc-ss/mastodon-scraping/commit/2a15ce1b2... is all :fu: because git sees basically the "first line" changed
However, had the author normalized the instances.json with something like "jq -S" then one would end up with a more reasonable 1736 textual changes, which github would have almost certainly rendered
diff -u \
What are some alternatives?
metrobus-timetrack-history - Tracking Metrobus location data
github-actions - Infromation and tips regarding GitHub Actions
bbcrss - Scrapes the headlines from BBC News indexes every five minutes
mcbroken-archive - :inbox_tray: Archive for data from mcbroken.com.
hun_law_rs - Tool for parsing hungarian laws (Rust version)
Geo-IP-Database - Automatically updated tree-formatted database from MaxMind database
bchydro-outages - Track BCHydro Outages via Git history
hun_law_py - Tools for parsing hungarian legal documents
torvenyek - Magyar törvények git repo

