github-explorer
VaRA-Tool-Suite | github-explorer | |
---|---|---|
1 | 13 | |
13 | 133 | |
- | 3.0% | |
8.1 | 4.3 | |
6 days ago | 4 months ago | |
Python | HTML | |
BSD 2-clause "Simplified" License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
VaRA-Tool-Suite
-
Backdoor in upstream xz/liblzma leading to SSH server compromise
I tried to understand the significance of this (parent maybe implied that they reused a completely fictitious identity generated by some test code), and I think this is benign.
That project just includes some metadata about a bunch of sample projects, and it links directly to a mirror of the xz project itself:
https://github.com/se-sic/VaRA-Tool-Suite/blob/982bf9b9cbf64...
I assume it downloads the project, examines the git history, and the test then ensures that the correct author name and email addresses are recognized.
(that said, I haven't checked the rest of the project, so I don't know if the code from xz is then subsequently built, and or if this other project could use that in an unsafe manner)
github-explorer
-
Backdoor in upstream xz/liblzma leading to SSH server compromise
clickhouse has pretty good github_events dataset on their playground that folks can use to do some research - some info on the dataset https://ghe.clickhouse.tech/
Example of what user JiaT75 did so far:
https://play.clickhouse.com/play?user=play#U0VMRUNUICogRlJPT...
pull requests mentioning xz, 5.6 without downgrade, cve being mentioned in the last 60 days:
https://play.clickhouse.com/play?user=play#U0VMRUNUIGNyZWF0Z...
- Everything You Always Wanted to Know About GitHub (But Were Afraid to Ask)
-
Stargazers intersections for most popular GitHub projects in Venn diagrams
It shouldn’t be hard to implement: https://ghe.clickhouse.tech/#how-to-download-the-data
- GitHub Profile Achievements
-
Getting 10TB of GitHub Logs and Extracting Details of All Users and Repositories
The article leaves a bitter taste of unnecessary complexity. Data engineering should not be hard.
For example, you can load the GitHub Archive to ClickHouse, and it will be accessible with interactive real-time queries: https://ghe.clickhouse.tech/
See also https://til.simonwillison.net/clickhouse/github-explorer
-
Hundreds of millions of stars turned into a map of GitHub projects
I recommend checking https://ghe.clickhouse.tech/
It explains the full pipeline - how to download, collect, and analyze this sort of data.
- Everything you always wanted to know about GitHub (but were afraid to ask)
-
Cached Chrome Top Million Websites
Yes, it's continuously updated.
The source code is here: https://github.com/ClickHouse/github-explorer
This shell scripts updates it: https://github.com/ClickHouse/github-explorer/blob/main/upda...
What are some alternatives?
stencil-golang - Template repository for Golang applications
map-of-github - Inspirational Mapping
crux-top-lists - Downloadable snapshots of the Chrome Top Million Websites pulled from public CrUX data in Google BigQuery.
map-of-reddit - Interactive map of reddit
github-profile-trophy - 🏆 Add dynamically generated GitHub Stat Trophies on your readme
demo - A new issue is created in this repo every minute
hn-search - Hacker News Search
Comcast - Simulating shitty network connections so you can build better systems.
Anime-Girls-Holding-Programming-Books - Anime Girls Holding Programming Books
Anime-Girls-Holding-Programming-
ClickHouse - ClickHouse® is a real-time analytics DBMS
rust1 - rust1