Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more โ
Top 5 Python wayback-machine Projects
-
ArchiveBox
๐ Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
wayback-machine-scraper
A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
-
Discord-Recon
Discord bot created to automate bug bounty recon, automated scans and information gathering via a discord server
-
hn_bot
A Bot that searches for and posts links to archived versions of articles after scanning all of HackerNews' top articles for those that contain a link to a site that requires a subscription.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Project mention: Ask HN: What Underrated Open Source Project Deserves More Recognition? | news.ycombinator.com | 2024-03-07Two projects I greatly appreciate, allowing me to easily archive my bandcamp and GOG purchases (after the initial setup anyways):
https://github.com/easlice/bandcamp-downloader
https://github.com/Kalanyr/gogrepoc
And I recently learned about archivebox, which I think is going to be a fast favorite and finally let me clear out my mess of tabs/bookmarks: https://github.com/ArchiveBox/ArchiveBox
I ended up using waybackpy python module to retrieve archived URLs, it worked well. I think the feature you want for this is the "snapshots", but I didn't test this myself
Project mention: wayback-machine-scraper: NEW Data - star count:380.0 | /r/algoprojects | 2023-12-10
Python wayback-machine related posts
- Ask HN: How can I back up an old vBulletin forum without admin access?
- wayback-machine-scraper: NEW Data - star count:380.0
- Best practices for archiving websites
- download all captures of a page in archive.org
- If we lose the Internet Archive, weโre screwed
- End-of-Availability notice for legacy DSM, Surveillance Station, SRM, and more
- A self-hosted archiving service integrated with Internet Archive, archive.today, IPFS and beyond.
-
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Apr 2024
Index
What are some of the best open-source wayback-machine projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | ArchiveBox | 19,737 |
2 | waybackpy | 405 |
3 | wayback-machine-scraper | 405 |
4 | Discord-Recon | 68 |
5 | hn_bot | 21 |
Sponsored