wayback-machine-downloader
waybackpack
Our great sponsors
wayback-machine-downloader | waybackpack | |
---|---|---|
48 | 6 | |
5,045 | 2,767 | |
- | - | |
0.0 | 7.0 | |
3 months ago | 3 months ago | |
Ruby | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wayback-machine-downloader
-
Ask HN: Cool Useful GitHub Repos?
I just found this https://github.com/hartator/wayback-machine-downloader
anyone have anything similarly interesting/cool/niche-useful ?
-
ArchiveTeam is saving Blogger from Google deletion
Send ArchiveTeam the link on IRC or here and we can save it to archive.org, then later you can use wayback-machine-downloader to grab it from archive.org.
https://github.com/hartator/wayback-machine-downloader
-
My TikTok was Hacked & Deleted and I GOT IT BACK!
This is where it gets tricky, you need to download the code from the wayback machine and he was able to do that by following these steps: https://github.com/hartator/wayback-machine-downloader
-
Is there a way to quick download twitter images on the wayback machine?
Not sure if it will work for twitter, but I have used wayback-machine-downloader to batch download stuff.
-
Forgot to backup my WordPress files before I swapped webhosting provider, am I screwed?
Adding to archive.org, there is a github repo to fetch website data. You can give a try too. Here is the repo link
- Can I please get help downloading and saving a website for offline use?
-
Hey guys, looks like we have a potential hacker on our hands. All of our company's files were deleted from our FTP. :( Is there any way we can get a cache of our website and restore everything? Any help or advice would be greatly appreciated. Thanks in advance!
Edit: Good news! I found a solution that saved me. I was able to download the full website (including images, JS, and CSS files) using this tool: https://github.com/hartator/wayback-machine-downloader
-
Hey guys, so a potential hacker managed to delete all of our company's files from our FTP. Yikes! Is there a way to retrieve a cache of our website and restore it? Any advice or tips would be greatly appreciated. Thanks in advance!
Edit: Thank you to everyone who suggested the Wayback Machine Downloader! It saved the day and allowed me to download the full website, including images, JS, and CSS files.
-
Have a lengthy flight: how to seamlessly mirror couple websites
I've used https://github.com/hartator/wayback-machine-downloader but it sometimes messes up CSS badly
- what Do YOU Recommend?
waybackpack
-
People who've received a black bar on Hacker News
Thank you! But the script but the only thing that really deserves credit is Jeremy Singer-Vine's https://github.com/jsvine/waybackpack library. Pretty much made this a very straightforward task
- Upgrading from Debian Jessie to Bullseye after nearly 30 years
- Setting up a Deadsy Wiki site! (HELP WANTED)
-
Need help copying a website from the "wayback machine" iternet archive
https://github.com/hartator/wayback-machine-downloader https://github.com/jsvine/waybackpack or google "wayback machine downloader" for other options
-
Wayback Machine Downloader – Download an Entire Website from the Wayback Machine
Which paid services are you referring to? It is likely that these services aren't distributing the projects they are based on, if so, then they are in compliance with the licenses of the open source projects, which don't require attribution unless you distribute them.
This project started in 2015 btw. Another similar project called waybackpack started in 2016. There are probably more projects. IMO wayback-machine-downloader is the better project though.
https://github.com/jsvine/waybackpack
The Wayback CDX Server API these projects are based on is quite simple to use btw, just some JSON responses to decode.
https://archive.org/help/wayback_api.php
- Is there a way to scrape video links off a youtube channel and see if any of the links are archived on web.archive.org? without pasting links one by one
What are some alternatives?
savepagenow - A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
wayback-machine-spn-scripts - Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now
warrick - Recover lost websites from the Web Infrastructure
wayback - IA's public Wayback Machine (moved from SourceForge)
neocities - Neocities.org - the web site. The entire thing. Yep, we're completely open source.
Hexo - A fast, simple & powerful blog framework, powered by Node.js.
docker-http-https-echo - Docker image that echoes request data as JSON; listens on HTTP/S, useful for debugging.
gba-remote-play - 📡 Stream Raspberry Pi games to a GBA via Link Cable.
hacker-news-undocumented - Some of the hidden norms about Hacker News not otherwise covered in the Guidelines and the FAQ.