wayback-machine-downloader
wayback
wayback-machine-downloader | wayback | |
---|---|---|
48 | 22 | |
5,053 | 1,643 | |
- | 1.7% | |
0.0 | 6.4 | |
3 months ago | 5 days ago | |
Ruby | Go | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wayback-machine-downloader
-
Ask HN: Cool Useful GitHub Repos?
I just found this https://github.com/hartator/wayback-machine-downloader
anyone have anything similarly interesting/cool/niche-useful ?
-
ArchiveTeam is saving Blogger from Google deletion
Send ArchiveTeam the link on IRC or here and we can save it to archive.org, then later you can use wayback-machine-downloader to grab it from archive.org.
https://github.com/hartator/wayback-machine-downloader
-
My TikTok was Hacked & Deleted and I GOT IT BACK!
This is where it gets tricky, you need to download the code from the wayback machine and he was able to do that by following these steps: https://github.com/hartator/wayback-machine-downloader
-
Is there a way to quick download twitter images on the wayback machine?
Not sure if it will work for twitter, but I have used wayback-machine-downloader to batch download stuff.
-
Forgot to backup my WordPress files before I swapped webhosting provider, am I screwed?
Adding to archive.org, there is a github repo to fetch website data. You can give a try too. Here is the repo link
- Can I please get help downloading and saving a website for offline use?
-
Hey guys, looks like we have a potential hacker on our hands. All of our company's files were deleted from our FTP. :( Is there any way we can get a cache of our website and restore everything? Any help or advice would be greatly appreciated. Thanks in advance!
Edit: Good news! I found a solution that saved me. I was able to download the full website (including images, JS, and CSS files) using this tool: https://github.com/hartator/wayback-machine-downloader
-
Hey guys, so a potential hacker managed to delete all of our company's files from our FTP. Yikes! Is there a way to retrieve a cache of our website and restore it? Any advice or tips would be greatly appreciated. Thanks in advance!
Edit: Thank you to everyone who suggested the Wayback Machine Downloader! It saved the day and allowed me to download the full website, including images, JS, and CSS files.
-
Have a lengthy flight: how to seamlessly mirror couple websites
I've used https://github.com/hartator/wayback-machine-downloader but it sometimes messes up CSS badly
- what Do YOU Recommend?
wayback
-
If we lose the Internet Archive, we’re screwed
I wish there was an alternative to the Internet Archive with collaborative curation. You share files and people who tag and sort them into albums can download them. And if it was federated it could be just as extensive as the Internet Archive by searching files on many instances at the same time. Sadly the closest thing are ArchiveBox and wayback which won't replace the Internet Archive.
-
End-of-Availability notice for legacy DSM, Surveillance Station, SRM, and more
There's also wayback for remote and local archiving https://github.com/wabarc/wayback
-
Archiving Web Pages Using XMPP
Web archiving just got more convenient! Wayback now supports XMPP. This means you can receive archived web pages via XMPP messages, making it even easier to access historical versions of web pages. Give it a try and let us know what you think!
- Wayback: Self-hosted archiving service integrated with Internet Archive
- A self-hosted archiving service integrated with Internet Archive, archive.today, IPFS and beyond.
- GitHub - wabarc/wayback: A self-hosted archiving service integrated with Internet Archive, archive.today, IPFS and beyond.
What are some alternatives?
savepagenow - A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
telego - Telegram Bot API library for Go
warrick - Recover lost websites from the Web Infrastructure
go-twitch-irc - go irc client for twitch.tv
neocities - Neocities.org - the web site. The entire thing. Yep, we're completely open source.
olivia - 💁♀️Your new best friend powered by an artificial neural network
Hexo - A fast, simple & powerful blog framework, powered by Node.js.
go-joe - A general-purpose bot library inspired by Hubot but written in Go. :robot:
wayback-machine-spn-scripts - Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now
slackscot - Slack bot core/framework written in Go with support for reactions to message updates/deletes
gba-remote-play - 📡 Stream Raspberry Pi games to a GBA via Link Cable.
larry - Larry 🐦 is a bot generator that post content from different providers to one or multiple publishers