wayback-machine-spn-scripts
wayback-machine-downloader
wayback-machine-spn-scripts | wayback-machine-downloader | |
---|---|---|
8 | 48 | |
92 | 5,053 | |
- | - | |
1.6 | 0.0 | |
7 days ago | 3 months ago | |
Shell | Ruby | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wayback-machine-spn-scripts
-
Preserving Parliamentary Proposed Bills to Wayback Machine
I created this script which scrapes a list of all currently proposed bill URLs, and all PDFs of those bills, and the pages that list them. It then runs this script by /u/overcast07 that then goes through each of those URLs and backs them up.
-
Wayback machine - Schedule automatic backups - Part 2
I discovered that The Internet Archive's Wayback Machine has the "Save Page Now" tool, that allows you to manually backup a page.Through further research, and after asking here on Reddit, I discovered this script: https://github.com/overcast07/wayback-machine-spn-scripts
-
Best way to feed Wayback Machine a list of URLs?
I use this https://github.com/overcast07/wayback-machine-spn-scripts
-
Most of the time I try to save a Reddit thread on Internet Archive Wayback Machine, it fails to save. Can this be fixed?
Try using spn.sh if you can. In my experience, it's been more reliable than using wayback machine's front-end.
-
[Request] Userscript that clicks button on webpage after 'X' minutes and 'Y' seconds once.
I know I could use the excellent spn.sh in a while loop instead.
- Wayback Machine Downloader β Download an Entire Website from the Wayback Machine
- I wrote a Bash script that interfaces with Wayback Machine Save Page Now (automatic error handling, can submit selective/recursive outlinks)
- Shell script for Wayback Machine Save Page Now (has auto error handling, selective/recursive outlinks)
wayback-machine-downloader
-
Ask HN: Cool Useful GitHub Repos?
I just found this https://github.com/hartator/wayback-machine-downloader
anyone have anything similarly interesting/cool/niche-useful ?
-
ArchiveTeam is saving Blogger from Google deletion
Send ArchiveTeam the link on IRC or here and we can save it to archive.org, then later you can use wayback-machine-downloader to grab it from archive.org.
https://github.com/hartator/wayback-machine-downloader
-
My TikTok was Hacked & Deleted and I GOT IT BACK!
This is where it gets tricky, you need to download the code from the wayback machine and he was able to do that by following these steps: https://github.com/hartator/wayback-machine-downloader
-
Is there a way to quick download twitter images on the wayback machine?
Not sure if it will work for twitter, but I have used wayback-machine-downloader to batch download stuff.
-
Forgot to backup my WordPress files before I swapped webhosting provider, am I screwed?
Adding to archive.org, there is a github repo to fetch website data. You can give a try too. Here is the repo link
- Can I please get help downloading and saving a website for offline use?
-
Hey guys, looks like we have a potential hacker on our hands. All of our company's files were deleted from our FTP. :( Is there any way we can get a cache of our website and restore everything? Any help or advice would be greatly appreciated. Thanks in advance!
Edit: Good news! I found a solution that saved me. I was able to download the full website (including images, JS, and CSS files) using this tool: https://github.com/hartator/wayback-machine-downloader
-
Hey guys, so a potential hacker managed to delete all of our company's files from our FTP. Yikes! Is there a way to retrieve a cache of our website and restore it? Any advice or tips would be greatly appreciated. Thanks in advance!
Edit: Thank you to everyone who suggested the Wayback Machine Downloader! It saved the day and allowed me to download the full website, including images, JS, and CSS files.
-
Have a lengthy flight: how to seamlessly mirror couple websites
I've used https://github.com/hartator/wayback-machine-downloader but it sometimes messes up CSS badly
- what Do YOU Recommend?
What are some alternatives?
reveddit - Review removed content on reddit. Uses the Pushshift API, built on code from removeddit.
savepagenow - A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing service
warrick - Recover lost websites from the Web Infrastructure
waybackpack - Download the entire Wayback Machine archive for a given URL.
neocities - Neocities.org - the web site. The entire thing. Yep, we're completely open source.
wayback - IA's public Wayback Machine (moved from SourceForge)
Hexo - A fast, simple & powerful blog framework, powered by Node.js.
gba-remote-play - π‘ Stream Raspberry Pi games to a GBA via Link Cable.
ArchiveBox - π Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
go-readability - Go package that cleans a HTML page for better readability.