wayback-machine-downloader
go-readability
Our great sponsors
wayback-machine-downloader | go-readability | |
---|---|---|
48 | 2 | |
5,034 | 131 | |
- | - | |
0.0 | 0.0 | |
2 months ago | almost 2 years ago | |
Ruby | HTML | |
GNU General Public License v3.0 or later | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wayback-machine-downloader
-
Ask HN: Cool Useful GitHub Repos?
I just found this https://github.com/hartator/wayback-machine-downloader
anyone have anything similarly interesting/cool/niche-useful ?
-
ArchiveTeam is saving Blogger from Google deletion
Send ArchiveTeam the link on IRC or here and we can save it to archive.org, then later you can use wayback-machine-downloader to grab it from archive.org.
-
My TikTok was Hacked & Deleted and I GOT IT BACK!
This is where it gets tricky, you need to download the code from the wayback machine and he was able to do that by following these steps: https://github.com/hartator/wayback-machine-downloader
-
Is there a way to quick download twitter images on the wayback machine?
Not sure if it will work for twitter, but I have used wayback-machine-downloader to batch download stuff.
-
Forgot to backup my WordPress files before I swapped webhosting provider, am I screwed?
Adding to archive.org, there is a github repo to fetch website data. You can give a try too. Here is the repo link
- Can I please get help downloading and saving a website for offline use?
-
Hey guys, looks like we have a potential hacker on our hands. All of our company's files were deleted from our FTP. :( Is there any way we can get a cache of our website and restore everything? Any help or advice would be greatly appreciated. Thanks in advance!
Edit: Good news! I found a solution that saved me. I was able to download the full website (including images, JS, and CSS files) using this tool: https://github.com/hartator/wayback-machine-downloader
-
Hey guys, so a potential hacker managed to delete all of our company's files from our FTP. Yikes! Is there a way to retrieve a cache of our website and restore it? Any advice or tips would be greatly appreciated. Thanks in advance!
Edit: Thank you to everyone who suggested the Wayback Machine Downloader! It saved the day and allowed me to download the full website, including images, JS, and CSS files.
-
Have a lengthy flight: how to seamlessly mirror couple websites
I've used https://github.com/hartator/wayback-machine-downloader but it sometimes messes up CSS badly
- what Do YOU Recommend?
go-readability
-
Which library/project do you wish was ported to golang?
https://github.com/go-shiori/go-readability https://github.com/mauidude/go-readability
-
Blog with Markdown and Git, and degrade gracefully through time
In terms of extracting the actual blog content from pages, there is a go library that implements the readability algorithm:
https://github.com/mauidude/go-readability
This is the kind of thing pocket/instapaper do to extract the main content from a page in a format that's easier to read (and also probably to programmatically modify)
What are some alternatives?
savepagenow - A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
docs - This is a repo of the RetroArch official document page.
warrick - Recover lost websites from the Web Infrastructure
website - The Caddy website
neocities - Neocities.org - the web site. The entire thing. Yep, we're completely open source.
temporalite-archived - An experimental distribution of Temporal that runs as a single process
Hexo - A fast, simple & powerful blog framework, powered by Node.js.
blissue - A blog based on github issues
wayback-machine-spn-scripts - Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now
simonwillisonblog-backup - Backups of the database for simonwillison.net
gba-remote-play - 📡 Stream Raspberry Pi games to a GBA via Link Cable.
bdv32 - This is my website