wayback-machine-downloader

Download an entire website from the Wayback Machine. (by hartator)

Wayback-machine-downloader Alternatives

Similar projects and alternatives to wayback-machine-downloader

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better wayback-machine-downloader alternative or higher similarity.

Suggest an alternative to wayback-machine-downloader

Reviews and mentions

Posts with mentions or reviews of wayback-machine-downloader. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-08-18.
  • Need help copying a website from the "wayback machine" iternet archive
    https://github.com/hartator/wayback-machine-downloader https://github.com/jsvine/waybackpack or google "wayback machine downloader" for other options
  • Wayback Machine Downloader
  • Hacker News top posts: Jul 12, 2021
    Wayback Machine Downloader\ (17 comments)
  • hartator/wayback-machine-downloader
  • Download Site from Wayback Machine
  • Wayback Machine Downloader – Download an Entire Website from the Wayback Machine
    news.ycombinator.com | 2021-07-11
  • Ask HN: Does anyone have an archive of quirky.com?
    news.ycombinator.com | 2021-07-11
    Some of it is on archive.org:

    https://web.archive.org/web/20140607001646/https://www.quirk...

    You can use this tool to download the files for the site:

    https://github.com/hartator/wayback-machine-downloader/

  • Is there a way to scrape video links off a youtube channel and see if any of the links are archived on web.archive.org? without pasting links one by one
  • Downloading a website FROM Wayback Machine
    I think wayback_machine_downloader can download the raw files from the crawl but I haven't used it in a while.
  • Predicting the price of Heating Oil using PyCaret
    dev.to | 2021-07-07
    I started this project back in Dec 2020 in order to build up my dataset. The lambda function has been running for about 6 months, and I have a decent amount of data to work with. In order to expand my dataset, I was able to pull more data using The Wayback Machine on web.archive.org. The Wayback Machine stores a snapshot of many pages on the internet. It doesn't have every site, but it did have some snapshots from cheaptestoil.com. To get that data, I used https://github.com/hartator/wayback-machine-downloader to download the archive data. The archive only had 7 snapshots, between the dates of Aug 2020 and Oct 2021.
  • Retrieving old internet content
    reddit.com/r/OSINT | 2021-05-06
    My go-to solution is the wayback machine. Sometimes I use it in combination with a downloader (https://github.com/hartator/wayback-machine-downloader) so that I can search the source files for, e.g., tumblr/youtube links that are still valid.
  • So I found an expired domain ranking snippets in my niche
    reddit.com/r/juststart | 2021-03-18
    One thing you may find useful is to download all the old articles from archive.org. Not so you can copy them verbatim but so you can easily see what they were about and have all the slugs for when you do your 301’s. I did this recently using this (which is free) https://github.com/hartator/wayback-machine-downloader it’s a Ruby script but good instructions on how to make it work. Very handy.
  • Blog with Markdown and Git, and degrade gracefully through time
    news.ycombinator.com | 2021-02-08
    https://github.com/hartator/wayback-machine-downloader

    (I discuss it on my https://www.gwern.net/Search tutorial and use it every once in while eg to make my mirror of 'Climb Mount Improbable' https://www.gwern.net/docs/genetics/selection/www.mountimpro... or 'Hard Truths From Soft cats' https://www.gwern.net/images/hardtruthsfromsoftcats.tumblr.c... )

  • What's an efficient way to download a crawl of a page from Wayback Machine?

Stats

Basic wayback-machine-downloader repo stats
18
3,514
4.6
about 1 month ago

hartator/wayback-machine-downloader is an open source project licensed under GNU General Public License v3.0 or later which is an OSI approved license.

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
Find remote jobs at our new job board 99remotejobs.com. There are 37 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.