wayback-machine-scraper VS ArchiveBox

Compare wayback-machine-scraper vs ArchiveBox and see what are their differences.

wayback-machine-scraper

A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine. (by sangaline)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
wayback-machine-scraper ArchiveBox
6 248
405 19,790
- 3.4%
0.0 9.8
2 months ago 3 days ago
Python Python
ISC License MIT
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

wayback-machine-scraper

Posts with mentions or reviews of wayback-machine-scraper. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-05-20.

ArchiveBox

Posts with mentions or reviews of ArchiveBox. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-07.

What are some alternatives?

When comparing wayback-machine-scraper and ArchiveBox you can also consider the following projects:

waybackpy - Wayback Machine API interface & a command-line tool

Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.

cancel-culture - Tools for fighting abuse on Twitter

paimon-moe - Your best Genshin Impact companion! Help you plan what to farm with ascension calculator and database. Also track your progress with todo and wish counter.

autoscraper - A Smart, Automatic, Fast and Lightweight Web Scraper for Python

SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file

WordPress - WordPress, Git-ified. This repository is just a mirror of the WordPress subversion repository. Please do not send pull requests. Submit pull requests to https://github.com/WordPress/wordpress-develop and patches to https://core.trac.wordpress.org/ instead.

ArchivesSpace - The ArchivesSpace archives management tool

grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

Archivematica - Free and open-source digital preservation system designed to maintain standards-based, long-term access to collections of digital objects.

knowledge - Everything I know

logseq - A local-first, non-linear, outliner notebook for organizing and sharing your personal knowledge base. Use it to organize your todo list, to write your journals, or to record your unique life.