wayback-machine-scraper VS ArchiveBox

Compare wayback-machine-scraper vs ArchiveBox and see what are their differences.

wayback-machine-scraper

A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine. (by sangaline)

ArchiveBox

🗃 The open source self-hosted web archive. Takes browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more... [Moved to: https://github.com/ArchiveBox/ArchiveBox] (by pirate)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
wayback-machine-scraper ArchiveBox
6 2
405 8,085
- -
0.0 9.7
2 months ago over 3 years ago
Python Python
ISC License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

wayback-machine-scraper

Posts with mentions or reviews of wayback-machine-scraper. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-05-20.

ArchiveBox

Posts with mentions or reviews of ArchiveBox. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-04-12.

What are some alternatives?

When comparing wayback-machine-scraper and ArchiveBox you can also consider the following projects:

waybackpy - Wayback Machine API interface & a command-line tool

Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.

cancel-culture - Tools for fighting abuse on Twitter

youtube-dl-webui - Another webui for youtube-dl powered by Flask.

autoscraper - A Smart, Automatic, Fast and Lightweight Web Scraper for Python

archivy - Archivy is a self-hostable knowledge repository that allows you to learn and retain information in your own personal and extensible wiki.

ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

pinboard-notes-backup - Back up the notes you’ve saved to Pinboard

WordPress - WordPress, Git-ified. This repository is just a mirror of the WordPress subversion repository. Please do not send pull requests. Submit pull requests to https://github.com/WordPress/wordpress-develop and patches to https://core.trac.wordpress.org/ instead.

promnesia - Another piece of your extended mind

grasp - A reliable org-capture browser extension for Chrome/Firefox

wallabag.el - Emacs wallabag client - A Read It Later/Web Archiving Solution in Emacs.