Anybody here has access to Browsertrix Cloud?

This page summarizes the projects mentioned and recommended in the original post on /r/Archiveteam

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • browsertrix

    Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

  • Browsertrix Cloud is browser-based automated crawling suite whereas WebRecorder's other tools are currently more manual. Unfortunately we're not quite set up with a public instance of it yet and while you are more than welcome to download it from our GitHub repo and get it running locally using the local deployment docs, we can't offer access to it hosted somewhere — Check back in a 6 months maybe or sign up for the email list on that google form haha! :)

  • wayback-machine-downloader

    Download an entire website from the Wayback Machine.

  • I don't think that's a service Internet Archive offers? They do have a program called Waybackfill that allows orgs to back-fill their website's broken links with archived content which is pretty cool. Other than that however, I don't think there's an official way of actually downloading WARC files from Internet Archive. You may have luck with a tool like Wayback Machine Downloader but I've never used it and can't speak for it.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts