Anybody actually hoard something they weren't able to find later on the internet?

This page summarizes the projects mentioned and recommended in the original post on /r/DataHoarder

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • dmca

    Repository with text of DMCA takedown notices as received. GitHub does not endorse or adopt any assertion contained in the following notices. Users identified in the notices are presumed innocent until proven guilty. Additional information about our DMCA policy can be found at

  • Scrapy

    Scrapy, a fast high-level web crawling & scraping framework for Python.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • scrapy-splash

    Scrapy+Splash for JavaScript integration

  • To add to u/nemec, here are the docs for scrapy splash which I’ve used several times (and just requires you to spin up their docker container to get started): https://github.com/scrapy-plugins/scrapy-splash

  • btcrecover

    An open source Bitcoin wallet password and seed recovery tool designed for the case where you already know most of your password/seed, but need assistance in trying different possible combinations.

  • The guide for the patterns is a bit long but very useful. Here are some examples of a patterns that I used:

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts