Our great sponsors
-
fetchurls
A bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
ArchiveBox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
I happen to already run a YaCy node, and it's decent at crawling things sometimes. Then the list could be fed into ArchiveBox.
I've been playing around with ArchiveBox. It offers several different options for storing things.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.