-
ArchiveBox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
SingleFileZ
Web Extension to save a faithful copy of an entire web page in a self-extracting ZIP file
Here's a github gist with some useful wget commands. And here's a github repo with a bunch of tools for web archiving. And then, here's another github repo on data hoarding in general. Hope this helps!!
There's also archivebox. It's sort of like a self hosted wayback machine: https://github.com/ArchiveBox/ArchiveBox