-
ArchiveBox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
eternity
Discontinued bypass Reddit's 1000-item listing limits by externally storing your Reddit items (saved, created, upvoted, downvoted, hidden) in your own database
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
reddit_export_userdata
Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.
Neat, ya, that sort of thing. But seems to have a heavy focus on completeness of archiving rather than being light on space. Seems encouraging that they might have eg. the ability to add an adblocker to puppeteer at least: https://github.com/ArchiveBox/ArchiveBox/issues/51