-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I'm glad that you like the ideas, as for the browser extension i think that it could be implemented but doing so might lower the anonymity of users because of browser fingerprinting. The backend would be either selenium or torrequests to scrape the sites and save in a database. The main problem that i have at the moment is how to prevent the scraper from downloading fucked up shit. I have two options; don't download images or videos (only save the html + css from sites), or only scrape sites from a list of trusted onions (i.e facebook, the hidden wiki etc).
you are right about the content, some sort of the filtering (or completely avoiding) is required. for the download complete site (make it browsable) there is https://github.com/xroche/httrack/tree/master classic tool, idk if it supports socks5 proxy but it seems doable.