-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
how I read the file? First I got tried to extrat the file ok I got it, but them I text file I can't read that., I saw a few people saing it was just a json file I tried with a json reader but it say the json data is invalid, them I tried this program but nothing happens no new file is created or something, here a print, maybe I'm doing something wrong but I don't know because the script don't have any instruction how to use it!
If you only want to back up stuff from specific subs, you could use RedditScrape. It queries the official PushShift API to get posts and then downloads them via gallery-dl. I have been running it for almost 14 h now and downloaded 187k media (227 GiB) from a few subs that interest me. Might be getting rate limited by now, though I've been using a vpn so I could just switch location if really necessary.
Well done, now you should make it sane. No need to reinvent the wheel here. Just rewrite reddit-html-archiver to use the raw json from redarcs rather than the pushshift api.
Related posts
-
Open-source SDK for adding custom code interpreters to AI apps
-
Show HN: SpRAG – Open-source RAG implementation for challenging real-world tasks
-
Show HN: Local GLaDOS
-
Let's Build An AI Agent: trendrBOT answers questions about Google Search trends
-
NPi – An Open Source project for enhancing AI Agents in taking action