Archive Your Reddit Data Before It's Too Late

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • Reddit

  • Reddit actually used to be open source until mid-2017: https://github.com/reddit-archive/reddit

  • Shreddit

    Remove your comment history on Reddit as deleting an account does not do so.

  • Personally, I wiped all of my personal Reddit accounts last year. I now use anonymous accounts which I also wipe after awhile. I use Shreddit [^0] to wipe the account before deleting it. I do wish that I was able to backup some valuable conversations, but I honestly wouldn't find much worth in the backups without the additional context. So perhaps something to explore -- also storing the context of a particular comment with a configurable depth.

    [^0]: https://github.com/x89/Shreddit

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • PowerDeleteSuite

    Power Delete Suite for Reddit

  • I've been using this: https://github.com/j0be/PowerDeleteSuite/

    It runs client-side in the browser and has filter & export functionality if you want to retain anything.

  • my-best-of-reddit

    Get your upvoted posts and comments from Reddit delivered over Telegram

  • Shameless plug: I've written a utility which delivers you your favourited posts and comments. While it doesn't support backfilling at the moment, you're welcome to take a look: https://github.com/rounakdatta/my-best-of-reddit.

  • asdf

    Extendable version manager with support for Ruby, Node.js, Elixir, Erlang & more

  • You may consider using asdf or some other Python version manager to get a newer (or older) Python

    https://github.com/asdf-vm/asdf

  • qpixel

    Q&A-based community knowledge-sharing software

  • People keep suggesting Lemmy but I think decentralized social media is preferred by people like us who are on this site regularly. But in real world only centralized social media seems to work. So we might as well adopt a model that is good in the long run maybe something like Wikipedia?

    Someone already did this [1] as stackoverlflow alternative

    [0]: https://codidact.com/

  • sydent

    Sydent: Reference Matrix Identity Server

  • Yes.

    What makes this approach really stand out is that, even if there are 5 different aggregators, it doesn't matter, because they're all aggregating the same data. And even if someone decides to use one of the existing platforms (e.g. discourse), posts/comments from this aggregator to the platform still show up. So this is actually a way to alleviate the "yet another standard" issue (though I acknowledge it won't be as good as a single common instance, due to different formats).

    Matrix is a start. But when I go to https://matrix.org/, I see a giant header and "Try Now". When I go to https://matrix.org/docs/projects/try-matrix-now/, I see a bunch of clients. When I go to https://app.element.io/#/welcome and click "Explore Rooms", I see random forums including some russian forums.

    I think that you go to the site hub, you should immediately see a feed of curated popular posts and a "Create Account" form, similar to https://reddit.com/r/all, along with a search bar and list of filters. And when you create an account, you choose some suggestions and get presented with various communities, also like Reddit. Most people are barely even going to try your site, if you want them to join and put real effort into contributing you need to present good content as fast as possible.

    Another issue with Matrix is that a lot of content just isn't on Matrix. As well as Reddit and other existing communities. For example, Rust has a subreddit, discourse, Matrix forum, and Zulip, and probably a discord somewhere too. There should really be a single platform where I can see posts from all of this, because I'm definitely not going to be checking each one individually.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • reddit-shreddit

    Program to delete ENTIRE Reddit user post and comments history, AND daily job to keep user history limited to X days.

  • I wrote this comment a couple years ago, but unless something significant has changed, this still stands.

    Be careful. Shreddit does not get rid of all reddit comments, only the ones on your profile. Reddit stores a list of the last 1000 comments and posts you've made to your profile and that list is what Shreddit and 99% of reddit 'deletion' tools use. To truly remove all your comments and posts from reddit, look into reddit-shreddit [0]. Despite the similar name, it is completely different and leverages all your posts and comments from a reddit data request to delete all of your reddit posts and comments. It is infact the only method I have found, except for perhaps a GDPR request, that is truly a complete deletion.

    However, this only covers the deletion from reddit itself. There have been periodic reddit archivals ever since it was created. Up to 2015 they were done yearly, ever since they have been done monthly. Since they have always been up for download, all of your reddit data will still be out there, however you can remove your data from the main source of these archives called pushshift. I haven't done it personally but I've seen it said on reddit, you can email the dev of pushshift or dm him on reddit and he will remove your information from his archive.

    [0] https://github.com/nixfu/reddit-shreddit

  • reddit-user-to-sqlite

    Pull Reddit user data into a SQLite database

  • Done: https://github.com/xavdid/reddit-user-to-sqlite/issues/15

    I see someone else has already filed one about saving the context.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts