Organizing 6 TB of random junk

This page summarizes the projects mentioned and recommended in the original post on /r/DataHoarder

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Purpose built for real-time analytics at any scale.
InfluxDB Platform is powered by columnar analytics, optimized for cost-efficient storage, and built with open data standards.
www.influxdata.com
featured
  • stash

    An organizer for your porn, written in Go. Documentation: https://docs.stashapp.cc

    Even if it's not adult content, you might want Stash. The sorting, filtering, preview generation & scene markers make it great for organizing things for projects IMHO.

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • czkawka

    Multi functional app to find duplicates, empty folders, similar images etc.

    Great suggestion. I've been using this for a while on windows to find a list of all of a certain file type. I would also add that finding duplicate files with czkawka is a great storage saver and probably helps your drives index speed a little depending on how many duplicates there are.

  • rclip

    AI-Powered Command-Line Photo Search Tool

    For images rclip can give you search that's better than Google Photos with entirely unannotated data and pure natural English queries.

  • llama_index

    LlamaIndex is a data framework for your LLM applications

    Projects like LlamaIndex exist, but OpenAI embeddings are way to pricy for any large collection of data, so maybe switching to BERT may help.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Best way to find duplicate media, consolidate and keep the best quality?

    4 projects | /r/DataHoarder | 14 Mar 2023
  • Quick drivepool question

    2 projects | /r/DataHoarder | 25 Sep 2022
  • What do you think of my filing system? Any suggestions for improvements?

    2 projects | /r/DataHoarder | 24 Sep 2022
  • How are you handling pictures?

    2 projects | /r/homelab | 7 Aug 2022
  • Nested data backups from across the years..can anyone help me?

    2 projects | /r/DataHoarder | 26 Jun 2022