Organizing 6 TB of random junk

This page summarizes the projects mentioned and recommended in the original post on /r/DataHoarder

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • stash

    An organizer for your porn, written in Go. Documentation: https://docs.stashapp.cc

  • Even if it's not adult content, you might want Stash. The sorting, filtering, preview generation & scene markers make it great for organizing things for projects IMHO.

  • czkawka

    Multi functional app to find duplicates, empty folders, similar images etc.

  • Great suggestion. I've been using this for a while on windows to find a list of all of a certain file type. I would also add that finding duplicate files with czkawka is a great storage saver and probably helps your drives index speed a little depending on how many duplicates there are.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • rclip

    AI-Powered Command-Line Photo Search Tool

  • For images rclip can give you search that's better than Google Photos with entirely unannotated data and pure natural English queries.

  • llama_index

    LlamaIndex is a data framework for your LLM applications

  • Projects like LlamaIndex exist, but OpenAI embeddings are way to pricy for any large collection of data, so maybe switching to BERT may help.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Best way to find duplicate media, consolidate and keep the best quality?

    4 projects | /r/DataHoarder | 14 Mar 2023
  • Quick drivepool question

    2 projects | /r/DataHoarder | 25 Sep 2022
  • What do you think of my filing system? Any suggestions for improvements?

    2 projects | /r/DataHoarder | 24 Sep 2022
  • How are you handling pictures?

    2 projects | /r/homelab | 7 Aug 2022
  • Nested data backups from across the years..can anyone help me?

    2 projects | /r/DataHoarder | 26 Jun 2022