Archiving the Gaza conflict

This page summarizes the projects mentioned and recommended in the original post on /r/DataHoarder

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • ArchiveBox

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

  • I'd recommend ArchiveBox, it takes care of extracting videos and media files using youtube-dl, and it also saves to Archive.org for redundancy.

  • videoduplicatefinder

    Video Duplicate Finder - Crossplatform

  • To dedup video, I found after years of search one software that worked good enough to be useful: Video Duplicate Finder by 0x90d. It's open source and very easy to use with a GUI or in command-line. It will build a database of screenshots at different timepoints in each video and compare them. It works extremely well, it can find duplicates of different size, video quality (bitrate, resolution) and even different durations. It's the fastest video deduplicator and also the most reliable I have ever used, others are gadgets compared to this one. Rarely, some videos are not properly matched so you do need to check manually if you want to retain a maximum of videos, but otherwise if you don't mind losing a few ones you can just select all duplicates and remove them.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • archivenow

    A Tool To Push Web Resources Into Web Archives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Best way to feed Wayback Machine a list of URLs?

    3 projects | /r/Archiveteam | 14 Dec 2021
  • Vice website is shutting down

    1 project | news.ycombinator.com | 23 Feb 2024
  • ArchiveBox – open-source self-hosted web archiving

    2 projects | news.ycombinator.com | 13 Jan 2024
  • Best practices for archiving websites

    2 projects | /r/datacurator | 6 Dec 2023
  • BetaWiki – An open encyclopedia of software history

    1 project | news.ycombinator.com | 20 Jun 2023