Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Coincidentally I was looking through the list of apps that do this the other day, I think I looked at all the other ones they list as competitors.
My particular use case involved images that might be very near duplicates (screenshots of the same web page), which some of the tools cover, though it feels like a slightly seperate task from the exact bit duplicates, so not all do it.
One interesting one I found that wasn't listed in the Readme was:
https://github.com/kornelski/dupe-krill
Which had some notes about their use of BTreehashes to progressively compare files. Not sure how much difference it makes in practice but sounded elegant.