Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
cbird
Command-line program for managing a media collection, with focus on Content-Based Image Retrieval (Computer Vision) methods for finding duplicates.
czkawka claims to be able to compare video, https://github.com/qarmin/czkawka but honestly I wouldn't put much stock into that. Could be worth a shot.
I've had luck before stripping the audio of each video file in my collection then comparing the audio because audio isn't as random as video encoding. I had a fuzzyhash of all the audio in WAV and then compared all the audio against that list again to see where there was overlap.
I use Video Duplicate Finder in a docker container on my media server. It shows you at a glance which version of a file has the longest duration, highest frame rate, bitrate, resolution, file size, etc.
cbird has this. When you find a duplicate you can compare them side-by-side too.