Python image-duplicate-detection

Open-source Python projects categorized as image-duplicate-detection

Python image-duplicate-detection Projects

  • fastdup

    fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.

  • Project mention: Visualize your dataset using DINOv2 embedding | news.ycombinator.com | 2023-05-02

    Visualizing your dataset (especially large ones) in a low-dimensional embedding space can tell you a lot about the patterns and clusters in your dataset.

    We recently release a notebook showing how you can visualize your dataset using DINOv2 models by running it on your CPU.

    Yes! No GPUs needed.

    We used it to find clusters of similar images, duplicates, and outliers in a subset of the LAION dataset

    Try it on your own dataset:

    Colab notebook - https://colab.research.google.com/github/visual-layer/fastdup/blob/main/examples/dinov2_notebook.ipynb

    GitHub repo - https://github.com/visual-layer/fastdup

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python image-duplicate-detection related posts

  • Visualize your dataset using DINOv2 embedding

    1 project | news.ycombinator.com | 2 May 2023
  • Visualize your dataset using DINOv2 embedding

    2 projects | /r/computervision | 1 May 2023
  • [R][P] How to extract feature vectors of large datasets using DINOv2 on CPU

    1 project | /r/MachineLearning | 26 Apr 2023
  • Find image duplicates and outliers – A free, scalable, efficient tool

    1 project | /r/computervision | 21 Mar 2023
  • Find image duplicates and outliers – A free, scalable, efficient tool

    1 project | news.ycombinator.com | 21 Mar 2023
  • [R] We found nearly half a billion duplicated images on LAION-2B-en.

    2 projects | /r/MachineLearning | 6 Mar 2023
  • Dedup-ing LAION (60M duplicates) and ImageNet (1.2M duplicates) with fastdup

    1 project | news.ycombinator.com | 7 Mar 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 1 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

Project Stars
1 fastdup 1,403

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com