Top 7 Python similarity Projects
-
fastdup
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
-
python-string-similarity
A library implementing different string similarity and distance measures using Python.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Duplicate-Image-Finder
difPy - Python package for finding duplicate or similar images within folders
-
unisim
UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.
-
pysimilar
A python library for computing the similarity between two strings (text) based on cosine similarity
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Visualizing your dataset (especially large ones) in a low-dimensional embedding space can tell you a lot about the patterns and clusters in your dataset.
We recently release a notebook showing how you can visualize your dataset using DINOv2 models by running it on your CPU.
Yes! No GPUs needed.
We used it to find clusters of similar images, duplicates, and outliers in a subset of the LAION dataset
Try it on your own dataset:
Colab notebook - https://colab.research.google.com/github/visual-layer/fastdup/blob/main/examples/dinov2_notebook.ipynb
GitHub repo - https://github.com/visual-layer/fastdup
Project mention: Google UniSim for efficient similarity computation | news.ycombinator.com | 2023-11-30
Week 4: 🪞Image Deduplication
Python similarity related posts
Index
What are some of the best open-source similarity projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | fastdup | 1,403 |
2 | python-string-similarity | 946 |
3 | Duplicate-Image-Finder | 390 |
4 | aurora | 74 |
5 | unisim | 63 |
6 | pysimilar | 19 |
7 | image-deduplication-plugin | 8 |
Sponsored