LSH
Neural-Scam-Artist
Our great sponsors
LSH | Neural-Scam-Artist | |
---|---|---|
1 | 2 | |
273 | 22 | |
- | - | |
2.8 | 0.0 | |
11 months ago | over 2 years ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
LSH
-
D Efficient Way To Cluster Millions Of Face
I'm looking into this, after reading the wikipedia entry of this it sound promissing! I already found a python lib https://github.com/mattilyra/LSH for this, I will get back to you once I tested this!
Neural-Scam-Artist
What are some alternatives?
datasketch - MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
image-ndd-lsh - Near-duplicate image detection using Locality Sensitive Hashing
bertviz - BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Cython - The most widely used Python to C compiler
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
dedup - Find duplicate text files.
Transformers4Rec - Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.
intertext - Detect and visualize text reuse
Extracting-Training-Data-from-Large-Langauge-Models - A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020
tasksource - Datasets collection and standardization preprocessings for NLP extreme multitask learning