Clover
similarity
Clover | similarity | |
---|---|---|
1 | 7 | |
11 | 998 | |
- | 0.3% | |
1.7 | 5.9 | |
about 1 year ago | 10 days ago | |
Python | Python | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Clover
-
R&D: Clover, Tree Structure-based Efficient DNA Clustering for DNA-Based Storage
Deoxyribonucleic acid (DNA)-based data storage is a promising new storage technology which has the advantage of high storage capacity and long storage time compared with traditional storage media. However, the synthesis and sequencing process of DNA can randomly generate many types of errors, which makes it more difficult to cluster DNA sequences to recover DNA information. Currently, the available DNA clustering algorithms are targeted at DNA sequences in the biological domain, which not only cannot adapt to the characteristics of sequences in DNA storage, but also tend to be unacceptably time-consuming for billions of DNA sequences in DNA storage. In this paper, we propose an efficient DNA clustering method termed Clover for DNA storage with linear computational complexity and low memory. Clover avoids the computation of the Levenshtein distance by using a tree structure for interval-specific retrieval. We argue through theoretical proofs that Clover has standard linear computational complexity, low space complexity, etc. Experiments show that our method can cluster 10 million DNA sequences into 50 000 classes in 10 s and meet an accuracy rate of over 99%. Furthermore, we have successfully completed an unprecedented clustering of 10 billion DNA data on a single home computer and the time consumption still satisfies the linear relationship. Clover is freely available at https://github.com/Guanjinqu/Clover.
similarity
-
New free tool that uses fine-tuned BERT model to surface answers from research papers
Tensorflow Ranking and Tensorflow similarity (maybe relevant/irrelevant contrastive learning?) look like they could be useful.
-
Non-Machine Learning Image Matching with a Vector DB
There is the metric learning problem to learn a hash for similarity https://github.com/tensorflow/similarity
That said, I don't see many good models available for download on tfhub or huggingface optimized for it, but you can always programmatically modify your images (if you truly mean identical to humans) - change white balance, crop, rotate, select adjacent frames from videos, etc. and optimize a network that is small enough for you to be satisfied and see if that works, as a possible alternative.
-
Face Detection for 520 People
Metric learning has great implementations inside Tensorflow Similarity library: https://github.com/tensorflow/similarity Although the documentation is quite bad, but the jupyter notebooks are great.
-
[P] TensorFlow Similarity 0.16 is out
Just a quick note that TensorFlow Similarity 0.16 is out -- this release beside adding the XMB loss is mostly focus on refactoring and optimizing the core components to ensure everything works smoothly and accurately. Details are in the changelog as usual and a simple pip install -U tensorflow_similarity should just work.
- Self-supervised learning added to TensorFlow Similarity
-
[P] TensorFlow Similarity now self-supervised training
Very happy to announce that as part of the 0.15 release, TensorFlow Similarity now support self-supervised learning using STOA algorithms. To help you get started we included in the release a detailed getting started notebook that you can run in Colab. This notebook shows you how to use SimSiam self-supervised pre-training to almost double the accuracy compared to a model trained from scratch on CIFAR 10.
-
TensorFlow Introduces ‘TensorFlow Similarity’, An Easy And Fast Python Package To Train Similarity Models Using TensorFlow
Github: https://github.com/tensorflow/similarity
What are some alternatives?
uis-rnn - This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
pytorch-metric-learning - The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
dedupe - :id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
pgANN - Fast Approximate Nearest Neighbor (ANN) searches with a PostgreSQL database.
awesome-community-detection - A curated list of community detection research papers with implementations.
quaterion - Blazing fast framework for fine-tuning similarity learning models
Unsupervised-Classification - SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]
ContraD - Code for the paper "Training GANs with Stronger Augmentations via Contrastive Discriminator" (ICLR 2021)
Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time
sparse_dot_topn - Python package to accelerate the sparse matrix multiplication and top-n similarity selection
Keras - Deep Learning for humans
finetuner - :dart: Task-oriented embedding tuning for BERT, CLIP, etc.