umap
giotto-tda
Our great sponsors
umap | giotto-tda | |
---|---|---|
10 | 2 | |
6,946 | 805 | |
- | 1.9% | |
8.3 | 0.0 | |
3 days ago | about 1 month ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
umap
-
[OC] Clustering Images with OpenAI CLIP, T-SNE, UMAP & Plotly
UMAP GitHub repository: https://github.com/lmcinnes/umap
-
UMAP clustering in Ruby
Uniform Manifold Approximation and Projection (UMAP) is a well-known dimensionality reduction method along with t-SNE.
-
Introducing the Semantic Graph
A number of excellent topic modeling libraries exist in Python today. BERTopic and Top2Vec are two of the most popular. Both use sentence-transformers to encode data into vectors, UMAP for dimensionality reduction and HDBSCAN to cluster nodes.
-
Using the 80:20 rule, what top 20% of your tools, statistical tests, activities, etc. do you use to generate 80% of your results?
As with anything, it depends on the problem. But T-SNE and UMAP are often good.
-
[D] In UMAP and PyNNDescent, the conversion of Cosine and Correlation measures to distance metric seems problematic
UMAP distances.py: umap/distances.py at master ยท lmcinnes/umap (github.com)
-
I built an Image Search Engine using OpenAI CLIP and Images from Wikimedia
I used for this project Flask and OpenAI CLIP. For the vector search I used approximate nearest neighbors provided by spotify/annoy. I used Flask-SQLAlchemy with GeoAlchemy2 to query GPS coordinates. The embedding was done using UMAP.
-
We Analyzed 425,909 Favicons
side note: instead of t-SNE consider UMAP - provides better results (and it's much faster) https://github.com/lmcinnes/umap
-
Finding correlating features in a large dataset.
Sounds like a job for UMAP https://github.com/lmcinnes/umap ?
-
The most perplexing bug I've ever seen
I am a fairly experienced python developer/researcher (about 10 years), and have found a bug that breaks all of my intuitions. I am messing with the [UMAP](https://github.com/lmcinnes/umap) repository and trying to add the option to disable some additional features. I've stripped everything from it but have a [quick test that will run my UMAP version and compare the outputs with what the original gave](https://github.com/Andrew-Draganov/probabilistic_dim_reduction/blob/master/umap/nndescent_umap_test.py). Managing my random seeds, same inputs, all that.
-
Question about numpy method I found in github project
I'm currently reading through a project on github, https://github.com/lmcinnes/umap, and in `umap/umap_.py` at line 2287, they have this:
giotto-tda
What are some alternatives?
minisom - :red_circle: MiniSom is a minimalistic implementation of the Self Organizing Maps
findpeaks - The detection of peaks and valleys in a 1d-vector or 2d-array (image)
annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
sktime - A unified framework for machine learning with time series
Traccar - Traccar GPS Tracking System
yellowbrick - Visual analysis and diagnostic tools to facilitate machine learning model selection.
vaex - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐
TARDIS - TARDIS: Topological Algorithms for Robust DIscovery of Singularities
Openstreetmap - The Rails application that powers OpenStreetMap
CLIP - CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Orion - Robust web visualization tool for OwnTracks location data
ivis - Dimensionality reduction in very large datasets using Siamese Networks