umap
minisom
Our great sponsors
umap | minisom | |
---|---|---|
10 | 3 | |
6,946 | 1,387 | |
- | - | |
8.3 | 8.4 | |
3 days ago | 3 days ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
umap
-
[OC] Clustering Images with OpenAI CLIP, T-SNE, UMAP & Plotly
UMAP GitHub repository: https://github.com/lmcinnes/umap
-
UMAP clustering in Ruby
Uniform Manifold Approximation and Projection (UMAP) is a well-known dimensionality reduction method along with t-SNE.
-
Introducing the Semantic Graph
A number of excellent topic modeling libraries exist in Python today. BERTopic and Top2Vec are two of the most popular. Both use sentence-transformers to encode data into vectors, UMAP for dimensionality reduction and HDBSCAN to cluster nodes.
-
Using the 80:20 rule, what top 20% of your tools, statistical tests, activities, etc. do you use to generate 80% of your results?
As with anything, it depends on the problem. But T-SNE and UMAP are often good.
-
[D] In UMAP and PyNNDescent, the conversion of Cosine and Correlation measures to distance metric seems problematic
UMAP distances.py: umap/distances.py at master ยท lmcinnes/umap (github.com)
-
I built an Image Search Engine using OpenAI CLIP and Images from Wikimedia
I used for this project Flask and OpenAI CLIP. For the vector search I used approximate nearest neighbors provided by spotify/annoy. I used Flask-SQLAlchemy with GeoAlchemy2 to query GPS coordinates. The embedding was done using UMAP.
-
We Analyzed 425,909 Favicons
side note: instead of t-SNE consider UMAP - provides better results (and it's much faster) https://github.com/lmcinnes/umap
-
Finding correlating features in a large dataset.
Sounds like a job for UMAP https://github.com/lmcinnes/umap ?
-
The most perplexing bug I've ever seen
I am a fairly experienced python developer/researcher (about 10 years), and have found a bug that breaks all of my intuitions. I am messing with the [UMAP](https://github.com/lmcinnes/umap) repository and trying to add the option to disable some additional features. I've stripped everything from it but have a [quick test that will run my UMAP version and compare the outputs with what the original gave](https://github.com/Andrew-Draganov/probabilistic_dim_reduction/blob/master/umap/nndescent_umap_test.py). Managing my random seeds, same inputs, all that.
-
Question about numpy method I found in github project
I'm currently reading through a project on github, https://github.com/lmcinnes/umap, and in `umap/umap_.py` at line 2287, they have this:
minisom
- How to use MiniSOM (Self Organizing Maps) Library
- [P][D] Self Organizing Maps
-
[OC] Animation of a Self Organizing Map
I made this animation because I could not find a single decent demonstration of a SOM map on the internet, especially considering how popular it is becoming. I used the python library Pyvista for 3D plotting and creating the animation, and I used the minisom library for running the SOM.
What are some alternatives?
giotto-tda - A high-performance topological machine learning toolbox in Python
somoclu - Massively parallel self-organizing maps: accelerate training on multicore CPUs, GPUs, and clusters
annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
sparse-som - Efficient Self-Organizing Map for Sparse Data
Traccar - Traccar GPS Tracking System
susi - SuSi: Python package for unsupervised, supervised and semi-supervised self-organizing maps (SOM)
vaex - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐
DBCV - Python implementation of Density-Based Clustering Validation
Openstreetmap - The Rails application that powers OpenStreetMap
som-tsp - Solving the Traveling Salesman Problem using Self-Organizing Maps
CLIP - CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
n2d - A deep clustering algorithm. Code to reproduce results for our paper N2D: (Not Too) Deep Clustering via Clustering the Local Manifold of an Autoencoded Embedding.