extpp
umap
Our great sponsors
extpp | umap | |
---|---|---|
1 | 10 | |
20 | 6,946 | |
- | - | |
10.0 | 8.3 | |
over 1 year ago | 5 days ago | |
C++ | Python | |
BSD 2-clause "Simplified" License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
extpp
-
UMAP clustering in Ruby
There are two ways to write Ruby extensions in C++. One is Rice and the other is extpp. In this case, I used Rice because I wanted to use numo.hpp to link Numo::NArray and C++.
umap
-
[OC] Clustering Images with OpenAI CLIP, T-SNE, UMAP & Plotly
UMAP GitHub repository: https://github.com/lmcinnes/umap
-
UMAP clustering in Ruby
Uniform Manifold Approximation and Projection (UMAP) is a well-known dimensionality reduction method along with t-SNE.
-
Introducing the Semantic Graph
A number of excellent topic modeling libraries exist in Python today. BERTopic and Top2Vec are two of the most popular. Both use sentence-transformers to encode data into vectors, UMAP for dimensionality reduction and HDBSCAN to cluster nodes.
-
Using the 80:20 rule, what top 20% of your tools, statistical tests, activities, etc. do you use to generate 80% of your results?
As with anything, it depends on the problem. But T-SNE and UMAP are often good.
-
[D] In UMAP and PyNNDescent, the conversion of Cosine and Correlation measures to distance metric seems problematic
UMAP distances.py: umap/distances.py at master ยท lmcinnes/umap (github.com)
-
I built an Image Search Engine using OpenAI CLIP and Images from Wikimedia
I used for this project Flask and OpenAI CLIP. For the vector search I used approximate nearest neighbors provided by spotify/annoy. I used Flask-SQLAlchemy with GeoAlchemy2 to query GPS coordinates. The embedding was done using UMAP.
-
We Analyzed 425,909 Favicons
side note: instead of t-SNE consider UMAP - provides better results (and it's much faster) https://github.com/lmcinnes/umap
-
Finding correlating features in a large dataset.
Sounds like a job for UMAP https://github.com/lmcinnes/umap ?
-
The most perplexing bug I've ever seen
I am a fairly experienced python developer/researcher (about 10 years), and have found a bug that breaks all of my intuitions. I am messing with the [UMAP](https://github.com/lmcinnes/umap) repository and trying to add the option to disable some additional features. I've stripped everything from it but have a [quick test that will run my UMAP version and compare the outputs with what the original gave](https://github.com/Andrew-Draganov/probabilistic_dim_reduction/blob/master/umap/nndescent_umap_test.py). Managing my random seeds, same inputs, all that.
-
Question about numpy method I found in github project
I'm currently reading through a project on github, https://github.com/lmcinnes/umap, and in `umap/umap_.py` at line 2287, they have this:
What are some alternatives?
uwot - An R package implementing the UMAP dimensionality reduction method.
minisom - :red_circle: MiniSom is a minimalistic implementation of the Self Organizing Maps
numo-narray - Ruby/Numo::NArray - New NArray class library
giotto-tda - A high-performance topological machine learning toolbox in Python
umappp - UMAP C++ implementation
annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
ruby-umappp - Uniform Manifold Approximation and Projection for Ruby
Traccar - Traccar GPS Tracking System
vaex - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐
Openstreetmap - The Rails application that powers OpenStreetMap
CLIP - CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Orion - Robust web visualization tool for OwnTracks location data