pyRDF2Vec
magnitude
Our great sponsors
pyRDF2Vec | magnitude | |
---|---|---|
1 | 5 | |
154 | 1,517 | |
6.5% | 0.7% | |
7.3 | 0.0 | |
20 days ago | over 1 year ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pyRDF2Vec
-
[P] pyRDF2Vec 0.2.0 is out!
This release is packed with many new features and optimizations under the hood. An entire overview of what's new can be found in our CHANGELOG (https://github.com/IBCNServices/pyRDF2Vec/releases/tag/0.2.0). An overview of some major updates:
magnitude
-
Text Classification Library for a Quick Baseline
(3) FastText now supports multiple languages [2].
[1] https://github.com/plasticityai/magnitude#pre-converted-magn...
-
Pgvector – vector similarity search for Postgres
Check out Magnitude, we built it to solve that problem: https://github.com/plasticityai/magnitude
It's still loaded from a file, but heavily uses memory-mapping and caching to be speedy and not overload your RAM immediately. And in production scenarios, multiple worker processes can share that memory due to the memory mapping.
Disclaimer: I'm the author.
Our startup made a package powered by SQLite for this very purpose: https://github.com/plasticityai/magnitude
Might be worth checking out :)
-
Build an Embeddings index from a data source
General language models from pymagnitude
-
Tutorial series on txtai
Backed by the pymagnitude library. Pre-trained word vectors can be installed from the referenced link.
What are some alternatives?
faiss - A library for efficient similarity search and clustering of dense vectors.
flashtext - Extract Keywords from sentence or Replace keywords in sentences.
milvus - An open-source vector database for scalable similarity search and AI applications.
txtai - 💡 Build AI-powered semantic search applications
sentence-transformers - Multilingual Sentence & Image Embeddings with BERT
mmocr - OpenMMLab Text Detection, Recognition and Understanding Toolbox
fastlangid - fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-hant)
finalfusion-rust - finalfusion embeddings in Rust
gensim - Topic Modelling for Humans
smlar - PostgreSQL extension for an effective similarity search || mirror of git://sigaev.ru/smlar.git || see https://www.pgcon.org/2012/schedule/track/Hacking/443.en.html
pretty-print-confusion-matrix - Confusion Matrix in Python: plot a pretty confusion matrix (like Matlab) in python using seaborn and matplotlib