ml-engineering
qdrant
ml-engineering | qdrant | |
---|---|---|
9 | 142 | |
9,928 | 18,219 | |
- | 4.8% | |
9.7 | 9.9 | |
9 days ago | 1 day ago | |
Python | Rust | |
Creative Commons Attribution Share Alike 4.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ml-engineering
- Accelerators
-
Gemma: New Open Models
There is a lot of work to make the actual infrastructure and lower level management of lots and lots of GPUs/TPUs open as well - my team focuses on making the infrastructure bit at least a bit more approachable on GKE and Kubernetes.
https://github.com/GoogleCloudPlatform/ai-on-gke/tree/main
and
https://github.com/google/xpk (a bit more focused on HPC, but includes AI)
and
https://github.com/stas00/ml-engineering (not associated with GKE, but describes training with SLURM)
The actual training is still a bit of a small pool of very experienced people, but it's getting better. And every day serving models gets that much faster - you can often simply draft on Triton and TensorRT-LLM or vLLM and see significant wins month to month.
- FLaNK Stack 29 Jan 2024
-
ML Engineering Online Book
OK, the pdf is ready now: https://github.com/stas00/ml-engineering#pdf-version
-
Self train a super tiny model recommendations
this might be interesting: https://github.com/stas00/ml-engineering/blob/master/transformers/make-tiny-models.md
- The AI Battlefield Engineering – What You Need to Know
- Machine Learning Engineering Guides and Tools
qdrant
-
Hindi-Language AI Chatbot for Enterprises Using Qdrant, MLFlow, and LangChain
Great. Now that we have the embeddings, we need to store them in a vector database. We will be using Qdrant for this purpose. Qdrant is an open-source vector database that allows you to store and query high-dimensional vectors. The easiest way to get started with the Qdrant database is using the docker.
-
Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant
I took Qdrant for this project. The reason was that Qdrant stands for high-performance vector search, the best choice against use cases like finding similar function calls based on semantic similarity. Qdrant is not only powerful but also scalable to support a variety of advanced search features that are greatly useful to nuanced caching mechanisms like ours.
-
Ask HN: Has Anyone Trained a personal LLM using their personal notes?
I'm currently looking to implement locally, using QDrant [1] for instance.
I'm just playing around, but it makes sense to have a runnable example for our users at work too :) [2].
[1]. https://qdrant.tech/
-
Show HN: A fast HNSW implementation in Rust
Also compare with qdrant's Rust implementation; they tout their performance. https://github.com/qdrant/qdrant/tree/master/lib/segment/src...
-
pgvecto.rs alternatives - qdrant and Weaviate
3 projects | 13 Mar 2024
-
Open-source Rust-based RAG
There are much better known examples, such as https://qdrant.tech/ and https://github.com/lancedb/lancedb
-
Qdrant 1.8.0 - Major Performance Enhancements
For more information, see our release notes. Qdrant is an open source project. We welcome your contributions; raise issues, or contribute via pull requests!
-
Perform Image-Driven Reverse Image Search on E-Commerce Sites with ImageBind and Qdrant
Initialize the Qdrant Client with in-memory storage. The collection name will be “imagebind_data” and we will be using cosine distance.
-
7 Vector Databases Every Developer Should Know!
Qdrant is an open-source vector search engine optimized for performance and flexibility. It supports both exact and approximate nearest neighbor search, providing a balance between accuracy and speed for various AI and ML applications.
- Ask HN: Who is hiring? (February 2024)
What are some alternatives?
slurm-mail - Slurm-Mail is a drop in replacement for Slurm's e-mails to give users much more information about their jobs compared to the standard Slurm e-mails.
Milvus - A cloud-native vector database, storage for next generation AI applications
peft - 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Weaviate - Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
deeplake - Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
faiss - A library for efficient similarity search and clustering of dense vectors.
pinferencia - Python + Inference - Model Deployment library in Python. Simplest model inference server ever.
pgvector - Open-source vector similarity search for Postgres
haystack - :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Elasticsearch - Free and Open, Distributed, RESTful Search Engine
AtomGPT - 中英文预训练大模型,目标与ChatGPT的水平一致
towhee - Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.