Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more β
Top 22 Python Vector Projects
-
quivr
Opiniated RAG for integrating GenAI in your apps π§ Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
-
Your issue is that you're using the default (old) binding to GDAL, based on Fiona [0].
You need to use pyogrio [1], its vectorized counterpart, instead. Make sure you use `engine="pyogrio"` when calling `to_file` [2]. Fiona does a loop in Python, while pyogrio is exclusively compiled. So pyogrio is usually about 10-15x faster than fiona. Soon, in pyogrio version 0.8, it will be another ~2-4x faster than pyogrio is now [3].
[0]: https://github.com/Toblerity/Fiona
[1]: https://github.com/geopandas/pyogrio
[2]: https://geopandas.org/en/stable/docs/reference/api/geopandas...
[3]: https://github.com/geopandas/pyogrio/pull/346
-
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
udfs
Public Fused UDFs. Build any scale workflows with the Fused Python SDK and Workbench webapp, and integrate them into your stack with the Fused Hosted API. (by fusedio)
Project mention: A New Look: Solar Plant Prospection with Automated Viewshed Analysis | news.ycombinator.com | 2024-06-18yes i made one using a open python lib - https://github.com/fusedio/udfs/tree/main/community/fhk/pybd...
-
ChatData
ChatData π π brings RAG to real applications with FREEβ¨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 million arxiv papers.
-
-
-
Personally I feel they are overkill.
For example Iβm using duckDB as a vector store for similarity search and RAG. It works really well.
https://github.com/patricktrainer/duckdb-embedding-search
-
CuVec
Unifying Python/C++/CUDA memory: Python buffered array βοΈ `std::vector` βοΈ CUDA managed memory
-
edgedressing
edgedressing leverages a Windows "feature" in order to force a target's Edge browser to open. This browser is then directed to a URL of choice.
For anyone asking what this has to do with NCIS, have a look at the README of the parent repo: https://github.com/stryngs/edgedressing/tree/main
It turns out that (modulo some details) it's possible to trick a Windows computer connected to a network you control into opening a browser that points to a URL of your choosing. That's because NCSI initiates probes using plain HTTP rather than HTTPS, so the usual injection attacks can be carried out without the user having to take any action of their own.
-
loadthemall
QGIS plugin that recursively loads vector and raster layers stored in a directory structure, based on several filters
-
Piazza-Updater
Piazza-Updater automates updates to a Weaviate database with real-time vectorial data. By continuously searching the internet and integrating with Verba repositories, it enhances retrieval-augmented generation (RAG) capabilities, keeping your applications informed and responsive.
βοΈ Apache Polaris + Iceberg Quickstart β‘οΈ How to extract tables from pdfs π Microsoft 1bit LLM BitNet πΏοΈ Verifying Kafka Transactions Entry 2 πΏοΈ FLUSS: Streaming Storage πΏοΈ Fluss -> Flow for Flink Real Time Analytics π TableFlow - iceberg / kafka βοΈ Snowflake Cortex AI + Slack πΏοΈβοΈ Door dash flink, kafka, snowflake π§ Prompt Stack -- all in one π SpaCY Layout for PDF π± Responsible AI Pathways πΌ Megaparse documents python π Time Series LLM βοΈ Generate Synthetic Data in Snowflake πΏοΈ LLMs and GenAI - When to use them πΏοΈ Flink Observability with Prometheus π‘ New SQL GUI π« TDD for GenAI π΅οΈ π Open Source Agent Framework for Production π» Cedit command line editor π ServiceNow AgentLab π€ Snowflake Lessons Learned in Replication π Privastead π Backup Icloud with nodejs on linux π Backup Google with nodejs on linux π HuggingFace macos chat source code π Ollama working with structured output π dspy ai how to π Piazza updater π Building a financial report with langgraph ColPali Notebook with QWEN 2 VL
-
-
YassQueenDB
Graph database library that allows you to store, analyze, and search through your data in a graph format. By using the Universal Sentence Encoder, it provides an efficient and semantic approach to handle text data. ππ§ π
-
vektor
a mini vector database implementation that intends to be educational and interpretable (by bxptr)
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Vector discussion
Python Vector related posts
-
NCSI Probing Needs to Die
-
My take on CVE-2024-30078
-
Show HN: ChatData β an open-source ChatGPT-like chatbot
-
ChatData brings RAG to LLM apps with FREE knowledge base with files
-
Custom domains for Airtable
-
AI and the End of Programming
-
Textual Adds a Command Palette
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 22 Mar 2025
Index
What are some of the best open-source Vector projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | quivr | 37,551 |
2 | TaskingAI | 5,076 |
3 | Fiona | 1,182 |
4 | pymilvus | 1,130 |
5 | vector-python-sdk | 568 |
6 | earthpy | 516 |
7 | snowChat | 512 |
8 | svg.py | 300 |
9 | klongpy | 289 |
10 | udfs | 226 |
11 | ChatData | 168 |
12 | Vector | 133 |
13 | issuer-icons | 129 |
14 | duckdb-embedding-search | 124 |
15 | CuVec | 80 |
16 | edgedressing | 50 |
17 | loadthemall | 36 |
18 | Piazza-Updater | 30 |
19 | catbench | 17 |
20 | YassQueenDB | 15 |
21 | vektor | 12 |
22 | matmath | 5 |