Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more ā
Top 23 Python vector-database Projects
-
-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
-
txtai
š” All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
-
superduperdb
š® SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.
-
lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
-
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
-
NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
-
-
-
langchain-chatbot
AI Chatbot for analyzing/extracting information from data in conversational format.
-
-
ChatData
ChatData š š brings RAG to real applications with FREEāØ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 million arxiv papers.
-
-
DocumentGPT
DocumentGPT is a web application that allows you to chat over your research document using OpenAI's chat API and perform semantic search using vector databases. This tool provides a seamless interface for interacting with your research document, exploring search results, and engaging in a conversation with an AI chatbot.
-
NeoGPT
Chat effortlessly, execute commands, and interpret code with Llama3, Phi3, and more - your local AI assistant. Enjoy seamless interaction while ensuring ultimate privacy
-
-
-
markdown-file-query
Semantic QA with a markdown database: Query any markdown file using vector embedding, Pinecone vector database and GPT (langchain). A weaker version of privateGPT
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Show HN: FileKitty ā Combine and label text files for LLM prompt contexts | news.ycombinator.com | 2024-05-01
There are much better known examples, such as https://qdrant.tech/ and https://github.com/lancedb/lancedb
In this blog post, Iāll be comparing 3 distinct AI-first code search tools I recently came across: Cody (developed by late-stage startup, Sourcegraph), SeaGOAT (an open-source project that was trending on HN last week), and Bloop (an early-stage YC startup). Iāll be evaluating them along the dimensions of user-friendliness as well as their accuracy.
To create a PineCone account, sign up via this link: https://www.pinecone.io/
Project mention: Show HN: Neum AI ā Open-source large-scale RAG framework | news.ycombinator.com | 2023-11-21Interesting to see that the semantic chunking in the tools library is a wrapper around GPT-4. Asks GPT for the python code and executes it: https://github.com/NeumTry/NeumAI/blob/main/neumai-tools/neu...
Project mention: Show HN: Chromem-go ā Embeddable vector database for Go | news.ycombinator.com | 2024-04-05Qdrant lib project https://github.com/tyrchen/qdrant-lib, Qdrant SDK has also support for local mode, which means embeddable https://github.com/qdrant/qdrant-client
Project mention: Show HN: LLMFlows ā LangChain alternative for explicit and transparent apps | news.ycombinator.com | 2023-07-29
Qdrantās benchmark results are strongly in favor of accuracy and efficiency. We recommend that you consider them before deciding that an LLM is enough. Take a look at our open-source benchmark reports and try out the tests yourself.
Project mention: Show HN: ChatData ā an open-source ChatGPT-like chatbot | news.ycombinator.com | 2023-11-28Hey there, wonderful Hacker News community! We're excited to share something special with you - ChatData. This isn't just another chat-with-documents app; it's a game-changer that melds MyScale and LangChain, empowering you to query millions of files effortlessly.
ChatData redefines the conversation between you and knowledge. Explore the MyScale free knowledge base or delve into your uploaded documents for tailored insights and answers.
Retriever Type: Fueled by the Retrieval Augmented Generation (RAG) framework, ChatData introduces the Self-querying retriever and VectorSQL. Build intricate queries effortlessly using LangChain, covering everything from timestamps to arrays of strings.
Session Management: Elevate your chat experience with intuitive session management. Customize your session ID, tweak prompts, and guide ChatData through your queries with ease. It's like having a personal conversation with your knowledge!
Build Your Own Knowledge Base: Beyond MyScale's external knowledge base, ChatData invites you to upload your files using the Unstructured API. Your privacy matters - only processed texts are stored. It's your knowledge, your way!
Whether you're a researcher, a student, or just someone hungry for knowledge, ChatData simplifies your journey through vast data. Unleash the true potential of information retrieval and explore a world of knowledge with a friendly touch.
We genuinely can't wait to hear your thoughts and feedback. Let's embark on this exciting journey of knowledge discovery together with ChatData (https://github.com/myscale/ChatData)!
Was really excited to get everything working! Check it out at: https://github.com/aju22/DocumentGPT
Get the source code (and leave a little ā while you're there): https://github.com/AstraBert/everything-ai Get a quick-start with the documentation: https://astrabert.github.io/everything-ai/
One of the most interesting projects I came across this month was NeoGPT. It's a GPT based application that is being built to converse with documents and videos. While still in its infancy, the project has outlined a cool roadmap and has a very active base of contributors continuously expanding on its functionality. The project appeals to my desire to learn how to work with AI and neural networks. It is also at a development stage that it is not outside of the reach of my comprehension. Icing on the cake being it's Py based, which is my sharpest tool at the moment. I see it as a decent project to stay tapped into and grow my skills as the application develops.
Project mention: [D] ChatGPT4 doesnāt cut it for my work. Need a more accurate tool. | /r/MachineLearning | 2023-12-06We have a research-focused framework for these kinds of tasks here: https://github.com/biocypher/biochatter. Requests and contributions welcome.
Python vector-database discussion
Python vector-database related posts
-
Build a simple RAG chatbot with LangChain...
-
RAG is Dead. Long Live RAG!
-
7 Vector Databases Every Developer ShouldĀ Know!
-
Qdrant, the Vector Search Database, raised $28M in a Series A round
-
Using Vector Embeddings to Overengineer 404 pages
-
Pinecone: Build Knowledgeable AI
-
Vector Databases: A Technical Primer [pdf]
-
A note from our sponsor - InfluxDB
www.influxdata.com | 15 Jun 2024
Index
What are some of the best open-source vector-database projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | llama_index | 32,542 |
2 | deeplake | 7,826 |
3 | txtai | 7,265 |
4 | superduperdb | 4,462 |
5 | lancedb | 3,312 |
6 | autollm | 936 |
7 | SeaGOAT | 929 |
8 | canopy | 915 |
9 | NeumAI | 795 |
10 | qdrant-client | 647 |
11 | llmflows | 629 |
12 | vectordb | 489 |
13 | langchain-chatbot | 384 |
14 | vector-db-benchmark | 237 |
15 | ChatData | 137 |
16 | relevanceai | 105 |
17 | DocumentGPT | 105 |
18 | everything-ai | 103 |
19 | citrus | 93 |
20 | NeoGPT | 66 |
21 | vector-db-benchmark | 58 |
22 | biochatter | 47 |
23 | markdown-file-query | 25 |