SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Database Projects
-
quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
git clone https://github.com/sqlmapproject/sqlmap.git
-
pandas-ai
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
View the Project on GitHub
-
Essentially this solution is to let the AI *formulate the search* expression and not do the search itself (similar to the concept of generating a SQL statement instead of executing it https://github.com/vanna-ai/vanna).
-
DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
-
gel
Gel supercharges Postgres with a modern data model, graph queries, Auth & AI solutions, and much more. (by geldata)
[2] https://github.com/geldata/gel/issues/4228#issuecomment-1208...
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: SQLiteStudio: Create, edit, browse SQLite databases | news.ycombinator.com | 2024-11-24
One day I set out to resolve this, though I can not find in my notes' files what the solution was (for MySQL). It might have been to simply use MyCLI instead of the standard MySQL CLI. Have you tried it?
https://github.com/dbcli/mycli
-
Project mention: XAN: A Modern CSV-Centric Data Manipulation Toolkit for the Terminal | news.ycombinator.com | 2025-03-27
I used to use q for this sort of thing. Not sure if there are better choices now as it have been a few years.
https://harelba.github.io/q/
-
-
I felt the same - have to relearn/lookup everything every time I went back to a project or wanted to do some operations that are simple to describe in SQL but I couldn't wrap my mind around e.g. using multi-indexed dataframes & aggregations properly. These days, I always jump to Polars instead of Pandas - much more intuitive and consistent API. Tons of props to Pandas for all that they did (and continue to do) in the data space, but their API did not evolve very well IMO.
I've also been wanting to play with Ibis[1] recently, but Polars has been sufficient for me.
[1] https://ibis-project.org/
-
Project mention: Build fully portable AI applications on top of Snowflake with SuperDuperDB | dev.to | 2024-06-26
Customize how AI and databases work together. Scale your AI projects to handle more data and users. Move AI projects between different environments easily. Extend the system with new AI features and database functionality. Check it out: Blog: https://blog.superduperdb.com/version-02 Github: https://github.com/SuperDuperDB/superduperdb (leave us a star ⭐️🥳)
-
FinanceDatabase
This is a database of 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets.
-
dataset
Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.
-
-
-
tencent-ml-images
Largest multi-label image database; ResNet-101 model; 80.73% top-1 acc on ImageNet
-
-
pgadmin4
pgAdmin is the most popular and feature rich Open Source administration and development platform for PostgreSQL, the most advanced Open Source database in the world.
Project mention: From CSV to SQL: Importing CSV Real World Data into PostgreSQL. | dev.to | 2025-05-03Depending on your tools and environment, there are several options for importing data, from using a graphical tool like pgAdmin to a direct SQL query. For my case, I was using a locally hosted PostgreSQL, and found the command line to be convenient.
-
PyPika
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
Project mention: FastAPI, Pydantic, Psycopg3: the holy trinity for Python web APIs | dev.to | 2024-10-24PyPika: I don't know much about this one.
-
baserow
The official repository is hosted on https://gitlab.com/bramw/baserow. Baserow is an open source no-code database tool and Airtable alternative.
Project mention: Baserow 1.32: Dashboards, enhanced Airtable import, SSO for Application Builder, and more - Open Source Airtable Alternative | dev.to | 2025-03-19Try out Baserow 1.32: https://baserow.io
-
-
GarminDB
Download and parse data from Garmin Connect or a Garmin watch, FitBit CSV, and MS Health CSV files into and analyze data in Sqlite serverless databases with Jupyter notebooks.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Database discussion
Python Database related posts
-
Bidirectional Enhancement of Graph and Agent
-
Chat2Graph: Graph Native Agentic System
-
From CSV to SQL: Importing CSV Real World Data into PostgreSQL.
-
Show HN: Morphik – Open-source MCP server for technical document search
-
Ask HN: What RAG evaluations do you care about?
-
Show HN: I built an open-source NotebookLM alternative using Morphik
-
XAN: A Modern CSV-Centric Data Manipulation Toolkit for the Terminal
-
A note from our sponsor - SaaSHub
www.saashub.com | 21 May 2025
Index
What are some of the best open-source Database projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | quivr | 37,875 |
2 | SQLMap | 34,136 |
3 | pandas-ai | 20,204 |
4 | vanna | 17,655 |
5 | DB-GPT | 16,512 |
6 | gel | 13,655 |
7 | pgcli | 12,438 |
8 | mycli | 11,636 |
9 | q | 10,282 |
10 | TinyDB | 7,114 |
11 | ibis | 5,759 |
12 | superduper | 5,058 |
13 | FinanceDatabase | 4,920 |
14 | dataset | 4,798 |
15 | sqlite-web | 3,698 |
16 | dataherald | 3,495 |
17 | tencent-ml-images | 3,067 |
18 | arctic | 3,065 |
19 | pgadmin4 | 2,927 |
20 | PyPika | 2,673 |
21 | baserow | 2,672 |
22 | evadb | 2,670 |
23 | GarminDB | 2,430 |