InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 23 SQL Open-Source Projects
-
devops-exercises
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
Project mention: Struggling to Learn DevOps? Here’s How to Master It in Just 30 Days | dev.to | 2025-04-25DevOps Exercises Interview prep + practical questions
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
Apache Iceberg defines a table format that separates how data is stored from how data is queried. Any engine that implements the Iceberg integration — Spark, Flink, Trino, DuckDB, Snowflake, RisingWave — can read and/or write Iceberg data directly.
-
Project mention: Show HN: Hacker News historic upvote and score data | news.ycombinator.com | 2025-06-03
-
tidb
TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.
For tooling developers, the libpg_query project extracts PostgreSQL's actual parser code, providing 100% compatibility with PostgreSQL's SQL parsing behavior. For MySQL, the most widely used parser is from TiDB, but it has compatibility differences since it was built specifically for TiDB's needs rather than pure MySQL compatibility.
-
cockroach
CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.
-
Directus
The flexible backend for all your projects 🐰 Turn your DB into a headless CMS, admin panels, or apps with a custom UI, instant APIs, auth & more.
Project mention: Directus.js (JavaScript SDK) – The official client for Directus APIs | dev.to | 2025-06-11If you're working with Directus, the open-source headless CMS and data platform, then Directus.js is your new best friend.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Have been building drawDB[1] for a while now. It's a database schema visualizer. Currently working on adding support for oracle databases. Wrote a parser[2] to allow importing from oracle sql. Have been struggling with motivation though, the pieces are pretty much there but I've been procrastinating on putting it all together. This has been my main side project for almost 2 years now.. I miss the feeling of novelty.. but can't come up with something worth building..idk
[1] https://github.com/drawdb-io/drawdb
[2] https://github.com/1ilit/oracle-sql-parser
-
Thanks for creating this issue, it is worth investigating!
I see you also created similar issues in Polars: https://github.com/pola-rs/polars/issues/17932 and DuckDB: https://github.com/duckdb/duckdb/issues/17066
ClickHouse has a built-in memory tracker, so even if there is not enough memory, it will stop the query and send an exception to the client, instead of crashing. It also allows fair sharing of memory between different workloads.
You need to provide more info on the issue for reproduction, e.g., how to fill the tables. 16 GB of memory should be enough even for a CROSS JOIN between a 10 billion-row and a 100-row table, because it is processed in a streaming fashion without accumulating a large amount of data in memory. The same should be true for a merge join.
However, there are places when a large buffer might be needed. For example, if you insert data into a table backed by S3 storage, it requires a buffer that can be in the order of 500 MB.
There is a possibility that your machine has 16 GB of memory, but most of it is consumed by Chrome, Slack, or Safari, and not much is left for ClickHouse server.
-
Sequelize
Feature-rich ORM for modern Node.js and TypeScript, it supports PostgreSQL (with JSON and JSONB support), MySQL, MariaDB, SQLite, MS SQL Server, Snowflake, Oracle DB (v6), DB2 and DB2 for IBM i.
Object-Relational Mapping frameworks like Hibernate (Java), SQLAlchemy (Python), and Sequelize (Node.js) typically use parameterized queries by default and abstract direct SQL interaction. These frameworks help eliminate common developer errors that might otherwise introduce vulnerabilities.
-
Project mention: SurrealDB 2.2: Benchmarking, graph path algorithms and foreign key constraints | dev.to | 2025-03-17
To make this better, we've created a language testing suite similar to the ECMAscript conformance testing suite test262.
-
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
Project mention: Data-engineer-handbook: everything to learn about data engineering | news.ycombinator.com | 2024-12-03This thing points to some sort of github metrics dashboard.
The actual handbook is at: https://github.com/DataExpert-io/data-engineer-handbook
-
drizzle-orm
Headless TypeScript ORM with a head. Runs on Node, Bun and Deno. Lives on the Edge and yes, it's a JavaScript ORM too 😅
Database ORM: Drizzle ORM
-
This is especially helpful because Supabase uses PostegREST under the hood, which does not support OpenAPI 3.x. Note that some Postgres specific features like enums are not converted to JSON Schema enums as Sequelize doesn't expose that information.
-
Project mention: My personal favorite MCP server which has became part of my life | dev.to | 2025-05-27
GitHub: github.com/apache/flink
-
TDengine
High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios
Project mention: Why SSDLC needs static analysis: a case study of 190 bugs in TDengine | dev.to | 2025-05-12We'll continue examining the TDengine project, which we've covered in three small notes on code refactoring:
-
Project mention: SQL powered operating system instrumentation, monitoring, and analytics | news.ycombinator.com | 2024-09-10
-
pandas-ai
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
View the Project on GitHub
-
shardingsphere
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
Project mention: Apache ShardingSphere: Effortless Database Scaling and Beyond! | dev.to | 2025-02-21View the Project on GitHub
-
-
Knex
A query builder for PostgreSQL, MySQL, CockroachDB, SQL Server, SQLite3 and Oracle, designed to be flexible, portable, and fun to use.
We can reuse this logic with the help of query builders like Knex and its modify method.
-
beekeeper-studio
Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.
-
TimescaleDB
A time-series database for high-performance real-time analytics packaged as a Postgres extension
timescaledb A solution by Timescale. Provides a lot more functions to handle time series than pg_timeseries. Low latency makes it adequate for real-time analytics. Supports incremental views through continuous aggregates. Has some overlap with pg_mooncake, but can't write to Iceberg or Delta Lake, using them directly as the storage layer. Supports tiered storage
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
SQL discussion
SQL related posts
-
The Missing Manual for Signals: State Management for Python Developers
-
Malloy: An open-source language for analyzing, transforming, and modeling data
-
Why Pandas feels clunky when coming from R (2024)
-
Test Postgres in Python Like SQLite
-
Show HN: Arkflow will soon support Python processors
-
Declarative database development and database portability tool
-
How to Run CRON Jobs in Postgres Without Extra Infrastructure
-
A note from our sponsor - InfluxDB
www.influxdata.com | 14 Jun 2025
Index
What are some of the best open-source SQL projects? This list will help you:
# | Project | Stars |
---|---|---|
1 | devops-exercises | 76,540 |
2 | dbeaver | 43,948 |
3 | Apache Spark | 41,304 |
4 | ClickHouse | 41,164 |
5 | tidb | 38,570 |
6 | cockroach | 30,980 |
7 | Directus | 30,844 |
8 | drawdb | 30,441 |
9 | DuckDB | 30,100 |
10 | Sequelize | 30,025 |
11 | surrealdb | 29,384 |
12 | data-engineer-handbook | 29,151 |
13 | drizzle-orm | 28,678 |
14 | postgrest | 25,387 |
15 | Apache Flink | 24,942 |
16 | TDengine | 23,983 |
17 | OSQuery | 22,521 |
18 | pandas-ai | 20,518 |
19 | shardingsphere | 20,278 |
20 | MyBatis | 20,109 |
21 | Knex | 19,862 |
22 | beekeeper-studio | 19,427 |
23 | TimescaleDB | 19,267 |