SQL

Top 23 SQL Open-Source Projects

  1. devops-exercises

    Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions

    Project mention: 10 GitHub Repositories That Will Actually Teach You DevOps in 2026 | dev.to | 2026-05-05

    github.com/bregman-arie/devops-exercises. 82k stars. Maintained by Arie Bregman, ex-Red Hat.

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. dbeaver

    Free universal database tool and SQL client

    Project mention: CLI to manage your SQL database schemas and migrations | news.ycombinator.com | 2025-11-01
  4. ClickHouse

    ClickHouse® is a real-time analytics database management system

    Project mention: ClickHouse Pricing Teardown 2026 | dev.to | 2026-05-27

    Website: clickhouse.com

  5. Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

    Project mention: MLOps Lifecycle: Stages, Workflow, and Best Practices | dev.to | 2026-06-02

    Feature transformations should be deterministic: The same input should produce the same output when the same feature definition and configuration are applied. This is what allows training, backtesting, and live inference to remain aligned. Tools such as Pandas, Spark, or feature platforms such as Feast can be used to implement that logic.

  6. data-engineer-handbook

    This is a repo with links to everything you'd ever want to learn about data engineering

  7. tidb

    TiDB is built for agentic workloads that grow unpredictably, with ACID guarantees and native support for transactions, analytics, and vector search. No data silos. No noisy neighbors. No infrastructure ceiling.

    Project mention: Go vet can't go: How PVS-Studio analyzes Go projects | dev.to | 2026-02-11

    A similar issue was also found in Tidb:

  8. DuckDB

    DuckDB is an analytical in-process SQL database management system

    Project mention: Readsb ADS-B Aircraft Local State Archive | dev.to | 2026-05-27

    wget https://github.com/duckdb/duckdb/releases/latest/download/duckdb_cli-linux-arm64.zip unzip duckdb_cli-linux-arm64.zip sudo mv duckdb /usr/local/bin/

  9. drawdb

    Free, simple, and intuitive online database diagram editor and SQL generator.

    Project mention: Ask HN: What Are You Working On? (September 2025) | news.ycombinator.com | 2025-09-29
  10. Directus

    The flexible backend for all your projects 🐰 Turn your DB into a headless CMS, admin panels, or apps with a custom UI, instant APIs, auth & more.

    Project mention: Which No-Code Database in 2026: Tested & Compared | dev.to | 2026-05-07

    How does Directus 2026.0 compare to NocoDB 2026.1 for teams needing custom API extensions?

  11. drizzle-orm

    ORM

    Project mention: Flat Chat Threads Suck for Reading Books. So I Built a Local-First AI Tree Companion. | dev.to | 2026-06-08

    Built on the Pi SDK for tree-structured agent conversations, Hono for a lightweight server (Electron-friendly), and SQLite with Drizzle ORM for metadata.

  12. surrealdb

    A scalable, distributed, collaborative, document-graph database, for the realtime web

    Project mention: SurrealMX: In-memory storage with time travel and persistent storage | dev.to | 2026-03-26

    SurrealMX is SurrealDB's in-memory storage engine that was built from the ground up a bit over a year ago. It was merged into the core database during the 3.0 alpha period last November with no fanfare at all in order to try it out for the first time as SurrealDB's in-memory storage engine. Since SurrealDB uses in-memory storage by default, that means that SurrealMX has been every 3.x user's default storage option since then.

  13. cockroach

    CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.

    Project mention: Ask HN: What is the most modular sync engine? | news.ycombinator.com | 2025-12-25
  14. Sequelize

    Feature-rich ORM for modern Node.js and TypeScript, it supports PostgreSQL (with JSON and JSONB support), MySQL, MariaDB, SQLite, MS SQL Server, Snowflake, Oracle DB, DB2 and DB2 for IBM i.

    Project mention: JavaScript Awesome Package | dev.to | 2026-02-03

    Sequelize - promise-based Node.js ORM for Postgres, MySQL, MariaDB, SQLite and Microsoft SQL Server

  15. postgrest

    REST API for any Postgres database

    Project mention: GeoJSON | news.ycombinator.com | 2026-05-08
  16. TDengine

    High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios

  17. pandas-ai

    Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

    Project mention: 📰 All Data and AI Weekly #231-02March2026 | dev.to | 2026-03-02

    Pandas-AI: Talk to your dataframes in natural language.

  18. OSQuery

    SQL powered operating system instrumentation, monitoring, and analytics.

    Project mention: Show HN: Network Monitor – a GUI to spot anomalous connections on your Linux | news.ycombinator.com | 2025-11-24

    Cool project, I wish we had more GUIs for these OS functions. How was your experience with GTK4 and Rust?

    And it's a bit sad that in the year of our lord 2025, the best way to get such fundamental information is by using regexes to parse a table[1], generated by a 6000-line C program[2], which is verified by (I hope I'm wrong!) a tiny test suite[3]. OSQuery[4] is also pretty cool, but it builds upon this fragile stack.

    That's something I miss from Windows, at least PowerShell has built-in commands that give you structured output.

    [1] https://github.com/grigio/network-monitor/blob/9dc470553bfdd...

    [2] https://github.com/iproute2/iproute2/blob/main/misc/ss.c

    [3] https://github.com/iproute2/iproute2/blob/main/testsuite/tes...

    [4] https://osquery.io/

  19. dolt

    Dolt – Git for Data

    Project mention: Multi-Agent Development Workflows with Claude Code | dev.to | 2026-05-19

    I'm using Beads for this. It stores data locally via Dolt, gives agents programmatic access, and handles dependencies between tasks. The key commands:

  20. beekeeper-studio

    Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.

    Project mention: Best Database Clients in 2026: Top SQL GUI Tools Compared | dev.to | 2026-06-04

    Beekeeper Studio is a modern SQL editor and database manager with an open-source community edition. It supports many databases, including MySQL, PostgreSQL, SQLite, SQL Server, ClickHouse, DuckDB, MariaDB, Oracle, Redis, Redshift, Trino, and more.

  21. TimescaleDB

    A time-series database for high-performance real-time analytics packaged as a Postgres extension

    Project mention: GeoJSON | news.ycombinator.com | 2026-05-08
  22. chartdb

    Database diagrams editor that allows you to visualize and design your DB with a single query.

    Project mention: Show HN: Clawe – open-source Trello for agent teams | news.ycombinator.com | 2026-02-10

    the limits of a no-code website editor https://chartdb.io/ that's hilarious

  23. marimo

    A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

    Project mention: Pluto.jl 1.0 release – reactive notebook for Julia | news.ycombinator.com | 2026-06-03

    Pluto is great. I use it all the time. If you like the reactivity/reproducibility but are wedded to Python, you might want to check out Marimo, which is also great. [https://marimo.io/]

    It too puts the output of a cell above the code so if you're unable to adapt to things that are different it's also probably not for you.

    FWIW, Observable's Notebooks (Javascript) work the same way: output above the code that produces it. [https://observablehq.com/]

    I too did not like having the output above the code but got over it pretty quickly. For plots, it's arguably better: usually, I want to see the plot before I see the 15 line invocation of some plot command. The thing that bugs me the most about Pluto now is that it really wants you to only have a single evaluating statement per cell. You have to wrap stuff in "block......end" if you want to e.g. define more than one variable in a cell.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

SQL discussion

Log in or Post with

SQL related posts

Index

What are some of the best open-source SQL projects? This list will help you:

# Project Stars
1 devops-exercises 82,703
2 dbeaver 50,531
3 ClickHouse 47,973
4 Apache Spark 43,440
5 data-engineer-handbook 41,634
6 tidb 40,138
7 DuckDB 38,728
8 drawdb 37,345
9 Directus 35,995
10 drizzle-orm 34,694
11 surrealdb 32,379
12 cockroach 32,204
13 Sequelize 30,350
14 postgrest 27,226
15 Apache Flink 26,068
16 TDengine 24,897
17 pandas-ai 23,583
18 OSQuery 23,308
19 dolt 23,410
20 beekeeper-studio 23,000
21 TimescaleDB 22,824
22 chartdb 22,353
23 marimo 21,378

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com