Top 23 SQL Open-Source Projects

  • devops-exercises

    Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions

    Project mention: DevOps Exercises | news.ycombinator.com | 2023-08-07
  • Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

    Project mention: Integrate Pyspark Structured Streaming with confluent-kafka | dev.to | 2023-08-12

    Apache Spark - https://spark.apache.org/

  • SonarLint

    Clean code begins in your IDE with SonarLint. Up your coding game and discover issues early. SonarLint is a free plugin that helps you find & fix bugs and security issues from the moment you start writing code. Install from your favorite IDE marketplace today.

  • tidb

    TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://tidbcloud.com/free-trial

    Project mention: TiDB: Open-source, cloud-native, distributed, MySQL compatible database | news.ycombinator.com | 2023-08-30
  • dbeaver

    Free universal database tool and SQL client

    Project mention: 👻Top 8 Free, Open Source SQL Clients🔥 | dev.to | 2023-09-04

    DBeaver is a veteran SQL client. In addition to basic visualization and management capabilities, it has a SQL editor, data and schema migration capabilities, monitor database connections, and more. It supports a full range of databases (both SQL and NoSQL). DBeaver is also hooked up with GPT-3, which converts your natural language to SQL.

  • ClickHouse

    ClickHouse® is a free analytics DBMS for big data

    Project mention: ClickHouse Keeper: A ZooKeeper alternative written in C++ | news.ycombinator.com | 2023-09-27

    We hope not and try to keep in wire compatible for clients to interact (recently added dynamic reconfig, etc.)

    It is definitely opinionated and influenced by our work...but not designed solely for it.

    But, also, we continue to improve. Most notably in the work on Multi-group Raft - https://github.com/ClickHouse/ClickHouse/issues/54172

  • Sequelize

    Feature-rich ORM for modern Node.js and TypeScript, it supports PostgreSQL (with JSON and JSONB support), MySQL, MariaDB, SQLite, MS SQL Server, Snowflake, Oracle DB (v6), DB2 and DB2 for IBM i.

    Project mention: Contributing to Tech Communities: How Open-Source can land you a job and get you out of the Skill Paradox 💼 | dev.to | 2023-09-26

    I made a small change to the new documentation for Sequelize! I was just scrolling through the documentation and found this mistake that could lead others to weird debugging sessions, so as soon as I found it, I submitted a PR for them! You can check out the contribution here!

  • cockroach

    CockroachDB - the open source, cloud-native distributed SQL database.

    Project mention: Is it bad to create a publicly accessible RDS database for my serverless web app? | /r/aws | 2023-08-11

    For example, when you create a serverless postgres database with a platform like CockroachDB or Neon, you effectively get a connection string with a strong password. Anyone can connect to your database from anywhere so long as they have the right connection string. There are no security settings in these services to change this behavior.

  • Mergify

    Updating dependencies is time-consuming.. Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free.

  • Directus

    The Modern Data Stack 🐰 — Directus is an instant REST+GraphQL API and intuitive no-code data collaboration app for any SQL database.

    Project mention: Building a User Feedback Widget with Vue.js and Directus | dev.to | 2023-09-21

    One of our DevRel initiatives at Directus is constantly improving our documentation. As a small team with finite time and resources, we rely a lot on user feedback to help guide our writing efforts. But we were missing the most important bit there – your feedback.

  • surrealdb

    A scalable, distributed, collaborative, document-graph database, for the realtime web

    Project mention: How to Design a SurrealDB schema and create a basic client for TypeScript | dev.to | 2023-09-17

    In the midst of a dynamic landscape of exciting new projects, one name shines bright — SurrealDB.

  • TDengine

    TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.

    Project mention: TDengine: NEW Data - star count:21596.0 | /r/algoprojects | 2023-08-06
  • postgrest

    REST API for any Postgres database

    Project mention: Production REST API Infrastructure | /r/webdev | 2023-06-18
  • OSQuery

    SQL powered operating system instrumentation, monitoring, and analytics.

    Project mention: Alternative to Endpoint Protector? | /r/selfhosted | 2023-05-06

    From a self hosted standpoint OSQuery or Wazuh are your best bets for monitoring USB devices. Windows makes blocking really challenging and I’m not aware of any “free” solutions that attempt it.

  • MyBatis

    MyBatis SQL mapper framework for Java

    Project mention: MyBatis VS jOOQ - a user suggested alternative | libhunt.com/r/mybatis-3 | 2022-11-25
  • shardingsphere

    Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.

    Project mention: Managing Data Residency - the demo | dev.to | 2023-05-25

    Opposite to what the documentation tells, the full prefix is jdbc:shardingsphere:absolutepath. I've opened a PR to fix the documentation.

  • Knex

    A query builder for PostgreSQL, MySQL, CockroachDB, SQL Server, SQLite3 and Oracle, designed to be flexible, portable, and fun to use.

    Project mention: Plankaban Raspberry Pi 4 Setup Help | /r/selfhosted | 2023-06-08

    # related: https://github.com/knex/knex/issues/2354

  • Dapper

    Dapper - a simple object mapper for .Net

    Project mention: Interceptors (new C# metaprogramming feature) to fuel DapperAOT development | news.ycombinator.com | 2023-06-21
  • cube.js

    📊 Cube — The Semantic Layer for Building Data Applications

    Project mention: GigaOm Sonar Report Reviews Semantic Layer and Metric Store Vendors | news.ycombinator.com | 2023-09-08

    https://github.com/cube-js/cube comes out very well at the end as a promising open source system, getting rather close to the bullseye. Would love to know more & hear people's experience with it.

  • dolt

    Dolt – Git for Data

    Project mention: SQLedge: Replicate Postgres to SQLite on the Edge | news.ycombinator.com | 2023-08-09

    #. SQLite WAL mode

    From https://www.sqlite.org/isolation.html https://news.ycombinator.com/item?id=32247085 :

    > [sqlite] WAL mode permits simultaneous readers and writers. It can do this because changes do not overwrite the original database file, but rather go into the separate write-ahead log file. That means that readers can continue to read the old, original, unaltered content from the original database file at the same time that the writer is appending to the write-ahead log

    #. superfly/litefs: aFUSE-based file system for replicating SQLite https://github.com/superfly/litefs

    #. sqldiff: https://www.sqlite.org/sqldiff.html https://news.ycombinator.com/item?id=31265005

    #. dolthub/dolt: https://github.com/dolthub/dolt

    > Dolt can be set up as a replica of your existing MySQL or MariaDB database using standard MySQL binlog replication. Every write becomes a Dolt commit. This is a great way to get the version control benefits of Dolt and keep an existing MySQL or MariaDB database.

    #. pganalyze/libpg_query: https://github.com/pganalyze/libpg_query :

    > C library for accessing the PostgreSQL parser outside of the server environment

    #. Ibis + Substrait [ + DuckDB ]

    > ibis strives to provide a consistent interface for interacting with a multitude of different analytical execution engines, most of which (but not all) speak some dialect of SQL.

    > Today, Ibis accomplishes this with a lot of help from `sqlalchemy` and `sqlglot` to handle differences in dialect, or we interact directly with available Python bindings (for instance with the pandas, datafusion, and polars backends).

    > [...] `Substrait` is a new cross-language serialization format for communicating (among other things) query plans. It's still in its early days, but there is already nascent support for Substrait in Apache Arrow, DuckDB, and Velox.

    #. benbjohnson/postlite: https://github.com/benbjohnson/postlite

    > postlite is a network proxy to allow access to remote SQLite databases over the Postgres wire protocol. This allows GUI tools to be used on remote SQLite databases which can make administration easier.

    > The proxy works by translating Postgres frontend wire messages into SQLite transactions and converting results back into Postgres response wire messages. Many Postgres clients also inspect the pg_catalog to determine system information so Postlite mirrors this catalog by using an attached in-memory database with virtual tables. The proxy also performs minor rewriting on these system queries to convert them to usable SQLite syntax.

    > Note: This software is in alpha. Please report bugs. Postlite doesn't alter your database unless you issue INSERT, UPDATE, DELETE commands so it's probably safe. If anything, the Postlite process may die but it shouldn't affect your database.

    #. > "Hosting SQLite Databases on GitHub Pages" (2021) re: sql.js-httpvfs, DuckDB https://news.ycombinator.com/item?id=28021766

    #. awesome-db-tools https://github.com/mgramin/awesome-db-tools

  • TimescaleDB

    An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.

    Project mention: How to setup Postgres master-master cluster. | /r/sysadmin | 2023-09-05

    Offboard it to Postgres specialists like https://www.timescale.com/

  • Presto

    The official home of the Presto distributed SQL query engine for big data

    Project mention: Ask HN: What are some SQL transpilers? | news.ycombinator.com | 2023-07-14
  • rqlite

    The lightweight, distributed relational database built on SQLite

    Project mention: I'm All-In on Server-Side SQLite | news.ycombinator.com | 2023-09-22
  • go-sql-driver/mysql

    Go MySQL Driver is a MySQL driver for Go's (golang) database/sql package (by go-sql-driver)

    Project mention: Tools besides Go for a newbie | /r/golang | 2023-03-26

    IDE: use whatever make you productive. I personally use vscode. VCS: git, as golang communities use github heavily as base for many libraries. AFAIK Linter: use staticcheck for linting as it looks like mostly used linting tool in go, supported by many also. In Vscode it will be recommended once you install go plugin. Libraries/Framework: actually the standard libraries already included many things you need, decent enough for your day-to-day development cycles(e.g. `net/http`). But here are things for extra: - Struct fields validator: validator - Http server lib: chi router , httprouter , fasthttp (for non standard http implementations, but fast) - Web Framework: echo , gin , fiber , beego , etc - Http client lib: most already covered by stdlib(net/http), so you rarely need extra lib for this, but if you really need some are: resty - CLI: cobra - Config: godotenv , viper - DB Drivers: sqlx , postgre , sqlite , mysql - nosql: redis , mongodb , elasticsearch - ORM: gorm , entgo , sqlc(codegen) - JS Transpiler: gopherjs - GUI: fyne - grpc: grpc - logging: zerolog - test: testify , gomock , dockertest - and many others you can find here

  • InfluxDB

    Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-09-27.

SQL related posts


What are some of the best open-source SQL projects? This list will help you:

Project Stars
1 devops-exercises 58,178
2 Apache Spark 36,785
3 tidb 34,864
4 dbeaver 33,897
5 ClickHouse 30,963
6 Sequelize 28,426
7 cockroach 27,854
8 Directus 23,214
9 surrealdb 22,480
10 Apache Flink 22,011
11 TDengine 21,751
12 postgrest 21,117
13 OSQuery 20,651
14 MyBatis 18,928
15 shardingsphere 18,805
16 Knex 18,034
17 Dapper 16,506
18 cube.js 16,235
19 dolt 15,532
20 TimescaleDB 15,516
21 Presto 15,083
22 rqlite 14,044
23 go-sql-driver/mysql 13,686
Collect and Analyze Billions of Data Points in Real Time
Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.