Java SQL

Open-source Java projects categorized as SQL

Top 23 Java SQL Projects

  • dbeaver

    Free universal database tool and SQL client

    Project mention: 👻Top 8 Free, Open Source SQL Clients🔥 | | 2023-09-04

    DBeaver is a veteran SQL client. In addition to basic visualization and management capabilities, it has a SQL editor, data and schema migration capabilities, monitor database connections, and more. It supports a full range of databases (both SQL and NoSQL). DBeaver is also hooked up with GPT-3, which converts your natural language to SQL.

  • WorkOS

    The modern API for authentication & user identity. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • shardingsphere

    Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.

    Project mention: Managing Data Residency - the demo | | 2023-05-25

    Opposite to what the documentation tells, the full prefix is jdbc:shardingsphere:absolutepath. I've opened a PR to fix the documentation.

  • MyBatis

    MyBatis SQL mapper framework for Java

    Project mention: MyBatis makes it easier to use a relational database with OO applications | | 2023-10-05
  • Presto

    The official home of the Presto distributed SQL query engine for big data

    Project mention: Multi-Database Support in DuckDB | | 2024-01-28

    We have some of this functionality in Presto (, but it takes fair bit of work to implement it for all the different backends.

  • QuestDB

    An open source time-series database for fast ingest and SQL queries

    Project mention: Normalizing Grafana charts with window functions | | 2024-01-10

    If you're interested in that functionality or have any other feedback, please drop by our open source repository or community Slack and let us know.

  • doris

    Apache Doris is an easy-to-use, high performance and unified analytics database.

    Project mention: Five Apache projects you probably didn't know about | | 2023-12-21

    Apache Doris is a real-time data warehouse.

  • Onboard AI

    ChatGPT with full context of any GitHub repo. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at

  • Trino

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (

    Project mention: Game analytic power: how we process more than 1 billion events per day | | 2023-11-24

    We decided not to waste time reinventing the wheel and simply installed Trino on our servers. It’s a full featured SQL query engine that works on your data. Now our analysts can use it to work with data from AppMetr and execute queries at different levels of complexity.

  • ali-dbhub


    Project mention: FLaNK Stack Weekly for 20 June 2023 | | 2023-06-20

    ChatGPT Mac/Windows App for SQL

  • Flyway

    Flyway by Redgate • Database Migrations Made Easy.

    Project mention: PostgreSQL Is Enough | | 2024-02-06

    There is a bit of tooling needed but is already around. For Java for example I had very good experience with a combination of flyway [1] for migrations, testcontainers [2] for making integration tests as easy as unit tests and querydsl [3] for a query and mapping layer.




  • beam

    Apache Beam is a unified programming model for Batch and Streaming data processing.

    Project mention: Ask HN: Does (or why does) anyone use MapReduce anymore? | | 2024-01-24

    The "streaming systems" book answers your question and more: It gives you a history of how batch processing started with MapReduce, and how attempts at scaling by moving towards streaming systems gave us all the subsequent frameworks (Spark, Beam, etc.).

    As for the framework called MapReduce, it isn't used much, but its descendant very much is. Nowadays people often use "map reduce" as a shorthand for whatever batch processing system they're building on top of.

  • starrocks

    StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.

    Project mention: StarRocks – sub-second MPP OLAP database for full analytics scenarios | | 2024-01-23
  • jOOQ

    jOOQ is the best way to write SQL in Java

    Project mention: ORMs are nice but they are the wrong abstraction | | 2024-02-01
  • ksql

    The database purpose-built for stream processing applications.

    Project mention: Supercharge your Kafka Clusters with Consumer Best Practices | /r/programming | 2023-03-12
  • Apache Hive

    Apache Hive

    Project mention: Apache Iceberg as storage for on-premise data store (cluster) | /r/dataengineering | 2023-03-16

    Trino or Hive for SQL querying. Get Trino/Hive to talk to Nessie.

  • JSqlParser

    JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern

  • OrientDB

    OrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text and Geospatial models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing and Reactive Queries.

    Project mention: Graph Databases vs Relational Databases: What and why? | | 2023-03-29

    First, you need to choose a specific graph database platform to work with, such as Neo4j, OrientDB, JanusGraph, Arangodb or Amazon Neptune. Once you have selected a platform, you can then start working with graph data using the platform's query language.

  • Apache Ignite

    Apache Ignite (by apache)

  • esProc

    esProc SPL is a scripting language for data processing, with well-designed rich library functions and powerful syntax, which can be executed in a Java program through JDBC interface and computing independently.

    Project mention: How come there are tens of thousands of tables in a database | | 2024-02-17

    With the open and integrable computing capability of esProc SPL, it will be more convenient to design application frameworks. Computing can be placed in the most suitable position, without the need to deploy extra databases to obtain computing power. The database can focus on doing its most suitable thing, and complex and flexible computing can be left to SPL to solve, maximizing resource utilization. Finally, esProc SPL is open source. It is here

  • liquibase

    Main Liquibase Source

    Project mention: I am looking for a troubled/bad open source codebase | /r/ExperiencedDevs | 2023-07-12

    While I respect the work, Liquibase's code base is quite messy...

  • Apache Calcite

    Apache Calcite

    Project mention: Data diffs: Algorithms for explaining what changed in a dataset (2022) | | 2023-07-26

    > Make diff work on more than just SQLite.

    Another way of doing this that I've been wanting to do for a while is to implement the DIFF operator in Apache Calcite[0]. Using Calcite, DIFF could be implemented as rewrite rules to generate the appropriate SQL to be directly executed against the database or the DIFF operator can be implemented outside of the database (which the original paper shows is more efficient).


  • spotless

    Keep your code spotless

    Project mention: FLiPN-FLaNK Stack for March 6, 2023 | | 2023-03-06
  • H2

    H2 is an embeddable RDBMS written in Java.

    Project mention: H2 Database – CVE getting flagged by automated scans | | 2023-07-18

    The URL should point to a particular comment, but HN removes fragments:

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-17.

Java SQL related posts


What are some of the best open-source SQL projects in Java? This list will help you:

Project Stars
1 dbeaver 36,074
2 Apache Flink 22,861
3 shardingsphere 19,244
4 MyBatis 19,238
5 Presto 15,436
6 QuestDB 13,234
7 doris 10,775
8 Trino 9,229
9 ali-dbhub 8,551
10 Flyway 7,675
11 beam 7,385
12 starrocks 7,334
13 jOOQ 5,823
14 ksql 5,754
15 Apache Hive 5,249
16 JSqlParser 4,841
17 OrientDB 4,670
18 Apache Ignite 4,641
19 esProc 4,380
20 liquibase 4,275
21 Apache Calcite 4,257
22 spotless 4,037
23 H2 3,989
Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.