Top 23 Java SQL Projects

dbeaver

27 37,391 9.9 Java

Free universal database tool and SQL client

Project mention: DBeaver – open-source Database client | news.ycombinator.com | 2024-03-10

Yes but not in the community version:
https://github.com/dbeaver/dbeaver/wiki/Schema-compare

Apache Flink

9 23,158 9.9 Java

Apache Flink

Project mention: First 15 Open Source Advent projects | dev.to | 2023-12-15

7. Apache Flink | Github | tutorial

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
shardingsphere

23 19,425 10.0 Java

Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.

Project mention: Managing Data Residency - the demo | dev.to | 2023-05-25

Opposite to what the documentation tells, the full prefix is jdbc:shardingsphere:absolutepath. I've opened a PR to fix the documentation.

MyBatis

4 19,404 9.3 Java

MyBatis SQL mapper framework for Java

Project mention: MyBatis makes it easier to use a relational database with OO applications | news.ycombinator.com | 2023-10-05

Presto

14 15,591 9.9 Java

The official home of the Presto distributed SQL query engine for big data

Project mention: Multi-Database Support in DuckDB | news.ycombinator.com | 2024-01-28

We have some of this functionality in Presto (https://github.com/prestodb/presto), but it takes fair bit of work to implement it for all the different backends.

QuestDB

311 13,448 9.7 Java

An open source time-series database for fast ingest and SQL queries

Project mention: How to Forecast Air Temperatures with AI + IoT Sensor Data | dev.to | 2024-03-24

If your data lacks uniform time intervals between consecutive entries, QuestDB offers a solution by allowing you to sample your data. After that, MindsDB facilitates creating, training, and deploying your time-series models.

doris

42 11,314 10.0 Java

Apache Doris is an easy-to-use, high performance and unified analytics database.

Project mention: Variant in Apache Doris 2.1.0: a new data type 8 times faster than JSON for semi-structured data analysis | dev.to | 2024-03-27

As an open-source real-time data warehouse, Apache Doris provides semi-structured data processing capabilities, and the newly-released version 2.1.0 makes a stride in this direction. Before V2.1, Apache Doris stores semi-structured data as JSON files. However, during query execution, the real-time parsing of JSON data leads to high CPU and I/O consumption in addition to high query latency, especially when the dataset is huge and complicated. Moreover, the lack of a pre-defined schema means there is no handle for query optimization.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Trino

44 9,552 10.0 Java

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Project mention: Trino: Fast distributed SQL query engine for big data analytics | news.ycombinator.com | 2024-03-19

ali-dbhub

2 8,454 9.5 Java

已迁移新仓库，此版本将不再维护

Project mention: FLaNK Stack Weekly for 20 June 2023 | dev.to | 2023-06-20

ChatGPT Mac/Windows App for SQL https://github.com/alibaba/Chat2DB

starrocks

12 7,764 10.0 Java

StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.

Project mention: A MySQL compatible database engine written in pure Go | news.ycombinator.com | 2024-04-09

tidb has been around for a while, it is distributed, written in Go and Rust, and MySQL compatible. https://github.com/pingcap/tidb
Somewhat relatedly, StarRocks is also MySQL compatible, written in Java and C++, but it's tackling OLAP use-cases. https://github.com/StarRocks/starrocks

Flyway

81 7,763 7.2 Java

Flyway by Redgate • Database Migrations Made Easy.

Project mention: Let's write a simple microservice in Clojure | dev.to | 2024-04-26

The session logs show that the application loads configurations and establishes a connection with a PostgreSQL database. This involves initializing a HikariCP connection pool and Flyway for database migrations. The logs confirm that the database schema validation and migration checks were successful. The startup of the Jetty HTTP server follows, and the server becomes operational and ready to accept requests on the specified port.

beam

30 7,508 10.0 Java

Apache Beam is a unified programming model for Batch and Streaming data processing.

Project mention: Ask HN: Does (or why does) anyone use MapReduce anymore? | news.ycombinator.com | 2024-01-24

The "streaming systems" book answers your question and more: https://www.oreilly.com/library/view/streaming-systems/97814.... It gives you a history of how batch processing started with MapReduce, and how attempts at scaling by moving towards streaming systems gave us all the subsequent frameworks (Spark, Beam, etc.).
As for the framework called MapReduce, it isn't used much, but its descendant https://beam.apache.org very much is. Nowadays people often use "map reduce" as a shorthand for whatever batch processing system they're building on top of.

jOOQ

94 5,882 9.8 Java

jOOQ is the best way to write SQL in Java

Project mention: Serious flaws in SQL – Edgar F. Codd (1990) | news.ycombinator.com | 2024-04-25

> 2. ORMs do not hide SQL nastiness.
This is certainly true!
I mean: ORMs are now well known to "make the easy queries slightly more easy, while making intermediate queries really hard and complex queries impossible".
I think the are of ORMs is over. It simply did not deliver.
If a book on SQL is --say-- 100 pages, a book on Hibernate is 400 pages. So much to learn just to make the easy queries slightly easier to type? Just not worth it.
I prefer jooq any day over ORMs. And dont get me started over what tools like Hasuna have to offer.
There are also some languages (forgot the names) that are SQL-done-right. Select in the back, more type safe, more logic, more in the same steps as the query gets executed. These need to be adopted by PG and MySQL and we're good to go. (IMHO)
https://www.jooq.org/
https://hasura.io/

ksql

4 5,817 10.0 Java

The database purpose-built for stream processing applications.
Apache Hive

14 5,326 9.6 Java

Apache Hive
JSqlParser

4 4,956 9.2 Java

JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern
OrientDB

3 4,691 9.8 Java

OrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text and Geospatial models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing and Reactive Queries.
Apache Ignite

3 4,678 9.5 Java

Apache Ignite (by apache)
esProc

55 4,425 9.6 Java

esProc SPL is a scripting language for data processing, with well-designed rich library functions and powerful syntax, which can be executed in a Java program through JDBC interface and computing independently.

Project mention: How Slow Is Database IO？ | news.ycombinator.com | 2024-04-25

liquibase

54 4,394 9.9 Java

Main Liquibase Source

Project mention: I am looking for a troubled/bad open source codebase | /r/ExperiencedDevs | 2023-07-12

While I respect the work, Liquibase's code base is quite messy... https://github.com/liquibase/liquibase

Apache Calcite

28 4,363 9.0 Java

Apache Calcite

Project mention: Data diffs: Algorithms for explaining what changed in a dataset (2022) | news.ycombinator.com | 2023-07-26

> Make diff work on more than just SQLite.
Another way of doing this that I've been wanting to do for a while is to implement the DIFF operator in Apache Calcite[0]. Using Calcite, DIFF could be implemented as rewrite rules to generate the appropriate SQL to be directly executed against the database or the DIFF operator can be implemented outside of the database (which the original paper shows is more efficient).
[0] https://calcite.apache.org/

spotless

10 4,161 9.7 Java

Keep your code spotless
H2

11 4,048 9.1 Java

H2 is an embeddable RDBMS written in Java.

Project mention: H2 Database – CVE getting flagged by automated scans | news.ycombinator.com | 2023-07-18

The URL should point to a particular comment, but HN removes fragments: https://github.com/h2database/h2database/issues/3686#issueco...

SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java SQL related posts

How Slow Is Database IO？
1 project | news.ycombinator.com | 25 Apr 2024
Computing Engine on Web
1 project | news.ycombinator.com | 22 Apr 2024
What is the difference between BI and AI？
1 project | news.ycombinator.com | 18 Apr 2024
Multi Purpose Traversal
1 project | news.ycombinator.com | 14 Apr 2024
Why ETL Becomes ELT or Even Let？
1 project | news.ycombinator.com | 11 Apr 2024
A tool for developing quantitative strategy model
1 project | news.ycombinator.com | 6 Apr 2024
A major culprit in the slow running and collapse of a database
1 project | news.ycombinator.com | 28 Mar 2024
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source SQL projects in Java? This list will help you:

	Project	Stars
1	dbeaver	37,391
2	Apache Flink	23,158
3	shardingsphere	19,425
4	MyBatis	19,404
5	Presto	15,591
6	QuestDB	13,448
7	doris	11,314
8	Trino	9,552
9	ali-dbhub	8,454
10	starrocks	7,764
11	Flyway	7,763
12	beam	7,508
13	jOOQ	5,882
14	ksql	5,817
15	Apache Hive	5,326
16	JSqlParser	4,956
17	OrientDB	4,691
18	Apache Ignite	4,678
19	esProc	4,425
20	liquibase	4,394
21	Apache Calcite	4,363
22	spotless	4,161
23	H2	4,048