Java lakehouse

Open-source Java projects categorized as lakehouse

Top 6 Java lakehouse Projects

  1. Presto

    The official home of the Presto distributed SQL query engine for big data

    Project mention: Twitter's 600-Tweet Daily Limit Crisis: Soaring GCP Costs and the Open Source Fix Elon Musk Ignored | dev.to | 2025-04-10

    Presto: Presto is an open-source distributed SQL query engine that enables querying data from various sources. It provides fast and interactive analytics capabilities, supporting a wide range of data formats and integration with different storage systems.

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. doris

    Apache Doris is an easy-to-use, high performance and unified analytics database.

    Project mention: Apache Doris: open-source data warehouse for real time data analytics | news.ycombinator.com | 2024-10-26
  4. starrocks

    The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

  5. LakeSoul

    LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

  6. gravitino

    World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

    Project mention: What is Data Agent and how to build it in 15 Minutes | news.ycombinator.com | 2024-08-16
  7. amoro

    Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.

    Project mention: Show HN: Apache Amoro is a Lakehouse management system built on iceberg | news.ycombinator.com | 2024-08-05

    Apache Amoro released its first Apache version, 0.7.0-incubating, on August 1st. Users of Iceberg and Paimon are welcomed to try it out and provide feedback: https://amoro.apache.org/

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java lakehouse discussion

Log in or Post with

Java lakehouse related posts

Index

What are some of the best open-source lakehouse projects in Java? This list will help you:

# Project Stars
1 Presto 16,312
2 doris 13,529
3 starrocks 9,886
4 LakeSoul 2,706
5 gravitino 1,451
6 amoro 950

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that Java is
the 8th most popular programming language
based on number of references?