Java Apache

Open-source Java projects categorized as Apache

Top 23 Java Apache Projects

  1. Apache ZooKeeper

    Apache ZooKeeper

    Project mention: Mastering Apache Kafka: Powering Modern Data Pipelines | dev.to | 2025-01-16

    Zookeeper is a distributed coordination service used in older versions of Kafka to manage cluster metadata, leader election, and configuration. It ensures consistency and synchronization across Kafka brokers.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. seatunnel

    SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

    Project mention: Practical experience in deploying K8s clusters in Apache SeaTunnel separated cluster mode | dev.to | 2025-04-22

    Official Apache SeaTunnel website: https://seatunnel.apache.org/

  4. iceberg

    Apache Iceberg

    Project mention: Every Database Will Support Iceberg — Here's Why | dev.to | 2025-04-22

    If you follow me on LinkedIn or Medium, you’ve probably noticed I’ve been talking a lot about Apache Iceberg. And as the founder of RisingWave — a stream processing and management system — I get this question a lot:

  5. Apache Storm

    Apache Storm (by apache)

  6. Apache Hive

    Apache Hive

    Project mention: Hive: An Open-Source Data Warehouse Built on Apache Hadoop | news.ycombinator.com | 2024-08-13
  7. groovy

    Apache Groovy: A powerful multi-faceted programming language for the JVM platform

    Project mention: Tuning OutOfMemoryError: Metaspace Size Problems | dev.to | 2025-04-24

    Dynamic class loading, typically when using either Java Reflection or Groovy Scripting;

  8. Leetcode

    Solutions to LeetCode problems; updated daily. Subscribe to my YouTube channel for more. (by fishercoder1534)

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. Apache Log4j 2

    Apache Log4j is a versatile, feature-rich, efficient logging API and backend for Java.

    Project mention: Most Effective Approaches for Debugging Applications | dev.to | 2025-04-27

    Structured logging transforms debugging by providing a detailed, searchable record of an application’s state, including variable values, stack traces, and user actions. According to Gartner, organizations with robust logging systems resolve production issues 40% faster. Doug Crawford, President and Founder of Best Trade Schools, highlights their value: “Implementing a structured logging system… makes isolating the problem straightforward.” Tools like Sentry for real-time error tracking, Log4j for Java applications, or ELK Stack for log aggregation enable developers to pinpoint issues quickly, reducing the need for manual reproduction. For example, Sentry’s breadcrumb feature captures user actions leading to an error, offering a clear debugging trail.

  11. Apache Nutch

    Apache Nutch is an extensible and scalable web crawler

    Project mention: 11 best open-source web crawlers and scrapers in 2024 | dev.to | 2024-10-29

    Language: Java | GitHub: 2.9K+ stars | link

  12. Apache Parquet

    Apache Parquet Java

    Project mention: How to Pitch Your Boss to Adopt Apache Iceberg? | dev.to | 2025-04-11

    Iceberg decouples storage from compute. That means your data isn’t trapped inside one proprietary system. Instead, it lives in open file formats (like Apache Parquet) and is managed by an open, vendor-neutral metadata layer (Apache Iceberg).

  13. Flume

    Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data

  14. Apache ActiveMQ

    Apache ActiveMQ Classic

  15. Apache Geode

    Apache Geode

    Project mention: No SNAPSHOTs | dev.to | 2024-07-30

    Even ASF does not use Maven to build some of its projects anymore: Beam, Groovy, Lucene, Geode, POI, and Solr are not built with Maven. Those are not the most popular ASF projects, I know, but still, it is something.

  16. bookkeeper

    Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads

  17. Apache OpenNLP

    Apache OpenNLP

  18. hop

    Hop Orchestration Platform

    Project mention: Show HN: Flowcode – Turing-complete visual programming platform | news.ycombinator.com | 2025-04-29
  19. mina-sshd

    Apache MINA sshd is a comprehensive Java library for client- and server-side SSH.

    Project mention: mina-sshd VS sshj - a user suggested alternative | libhunt.com/r/mina-sshd | 2025-03-01
  20. Apache ActiveMQ Artemis

    Mirror of Apache ActiveMQ Artemis

  21. ranger

    Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond (by apache)

  22. Apache Wicket

    Apache Wicket - Component-based Java web framework

    Project mention: Show HN: Latudio – a language acquisition app with a listening-oriented approach | news.ycombinator.com | 2024-12-04
  23. Apache Orc

    Apache ORC - the smallest, fastest columnar storage for Hadoop workloads

  24. seatunnel-web

    SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

    Project mention: November Report on Apache SeaTunnel Community Development | dev.to | 2024-12-03

    [Bug] [Seatunnel-web]No configuration setting found for key 'where_c…ondition' @arshadmohammad

  25. Apache Ant

    Apache Ant is a Java-based build tool. (by apache)

    Project mention: Getting Started with DevOps | dev.to | 2025-04-15

    Ant,

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java Apache discussion

Log in or Post with

Java Apache related posts

  • Practical experience in deploying K8s clusters in Apache SeaTunnel separated cluster mode

    1 project | dev.to | 22 Apr 2025
  • How to Pitch Your Boss to Adopt Apache Iceberg?

    4 projects | dev.to | 11 Apr 2025
  • You’re Invited: Apache SeaTunnel Biweekly Community Meeting on April 8, 2025

    1 project | dev.to | 7 Apr 2025
  • Processing data with “Data Prep Kit” (part 2)

    2 projects | dev.to | 7 Apr 2025
  • Max severity RCE flaw discovered in widely used Apache Parquet

    5 projects | news.ycombinator.com | 6 Apr 2025
  • Crawling web sites using “Data Prep Kit”

    2 projects | dev.to | 4 Apr 2025
  • 🔬Public docker images Trivy scans as duckdb datas on Kaggle

    1 project | dev.to | 31 Mar 2025
  • A note from our sponsor - SaaSHub
    www.saashub.com | 13 May 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Apache projects in Java? This list will help you:

# Project Stars
1 Apache ZooKeeper 12,465
2 seatunnel 8,476
3 iceberg 7,344
4 Apache Storm 6,620
5 Apache Hive 5,700
6 groovy 5,304
7 Leetcode 3,906
8 Apache Log4j 2 3,475
9 Apache Nutch 3,011
10 Apache Parquet 2,804
11 Flume 2,551
12 Apache ActiveMQ 2,351
13 Apache Geode 2,299
14 bookkeeper 1,930
15 Apache OpenNLP 1,507
16 hop 1,132
17 mina-sshd 969
18 Apache ActiveMQ Artemis 966
19 ranger 950
20 Apache Wicket 759
21 Apache Orc 725
22 seatunnel-web 678
23 Apache Ant 438

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Java is
the 8th most popular programming language
based on number of references?