Java data-integration

Open-source Java projects categorized as data-integration

Top 7 Java data-integration Projects

data-integration
  1. seatunnel

    SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.

    Project mention: Where Does Data Flow? A Complete Guide to Apache SeaTunnel Sink Connectors (2024 Edition) | dev.to | 2025-08-28
  2. Sevalla

    Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!

    Sevalla logo
  3. hudi

    Upserts, Deletes And Incremental Processing on Big Data.

    Project mention: Twitter's 600-Tweet Daily Limit Crisis: Soaring GCP Costs and the Open Source Fix Elon Musk Ignored | dev.to | 2025-04-10

    Apache Hudi: Apache Hudi is a distributed data lake storage system that offers near real-time data ingestion and efficient data management for big data workloads. It provides features like record-level updates and deletes, incremental data processing, and data lifecycle management.

  4. bitsail

    BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.

  5. hop

    Hop Orchestration Platform

    Project mention: Show HN: Flowcode – Turing-complete visual programming platform | news.ycombinator.com | 2025-04-29
  6. seatunnel-web

    SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

    Project mention: November Report on Apache SeaTunnel Community Development | dev.to | 2024-12-03

    [Bug] [Seatunnel-web]No configuration setting found for key 'where_c…ondition' @arshadmohammad

  7. SpringBoot3BatchStarter

    Spring Batch 5 skeleton for Spring Boot 3. Includes DB to CSV and CSV to DB samples for quick customization. This repository demonstrates multi-database setup, efficient batch processing, and GitHub Actions integration for CI/CD pipelines.

    Project mention: Zero Config Spring Batch: Just Write Business Logic | dev.to | 2025-01-04

    Technical Blog (English): https://blog.kinto-technologies.com/posts/2024-12-25_copy_paste_spring_batch5_boot3

  8. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java data-integration discussion

Log in or Post with

Java data-integration related posts

  • Where Does Data Flow? A Complete Guide to Apache SeaTunnel Sink Connectors (2024 Edition)

    1 project | dev.to | 28 Aug 2025
  • SeaTunnel Community Rocked July: New Features, Major Optimizations, All-Star Contributors

    2 projects | dev.to | 14 Aug 2025
  • Apache SeaTunnel New Positioning! Moving Toward a Unified Tool for Multimodal Data Integration

    2 projects | dev.to | 14 Aug 2025
  • From Logs to Alerts: Mastering SeaTunnel's Event Listener Capabilities

    1 project | dev.to | 30 Jul 2025
  • Apache SeaTunnel Hive Deep Integration Guide: Principles, Configuration, & Practice

    1 project | dev.to | 9 Jul 2025
  • A Deep Dive Into SeaTunnel's Thread Sharing Mechanism and Task Execution Model Optimization

    1 project | dev.to | 24 Jun 2025
  • How to Build Real-Time Data Pipelines with SQL Server CDC and Apache SeaTunnel

    1 project | dev.to | 17 Jun 2025
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 1 Sep 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source data-integration projects in Java? This list will help you:

# Project Stars
1 seatunnel 8,749
2 flink-cdc 6,205
3 hudi 5,924
4 bitsail 1,666
5 hop 1,216
6 seatunnel-web 719
7 SpringBoot3BatchStarter 8

Sponsored
Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com

Did you know that Java is
the 8th most popular programming language
based on number of references?