InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 7 Java data-integration Projects
-
Project mention: Where Does Data Flow? A Complete Guide to Apache SeaTunnel Sink Connectors (2024 Edition) | dev.to | 2025-08-28
-
Sevalla
Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
-
Project mention: flink-cdc VS cocoindex - a user suggested alternative | libhunt.com/r/flink-cdc | 2025-04-01
-
Project mention: Twitter's 600-Tweet Daily Limit Crisis: Soaring GCP Costs and the Open Source Fix Elon Musk Ignored | dev.to | 2025-04-10
Apache Hudi: Apache Hudi is a distributed data lake storage system that offers near real-time data ingestion and efficient data management for big data workloads. It provides features like record-level updates and deletes, incremental data processing, and data lifecycle management.
-
bitsail
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
-
Project mention: Show HN: Flowcode – Turing-complete visual programming platform | news.ycombinator.com | 2025-04-29
-
seatunnel-web
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
[Bug] [Seatunnel-web]No configuration setting found for key 'where_c…ondition' @arshadmohammad
-
SpringBoot3BatchStarter
Spring Batch 5 skeleton for Spring Boot 3. Includes DB to CSV and CSV to DB samples for quick customization. This repository demonstrates multi-database setup, efficient batch processing, and GitHub Actions integration for CI/CD pipelines.
Technical Blog (English): https://blog.kinto-technologies.com/posts/2024-12-25_copy_paste_spring_batch5_boot3
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
Java data-integration discussion
Java data-integration related posts
-
Where Does Data Flow? A Complete Guide to Apache SeaTunnel Sink Connectors (2024 Edition)
-
SeaTunnel Community Rocked July: New Features, Major Optimizations, All-Star Contributors
-
Apache SeaTunnel New Positioning! Moving Toward a Unified Tool for Multimodal Data Integration
-
From Logs to Alerts: Mastering SeaTunnel's Event Listener Capabilities
-
Apache SeaTunnel Hive Deep Integration Guide: Principles, Configuration, & Practice
-
A Deep Dive Into SeaTunnel's Thread Sharing Mechanism and Task Execution Model Optimization
-
How to Build Real-Time Data Pipelines with SQL Server CDC and Apache SeaTunnel
-
A note from our sponsor - InfluxDB
www.influxdata.com | 1 Sep 2025
Index
What are some of the best open-source data-integration projects in Java? This list will help you:
# | Project | Stars |
---|---|---|
1 | seatunnel | 8,749 |
2 | flink-cdc | 6,205 |
3 | hudi | 5,924 |
4 | bitsail | 1,666 |
5 | hop | 1,216 |
6 | seatunnel-web | 719 |
7 | SpringBoot3BatchStarter | 8 |