Java etl-framework

Open-source Java projects categorized as etl-framework

Top 3 Java etl-framework Projects

etl-framework
  1. Logstash

    Logstash - transport and process your logs, events, or other data

  2. Sevalla

    Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!

    Sevalla logo
  3. seatunnel-web

    SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

    Project mention: November Report on Apache SeaTunnel Community Development | dev.to | 2024-12-03

    [Bug] [Seatunnel-web]No configuration setting found for key 'where_c…ondition' @arshadmohammad

  4. langchain-beam

    Integrates LLMs as PTransform in Apache Beam pipelines using LangChain

    Project mention: Generating text embeddings in ETL pipeline | news.ycombinator.com | 2025-01-13

    Hello, I've been working on langchain-beam library. Its a langchain and apache beam integration to use langchain's components like LLM interface in apache beam ETL pipeline and leverage LLM's capabilities for data processing, transformations and provide a way to create RAG based ETL pipelines.

    recently I've added a feature to integrate embedding models into beam pipeline and generate vector embeddings for text in pipeline using the models so that embedding generation activity can be a part of the data pipeline instead of separate service.

    I’d love to hear your thoughts. Repo - https://github.com/Ganeshsivakumar/langchain-beam

    Example usage, to create embeddings in pipeline :

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java etl-framework discussion

Log in or Post with

Java etl-framework related posts

  • Generating text embeddings in ETL pipeline

    1 project | news.ycombinator.com | 13 Jan 2025
  • SymmetricDS: Open-Source, cross platform database replication software

    3 projects | news.ycombinator.com | 6 Aug 2023
  • Questions Regarding design DW

    1 project | /r/dataengineering | 24 Jun 2023
  • SeaTunnel Zeta engine, the first choice for massive data synchronization, is officially released!

    1 project | dev.to | 5 Jan 2023
  • Major Release! SeaTunnel 2.3.0-beta supports the self-innovate SeaTunnel Engine and more connectors!

    1 project | dev.to | 3 Nov 2022
  • SeaTunnel Will Support CDC As A Feature Soon!

    1 project | /r/u_SeaTunnel | 3 Nov 2022
  • SeaTunnel Will Support CDC As A Feature Soon!

    1 project | dev.to | 3 Nov 2022
  • A note from our sponsor - SaaSHub
    www.saashub.com | 2 Sep 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source etl-framework projects in Java? This list will help you:

# Project Stars
1 Logstash 14,616
2 seatunnel-web 719
3 langchain-beam 21

Sponsored
Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com

Did you know that Java is
the 8th most popular programming language
based on number of references?