Java etl-framework

Open-source Java projects categorized as etl-framework

Top 3 Java etl-framework Projects

etl-framework
  1. Logstash

    Logstash - transport and process your logs, events, or other data

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. seatunnel-web

    SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

    Project mention: November Report on Apache SeaTunnel Community Development | dev.to | 2024-12-03

    [Bug] [Seatunnel-web]No configuration setting found for key 'where_c…ondition' @arshadmohammad

  4. langchain-beam

    Integrates LLMs as PTransform in Apache Beam pipelines using LangChain

    Project mention: Generating text embeddings in ETL pipeline | news.ycombinator.com | 2025-01-13

    Hello, I've been working on langchain-beam library. Its a langchain and apache beam integration to use langchain's components like LLM interface in apache beam ETL pipeline and leverage LLM's capabilities for data processing, transformations and provide a way to create RAG based ETL pipelines.

    recently I've added a feature to integrate embedding models into beam pipeline and generate vector embeddings for text in pipeline using the models so that embedding generation activity can be a part of the data pipeline instead of separate service.

    I’d love to hear your thoughts. Repo - https://github.com/Ganeshsivakumar/langchain-beam

    Example usage, to create embeddings in pipeline :

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Java etl-framework discussion

Log in or Post with

Java etl-framework related posts

  • Generating text embeddings in ETL pipeline

    1 project | news.ycombinator.com | 13 Jan 2025
  • SymmetricDS: Open-Source, cross platform database replication software

    3 projects | news.ycombinator.com | 6 Aug 2023
  • Questions Regarding design DW

    1 project | /r/dataengineering | 24 Jun 2023
  • SeaTunnel Zeta engine, the first choice for massive data synchronization, is officially released!

    1 project | dev.to | 5 Jan 2023
  • Major Release! SeaTunnel 2.3.0-beta supports the self-innovate SeaTunnel Engine and more connectors!

    1 project | dev.to | 3 Nov 2022
  • SeaTunnel Will Support CDC As A Feature Soon!

    1 project | /r/u_SeaTunnel | 3 Nov 2022
  • SeaTunnel Will Support CDC As A Feature Soon!

    1 project | dev.to | 3 Nov 2022
  • A note from our sponsor - Nutrient
    www.nutrient.io | 16 Feb 2025
    Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries. Learn more →

Index

What are some of the best open-source etl-framework projects in Java? This list will help you:

# Project Stars
1 Logstash 14,340
2 seatunnel-web 618
3 langchain-beam 14

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai