Top 6 Java ETL Projects
-
airbyte
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
Project mention: What are your thoughts on projects using the Elastic License? | reddit.com/r/opensource | 2023-01-26Doing a quick GitHub search reveals quite a few projects using the ELv2 license, including Airbyte and InvoiceNinja. Elastic (the company) aside, what are your thoughts on the Elastic License v2? Does your employer allow projects with an ELv2 license? Do you consider it open source? I understand that it's not OSI approved, but wondering where people stand when it comes to commercial open source software.
-
Project mention: Ask HN: What is the most impactful thing you've ever built? | news.ycombinator.com | 2022-11-18
As part of my data consulting, I struggled with identity resolution and started working on scalable no code identity resolution - https://github.com/zinggAI/zingg/ . It has pushed my limits as a software engineer and product builder, and I had to do a lot of learning to build it. Its cool to see people use Zingg in their workflows and save months of working on custom solutions. Big highlight has been North Carolina Open Campaign Data https://crossroads-cx.medium.com/building-open-access-to-nc-...
-
Sonar
Write Clean Java Code. Always.. Sonar helps you commit clean code every time. With over 600 unique rules to find Java bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
Smooks
Extensible data integration Java framework for building XML and non-XML fragment-based applications
-
kafka-connect-file-pulse
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
-
Project mention: How can an opensource GPLv3/GPLv2 database (such as Neo4j or Virtuoso) be distributed alongside a proprietary software? | dev.to | 2022-11-29
Scenario 1) The application uses Neo4j GPLv3 database alongside Neo4j's own exclusive query language called Cypher. The program will have some relevant part of its functionality written in CypherQL even though it connects to the database using an Apache 2.0 licensed driver.
-
Java ETL related posts
- Best ‘E’TL tools for extracting data from on-prem SQL databases
- Maven unable to resolve a dependency given in pom.xml. I've instead tried manually downloading installing the jar, but now maven cannot find the package.
- Download json and csv file from github repository with apache kafka
- Streaming data into Kafka S01/E04 — Loading Log files using Grok Expression
Index
What are some of the best open-source ETL projects in Java? This list will help you:
Project | Stars | |
---|---|---|
1 | airbyte | 9,359 |
2 | zingg | 691 |
3 | Smooks | 356 |
4 | kafka-connect-file-pulse | 237 |
5 | neo4j-jdbc | 110 |
6 | dcc-import | 1 |