Top 6 Java ETL Projects
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.Project mention: What are your thoughts on projects using the Elastic License? | reddit.com/r/opensource | 2023-01-26
Doing a quick GitHub search reveals quite a few projects using the ELv2 license, including Airbyte and InvoiceNinja. Elastic (the company) aside, what are your thoughts on the Elastic License v2? Does your employer allow projects with an ELv2 license? Do you consider it open source? I understand that it's not OSI approved, but wondering where people stand when it comes to commercial open source software.
Scalable identity resolution, entity resolution, data mastering and deduplication using MLProject mention: Ask HN: What is the most impactful thing you've ever built? | news.ycombinator.com | 2022-11-18
As part of my data consulting, I struggled with identity resolution and started working on scalable no code identity resolution - https://github.com/zinggAI/zingg/ . It has pushed my limits as a software engineer and product builder, and I had to do a lot of learning to build it. Its cool to see people use Zingg in their workflows and save months of working on custom solutions. Big highlight has been North Carolina Open Campaign Data https://crossroads-cx.medium.com/building-open-access-to-nc-...
Write Clean Java Code. Always.. Sonar helps you commit clean code every time. With over 600 unique rules to find Java bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
Extensible data integration Java framework for building XML and non-XML fragment-based applications
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
JDBC driver for Neo4jProject mention: How can an opensource GPLv3/GPLv2 database (such as Neo4j or Virtuoso) be distributed alongside a proprietary software? | dev.to | 2022-11-29
Scenario 1) The application uses Neo4j GPLv3 database alongside Neo4j's own exclusive query language called Cypher. The program will have some relevant part of its functionality written in CypherQL even though it connects to the database using an Apache 2.0 licensed driver.
Reference data importers
Java ETL related posts
Best ‘E’TL tools for extracting data from on-prem SQL databases
2 projects | reddit.com/r/snowflake | 28 Mar 2022
Maven unable to resolve a dependency given in pom.xml. I've instead tried manually downloading installing the jar, but now maven cannot find the package.
1 project | reddit.com/r/learnjava | 30 Aug 2021
Download json and csv file from github repository with apache kafka
1 project | reddit.com/r/apachekafka | 29 Jul 2021
Streaming data into Kafka S01/E04 — Loading Log files using Grok Expression
5 projects | dev.to | 5 Jan 2021
What are some of the best open-source ETL projects in Java? This list will help you: