Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev. Learn more →
Top 13 Java Data Projects
-
kestra
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
Kestra's communication is asynchronous and based on a queuing mechanism. It leverages the Micronaut framework and offers two runners: one that uses a database (JDBC) for both the message queue and resource storage, and another that uses Kafka as the message queue and Elasticsearch as the resource storage. The platform is fully extensible and plugin-based, providing a rich set of plugins for various workflow tasks, triggers, and data storage options. For those interested, the GitHub repository is available here: https://github.com/kestra-io/kestra
-
data-transfer-project
The Data Transfer Project makes it easy for people to transfer their data between online service providers. We are establishing a common framework, including data models and protocols, to enable direct transfer of data both into and out of participating online service providers.
I would argue that it is exactly in line with Apple's brand identity.
Pretty much everybody agrees that you need to backup your cloud storage as well as your local computer, and Apple even backs up your i-devices to the cloud, and yet, there is no automated way of backing up your iCloud storage.
About a decade ago, Google initiated the Data Transfer Framework[1] that allows you to transfer data from one cloud provider to another, directly from provider to provider instead of downloading it first. It sadly appears to not have gotten enough traction to be of any use.
-
Onboard AI
Learn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev.
-
Project mention: Am i safe by sticking with Java and XML for years ahead ? | /r/androiddev | 2023-06-04
I guess it wouldn't be a first https://github.com/flipkart-incubator/proteus , but
-
Project mention: Why is Hive Metastore everywhere? (Especially Iceberg) | /r/dataengineering | 2023-06-30
Try Nessie https://github.com/projectnessie/nessie - it recently got trino support as well ..
-
-
Hi there - take a look at RIOT which might be helpful... https://github.com/redis-developer/riot
-
rapiddweller-benerator-ce
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
-
InfluxDB
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
-
-
Db4o-gpl
new Db4o GPL Source Code for Java7+ & .netstardard2.0 Android Xamarin..., the best database project to help you to learn how to make databases
-
Nextcloud Tables (version 1.0.7): Companion app for Nextcloud Tables
-
SheetsIO
Small configurable Java app that pulls data from a Google Spreadsheet (using v4 api) and writes to files and a local webserver.
-
Data-Structures-and-Algorithms
Solutions to Arrays, Strings, Lists, Sorting, Stacks, Trees and General DS problems using JAVA. (by anishkumar127)
-
Java Data related posts
- Why is Hive Metastore everywhere? (Especially Iceberg)
- Missouri trans 'snitch form' down after people spammed it with the 'Bee Movie' script
- Uploading Data from a CSV file
- Is it safe to update docker/docker-compose?
- What are the main things I need to know to be hired as a Java developer?
- Is learning and mastering Spring & Spring boot worth it in 2023 ?
- Two-way syncs across your data stack and SaaS tools
-
A note from our sponsor - Onboard AI
getonboard.dev | 7 Dec 2023
Index
What are some of the best open-source Data projects in Java? This list will help you:
Project | Stars | |
---|---|---|
1 | kestra | 5,045 |
2 | data-transfer-project | 3,522 |
3 | proteus | 1,293 |
4 | nessie | 735 |
5 | jimmer | 483 |
6 | riot | 204 |
7 | rapiddweller-benerator-ce | 118 |
8 | ModelRunner | 57 |
9 | Db4o-gpl | 29 |
10 | nextcloud-tables | 22 |
11 | SheetsIO | 19 |
12 | Data-Structures-and-Algorithms | 11 |
13 | SparkDB | 3 |