Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 8 Java Bigdata Projects
-
shardingsphere
Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
-
dataCompare
big data comparison and data profiling platform: low code,data comparison and data profiling
-
big-data-pipeline-lambda-arch
A full big data pipeline (Lambda Architecture) with Spark, Kafka, HDFS and Cassandra.
-
hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
rapiddweller-benerator-ce
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
Opposite to what the documentation tells, the full prefix is jdbc:shardingsphere:absolutepath. I've opened a PR to fix the documentation.
Project mention: Getting Started with Flink SQL, Apache Iceberg and DynamoDB Catalog | dev.to | 2023-12-18Apache Iceberg is one of the three types of lakehouse, the other two are Apache Hudi and Delta Lake.
Project mention: Open Table Formats Such as Apache Iceberg Are Inevitable for Analytical Data | news.ycombinator.com | 2024-01-18Apache AVRO [1] is one but it has been largely replaced by Parquet [2] which is a hybrid row/columnar format
[1] https://avro.apache.org/
Project mention: OpenDataDiscovery 0.15 with Data Deprecation and Metadata Stale | news.ycombinator.com | 2023-08-04
Java Bigdata related posts
- For those of you with Lakehouse Architectures, how do you handle duplicate records?
- AWS ACID data lakehouse
- hadoopcryptoledger: NEW Data - star count:139.0
- hadoopcryptoledger: NEW Data - star count:139.0
- hadoopcryptoledger: NEW Data - star count:139.0
- hadoopcryptoledger: NEW Data - star count:139.0
- hadoopcryptoledger: NEW Data - star count:139.0
-
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Apr 2024
Index
What are some of the best open-source Bigdata projects in Java? This list will help you:
Project | Stars | |
---|---|---|
1 | shardingsphere | 19,425 |
2 | hudi | 5,053 |
3 | Apache Avro | 2,764 |
4 | odd-platform | 1,108 |
5 | dataCompare | 234 |
6 | big-data-pipeline-lambda-arch | 161 |
7 | hadoopcryptoledger | 141 |
8 | rapiddweller-benerator-ce | 128 |
Sponsored