cassandra-java-driver
iceberg
cassandra-java-driver | iceberg | |
---|---|---|
1 | 18 | |
1,331 | 5,540 | |
0.1% | 2.1% | |
7.9 | 9.9 | |
about 10 hours ago | 4 days ago | |
Java | Java | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cassandra-java-driver
iceberg
- Iceberg won the table format war: But not in the way you thought it might
- Lakehouse using AWS Athena on Iceberg Concerns
- apache/iceberg: Apache Iceberg
- What are the main things I need to know to be hired as a Java developer?
- Have you used Athena Iceberg for small(-ish) data?
- Is Data Lakehouse a threat to Snowflake?
-
Snowflake vs databricks cloud/labor cost
This is interesting, imo.
- Setting the Table: Benchmarking Open Table Formats
-
Spark Dynamic Partition Overwrite Mode Replaces Existing Data
If you're using Iceberg as your table format, it had bugs with MERGE INTO with non-nullable columns up until September: https://github.com/apache/iceberg/pull/5679
-
How to migrate delta tables to iceberg?
yeah, this as a capability is a WIP and discussion point in the iceberg community - https://github.com/apache/iceberg/pull/5331
What are some alternatives?
Spring Boot - Spring Boot
kudu - Mirror of Apache Kudu
hudi - Upserts, Deletes And Incremental Processing on Big Data.
Apache Avro - Apache Avro is a data serialization system.
debezium - Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
RocksDB - A library that provides an embeddable, persistent key-value store for fast storage.
delta - An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Dask - Parallel computing with task scheduling
Apache Orc - Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
LakeSoul - LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
hiveberg - Demonstration of a Hive Input Format for Iceberg
Apache Hive - Apache Hive