Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs (by delta-io)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
https://github.com/bitsondatadev/trino-getting-started/tree/main/delta-lake => Trino (Presto "equivalent") + delta lake format + Minio (s3 equivalent)
Spark + dbt => https://github.com/dbt-labs/dbt-spark/blob/main/docker-compose.yml
There is an open PR for delta docker => https://github.com/delta-io/delta/pull/922
Related posts
- [D] Is there other better data format for LLM to generate structured data?
- Delta vs Iceberg: make love not war
- Databricks Strikes $1.3B Deal for Generative AI Startup MosaicML
- Medallion/lakehouse architecture data modelling
- whenNotMatchedBySourceUpdate not existing? Trying to upsert parquet into Delta table