-
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs (by delta-io)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Currently the infrastructure we have is some custom made pipelines that load the data on S3, and I use Delta Tables here and there for its convenience: ACID, time travel, merges, CDC etc...
Disclaimer: I work for this company. You should check out Rudderstack. It’s free for up to 5M api calls and it supports sending data to S3 or databricks. I’m at the databricks conference as I’m typing this.
I like the idea of using duckdb + dbt-duckdb