-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Side Projects: Golang, Makefile for orchestration, Parquet on self hosted Minio or R2, DuckDB, and go-hsnw, for vectors. If I'm not self hosting, Google Cloud Autopilot Kubernetes is my preferred cloud deployment environment.
I distilled my open data stack into four core tools: airbyte, dbt, metabase and dagster (blog, github, example)
Or in a side project: I combined an extensive amount of tools for web-scraping real-estates, uploading them to S3 with MinIO, Spark, and Delta Lake, adding some Data Science magic with Jupyter Notebooks, ingesting into Data Warehouse Apache Druid, visualizing dashboards with Superset and managing everything with Dagster (blog, github)