Databricks platform for small data, is it worth it?

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • delta

    An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs (by delta-io)

  • Currently the infrastructure we have is some custom made pipelines that load the data on S3, and I use Delta Tables here and there for its convenience: ACID, time travel, merges, CDC etc...

  • Rudderstack

    Privacy and Security focused Segment-alternative, in Golang and React

  • Disclaimer: I work for this company. You should check out Rudderstack. It’s free for up to 5M api calls and it supports sending data to S3 or databricks. I’m at the databricks conference as I’m typing this.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • dbt-duckdb

    dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)

  • I like the idea of using duckdb + dbt-duckdb

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • [D] Is there other better data format for LLM to generate structured data?

    1 project | /r/MachineLearning | 10 Dec 2023
  • Rudderstack Switches to Elastic License

    1 project | news.ycombinator.com | 8 Sep 2023
  • Delta vs Iceberg: make love not war

    1 project | /r/MicrosoftFabric | 30 Jun 2023
  • Databricks Strikes $1.3B Deal for Generative AI Startup MosaicML

    4 projects | news.ycombinator.com | 26 Jun 2023
  • What is the role of data integration in a Customer Data Platform (CDP)?

    1 project | /r/u_asdfjohn31 | 16 Jun 2023