Druid VS materialize

Compare Druid vs materialize and see what are their differences.

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
Druid materialize
24 117
13,180 5,558
0.6% 0.9%
9.9 10.0
5 days ago 2 days ago
Java Rust
Apache License 2.0 GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Druid

Posts with mentions or reviews of Druid. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-28.
  • How to choose the right type of database
    15 projects | dev.to | 28 Feb 2024
    Apache Druid: Focused on real-time analytics and interactive queries on large datasets. Druid is well-suited for high-performance applications in user-facing analytics, network monitoring, and business intelligence.
  • Choosing Between a Streaming Database and a Stream Processing Framework in Python
    10 projects | dev.to | 10 Feb 2024
    Online analytical processing (OLAP) databases like Apache Druid, Apache Pinot, and ClickHouse shine in addressing user-initiated analytical queries. You might write a query to analyze historical data to find the most-clicked products over the past month efficiently using OLAP databases. When contrasting with streaming databases, they may not be optimized for incremental computation, leading to challenges in maintaining the freshness of results. The query in the streaming database focuses on recent data, making it suitable for continuous monitoring. Using streaming databases, you can run queries like finding the top 10 sold products where the “top 10 product list” might change in real-time.
  • Show HN: The simplest tiny analytics tool – storywise
    3 projects | news.ycombinator.com | 18 Jul 2023
    https://github.com/apache/druid

    It's always a question of tradeoffs.

    The awesome-selfhosted project has a nice list of open-source analytics projects. It's really good inspiration to dig into these projects and find out about the technology choices that other open-source tools in the space have made.

  • Analysing Github Stars - Extracting and analyzing data from Github using Apache NiFi®, Apache Kafka® and Apache Druid®
    8 projects | dev.to | 11 Jan 2023
    Spencer Kimball (now CEO at CockroachDB) wrote an interesting article on this topic in 2021 where they created spencerkimball/stargazers based on a Python script. So I started thinking: could I create a data pipeline using Nifi and Kafka (two OSS tools often used with Druid) to get the API data into Druid - and then use SQL to do the analytics? The answer was yes! And I have documented the outcome below. Here’s my analytical pipeline for Github stars data using Nifi, Kafka and Druid.
  • Apache Druid® - an enterprise architect's overview
    1 project | dev.to | 15 Dec 2022
    Apache Druid is part of the modern data architecture. It uses a special data format designed for analytical workloads, using extreme parallelisation to get data in and get data out. A shared-nothing, microservices architecture helps you to build highly-available, extreme scale analytics features into your applications.
  • Real Time Data Infra Stack
    15 projects | dev.to | 4 Dec 2022
    Apache Druid
  • When you should use columnar databases and not Postgres, MySQL, or MongoDB
    5 projects | dev.to | 25 Oct 2022
    But then you realize there are other databases out there focused specifically on analytical use cases with lots of data and complex queries. Newcomers like ClickHouse, Pinot, and Druid (all open source) respond to a new class of problem: The need to develop applications using endpoints published on analytical queries that were previously confined only to the data warehouse and BI tools.
  • Druids by Datadog
    6 projects | news.ycombinator.com | 20 Sep 2022
    Datadog's product is a bit too close to Apache Druid to have named their design system so similarly.

    From https://druid.apache.org/ :

    > Druid unlocks new types of queries and workflows for clickstream, APM, supply chain, network telemetry, digital marketing, risk/fraud, and many other types of data. Druid is purpose built for rapid, ad-hoc queries on both real-time and historical data.

  • Mom at 54 is thinking about coding and a complete career shift. Thoughts?
    2 projects | /r/cscareerquestions | 18 Sep 2022
    Maybe rare for someone to be seeking their first coding job at that age. But plenty of us are in our 50s or older and still coding up a storm. And not necessarily ancient tech or anything. My current project exposes analytics data from Apache Druid and Cassandra via Go microservices hosted in K8s.
  • Building an arm64 container for Apache Druid for your Apple Silicon
    4 projects | dev.to | 8 Sep 2022
    Fortunately, it is super easy to build your own leveraging the binary distribution and existing docker.sh.

materialize

Posts with mentions or reviews of materialize. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-17.
  • Ask HN: How Can I Make My Front End React to Database Changes in Real-Time?
    8 projects | news.ycombinator.com | 17 Apr 2024
    [2] https://materialize.com/
  • Choosing Between a Streaming Database and a Stream Processing Framework in Python
    10 projects | dev.to | 10 Feb 2024
    To fully leverage the data is the new oil concept, companies require a special database designed to manage vast amounts of data instantly. This need has led to different database forms, including NoSQL databases, vector databases, time-series databases, graph databases, in-memory databases, and in-memory data grids. Recent years have seen the rise of cloud-based streaming databases such as RisingWave, Materialize, DeltaStream, and TimePlus. While they each have distinct commercial and technical approaches, their overarching goal remains consistent: to offer users cloud-based streaming database services.
  • Proton, a fast and lightweight alternative to Apache Flink
    7 projects | news.ycombinator.com | 30 Jan 2024
    > Materialize no longer provide the latest code as an open-source software that you can download and try. It turned from a single binary design to cloud-only micro-service

    Materialize CTO here. Just wanted to clarify that Materialize has always been source available, not OSS. Since our initial release in 2020, we've been licensed under the Business Source License (BSL), like MariaDB and CockroachDB. Under the BSL, each release does eventually transition to Apache 2.0, four years after its initial release.

    Our core codebase is absolutely still publicly available on GitHub [0], and our developer guide for building and running Materialize on your own machine is still public [1].

    It is true that we substantially rearchitected Materialize in 2022 to be more "cloud-native". Our new cloud offering offers horizontal scalability and fault tolerance—our two most requested features in the single-binary days. I wouldn't call the new architecture a microservices design though! There are only 2-3 services, each quite substantial, in the new architecture (loosely: a compute service, an orchestration service, and, soon, a load balancing service).

    We do push folks to sign up for a free trial of our hosted cloud offering [2] these days, rather than trying to start off by running things locally, as we generally want folks' first impression of Materialize to be of the version that we support for production use cases. A all-in-one single machine Docker image does still exist, if you know where to look, but it's very much use-at-your-own-risk, and we don't recommend using it for anything serious, but it's there to support e.g. academic work that wants to evaluate Materialize's capabilities to incrementally maintain recursive SQL queries.

    If folks have questions about Materialize, we've got a lively community Slack [3] where you can connect directly with our product and engineering teams.

    [0]: https://github.com/MaterializeInc/materialize/tree/main

  • What I Talk About When I Talk About Query Optimizer (Part 1): IR Design
    7 projects | news.ycombinator.com | 29 Jan 2024
  • We Built a Streaming SQL Engine
    3 projects | news.ycombinator.com | 21 Oct 2023
    Some recent solutions to this problem include Differential Dataflow and Materialize. It would be neat if postgres adopted something similar for live-updating materialized views.

    https://github.com/timelydataflow/differential-dataflow

    https://materialize.com/

  • Ask HN: Who is hiring? (October 2023)
    9 projects | news.ycombinator.com | 2 Oct 2023
    Materialize | Full-Time | NYC Office or Remote | https://materialize.com

    Materialize is an Operational Data Warehouse: A cloud data warehouse with streaming internals, built for work that needs action on what’s happening right now. Keep the familiar SQL, keep the proven architecture of cloud warehouses but swap the decades-old batch computation model for an efficient incremental engine to get complex queries that are always up-to-date.

    Materialize is the operational data warehouse built from the ground up to meet the needs of modern data products: Fresh, Correct, Scalable — all in a familiar SQL UI.

    Senior/Staff Product Manager - https://grnh.se/69754ebf4us

    Senior Frontend Engineer - https://grnh.se/7010bdb64us

    ===

    Investors include Redpoint, Lightspeed and Kleiner Perkins.

  • Ask HN: Who is hiring? (June 2023)
    14 projects | news.ycombinator.com | 1 Jun 2023
    Materialize | EM (Compute), Senior PM | New York, New York | https://materialize.com/

    You shouldn't have to throw away the database to build with fast-changing data. Keep the familiar SQL, keep the proven architecture of cloud warehouses, but swap the decades-old batch computation model for an efficient incremental engine to get complex queries that are always up-to-date.

    That is Materialize, the only true SQL streaming database built from the ground up to meet the needs of modern data products: Fresh, Correct, Scalable — all in a familiar SQL UI.

    Engineering Manager, Compute - https://grnh.se/4e14099f4us

    Senior Product Manager - https://grnh.se/587c36804us

    VP of Marketing - https://grnh.se/9caac4b04us

  • What are your favorite tools or components in the Kafka ecosystem?
    10 projects | /r/apachekafka | 31 May 2023
  • Ask HN: Who is hiring? (May 2023)
    13 projects | news.ycombinator.com | 1 May 2023
  • Dozer: A scalable Real-Time Data APIs backend written in Rust
    6 projects | /r/rust | 10 Apr 2023
    How does it compare to https://materialize.com/ ?

What are some alternatives?

When comparing Druid and materialize you can also consider the following projects:

iced - A cross-platform GUI library for Rust, inspired by Elm

ClickHouse - ClickHouse® is a free analytics DBMS for big data

cube.js - 📊 Cube — The Semantic Layer for Building Data Applications

risingwave - Scalable Postgres for stream processing, analytics, and management. KsqlDB and Apache Flink alternative. 🚀 10x more productive. 🚀 10x more cost-efficient.

Apache Cassandra - Mirror of Apache Cassandra

openpilot - openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for 250+ supported car makes and models.

Apache HBase - Apache HBase

rust-kafka-101 - Getting started with Rust and Kafka

egui - egui: an easy-to-use immediate mode GUI in Rust that runs on both web and native

dbt-expectations - Port(ish) of Great Expectations to dbt test macros

Scylla - NoSQL data store using the seastar framework, compatible with Apache Cassandra

scryer-prolog - A modern Prolog implementation written mostly in Rust.