Real Time Data Infra Stack

This page summarizes the projects mentioned and recommended in the original post on dev.to

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • elasticsearch-mapper-attachments

    Discontinued Mapper Attachments Type plugin for Elasticsearch

  • In general, there are two different approaches, one is to choose a common NoSQL database, such as MongoDB, ElasticSearch or Apache Cassandra. All of these NoSQL databases have good scalability and can support complex queries. In addition, these databases are very mature, so the learning curve is low for both use and operation.

  • core

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web. (by vuejs)

  • Vue.js

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • superset

    Apache Superset is a Data Visualization and Data Exploration Platform

  • Apache Superset

  • streamlit

    Streamlit — A faster way to build and share data apps.

  • Streamlit

  • redpanda

    Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

  • Redpanda

  • Redash

    Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

  • redash

  • Memgraph

    Open-source graph database, built for real-time streaming data, compatible with Neo4j.

  • MEM GRAPH: Graph database built for real-time streaming and compatible with Neo4j

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • materialize

    The data warehouse for operational workloads. (by MaterializeInc)

  • Even though these technical stacks are listed by category, some fields actually overlap. For example, although Materialize is classified as a stream processor, it makes sense to treat as a serving layer because it is essentially a streaming database, and the same is true for ksqlDB.

  • ApacheKafka

    A curated re-sources list for awesome Apache Kafka

  • The most common option is Kafka, but there are also various open source software and managed services. By the way, the following order does not represent the recommendation order.

    The Apache Flink, which is often mentioned, is one of these options, and there are many others.

  • faust

    Python Stream Processing. A Faust fork (by faust-streaming)

  • Faust: Python framework

  • Druid

    Apache Druid: a high performance real-time analytics database.

  • Apache Druid

  • debezium

    Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

  • To capture data changes from these databases and deliver them to the stream platform, we often use Debezium.

  • ClickHouse

    ClickHouse® is a free analytics DBMS for big data

  • ClickHouse

  • beam

    Apache Beam is a unified programming model for Batch and Streaming data processing.

  • Apache Beam: Streaming framework which can be run on several runner such as Apache Flink and GCP Dataflow

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts