Query Real Time Data in Kafka Using SQL

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

risingwave

27 6,331 10.0 Rust

SQL stream processing, analytics, and management. PostgreSQL simplicity, unrivaled performance, and seamless elasticity. 🚀 10x more productive. 🚀 10x more cost-efficient.

In the demo tutorial, we'll leverage the following GitHub repository where we assume that all necessary things are set up using Docker compose.

Apache Spark

101 38,414 10.0 Scala

Apache Spark - A unified analytics engine for large-scale data processing

Additionally, one of the challenges of working with Kafka is how to efficiently analyze and extract insights from the large volumes of data stored in Kafka topics. Traditional batch processing approaches, such as Hadoop MapReduce or Apache Spark, can be slow and expensive, and may not be suitable for real-time analytics. To address this challenge, you can use SQL queries with Kafka to analyze and extract insights from the data in real time.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
redpanda

70 8,868 10.0 C++

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

RisingWave is an open-source distributed SQL database for stream processing. RisingWave accepts data from sources like Apache Kafka, Apache Pulsar, Amazon Kinesis, Redpanda, and databases via native Change data capture connections to MySQL and PostgreSQL sources. It uses the concept of materialized view that involves caching the outcome of your query operations and it is quite efficient for long-running stream processing queries.

Apache Pulsar

30 13,772 9.8 Java

Apache Pulsar - distributed pub-sub messaging system

RisingWave is an open-source distributed SQL database for stream processing. RisingWave accepts data from sources like Apache Kafka, Apache Pulsar, Amazon Kinesis, Redpanda, and databases via native Change data capture connections to MySQL and PostgreSQL sources. It uses the concept of materialized view that involves caching the outcome of your query operations and it is quite efficient for long-running stream processing queries.

materialize

117 5,585 10.0 Rust

The data warehouse for operational workloads. (by MaterializeInc)

Most streaming database technologies use SQL for these reasons: RisingWave, Materialize, KsqlDB, Apache Flink, and so on offering SQL interfaces. This post explains how to choose the right streaming database.

ApacheKafka

104 28 0.0

A curated re-sources list for awesome Apache Kafka

Apache Kafka is a distributed streaming platform that allows you to store and process real-time data streams. It is commonly used in modern data architectures to capture and analyze user interactions with web and mobile applications, as well as IoT device data, logs, and system metrics. It is often used for real-time data processing, data pipelines, and event-driven applications. However, querying data stored in Kafka can be challenging, especially for users who are more comfortable with SQL than with Kafka's native APIs. This is where the streaming SQL engine and database can be helpful. It is actually possible to run SQL directly on streaming data.

flink-statefun

18 495 3.2 Java

Apache Flink Stateful Functions

Most streaming database technologies use SQL for these reasons: RisingWave, Materialize, KsqlDB, Apache Flink, and so on offering SQL interfaces. This post explains how to choose the right streaming database.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

We Built a Streaming SQL Engine

3 projects | news.ycombinator.com | 21 Oct 2023
What makes a time series oriented database (ex: QuestDB) more efficient for OLAP on time series than an OLAP "only" oriented database (ex: DuckDB) technically?

1 project | /r/dataengineering | 23 Jan 2023
How to handle partial updates and bulk updates in the source systems

1 project | /r/dataengineering | 5 Jan 2023
Headless BI with streaming data

2 projects | dev.to | 22 Sep 2022
Realtime query on Kafka?

1 project | /r/apachekafka | 12 Sep 2022

Query Real Time Data in Kafka Using SQL

This page summarizes the projects mentioned and recommended in the original post on dev.to
SQL Streaming Kafka Rust Database
Post date: 23 Mar 2023

risingwave

Apache Spark

InfluxDB

redpanda

Apache Pulsar

materialize

ApacheKafka

flink-statefun

SaaSHub

Related posts

We Built a Streaming SQL Engine

What makes a time series oriented database (ex: QuestDB) more efficient for OLAP on time series than an OLAP "only" oriented database (ex: DuckDB) technically?

How to handle partial updates and bulk updates in the source systems

Headless BI with streaming data

Realtime query on Kafka?

Query Real Time Data in Kafka Using SQL

This page summarizes the projects mentioned and recommended in the original post on dev.to SQL Streaming Kafka Rust Database Post date: 23 Mar 2023

Related posts

We Built a Streaming SQL Engine

What makes a time series oriented database (ex: QuestDB) more efficient for OLAP on time series than an OLAP "only" oriented database (ex: DuckDB) technically?

How to handle partial updates and bulk updates in the source systems

Headless BI with streaming data

Realtime query on Kafka?

This page summarizes the projects mentioned and recommended in the original post on dev.to
SQL Streaming Kafka Rust Database
Post date: 23 Mar 2023