What are your favorite tools or components in the Kafka ecosystem?

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

kafka-ml

1 139 7.9 Python

Kafka-ML: connecting the data stream with ML/AI frameworks (now TensorFlow and PyTorch!)

For example, CLIs, UIs, monitoring tools / integrations, cluster administration, stream processing libraries (Flink, Kafka Streams, smaller / newer libs), etc? Anything in the ML / AI space (e.g. a quick Google search came up with https://github.com/ertis-research/kafka-ml).
console

4 3,591 9.8 Go

Redpanda Console is a developer-friendly UI for managing your Kafka/Redpanda workloads. Console gives you a simple, interactive approach for gaining visibility into your topics, masking data, managing consumer groups, and exploring real-time data with time-travel debugging. (by redpanda-data)
InfluxDB

www.influxdata.com
sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
materialize

116 5,549 10.0 Rust

The data warehouse for operational workloads. (by MaterializeInc)
bytewax

18 1,127 9.8 Python

Python Stream Processing
river

17 4,754 9.2 Python

🌊 Online machine learning in Python

River - https://github.com/online-ml/river (Online machine learning, best used with Bytewax for Kafka integration)
python-fake-data-producer-for-apache-kafka

32 73 4.5 Python

The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and push it to an Apache Kafka topic.

Fake data utility - https://github.com/aiven/python-fake-data-producer-for-apache-kafka
debezium

80 9,843 9.9 Java

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

Debezium: https://debezium.io/ (connector for cdc)
WorkOS

workos.com
sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
datagen

7 133 6.1 TypeScript

Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.

For fake data, shameless plug for https://github.com/MaterializeInc/datagen/tree/main
conduktor-poc-kafka-protocol

1 58 8.2 Java

POC to demonstrate how to alter incoming/outgoing records in Kafka. It's a toy, don't use it in production.

They also provide an open-source Kafka proxy which can be used to enhance Kafka with 'interceptors'.
kloadgen

1 200 6.5 Java

KLoadGen is kafka load generator plugin for jmeter designed to work with AVRO, JSON and PROTOL-BUFFERS schema Registries. (by sngular)

Sngular kloadgen for fake/synthetic data and load testing is great if you already use java/jmeter

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project