What are your favorite tools or components in the Kafka ecosystem?

This page summarizes the projects mentioned and recommended in the original post on /r/apachekafka

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • kafka-ml

    Kafka-ML: connecting the data stream with ML/AI frameworks (now TensorFlow and PyTorch!)

    For example, CLIs, UIs, monitoring tools / integrations, cluster administration, stream processing libraries (Flink, Kafka Streams, smaller / newer libs), etc? Anything in the ML / AI space (e.g. a quick Google search came up with https://github.com/ertis-research/kafka-ml).

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • console

    Redpanda Console is a developer-friendly UI for managing your Kafka/Redpanda workloads. Console gives you a simple, interactive approach for gaining visibility into your topics, masking data, managing consumer groups, and exploring real-time data with time-travel debugging. (by redpanda-data)

  • materialize

    The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data. (by MaterializeInc)

  • bytewax

    Python Stream Processing

  • river

    🌊 Online machine learning in Python

    River - https://github.com/online-ml/river (Online machine learning, best used with Bytewax for Kafka integration)

  • python-fake-data-producer-for-apache-kafka

    The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and push it to an Apache Kafka topic.

    Fake data utility - https://github.com/aiven/python-fake-data-producer-for-apache-kafka

  • debezium

    Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

    Debezium: https://debezium.io/ (connector for cdc)

  • datagen

    Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.

    For fake data, shameless plug for https://github.com/MaterializeInc/datagen/tree/main

  • conduktor-poc-kafka-protocol

    POC to demonstrate how to alter incoming/outgoing records in Kafka. It's a toy, don't use it in production.

    They also provide an open-source Kafka proxy which can be used to enhance Kafka with 'interceptors'.

  • kloadgen

    KLoadGen is kafka load generator plugin for jmeter designed to work with AVRO, JSON and PROTOL-BUFFERS schema Registries. (by sngular)

    Sngular kloadgen for fake/synthetic data and load testing is great if you already use java/jmeter

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Ask HN: How do you version your data?

    2 projects | news.ycombinator.com | 26 Dec 2024
  • Top 17 Fast-Growing Github Repo of 2024

    11 projects | dev.to | 14 Jun 2024
  • FLaNK AI-April 22, 2024

    28 projects | dev.to | 22 Apr 2024
  • Ask HN: What side projects landed you a job?

    62 projects | news.ycombinator.com | 3 Dec 2023
  • Ask HN: Learning new coding patterns – how to start?

    3 projects | news.ycombinator.com | 10 Nov 2023

Did you konow that Python is
the 2nd most popular programming language
based on number of metions?