conduktor-poc-kafka-protocol
datagen
conduktor-poc-kafka-protocol | datagen | |
---|---|---|
1 | 7 | |
59 | 135 | |
- | 3.0% | |
7.7 | 6.1 | |
13 days ago | 2 months ago | |
Java | TypeScript | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
conduktor-poc-kafka-protocol
-
What are your favorite tools or components in the Kafka ecosystem?
They also provide an open-source Kafka proxy which can be used to enhance Kafka with 'interceptors'.
datagen
-
What are your favorite tools or components in the Kafka ecosystem?
For fake data, shameless plug for https://github.com/MaterializeInc/datagen/tree/main
- What are some good publicly available real-time data sources?
-
Simulating Streaming Data for Fraud Detection with Datagen CLI
Building and testing a real-time fraud detection application requires a continuous stream of realistic data. But generating that data can be a challenge. That's why we recently created the Datagen CLI, a simple tool that helps you create believable fake data using the FakerJS API.
-
How train my SQL skills with real world data engineering problems ?
Generate fake data with a normalized schema of your choosing with this tool from Materialize, then denormalize it and build a warehouse model.
- FLiPN-FLaNK Stack Weekly for 20 March 2023
- Datagen CLI: Stream Fake Relational Data
What are some alternatives?
kafka-ml - Kafka-ML: connecting the data stream with ML/AI frameworks (now TensorFlow and PyTorch!)
ChatGLM-6B - ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
river - 🌊 Online machine learning in Python
CloudDemo2023 - 2023 Demos
bytewax - Python Stream Processing
halp - A CLI tool to get help with CLI tools 🐙
materialize - The data warehouse for operational workloads.
awesome-public-real-time-datasets - A list of publicly available datasets with real-time data maintained by the team at bytewax.io
console - Redpanda Console is a developer-friendly UI for managing your Kafka/Redpanda workloads. Console gives you a simple, interactive approach for gaining visibility into your topics, masking data, managing consumer groups, and exploring real-time data with time-travel debugging.
RedfinScraper - Scrapes Redfin data.
python-fake-data-producer-for-apache-kafka - The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and push it to an Apache Kafka topic.
cf-url-shortener - URL Shortener Cloudflare function that uses Upstash Redis and Kafka along with https://materialize.com