-
kafka-connect-file-pulse
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
This is the fourth and last article of the series "Streaming data into Kafka" series. In the first three articles, we saw how it's fairly easy to use Kafka Connect to load records from CSV, XML and JSON files into Apache Kafka without writing a single line of code. For doing this, we have used the Kafka Connect FilePulse connector which packs with a lot of nice features to parse and transform data.
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
kafkacat
Discontinued Generic command line non-JVM Apache Kafka producer and consumer [Moved to: https://github.com/edenhill/kcat]
Note: In the example above, we have used kafkacat to consume the topics. The option -o-1 is used to only consume the latest message
-
Connect FilePulse brings the power of Grok Expression directly to Kafka Connect with the GrokFilter which under the hood uses the Joni library which is the Java port of Oniguruma regexp library. It also provides a lot of predefined and reusable grok patterns. See the complete list of patterns.
-
Good news! we have externalized the work done with the GrokFilter to a dedicated to SMT called: Kafka Connect Grok Transformation.
Related posts
-
Kafka Connect Filepulse 2.13.0 is now available! This version includes support for SFTP and Alibaba OSS. It also contains many bug fixes and improvements. 🚀
-
Download json and csv file from github repository with apache kafka
-
Kuvasz-streamer: open-source CDC for Postgres for low latency replication
-
Databrew Blink: Open-Source Database CDC Tool
-
connect VS goka - a user suggested alternative
2 projects | 23 Jul 2024