kafkat
KafkaT-ool (by airbnb)
secor
Secor is a service implementing Kafka log persistence (by pinterest)
kafkat | secor | |
---|---|---|
1 | 3 | |
503 | 1,835 | |
0.4% | 0.2% | |
0.0 | 0.0 | |
almost 5 years ago | 5 days ago | |
Ruby | Java | |
Apache License 2.0 | Apache License 2.0 |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
kafkat
Posts with mentions or reviews of kafkat.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-01-01.
-
ELT, Data Pipeline
Few tips before I close this blog post, always have enough memory in Kafka server as it is very memory intensive and tends to shutdown gracefully every time it hits heap size limit without indicating anything in the logs. For monitoring your cluster use Kafka Manager, it does the job very well and also have KafkaT on server which saves you from running cumbersome builtin Kafka commands.
secor
Posts with mentions or reviews of secor.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-11-07.
-
Kafka to GCS Persistence Tools
secor: Seems a bit on the older side
-
Storing kakfa messages to dynamoDb
Since you are trying to archive this data, a simpler solution is to use something like secor to archive the data in kafka to S3. It’s much cheaper than dynamodb too.
-
ELT, Data Pipeline
Once we had our producer working for Kafka , it was time for a consumer to start pulling data and push it to GCS. With some research over at Github we found Secor from Pinterest to be a viable option for our use. Though it being a great piece of software, it wasn't mapping ideally to our design, for that purpose we had to submit few Pull requests to make the necessary changes to the secor project for our use and the greater good of the open source community. From updating the docs (PR268, PR271, PR277) on how to set it up to adding flexible upload directory structure with hourly support (PR275) and support for partitioned parser with no offset folder (PR279), also added flexible delimited file reader, writer option (PR291) for better control over file structure. Below diagram is our current ELT pipeline running in production.
What are some alternatives?
When comparing kafkat and secor you can also consider the following projects:
Scio - A Scala API for Apache Beam and Google Cloud Dataflow.
fluent-plugin-kafka - Kafka input and output plugin for Fluentd
Apache Kafka - Mirror of Apache Kafka
Thingsboard - Open-source IoT Platform - Device management, data collection, processing and visualization.
graylog - Free and open log management
gcs-connector-for-apache-kafka - Aiven's GCS Sink Connector for Apache Kafka®