bitsail
scotty-window-processor
bitsail | scotty-window-processor | |
---|---|---|
1 | 1 | |
1,584 | 75 | |
1.1% | - | |
6.6 | 4.6 | |
5 months ago | 9 months ago | |
Java | Java | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
bitsail
scotty-window-processor
-
Implementing a “Lookback” Window Using Apache Flink’s KeyedProcessFunction
This concept is similar to a sliding window with a small step size, but with a more memory-efficient implementation. By using “slice sharing” instead of duplicating events into every overlapping window, the memory footprint is reduced. Scotty window processor is an open-source implementation of memory-efficient window aggregations with connectors for popular stream processors like Flink. This is a promising avenue for approximating a “lookback” window when aggregations like count, sum or histogram are required.
What are some alternatives?
incubator-seatunnel - SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time). [Moved to: https://github.com/apache/seatunnel]
LakeSoul - LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
kestra - Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
jdsp - A Java Library for Digital Signal Processing
flink-kubernetes-operator - Apache Flink Kubernetes Operator
SREWorks - Cloud Native DataOps & AIOps Platform | 云原生数智运维平台
Apache Flink - Apache Flink
Zeppelin - Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
seatunnel - SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
flink-remote-shuffle - Remote Shuffle Service for Flink