Schemer
Apache Flink
| Schemer | Apache Flink | |
|---|---|---|
| - | 30 | |
| 116 | 26,068 | |
| 0.0% | 0.4% | |
| 0.0 | 9.9 | |
| over 6 years ago | 2 days ago | |
| Scala | Java | |
| Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Schemer
We haven't tracked posts mentioning Schemer yet.
Tracking mentions began in Dec 2020.
Apache Flink
-
Performance Test: Flink 1.19 vs. Spark 4.0 vs. Kafka Streams 3.8 Windowed Aggregation Throughput
Solution & Implementation: Migrated windowed aggregation from Kafka Streams 3.7 to Flink 1.19, reusing existing Kafka sources/sinks. Optimized Flink config: set parallelism to 16 (matching vCPU), enabled RocksDB state backend with 100ms watermark out-of-order tolerance, configured 5s checkpoints. Trained team on Flink watermarking and state management via Apache Flink GitHub repo documentation.
-
Why Apache IoTDB Is Written in Java: A Decade of Engineering Trade-offs
When IoTDB was initiated in 2011, almost all influential distributed systems and databases were built in Java or on the JVM—such as Hadoop, HBase, Spark (Scala on JVM), Cassandra, Kafka, and Flink. To integrate deeply with the big data ecosystem, choosing Java was a natural decision.
-
Apache Flink VS sail - a user suggested alternative
2 projects | 18 Mar 2026
-
AWS EKS Deployment: Real-Time Data Streaming Platform - 50K Events/Sec for $1,250/Month
Apache Flink: flink.apache.org
-
Gravitino - the unified metadata lake
In the meantime, other query engine support is on the roadmap, including Apache Spark, Apache Flink, and others.
-
Timestone: A Lightweight Java Library for Testing Time-Based Logic
Flink has org.apache.flink.util.clock.Clock abstract class for retrieving either absolute or relative time.
-
Towards Sub-100ms Latency Stream Processing with an S3-Based Architecture
Many stream processing systems today still rely on local disks and RocksDB to manage state. This model has been around for a while and works fine in simple, single-tenant setups. Apache Flink, for example, uses RocksDB as its default state backend - state is kept on local disks, and periodic checkpoints are written to external storage for recovery.
-
Introducing RisingWave's Hosted Iceberg Catalog-No External Setup Needed
Because the hosted catalog is a standard JDBC catalog, tools like Spark, Trino, and Flink can still access your tables. For example:
-
When plans change at 500 feet: Complex event processing of ADS-B aviation data with Apache Flink
I wrote a python based aircraft monitor which polls the adsb.fi feed for aircraft transponder messages, and publishes each location update as a new event into an Apache Kafka topic. I used Apache Flink — and more specially Flink SQL, to transform and analyse my flight data. The TL;DR summary is I can write SQL for my real-time data processing queries — and get the scalability, fault tolerance, and low latency managed by the Flink runtime.
-
My personal favorite MCP server which has became part of my life
GitHub: github.com/apache/flink
What are some alternatives?
GridScale - Scala library for accessing various file, batch systems, job schedulers and grid middlewares.
Deeplearning4j - Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...
Sparkta - Real Time Analytics and Data Pipelines based on Spark Streaming
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
Gearpump - Lightweight real-time big data streaming engine over Akka
Deep Java Library (DJL) - An Engine-Agnostic Deep Learning Framework in Java