Apache Flink Cheatsheet

What is Apache Flink?

Apache Flink is a distributed stream-processing engine built for stateful computations over unbounded and bounded data streams. Unlike batch systems that wait for all data before processing, Flink processes each event as it arrives - with millisecond latency - while maintaining persistent state across millions of keys and guaranteeing exactly-once correctness even after failures.

Its unified API handles both streaming (continuous, never-ending) and batch (finite, historical) workloads with the same SQL or DataStream code. State is first-class: Flink checkpoints the entire pipeline to durable storage periodically, so any failure is fully recoverable with no data loss or duplication.

When Flink is the right fit

Real-time analytics - dashboards and metrics that update as events arrive, not on a cron delay
Event-driven pipelines - react to fraud signals, alerts, or threshold breaches the moment they happen
CDC & data sync - replicate database changes to a warehouse or search index with sub-second lag
Stream enrichment - join a high-volume event stream against a slowly-changing reference table at scale
Sessionisation - group user activity into sessions with dynamic, activity-driven boundaries
Materialized views - keep a precomputed result set continuously fresh without full recomputation
Large stateful jobs - per-key state across billions of keys backed by RocksDB on disk
Exactly-once ETL - end-to-end correctness guarantees into Kafka, Iceberg, or JDBC targets

Less ideal for: pure ad-hoc SQL queries (use Trino/Athena), simple periodic batch reports with no streaming requirement (use Spark), or sub-millisecond latency (use in-memory DBs).

Often replaces: Apache Spark Streaming / Structured Streaming, Apache Storm, Apache Samza, or hand-rolled Kafka consumer apps with bespoke state management.

Project resources

Codegithub.com/apache/flink
Docsnightlies.apache.org/flink-docs-stable
Issuesissues.apache.org/jira/FLINK
Communityflink.apache.org/community
Slackapache-flink.slack.com
Mailing listuser@flink.apache.org
Stack Overflowtag: apache-flink

Releases & stats

2.1.0 2.0.0 1.20.x 1.19.x 1.18.x 1.17.x Full history →

2.1.0

Latest (Apr 2026)

~24k

GitHub stars

~1,800

Contributors

2014

ASF top-level since