Apache Flink favicon

Apache Flink

Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.

Flink includes several APIs for creating applications that use the Flink engine: DataSet API for static data embedded in Java, Scala, and Python, DataStream API for unbounded streams embedded in Java and Scala, and Table API with a SQL-like expression language embedded in Java and Scala. Flink also bundles libraries for domain-specific use cases: Machine Learning library, and Gelly, a graph processing API and library. You can integrate Flink easily with other well-known open source systems both for data input and output as well as deployment. (Previously Stratosphere)

Apache Storm

Apache Storm

Apache Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unboun ...

Heron

Heron

A realtime, distributed, fault-tolerant stream processing engine from Twitter

Gearpump

Gearpump

Apache Gearpump is a real-time big data streaming engine. The name Gearpump is a reference to the engineering term “gear pump” ...