Apache Flink - Overview and Use cases of a Distributed Dataflow System (at pre-Hadoop Summit...

Stephan Ewen

Flink committer

co-founder / CTO @ data Artisans

@StephanEwen

Apache

Looking back one year

April 16, 2014

Stratosphere 0.4

Stratosphere Optimizer

Pact API (Java)

Stratosphere Runtime

DataSet API (Scala)

Local Remote

Batch processing on a pipelining engine, with iterations …

Looking at now…

Historic data

Kafka, RabbitMQ, ...

HDFS, JDBC, ...

ETL, Graphs,

Machine Learning

Relational, …

Low latency,

windowing,

aggregations, ...

Event logs

Real-time data

streams

What is Apache Flink?

(master)

What is Apache Flink?

Flink Optimizer

DataSet (Java/Scala) DataStream (Java/Scala)

Stream Builder Hadoop

Local Remote Yarn Tez Embedded

Flink Dataflow Runtime

RabbitMQ

HCatalog

Batch / Steaming APIs

case class Word (word: String, frequency: Int)

val lines: DataStream[String] = env.fromSocketStream(...)

lines.flatMap {line => line.split(" ")

.map(word => Word(word,1))}

.window(Count.of(1000)).every(Count.of(100))

.groupBy("word").sum("frequency")

.print()

val lines: DataSet[String] = env.readTextFile(...)

lines.flatMap {line => line.split(" ")

.map(word => Word(word,1))}

.groupBy("word").sum("frequency")

.print()

DataSet API (batch):

DataStream API (streaming):

Technology inside Flink

case class Path (from: Long, to:Long)val tc = edges.iterate(10) {

paths: DataSet[Path] =>val next = paths

.join(edges)

.where("to")

.equalTo("from") {(path, edge) =>

Path(path.from, edge.to)}.union(paths).distinct()

Cost-based

optimizer

Type extraction

scheduling

Recovery

metadata

Pre-flight (Client)

MasterWorkers

DataSourc

eorders.tbl

Filter

MapDataSourc

elineitem.tbl

JoinHybrid Hash

HTprobe

hash-part [0] hash-part [0]

GroupRed

forward

Program

Dataflow

Memory

manager

Out-of-core

Batch &

Streaming

State &

Checkpoints

deploy

operators

intermediate

results

Flink by Feature / Use Case

Data Streaming Analysis

Life of data streams

Create: create streams from event sources (machines, databases, logs, sensors, …)

Collect: collect and make streams available for consumption (e.g., Apache Kafka)

Process: process streams, possibly generating derived streams (e.g., Apache Flink)

Stream Analysis in Flink

13More at: http://flink.apache.org/news/2015/02/09/streaming-example.html

Defining windows in Flink

Trigger policy• When to trigger the computation on current window

Eviction policy• When data points should leave the window

• Defines window width/size

E.g., count-based policy• evict when #elements > n

• start a new window every n-th element

Built-in: Count, Time, Delta policies

Checkpointing / Recovery

Flink acknowledges batches of records

• Less overhead in failure-free case

• Currently tied to fault tolerant data sources (e.g., Kafka)

Flink operators can keep state

• State is checkpointed

• Checkpointing and record acks go together

Exactly one semantics for state

Checkpointing / Recovery

Chandy-Lamport Algorithm for consistent asynchronous distributed snapshots

Pushes checkpoint barriersthrough the data flow

Operator checkpointstarting

Checkpoint done

Data Stream

barrier

Before barrier =part of the snapshot

After barrier =Not in snapshot

Checkpoint done

checkpoint in progress

(backup till next snapshot)

Heavy ETL Pipelines

Heavy Data Pipelines

Complex ETL programs

Apology: Graph had to be blurred for

online slides, due to confidentiality

Memory Management

public class WC {public String word;public int count;

Pool of Memory Pages

Sorting,

hashing,

caching

Shuffling,

broadcasts

User code

objects

Flink contains its own memory management stack. Memory is

allocated, de-allocated, and used strictly using an internal buffer pool

implementation. To do that, Flink contains its own type extraction and

serialization components.

More at: https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=53741525

Smooth out-of-core performance

20More at: http://flink.apache.org/news/2015/03/13/peeking-into-Apache-Flinks-Engine-Room.html

Single-core join of 1KB Java objects beyond memory (4 GB)

Blue bars are in-memory, orange bars (partially) out-of-core

Benefits of managed memory

More reliable and stable performance (less GC effects, easy to go to disk)

Table API

val customers = envreadCsvFile(…).as('id, 'mktSegment).filter( 'mktSegment === "AUTOMOBILE" )

val orders = env.readCsvFile(…).filter( o => dateFormat.parse(o.orderDate).before(date) ).as('orderId, 'custId, 'orderDate, 'shipPrio)

val items = orders.join(customers).where('custId === 'id).join(lineitems).where('orderId === 'id).select('orderId,'orderDate,'shipPrio,

'extdPrice * (Literal(1.0f) - 'discount) as 'revenue)

val result = items.groupBy('orderId, 'orderDate, 'shipPrio).select('orderId, 'revenue.sum, 'orderDate, 'shipPrio)

Iterations in Data Flows

Machine Learning Algorithms

Iterate by looping

for/while loop in client submits one job per

iteration step

Data reuse by caching in memory and/or disk

Step Step Step Step Step

Client

Iterate in the Dataflow

partial

solution partial

solution X

datasets

Y initial

solution

iteration

result

Replace

Step function

Large-Scale Machine Learning

Factorizing a matrix with28 billion ratings forrecommendations

(Scale of Netflixor Spotify)

More at: http://data-artisans.com/computing-recommendations-with-flink.html

State in Iterations

Graphs and Machine Learning

Iterate natively with deltas

partial

solution

datasets

Y initial

solution

iteration

result

workset A B workset

Merge deltas

Replace

initial

workset

Effect of delta iterations…

5000000

10000000

15000000

20000000

25000000

30000000

35000000

40000000

45000000

1 6 11 16 21 26 31 36 41 46 51 56 61

iteration

… very fast graph analysis

… and mix and matchETL-style and graph analysisin one program

Performance competitivewith dedicated graph

analysis systems

More at: http://data-artisans.com/data-analysis-with-flink.html

Closing

Flink Roadmap for 2015

Out-of-core state in Streaming

Monitoring and scaling for streaming

Streaming Machine Learning with SAMOA

More additions to the libraries

• Batch Machine Learning

• Graph library additions (more algorithms)

SQL on top of expression language

Master failover32

Flink community

Aug-10 Feb-11 Sep-11 Apr-12 Oct-12 May-13 Nov-13 Jun-14 Dec-14 Jul-15

#unique contributor ids by git commits

flink.apache.org

@ApacheFlink

Backup

Cornerpoints of Flink Design

Robust Algorithms on

Managed Memory

Pipelined Execution

of Batch Programs

Better shuffle performance

No OutOfMemory Errors

Scales to very large JVMs

Efficient an robust processing

Flexible Data

Streaming Engine

Low Latency Steam Proc.

Highly flexible windows

Native Iterations

Very fast Graph Processing

Stateful Iterations for ML

High-level APIs,

beyond key/value pairs

Java/Scala/Python (upcoming)

Relational-style optimizer

Graphs / Machine Learning

Streaming ML (coming)

Scales to very large groups

Active Library Development

Program optimization

A simple program

val orders = … val lineitems = …

val filteredOrders = orders.filter(o => dataFormat.parse(l.shipDate).after(date)).filter(o => o.shipPrio > 2)

val lineitemsOfOrders = filteredOrders.join(lineitems).where(“orderId”).equalTo(“orderId”).apply((o,l) => new SelectedItem(o.orderDate, l.extdPrice))

val priceSums = lineitemsOfOrders.groupBy(“orderDate”).sum(“l.extdPrice”);

Two execution plans

DataSourceorders.tbl

Filter

Map DataSourcelineitem.tbl

JoinHybrid Hash

buildHT probe

broadcast forward

Combine

GroupRed

DataSourceorders.tbl

Filter

Map DataSourcelineitem.tbl

JoinHybrid Hash

buildHT probe

hash-part [0] hash-part [0]

hash-part [0,1]

GroupRed

forwardBest plan

depends on

relative sizes

of input files

Examples of optimization

Task chaining

• Coalesce map/filter/etc tasks

Join optimizations

• Broadcast/partition, build/probe side, hash or sort-merge

Interesting properties

• Re-use partitioning and sorting for later operations

Automatic caching

• E.g., for iterations

Visualization

Visualization tools

Apache Flink - Overview and Use cases of a Distributed Dataflow System (at pre-Hadoop Summit...

Software

PragueJS meetups 30th anniversary

Toronto Meetups re-Invent Deck

Airheads Meetups- High density WLAN

Javier Lopez_Mihail Vieru - Flink in Zalando's World of Microservices - Flink Forward

Apache Flink的过去、现在和未来²尼- Apache... · Flink 1.9 的架构变化 Runtime Distributed Streaming Dataflow Query Processor DAG & StreamOperator Local Single JVM Cloud

Flink and Apache Spark Fernanda de Camargo Magano Dylan ... · Flink and Apache Spark Fernanda de Camargo Magano Dylan Guedes. About Flink ... Introduction to Apache Flink Book. Use

Flink Forward SF 2017: Eron Wright - Introducing Flink Tensorflow

Apache Flink: The Latest and GreatestApache Flink: The Latest and Greatest 2 Original creators of Apache Flink® Providers of the dA Platform, a supported Flink distribution The Latest

ArtsECO Teacher MeetUps · 2019-01-30 · arts.uwm.edu/arts-eco ArtsECO Teacher MeetUps Teacher MeetUps are FREE professional development and networking opportunities for Milwaukee

@mattcasters Kettle Past - Present - Future Project Hop ...blog.jortilles.com/wp-content/uploads/2019/11/kcm19-mattcasters... · Apache Flink Google Cloud DataFlow Local runners:

A Hybrid Systolic-Dataflow Architecture for Inductive ...jianw/hpca2020.pdfHybrid Systolic-Dataflow Hybrid Systolic-Dataflow Hybrid Systolic-Dataflow Fig. 3: Proposed Architecture

Startup Stage #10 - Meetups - International Cupons

Apache Flink Big Data Stream Processing · PDF fileApache Flink Big Data Stream Processing Tilmann Rabl ... Apache Flink! The case for Flink as a stream processor • Ideal basis for

Gradoop: Scalable Graph Analytics with Apache Flink @ Flink Forward 2015

TechCrunch Meetups 2016

Flink meetup

Tech. 2017 predictions presentation for meetups

StreamBox-HBM: Stream Analytics on High Bandwidth Hybrid ...pekhimenko/Papers/StreamBox-ASPLOS_19.pdf · as Flink [12], Spark Streaming [71], and Google Cloud Dataflow [5]. These

Visual Debugging of Dataflow Systems1181210/FULLTEXT01.pdf · distributed applications such as Spark, TensorFlow, Flink with a variety of input and output sources, e.g. Kafka, HDFS

Flink Forward SF 2017: Ted Dunning - Non-Flink Machine Learning on Flink