High-throughput, low-latency, and exactly-once stream ...istoica/classes/... · High-throughput,...

View
9
Download
0
Category

Documents

Preview:

Citation preview

High-throughput, low-latency, and exactly-once stream

processing with Apache FlinkSep 15

Zehao Wu

streaming infrastructure

popularity of stream data platforms is skyrocketing

two main properties

high throughput across a wide spectrum of latencies

strong consistency guarantees even in the presence of stateful computations

What makes a good stream system?

Exactly-once guarantees

Low latency

High throughput

Powerful computation model

Low overhead of the fault tolerance mechanism in the absence of failures

Flow control

Record acknowledgements

Apache Storm

sends back to the previous an acknowledgement

source keeps a backup of all the tuples

received acknowledgements from all, discarded

not all have been received, replayed

Micro batches

Apache Storm Trident, Apache Spark Streaming

broken down in a series of small, atomic batch jobs called micro-batches

either succeed or fail

at a failure, the latest can be simply recomputed

Transactional updates

Google Cloud Dataflow

Details will be covered in the next presentation

Distributed Snapshots

Apache Flink

Chandy Lamport algorithm, regular processing keeps going, while checkpoints happen in the background

draw a consistent snapshot of that state

storing that snapshot in durable storage

restore from durable storage, rewind the stream source and hit the play button again

Lacking

High availability

Event time and watermarks

Improved monitoring of running jobs

Pros

Fast

Reliable

Expressive

Easy to use

Scalable

Hadoop-compatible

Cons

Too many traditional database designs

forced declarative

harder to program

the community not as active as Spark

Recommended

Computational Performance Latency and throughput of memorybanas/WO/WO_W05_Memory_2.pdf · Memory latency How to measure latency experimentally: different types of accesses (depends

Documents

Pipelining & Verilog - MITweb.mit.edu/6.111/www/f2017/handouts/L09.pdf · Pipelining & Verilog • 6.UAP? • Division • Latency & Throughput • Pipelining to increase throughput

Documents

ASAS 2013 - Space-based architecture: Linear scalability? High throughput? Low latency?

Technology

NVIDIA Multi-Instance GPU and NVIDIA Virtual Compute Server...with predi ctable throughput and latency. The extreme throughput and latency of MIG surpass vGPU temporal partitioning

Documents

LATA: A Latency and Throughput-Aware Packet Processing System

Documents

Building Interactive Distributed Processing Applications ...dprg.cs.uiuc.edu/docs/shadi_phd_thesis/defense.pdf · PACELC Latency Throughput Latency Isolation 1. Abadi, ^ onsistency

Documents

From the latency to the throughput age - Post H2020 Vision - Jesus Labarta.pdf · • From the latency to the throughput age !!! • … and can/should be achieved productively •

Documents

Intel® Architecture Code Analyzer · 2.2 Latency Analysis . The Latency Analysis is used to analyze the latency and resource conflicts in a section of code; unlike the throughput

Documents

Maintaining Low Latency While Maximizing Throughput on a Single Cluster

Technology

Throughput vs. Latency: QoS-centric Resource Allocation ...eprints.networks.imdea.org/1534/1/PID4665795.pdf · Throughput vs. Latency: QoS-centric Resource Allocation for Multi-User

Documents

Stealthy Trafﬁc Analysis of Low-Latency Anonymous ...pmittal/publications/throughput...Stealthy Trafﬁc Analysis of Low-Latency Anonymous Communication Using Throughput Fingerprinting

Documents

A Low-Latency Asynchronous Interconnection Network with ...Requires high performance •Low system-level latency - Lightweight routers for low-latency •High sustained throughput

Documents

High Throughput Low Latency LDPC Decoding on GPU for SDR ...by2/papers/globalsip2013_ldpc_gpu.pdf · High Throughput Low Latency LDPC Decoding on GPU for SDR Systems Guohui Wang,

Documents

Average Throughput vs. Latency (Pre-existing)engr.case.edu/liberatore_vincenzo/papers/adamprj2.pdfThroughput vs. Latency – Throughput is relatively unaffected by latency, as seen

Documents

CheXNet – Inference with Nvidia T4 on Dell EMC PowerEdge R7425 · 2019-09-17 · (img/sec) Latency (ms) Throughput (img/sec) Latency (ms) Throughput (img/sec) Latency (ms) 1 315

Documents

Low Latency Low Loss Scalable Throughput (L4S)

Documents

Optimizing Latency and Throughput Trade-o s in a Stream Processing … › ~kubitron › courses › cs262... · 2014-12-18 · Optimizing Latency and Throughput Trade-o s in a Stream

Documents

Miercom Report Dell EMC Data Center Switch Testing4.0 RFC 2544 Port-Pair Unicast Throughput and Latency Description RFC 2544 is a standard test case for verifying throughput and latency

Documents

HYCON WHITEPAPER v1 · 2019-05-08 · 6 Latency As mentioned above, latency and throughput go hand in hand as limiting factors for the fact that maximum throughput of a network is

Documents

Deploying High-Throughput, Low-Latency Predictive Models with

Documents