GRASS : Trimming Stragglers in Approximation Analytics

GRASS: Trimming Stragglers in Approximation Analytics

Ganesh Ananthanarayanan, Michael Hung, Xiaoqi Ren, Ion Stoica, Adam Wierman, Minlan Yu

Next Generation of Analytics

• Timely results, even if approximate– Data deluge makes this necessary

Optimal Scheduler

Approximation Dimensions

Error: Minimize time to get desired accuracy “#cars sold to the nearest thousand”

Deadline: Maximize accuracy within deadline“Pick the best ad to display within 2s”

*w.r.t. state-of-the-art schedulers (production workloads from Facebook and Bing)

Improve accuracy by 48% Speedup by 40%

• Prioritize tasks– Subset of tasks to complete– #tasks » #slots (multi-waved jobs)

(NP-Hard but many known heuristics…)

• Straggler tasks– Slowest task can be 8x slower than median task– Speculation: Spawn a duplicate, earliest wins• Google[OSDI’04], FB[OSDI’08], Microsoft[OSDI’10]

Scheduling Challenge

Challenge: dynamically prioritize between speculative & unscheduled tasksto meet deadline/error bound

Speculative copies consume extra resources

Opportunity Cost

Slot 1

Slot 2

Slot 3 T1

50 109

Is speculation worth the

payoff?

Roadmap

1. Two natural scheduling designs

2. GRASS: Combining the two designs

3. Evaluation of GRASS

Greedy Scheduling (GS)

Greedily improve accuracy, i.e., earliest finishing task

T2 T3time

Slot 1

Slot 2

T4 T5 T6

Task ID T1 T2 T3 T4 T5 T6 T7 T8 T9Time

remaining5 --- --- --- --- --- --- --- ---

New copy 2 --- 1 1 1 1 1 1 3

T1Deadline = 6

(at time =1 )

Accuracy = 7/9Straggler

Resource Aware Scheduling (RAS)

Speculate only if it saves time and resources

Slot 1

Slot 2

Task ID T1 T2 T3 T4 T5 T6 T7 T8 T9Time

remaining5 --- --- --- --- --- --- --- ---

New copy 2 --- 1 1 1 1 1 1 3

T1Deadline = 6

(at time =1 )

Accuracy = 8/9

One copy for 5s (vs.) Two copies for 2s

Straggler

GS vs. RAS

T2 T3time

Slot 1

Slot 2

T4 T5 T6

T7Deadline = 6

Accuracy = 7/9

Slot 1

Slot 2

T8Deadline = 6

Accuracy = 8/9

Deadline = 3

Accuracy = 3/9T1

Accuracy = 2/9

10 3 6

Neither GS nor RAS is uniformly better

Intuition: Use RAS early in the job (be “conservative”),

switch to GS towards the end (be “aggressive”)

Theoretical Scheduling Model

• Multi-waved scheduling of tasks– Constant wave-width– Agnostic to fairness policies– Heavy-tailed (Pareto) distribution of task durations

• Speculation: GS, RAS, Switching, Optimal

Theorem:Using RAS when >2 waves of tasks remain, and

GS when ≤2 waves of tasks remainis “near-optimal”

How to estimate two remaining waves?

• Wave boundaries are not strict– Non-uniform task durations

• Wave-width is not constant

Start with RAS and switch to GS close to the deadline/error-bound

GSRASRAS GSRAS

Learning the switching point

• GS-only and RAS-only job samples– “Exploration vs. Exploitation”– Multi-armed bandit solution, ɛ = 0.1

RAS[4s]+GS[2s]RAS[5s]+GS[1s]

RAS[6s]

Switch

Deadline5

GRASS (= GS + RAS) Scheduler

• Opportunity Cost in speculation for stragglers– GS Greedy Scheduling– RAS Resource Aware Scheduling

• Switch RASGS close to deadline/error-bound– Learn switching point empirically from job samples

• Provably near-optimal in theoretical model

Implementation

• Hadoop 0.20.2 and Spark 0.7.3– Modified Fair Scheduler– Job bins with GS-only and RAS-only samples

• Task Estimators– Remaining time is extrapolated from data-to-process• progress reports at 5% intervals

– New copy’s time is sampled from completed tasks

How well does GRASS perform?

• Workload from Facebook and Bing traces– Hadoop and Dryad production jobs– Added deadlines and error bounds

• Baselines: LATE & Mantri

• 200 node EC2 deployment (m2.2xlarge instances)

Accuracy of deadline-bound jobs improve by 47%

Gains hold across deadlines (lenient and stringent )

GRASS is 22% better than statically picking GS or RAS

… and is near-optimal

Error-bound Jobs

• Overall speedup of 38% (optimal is 40%)– Gains hold across all error bounds

• Exact jobs (0% error-bound) speed up by 34%

Unified Straggler Mitigation

Conclusion

• Next gen. of analytics: Approximate but timely results • Challenge: Dynamic and unpredictable stragglers

• GRASS – Conservative speculation early in the job; aggressive towards its end

• Evaluation with Hadoop & Spark– Accuracy of deadline-bound jobs improve by 47%– Error-bound jobs speed up by 38%

GRASS : Trimming Stragglers in Approximation Analytics

Documents

GRASS : Trimming Stragglers in Approximation …...GRASS : Trimming Stragglers in Approximation Analytics Ganesh Ananthanarayanan, Michael Hung, Xiaoqi Ren, Ion Stoica, Adam Wierman,

Pixel threshold trimming

SHEARS BONSAI SHEARS TRIMMING SCISSORS Y-5 TRIMMING SCISSORS 180mm TRIMMING SCISSORS CHROME-PLATED FINISH TRIMMING SCISSORS BONSAI SHEARS BONSAI SHEARS LONG BLADE

Trimming the Hedges

Hedge and edge trimming shears Manual.pdf · Trimming shear blade Use the trimming blade for trimming and shaping hedges, shrubs and bushes. Cutting shrubs • Before cutting, check

Tree Trimming Austin

Hoof Trimming Clinic for Dairy Veterinarians - in1touchabvma.in1touch.org/uploaded/web/Hoof Trimming Clinic for Dairy... · Hoof Trimming Clinic for Dairy Veterinarians . ... 10:00

Another Non-segregated Blue Stragglers Population in a Globular Cluster: the Case of NGC2419 Another Non-segregated Blue Stragglers Population in a Globular

GRASS: Trimming Stragglers in Approximation Analyticsusers.cms.caltech.edu/~adamw/papers/straggler.pdfSpark [15] (for interactive jobs). We evaluate GRASS using production workloads

Innova Manual Trimming - Marel · PDF fileInnova Manual Trimming Trimming for manual lines ... Database used SQL Server 2005 Express, ... Marel M2000-A73 input scales

Beyond Blue Stragglers - Kepler & K2 Science Center · 2018. 1. 23. · Beyond Blue Stragglers: K2 Observations Reveal Post-Mass-Transfer Binaries Hidden on the M67 Main-Sequence

Collage Inference: Tolerating Stragglers in Distributed ... · stragglers in distributed image classification. We propose en-hanced single shot object detection models, Collage-CNN

Approximation Algorithms. 2301681Approximation Algorithms2 Outlines Why approximation algorithm? Approximation ratio Approximation vertex cover

Barefoot Trimming Trimming the Heelsthehorseshoof.com/32_TrimmingHeels.pdfBarefoot Trimming Trimming the Heels by James Welz F or guidance on how to trim the heels of our domestic

ESSEX STRAGGLERS ORIENTEERING SOCIETY … stragglers orienteering society annual general meeting 2014 ... 858 675----12 16 6,407 ... 215 38 400 76 11

On data skewness, stragglers, and MapReduce …On data skewness, stragglers, and MapReduce progress indicators Emilio Coppa Irene Finocchi Computer Science Department, Sapienza University

5785451 BodyCruzer NA S1 - BraunThe BodycruZer is not intended for removing facial hair or scalp hair. Trimming Trimming on the skin/contour trimming For trimming precise lines and

Cord & Trimming - Piping

TREEPOWER programsfrom acrossthenation · portanceoftree-trimming,care andplacementtoensuresafety. APPA’sTREEPOWERprogram ... Utah annuallysendsoutflyersontree trimming,“PlantingtheRightTree”

The blue stragglers formed via mass transfer in old open clusters