Benchmarking (RICON 2014)

Benchmarking: You’re Doing It Wrong

Aysylu Greenberg @aysylu22

To Write Good Benchmarks…

Need to be Full Stack

your process vs Goal your process vs Best PracCces

Benchmark = How Fast?

•  How Not to Write Benchmarks •  Benchmark Setup & Results: -  You’re wrong about machines -  You’re wrong about stats -  You’re wrong about what maLers

•  Becoming Less Wrong •  Having Fun with Riak

HOW NOT TO WRITE BENCHMARKS

Website Serving Images

•  Access 1 image 1000 Cmes •  Latency measured for each access •  Start measuring immediately •  3 runs •  Find mean •  Dev environment

Web Request

Server

S3 Cache

WHAT’S WRONG WITH THIS BENCHMARK?

YOU’RE WRONG ABOUT THE MACHINE

Wrong About the Machine

•  Cache, cache, cache, cache!

It’s Caches All The Way Down

Web Request

Server

S3 Cache

It’s Caches All The Way Down

Caches in Benchmarks Prof. Saman Amarasinghe, MIT 2009

Web Request

Server

S3 Cache

•  Cache, cache, cache, cache! •  Warmup & Cming

Web Request

Server

S3 Cache

•  Cache, cache, cache, cache! •  Warmup & Cming •  Periodic interference

Web Request

Server

S3 Cache

•  Cache, cache, cache, cache! •  Warmup & Cming •  Periodic interference •  Test != Prod

Web Request

Server

S3 Cache

•  Cache, cache, cache, cache! •  Warmup & Cming •  Periodic interference •  Test != Prod •  Power mode changes

YOU’RE WRONG ABOUT THE STATS

Wrong About Stats

•  Too few samples

Wrong About Stats

0 10 20 30 40 50 60

Latency

Convergence of Median on Samples

Stable Samples

Stable Median

Decaying Samples

Decaying Median

•  Access 1 image 1000 Cmes •  Latency measured for each access •  Start measuring immediately •  3 runs •  Find mean •  Dev machine

Web Request

Server

S3 Cache

Wrong About Stats

•  Too few samples •  Gaussian (not)

•  Access 1 image 1000 Cmes •  Latency measured for each access •  Start measuring immediately •  3 runs •  Find mean •  Dev machine

Web Request

Server

S3 Cache

Wrong About Stats

•  Too few samples •  Gaussian (not) •  MulCmodal distribuCon

MulCmodal DistribuCon

50% 99%

# occurren

Latency 5 ms 10 ms

Wrong About Stats

•  Too few samples •  Gaussian (not) •  MulCmodal distribuCon •  Outliers

YOU’RE WRONG ABOUT WHAT MATTERS

Wrong About What MaLers

•  Premature opCmizaCon

“Programmers waste enormous amounts of Cme thinking about … the speed of noncriCcal parts of their programs ... Forget about small efficiencies …97% of the Cme: premature opHmizaHon is the root of all evil. Yet we should not pass up our opportuniCes in that criCcal 3%.”

-‐-‐ Donald Knuth

•  Premature opCmizaCon •  UnrepresentaCve workloads

•  Premature opCmizaCon •  UnrepresentaCve workloads •  Memory pressure

•  Premature opCmizaCon •  UnrepresentaCve workloads •  Memory pressure •  Load balancing

•  Premature opCmizaCon •  UnrepresentaCve workloads •  Memory pressure •  Load balancing •  Reproducibility of measurements

BECOMING LESS WRONG

User AcCons MaLer

X > Y for workload Z with trade offs A, B, and C

-‐ hLp://www.toomuchcode.org/

Profiling Code instrumentaCon Aggregate over logs Traces

Microbenchmarking: Blessing & Curse

+ Quick & cheap + Answers narrow ?s well - Osen misleading results - Not representaCve of the program

•  Choose your N wisely

Choose Your N Wisely Prof. Saman Amarasinghe, MIT 2009

•  Choose your N wisely •  Measure side effects

•  Choose your N wisely •  Measure side effects •  Beware of clock resoluCon

•  Choose your N wisely •  Measure side effects •  Beware of clock resoluCon •  Dead Code EliminaCon

•  Choose your N wisely •  Measure side effects •  Beware of clock resoluCon •  Dead Code EliminaCon •  Constant work per iteraCon

Non-‐Constant Work Per IteraCon

Follow-‐up Material

•  How NOT to Measure Latency by Gil Tene –  hLp://www.infoq.com/presentaCons/latency-‐piualls

•  Taming the Long Latency Tail on highscalability.com –  hLp://highscalability.com/blog/2012/3/12/google-‐taming-‐the-‐long-‐latency-‐tail-‐when-‐more-‐machines-‐equal.html

•  Performance Analysis Methodology by Brendan Gregg –  hLp://www.brendangregg.com/methodology.html

•  Silverman’s Mode Detec@on Method by MaL Adereth –  hLp://adereth.github.io/blog/2014/10/12/silvermans-‐mode-‐detecCon-‐method-‐explained/

HAVING FUN WITH

•  SSD 30 GB •  M3 large •  Riak version 1.4.2-‐0-‐g61ac9d8 •  Ubuntu 12.04.5 LTS •  4 byte keys, 10 KB values

Latency (usec)

Number of Keys

Get Latency

Takeaway #1: Cache

Takeaway #2: Outliers

Takeaway #3: Workload

Benchmarking: You’re Doing It Wrong

Aysylu Greenberg @aysylu22

Benchmarking (RICON 2014)

Software

Distributed: of systems and teams (RICON 2015 version)

RICON TECHNOLOGIES CO., LTD. Mobile Data …riconmobile.com/ControlPanel/file/upload/2a90c738RICON-S9910_User...RICON TECHNOLOGIES CO., LTD. Mobile Data Communications User Manual

Ricon conference ppt

Ricon Wheelchair Lifts - Vantage Mobility International, Inc.cdn.vantagemobility.com/wp-content/uploads/RiconBrochure_5.pdf · Ricon Wheelchair Lifts ... thanks to its mechanical

UEFA Club Licensing Benchmarking Report 2014

Spindo, Tsp, G-brand, Fkk, Ricon, Flange

2014 Benchmarking Report order form

RICON keynote: outwards from the middle of the maze

Economic Benchmarking RIN Basis of Preparation 2014 56849 United 2014 - EB... · Economic Benchmarking RIN Basis of Preparation 2014 Economic Benchmarking RIN Basis of Preparation

S9922M2-3G-LTE-06.01 - riconmobile.com · RICON S9922M2 Router, Ricon Mobile Inc. tarafından endüstriyel sınıf kalitesinde 3G/LTE hücresel ağ teknolojisine uygun olarak tasarlanmış

2014 CDS Benchmarking Report. Table of Contents 2 SectionSlide(s) Introduction3–10 About CDS3 About the 2014 CDS Benchmarking Report4 Customizing 2014

CASE STUDY: wireless surveillance - Ricon Mobilericonmobile.com/ControlPanel/file/upload/77c0bfb4CASE... · 2018-06-06 · CASE STUDY: wireless surveillance RICON Technology provides

S9922M2-LTE-15.11 - Ricon Mobile

Zoo keeper for ricon

Ricon Wheelchair Lifts - Wheelchair Van Sales and Rentals · Ricon Wheelchair Lifts ... VMI wheelchair lifts for vehicles can make almost any full-size van a wheelchair-accessible

BEnChmarkIng UtIlIty ClEan EnErgy DEploymEnt: 2014

Ricon east

RICON Steel Fitting

CASE STUDY: wireless vending machines - Ricon …riconmobile.com/ControlPanel/file/upload/97c7f32aCASE...RICON SOLUTION FOR WIRELESS VENDING MACHINES APPLICATIONS 1. hardware: 2. software:

BENCHMARKING REGULATORIO DE LAS EPS 2014 (Datos 2014) › transparencia › benchmarking › ... · 2019-12-15 · BENCHMARKING REGULATORIO DE LAS EPS 2014 Puesto en el benchmarking