Cloud Control with Distributed Rate Limiting Barath Raghavan, Kashi Vishwanath, Sriram Ramabhadran,...

Cloud Control withDistributed Rate Limiting

Barath Raghavan, Kashi Vishwanath,Sriram Ramabhadran, Kenneth Yocum, and Alex C.

Snoeren

University of California, San Diego

Centralized network services

• Hosting with a single physical presence– However, clients are across the Internet

Running on a cloud

• Resources and clients are across the world

• Services combine these distributed resources

1 Gbps

Key challenge

We want to control distributed resources as if they were

centralized

Ideal: Emulate a single limiter

• Make distributed feel centralized– Packets should experience same limiter

behavior

Limiters

Distributed Rate Limiting (DRL)

Achieve functionally equivalent behavior to a central limiter

GlobalRandom Drop

FlowProportional Share

Packet-level(general)

Flow-level(TCP specific)

GlobalToken Bucket1 2 3

Distributed Rate Limiting tradeoffs

Accuracy(how close to K Mbps is delivered, flow rate

fairness)

+Responsiveness

(how quickly demand shifts are accommodated)

Communication Efficiency(how much and often rate limiters must

communicate)

Limiter 1

DRL Architecture

Limiter 2

Limiter 3

Limiter 4

Gossip

GossipGossipEstimatelocal demand

Estimateintervaltimer

Set allocation

Globaldemand

Enforce limit

Packetarrival

Token Buckets

Token bucket, fill rate K Mbps

Packet

Demand info(bytes/sec)

Building a Global Token Bucket

Limiter 1 Limiter 2

Baseline experiment

Limiter 1

3 TCP flowsS D

Limiter 2

7 TCP flowsS D

Single token bucket

10 TCP flowsS D

Global Token Bucket (GTB)

Single token bucket Global token bucket

7 TCP flows3 TCP flows

10 TCP flows

Problem: GTB requires near-instantaneous arrival info

(50ms estimate interval)

Global Random Drop (GRD)

5 Mbps (limit)4 Mbps (global arrival rate)

Case 1: Below global limit, forward packet

Limiters send, collect global rate info from others

Global Random Drop (GRD)

5 Mbps (limit)6 Mbps (global arrival rate)

Case 2: Above global limit, drop with probability:Excess

Global arrival rate

Same at all limiters

GRD in baseline experiment

Single token bucket Global random drop

7 TCP flows3 TCP flows

10 TCP flows

(50ms estimate interval)

Delivers flow behavior similar to a central limiter

GRD with flow join(50ms estimate interval)

Flow 1 joins at limiter 1

Flow 2 joins at limiter 2Flow 3 joins at limiter 3

Flow Proportional Share (FPS)Limiter 1

3 TCP flowsS D

Limiter 2

7 TCP flowsS D

Flow Proportional Share (FPS)

Limiter 1 Limiter 2

“3 flows”

“7 flows”

Goal: Provide inter-flow fairness for TCP flows

Local token-bucketenforcement

Estimating TCP demand

Limiter 1

Limiter 2

3 TCP flowsS D

1 TCP flow

S1 TCP flow

Local token rate (limit) = 10 Mbps

Flow A = 5 Mbps

Flow B = 5 Mbps

Flow count = 2 flows

Limiter 1

1 TCP flowS

Limiter 2

3 TCP flowsS D

S1 TCP flow

Key insight: Use a TCP flow’s rate to infer demand

Estimating skewed TCP demand

Flow A = 2 Mbps

Flow B = 8 Mbps

Flow count ≠ demand

Bottlenecked elsewhere

Estimating skewed TCP demand

Flow A = 2 Mbps

Flow B = 8 Mbps

Local LimitLargest Flow’s Rate

Bottlenecked elsewhere

= 1.25 flows

2.50 flowsLimiter 2

10 Mbps x 1.251.25 + 2.50

Flow Proportional Share (FPS)

Global limit = 10 Mbps

1.25 flowsLimiter 1

Set local token rate =

= 3.33 Mbps

Global limit x local flow countTotal flow count

Under-utilized limiters

Limiter 1

1 TCP flow

S1 TCP flow

Set local limit equal to actual usage

Wasted rate

(limiter returns to full utilization)

Flow Proportional Share (FPS)(500ms estimate interval)

Additional issues

• What if a limiter has no flows and one arrives?

• What about bottlenecked traffic?• What about varied RTT flows?• What about short-lived vs. long-lived

flows?

• Experimental evaluation in the paper– Evaluated on a testbed and over

Planetlab

Cloud control on Planetlab

• Apache Web servers on 10 Planetlab nodes

• 5 Mbps aggregate limit• Shift load over time from 10 nodes to

4 nodes

5 Mbps

Static rate limiting

Demands at 10 apache servers on Planetlab

Demand shifts to just 4 nodesWasted capacity

FPS (top) vs. Static limiting (bottom)

Conclusions

• Protocol agnostic limiting (extra cost)– Requires shorter estimate intervals

• Fine-grained packet arrival info not required– For TCP, flow-level granularity is

sufficient

• Many avenues left to explore– Inter-service limits, other resources (e.g.

Questions!

Cloud Control with Distributed Rate Limiting Barath Raghavan, Kashi Vishwanath, Sriram Ramabhadran,...

Documents

Empirical Analysis of Search Advertising Strategiescseweb.ucsd.edu/~snoeren/papers/ads-imc15.pdf · Empirical Analysis of Search Advertising Strategies Bhanu C. Vattikonda University

Lecture 9 - LVCSR Search - Columbia Universitystanchen/spring16/e6870/slides/lecture9_dcd.pdf · Lecture 9 LVCSR Search Michael Picheny, Bhuvana Ramabhadran, ... Does this FSA accept

Botcoin: Monetizing Stolen Cycles - University of …cseweb.ucsd.edu/~snoeren/papers/botcoin-ndss14.pdfBotcoin: Monetizing Stolen Cycles Danny Yuxing Huang, Hitesh Dharmdasaniy, Sarah

Integrated Coastal Zone Management Birgit Snoeren European Commission DG Environment Unit D3 : Cohesion Policy & Environmental impact assessment birgit.snoeren@ec.europa.eu

EECS E6870 Speech Recognition - Columbia Universitystanchen/fall09/e6870/slides/lecture1.pdf · EECS E6870 Speech Recognition Michael Picheny, Stanley F. Chen, Bhuvana Ramabhadran

Lecture 15: Distance-vector Routing › ... › lectures › 123-sp16-l15.pdf · CSE$123:$Computer$Networks AlexC.$Snoeren Lecture 15: Distance-vector Routing

Patrick Verkaik Yuvraj Agarwal, Rajesh Gupta, Alex C. Snoeren

Lecture 2 - Signal Processing and Dynamic Time Warpingstanchen/spring16/e6870/...Lecture 2 Signal Processing and Dynamic Time Warping Michael Picheny, Bhuvana Ramabhadran, Stanley

Progressive-X: Efficient, Anytime, Multi-Model Fitting Algorithmopenaccess.thecvf.com/content_ICCV_2019/papers/Barath...Progressive-X: Efﬁcient, Anytime, Multi-Model Fitting Algorithm

Current Monitor System (1606) AJ Pikul Barath Parthasarathy Jason Stock Maya Dubrow

Alex C. Snoeren Stefan Savage, Aaron Schulman, Brown Farinholt, … · Alex C. Snoeren Karl Koscher Stephen Checkoway Kirill Levchenko Brian Johannesmeyer. We created a testbed to

SmartNIC Performance Isolation with FairNIC: Programmable …cseweb.ucsd.edu/~snoeren/papers/fairnic-sigcomm20.pdf · 2020. 7. 4. · SmartNIC Performance Isolation with FairNIC:

Practical TDMA for Datacenter Ethernet Bhanu C. Vattikonda, George Porter, Amin Vahdat, Alex C. Snoeren

Introduction/Signal Processing, Part I Michael Picheny ...stanchen/fall12/e6870/slides/lecture1.pdf · Introduction/Signal Processing, Part I Michael Picheny, Bhuvana Ramabhadran,

Lecture 13 - Deep Belief Networks - Columbia Universitystanchen/fall12/e6870/slides/lecture13.pdfLecture 13 Deep Belief Networks Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen

IEEE/ACM TRANSACTIONS ON NETWORKING 1 Secure and Policy ...barath/papers/platypus-ton09.pdf · IEEE/ACM TRANSACTIONS ON NETWORKING 1 Secure and Policy-Compliant Source Routing Barath

Routing Snoeren

Lecture 1 - Introduction/Signal Processing, Part Istanchen/spring16/e6870/... · 2016. 1. 21. · Lecture 1 Introduction/Signal Processing, Part I Michael Picheny, Bhuvana Ramabhadran,

SEEDING CLOUD-BASED SERVICES: DISTRIBUTED RATE LIMITING (DRL) Kevin Webb, Barath Raghavan, Kashi Vishwanath, Sriram Ramabhadran, Kenneth Yocum, and Alex

Ind Barath Power Ltd