36
Automated and Scalable QoS Control - For Network Convergence Wonho Kim (Princeton Univ.) Puneet Sharma, Jeongkeun Lee, Sujata Banerjee, Jean Tourrilhes, Sung-Ju Lee, and Praveen Yalagandula (HP Labs) 1

Automated and Scalable QoS Control

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Automated and Scalable QoS Control

Automated and Scalable QoS Control- For Network Convergence

Wonho Kim (Princeton Univ.) Puneet Sharma, Jeongkeun Lee, Sujata Banerjee, Jean Tourrilhes, Sung-Ju Lee, and Praveen Yalagandula (HP Labs)

1

Page 2: Automated and Scalable QoS Control

Motivation

• Why do we care about QoS control?

− Network convergence

− Multi-tenancy networks

Automated QoS control is needed• Automated QoS control is needed

2 29 April 2010

Page 3: Automated and Scalable QoS Control

Network convergence

Map-

ReduceStorage

Fibre ChannelEthernet

33 29 April 2010

HPC …

Different protocols, adapters, switches, and configuration

Infiniband …

Page 4: Automated and Scalable QoS Control

Network convergence

Map-

ReduceStorage

44 29 April 2010

• Cost reduction (single network infrastructure)

• Unified management (single management infrastructure)

HPC …Converged Enhanced Ethernet (CEE)

Data Center Ethernet (DCE)

Data Center Bridging (DCB)

Fibre Channel over Ethernet (FCoE)

Fibre Channel over CEE (FCoCEE)

Fewer switches, ports, adapters, cables

Reduced power, equipment, cooling cost

Simpler topology

I/O consolidation

Unified resource management

Page 5: Automated and Scalable QoS Control

Multi-tenancy networksCustomer A

Customer C

• Serve multiple customers with a single fabric

• Better utilization of network infrastructure5 29 April 2010

Customer B

Page 6: Automated and Scalable QoS Control

Map-

ReduceStorage

Performance isolation

HPC …

66 29 April 2010

Page 7: Automated and Scalable QoS Control

Map-

ReduceStorage

Performance isolation

HPC …

77 29 April 2010

Page 8: Automated and Scalable QoS Control

Performance isolation

Map-

ReduceStorage

88 29 April 2010

HPC …

Page 9: Automated and Scalable QoS Control

Performance isolation

Map-

ReduceStorage

Customer A Customer C

Virtualized Servers

Variable Workloads

99 29 April 2010

• Need virtual network slices

• Need fine-grained performance isolation

HPC …

Customer B

Bugs, malicious attack

Page 10: Automated and Scalable QoS Control

Goal

• Enables performance isolation with QoS control

Slice 1

Slice n

10 29 April 2010

Physical network fabric

QoS configuration

High-level

Specifications

Page 11: Automated and Scalable QoS Control

Good news

• Most commodity switches have QoS knobs

− rate limiter

− priority queues

− schedulers

• Single network domain

− datacenters, enterprise networks, …

− free from Layer-8 issues (billing, collaborations, …)

− fine-grained control becomes feasible

11 29 April 2010

Page 12: Automated and Scalable QoS Control

Challenges

• Coarse-grained QoS knobs

− designed for distributed management

− class-based

− no e2e performance

• Manual configuration

− no standards for classifiers

− error-prone

− static (not adaptive)

12 29 April 2010

Storage

Traffic

CEO/CTO

Managers

Interns

VoIP

Fibre Channel

Email

Web

Page 13: Automated and Scalable QoS Control

Our Solution: OpenFlow QoS Controller

Physical network fabric

Virtual Slice 1

Virtual Slice n

13 29 April 2010

QoS configuration

High-level

Specifications

(rate limiter, priority queues, …)

Manual & Static setting

QoS Controller• Automated configuration

• Fine-grained flow management

• Adaptive to dynamic workloads

Problem

Page 14: Automated and Scalable QoS Control

Overview of OpenFlow QoS controller

OpenFlow APIs

FlowSrc Dest

prio prio prio prio

rate limit

1414 29 April 2010

QoS Controller

Topology, Nodes,

Resource, Flows, …

Rate limiters

Queue mappings

End-to-End

performance models

Adaptive

Aggregator

Network-wide

Optimization

Shortest Span First…

Page 15: Automated and Scalable QoS Control

Adaptive aggregation

Slice specs Flow specs

flow 1

flow 2

flow 3

Storage

Video

Peak rate: 400 MbpsDelay bound: 10 msAggregate: False

Peak rate: 100 Mbps

Customer DB src IP: X.X.X.X, port: 9551Type: IP (UDP), …

15 29 April 2010

flow 3

flow 4

Video

Backup

Peak rate: 100 MbpsDelay bound: 100 msAggregate: False

Aggregate: True

Employee DB

VoD

System backup

Log Archive

flow 5

flow 6

flow 7

flow 8

Page 16: Automated and Scalable QoS Control

Available QoS Knobs (Priority queue)

highest

Output port

short delay

16 29 April 2010

lowestlong delay

Page 17: Automated and Scalable QoS Control

Available QoS Knobs (Rate limiter)

highest

Output port

Peak rate

17 29 April 2010

lowest

Page 18: Automated and Scalable QoS Control

OpenFlow QoS APIs

Rate limiter

Flow 1

Flow 2

Flow 3

Flow 4

H/W rate limiter 1

H/W rate limiter 2

1818 29 April 2010

Priorityqueue mapping

Switch 1 Switch 2 Switch 3

• Extension of OpenFlow specification

• Expose QoS capability in switches

Page 19: Automated and Scalable QoS Control

OpenFlow QoS APIs

• With OpenFlow flow control

− fine-grained control of flows

− automated flow management

• With OpenFlow QoS APIs

− uniform control of QoS knobs

− configure QoS for individual (or aggregate) flows

19 29 April 2010

Page 20: Automated and Scalable QoS Control

Admission Control

• Input

− new flow arrival event

− performance requirements (peak rate, e2e delay)

− database for the current network state

− end-to-end performance model− end-to-end performance model

• Output

− admission control result (accept/reject)

− priority queue assignment, rate limiter settings

− path selection

20 29 April 2010

Page 21: Automated and Scalable QoS Control

Admission Control

• Two conditions should be satisfied

− satisfy f ’s performance requirement

− not violate existing flows in the networks

21 29 April 2010

Page 22: Automated and Scalable QoS Control

Difficulties in queue assignment

highest

(8/10)(10/10)f2

(8/10)(10/10)f1

22 29 April 2010

lowest

fSwitch A Switch B Switch C

We should consider interactions between• flows in a switch

• flows in multiple switches

Page 23: Automated and Scalable QoS Control

Admission control heuristic

• Goal

− increase the ratio of admitted flows

− lower the complexities in queue allocation

Shortest Span First (SSF)• Shortest Span First (SSF)

• Basic ideas

− estimate affordable options for a flow

− try first switches more likely to reject flow

23 29 April 2010

Page 24: Automated and Scalable QoS Control

Highest level & Lowest level

high

Shorter delay bound

More likely to violate other flows

f

Highest level for f

• Highest level: not violate existing flows

• Lowest level: not violate the new flow

• Span: available options for f24 29 April 2010

lowSwitch

Longer delay bound

More likely to violate flow fLowest level for f

Page 25: Automated and Scalable QoS Control

Shortest Span First (SSF)

high

src destf

• Step 1: compute highest & lowest levels independently

25 29 April 2010

lowSwitch A Switch B Switch C

Page 26: Automated and Scalable QoS Control

Shortest Span First (SSF)

high

src destf

• Step 2: sort switches in order of the span

26 29 April 2010

lowSwitch A Switch B Switch C

Page 27: Automated and Scalable QoS Control

Shortest Span First (SSF)

high

src destf

• Step 2: sort switches in order of the span

27 29 April 2010

lowSwitch C Switch A Switch B

Page 28: Automated and Scalable QoS Control

Shortest Span First (SSF)

high

src destf

• Step 3: try highest level at each hop

− try first a switch more likely to reject flow

28 29 April 2010

lowSwitch C Switch A Switch B

Page 29: Automated and Scalable QoS Control

Implementataion

• QoS APIs implemented on

− hardware switch (HP ProCurve 5406zl)

− software switch (Open vSwitch)

• QoS Controller implemented on top of NOX

− open-source OpenFlow controller

− http://noxrepo.org

• QoS Controller web interface

29 29 April 2010

Page 30: Automated and Scalable QoS Control

Prototype

Switch A

Host A Host BQoS

Controller

HTTP

OpenFlow

OpenFlow

OpenFlow

Output 14

Priority 7

30 29 April 2010

Host C Host D

Switch B Switch COutput 21

Priority 7

Peak rate 400 MbpsDelay bound 10 ms 400

Page 31: Automated and Scalable QoS Control

Evaluation

• Traffic generation

− generate 3 guaranteed flows from emulated services (UDP)

− generate cross traffic (UDP, TCP)

• Disable/Enable QoS controller

• Measured throughput and packet loss in testbeds

31 29 April 2010

Page 32: Automated and Scalable QoS Control

Throughput with UDP cross traffic

Generate cross traffic

Flow name Route (queue assignment)

32 29 April 2010

Enable QoS Controller

Flow name Route (queue assignment)

Customer DB H3 – S3(8) – S1(8) – H1

Employee DB H4 – S3(8) – S1(8) – H2

VoD H3 – S3(7) – S1(7) – H1

System Backup H4 – S3(1) – S1(1) – H2

QoS controller protects guaranteed flows in congestion

Page 33: Automated and Scalable QoS Control

Packet loss with TCP cross traffic

10 TCP

30 TCP

QoS control is needed even when most traffic in network is TCP

33 29 April 2010

1 TCP

Enable QoS Controller

Page 34: Automated and Scalable QoS Control

Future works

• Evaluations

− effectiveness of admission control heuristics(ratio of admitted flows)

− compare with offline optimal assignment

− simulations on a variety of datacenter networks (e.g., − simulations on a variety of datacenter networks (e.g., Hierarchical, FatTree, …)

• Deployment

− extend deployment to large networks

− test with mixture of services

34 29 April 2010

Page 35: Automated and Scalable QoS Control

Conclusion

• Single integrated network fabric is desirable

• We need fine-grained automated QoS control

• Contributions

− Design & Implement OpenFlow QoS APIs

− QoS controller: automated QoS control for network slicing

35 29 April 2010

Page 36: Automated and Scalable QoS Control

Thank you

36 29 April 2010