13
IQ switch scheduling Pag. 1 Input Queued routers/switches QoS Issues in Telecommunication Networks - 1 Andrea Bianco – TNG group - Politecnico di Torino Andrea Bianco Telecommunication Network Group [email protected] http://www.telematica.polito.it/ The Internet is a mesh of routers QoS Issues in Telecommunication Networks - 2 Andrea Bianco – TNG group - Politecnico di Torino core router access router enterprise router The Internet is a mesh of routers Access router: high number of ports at low speed (kbps/Mbps) several access protocols (modem, ADSL, cable) Enterprise router: medium number of ports at high speed (Mbps) QoS Issues in Telecommunication Networks - 3 Andrea Bianco – TNG group - Politecnico di Torino medium number of ports at high speed (Mbps) several services (IP classification, filtering) Core router: moderate number of ports at very high speed (Mbps/Gbps) very high throughput Faster and faster Need for high performance routers – to accommodate the bandwidth demands for new users and new services – to support QoS QoS Issues in Telecommunication Networks - 4 Andrea Bianco – TNG group - Politecnico di Torino to reduce costs Moore’s law (electronic packet processing power) is too slow with respect to the increase in link speed The bottleneck is memory speed Single packet processing The time to process one packet is becoming shorter and shorter Worst case: 40-Byte packets (ACKs) • 3.2 µs at 100 Mbps QoS Issues in Telecommunication Networks - 5 Andrea Bianco – TNG group - Politecnico di Torino 320 ns at 1 Gps 32 ns at 10 Gps 3.2 ns at 100 Gbps 320 ps at 1Tbps LC LC CP IP IP CP OP OP Hardware architecture QoS Issues in Telecommunication Networks - 6 Andrea Bianco – TNG group - Politecnico di Torino S F LC LC LC S F IP IP IP OP OP OP physical structure logical structure

IQ switch scheduling - Telecommunication Networks Group · PDF fileInput Queued routers/switches ... TNG group - Politecnico di Torino QoS Issues in Telecommunication Networks - 1

  • Upload
    vuphuc

  • View
    215

  • Download
    1

Embed Size (px)

Citation preview

IQ switch scheduling

Pag. 1

Input Queued routers/switches

QoS Issues in Telecommunication Networks - 1Andrea Bianco – TNG group - Politecnico di Torino

Andrea BiancoTelecommunication Network Group

[email protected]://www.telematica.polito.it/

The Internet is a mesh of routers

QoS Issues in Telecommunication Networks - 2Andrea Bianco – TNG group - Politecnico di Torino

core router

access router

enterprise router

The Internet is a mesh of routers• Access router:

– high number of ports at low speed (kbps/Mbps)– several access protocols (modem, ADSL, cable)

• Enterprise router:medium number of ports at high speed (Mbps)

QoS Issues in Telecommunication Networks - 3Andrea Bianco – TNG group - Politecnico di Torino

– medium number of ports at high speed (Mbps)– several services (IP classification, filtering)

• Core router:– moderate number of ports at very high speed

(Mbps/Gbps)– very high throughput

Faster and faster• Need for high performance routers

– to accommodate the bandwidth demands for new users and new services

– to support QoS

QoS Issues in Telecommunication Networks - 4Andrea Bianco – TNG group - Politecnico di Torino

– to reduce costs• Moore’s law (electronic packet processing

power) is too slow with respect to the increase in link speed

• The bottleneck is memory speed

Single packet processing

• The time to process one packet is becoming shorter and shorter

• Worst case: 40-Byte packets (ACKs)• 3.2 µs at 100 Mbps

QoS Issues in Telecommunication Networks - 5Andrea Bianco – TNG group - Politecnico di Torino

• 320 ns at 1 Gps• 32 ns at 10 Gps• 3.2 ns at 100 Gbps• 320 ps at 1Tbps

LC

LC

CP

IP

IP

CP

OP

OP

Hardware architecture

QoS Issues in Telecommunication Networks - 6Andrea Bianco – TNG group - Politecnico di Torino

S FLC

LC

LC

S F IP

IP

IP

OP

OP

OP

physical structure logical structure

IQ switch scheduling

Pag. 2

Hardware architecture: main elements

• Line cards– support input/output transmissions– store packets– adapt packets to the internal format of the switching fabric– support data link protocols

QoS Issues in Telecommunication Networks - 7Andrea Bianco – TNG group - Politecnico di Torino

support data link protocols– classify packets– schedule packets– support security

• Switching fabric– transfers packets from input ports to output ports

Hardware architecture: main elements

• Control processor/network processor– runs routing protocols– computes routing tables– manages the overall system

QoS Issues in Telecommunication Networks - 8Andrea Bianco – TNG group - Politecnico di Torino

• Forwarding engines– compute the packet destination (lookup)– inspect packet headers– rewrite packet headers

control

processor

forwarding

engine

forwarding

engine

Interconnections among main elements - I

QoS Issues in Telecommunication Networks - 9Andrea Bianco – TNG group - Politecnico di Torino

switching fabric

line card line card1 N

Interconnections among main elements - II

control

processor

QoS Issues in Telecommunication Networks - 10Andrea Bianco – TNG group - Politecnico di Torino

switching fabric

line card &forwarding engine

1

line card &forwarding engine

N

Cell switch (fabric) ORM1

ORM

1

ISM

ISM

packets cells cells packets

Cell-based synchronous routers

QoS Issues in Telecommunication Networks - 11Andrea Bianco – TNG group - Politecnico di Torino

ORMN

N

ISM

• ISM: Input-Segmentation Module

• ORM: Output-Reassembly Module

• Packet: variable-size data unit

• Cell: fixed-size data unit

Switching fabric• Our assumptions:

– Bufferless• to reduce internal hardware complexity

– Non-blocking

QoS Issues in Telecommunication Networks - 12Andrea Bianco – TNG group - Politecnico di Torino

• it is always possible to transfer in parallel from input to output ports any non-conflicting set of cells

IQ switch scheduling

Pag. 3

Switching fabric• Examples:

– Buses– Shared memory – Crossbar – Multi-stage

1234

1 2 3 4

inpu

ts

QoS Issues in Telecommunication Networks - 13Andrea Bianco – TNG group - Politecnico di Torino

• rearrangeable Clos network • Benes network • Batcher-Banyan network (self-routing)

– Buffered cross-bars (not considered)• Switching constraints

– at most one cell for each input and for each output can be transferred

1 2 3 4outputs

Generic switching architecture

Output 1

switching fabricInput 1 Sin Sout

QoS Issues in Telecommunication Networks - 14Andrea Bianco – TNG group - Politecnico di Torino

Input N Output NSinSout

input queues output queues

Speedup• The speedup (increase in speed with respect

to line speed) determines switch performance:– Sin = reading speed from input queues– Sout = writing speed to output queues

QoS Issues in Telecommunication Networks - 15Andrea Bianco – TNG group - Politecnico di Torino

• The speedup is also a technological constraint

• Maximum speedup factor:– S = max(Sin, Sout)

Switches with queues at outputs• OQ (Output Queued)• The switching fabric is able

to transfer to any output all cells received in one time slot

• 100% throughput

1 1

λ λ =Nλ

Nλi

QoS Issues in Telecommunication Networks - 16Andrea Bianco – TNG group - Politecnico di Torino

g p• Optimal average delay• Speedup N with respect to

line speed is required in switching fabric speed and in output port memory access

NxNN N

λi λo=Nλi

Switches with queues at inputs• IQ (Input Queued) • Advantages:

– Switching fabrics and memories less costly

• No speedup required in

1 1

QoS Issues in Telecommunication Networks - 17Andrea Bianco – TNG group - Politecnico di Torino

p p qthe switching fabric

• Memory access speed equal to line speed

• Speedup=1

– Only viable solution for very high speed devices

NxNN N

1 1

IQ switches• Two problems:

– Define scheduling algorithms to avoid output contention

• Select in each time slot data to be transmitted from inputs to outputs

QoS Issues in Telecommunication Networks - 18Andrea Bianco – TNG group - Politecnico di Torino

NxNN N

to outputs • Constraint: in each time slot

– At most 1 data from each input (no speeedup in reading)

– At most 1 data to each output (no speedup in writing because no memory is available)

IQ switch scheduling

Pag. 4

IQ switches• Two problems:

– Memory architecture:• If using FIFO queues, HoL

(Head of the Line) blocking• If choosing cyan packet from

queue N the red packet in

1 1

QoS Issues in Telecommunication Networks - 19Andrea Bianco – TNG group - Politecnico di Torino

queue N, the red packet in queue 1 is blocked by the cyan packet which is at the head of queue 1 even if red output port N is free

• For uniform unicast traffic throughput limited to 58.6%

NxNN N

Memory architecture

• To avoid HoL blocking phenomenum, a more complex memory architecture is needed

QoS Issues in Telecommunication Networks - 20Andrea Bianco – TNG group - Politecnico di Torino

• Two possible solutions:– p-window queueing– VOQ (Virtual Output Queueing)

p=3

1 1

Memory architecture• p-window queueing:

– p is the window size– The first p cells of each

queue are considered for scheduling

QoS Issues in Telecommunication Networks - 21Andrea Bianco – TNG group - Politecnico di Torino

NxNN N

– Higher complexity• Scheduler deals with pN

cells• Non FIFO queues

– HoL blocking reduced, completely eliminated only for P→∞

1 1

1

N

Memory architecture• VOQ (virtual output

queueing)– At each input data are

stored in separate queues according to

QoS Issues in Telecommunication Networks - 22Andrea Bianco – TNG group - Politecnico di Torino

NxNN N

1

N

data destination (N queues for each input)

– N2 queues in total– Eliminates HoL blocking

and it permits to achieve 100% throughput with a proper scheduling algorithm

Scheduling problem modelling• Problem to be solved

by the scheduling can be represented using a bipartite graph

• An edge i→j exist if there is at least a data at input i willing to reach output j

• Edges may be weighted

QoS Issues in Telecommunication Networks - 23Andrea Bianco – TNG group - Politecnico di Torino

1

2

3

N

1

2

3

N

• Weight– Binary– Queue length– HoL cell age or waiting

time– Other metrics

Scheduling problem modeling• The scheduling algorithm tries to determine, in each

time slot, a matching over the bipartite graphs.• Select at most N edges with constraints

– At most one edge for each inputAt most one edge for each output

QoS Issues in Telecommunication Networks - 24Andrea Bianco – TNG group - Politecnico di Torino

1

2

3

N

1

2

3

N

1

2

3

N

1

2

3

N

Graph G Matching M

– At most one edge for each output

IQ switch scheduling

Pag. 5

• Another possible representation is based on a (Request Matrix RM), which stores the information related to data transfer request

I 0: no data from input to output

Scheduling problem modeling

QoS Issues in Telecommunication Networks - 25Andrea Bianco – TNG group - Politecnico di Torino

• Matching it is a permutation matrix i.e., a matrix such that the row and column sum is at most equal to 1

INPUT

OUTPUT

1: at least one data from inputto output (may be weighted)

Scheduling in IQ switches• Request Matrix

3 5 0 02 0 0 4

• Permutation matrix

0 1 0 00 0 0 1

QoS Issues in Telecommunication Networks - 26Andrea Bianco – TNG group - Politecnico di Torino

4 5 0 00 0 8 2

1 0 0 00 0 1 0

Traffic description• Aij(n) = 1 if a packet arrives at time n at input i,

with destination reachable through output j• λij = E[Aij(n)]• An arrival process is admissible if:

QoS Issues in Telecommunication Networks - 27Andrea Bianco – TNG group - Politecnico di Torino

p– ∑i λij < 1– ∑j λij < 1

• no input and no output are overloaded on average• OQ switches exhibit finite delays only for admissible traffic

• Traffic matrix: Λ = [λij ]

Traffic scenarios• Uniform traffic

– Bernoulli i.i.d. arrivals– usual testbed in the

literature• “easy to schedule” ⎥

⎥⎥⎥

⎢⎢⎢⎢

1111111111111111

QoS Issues in Telecommunication Networks - 28Andrea Bianco – TNG group - Politecnico di Torino

• Diagonal traffic– Bernoulli i.i.d arrivals– critical to schedule, since

only two matchings are good

⎥⎥⎥⎥

⎢⎢⎢⎢

2001120001200012

⎦⎣ 1111

Traffic scenarios• LogDiagonal traffic

– Bernoulli i.i.d. arrivals– more critical than

uniform, less than di l t ffi

⎥⎥⎥⎥

⎢⎢⎢⎢

−=Λ

8124481224811248

12 N

ρ

QoS Issues in Telecommunication Networks - 29Andrea Bianco – TNG group - Politecnico di Torino

diagonal traffic ⎦⎣

Scheduling policies: objective• Let us consider a NxN IQ• Denote by i the input port index and by j the output port

index• Goal: assuming infinite buffer size, transfer any admissible

traffic pattern with no losses• Solutions are known

QoS Issues in Telecommunication Networks - 30Andrea Bianco – TNG group - Politecnico di Torino

– If traffic pattern is known in advance• TDM of Birkhoff von Neumann algorithm

– For admissible unknown traffic patter• Maximum Weight Matching• Maximum Size Matching

• Several heuristics are proposed for unknown traffic pattern– iSLIP, iLQF, iOCF, 2DRR (WFA), MUCS, RPA, and many others

IQ switch scheduling

Pag. 6

Scheduling uniform known traffic

• A number of algorithms give 100% throughput when traffic is uniform – For example:

• TDM and a few variantsiSLIP ( l t )

QoS Issues in Telecommunication Networks - 31Andrea Bianco – TNG group - Politecnico di Torino

• iSLIP (see later)

Example of a TDM schedule for a 4x4 switch

Birkhoff - von Neumann theorem• Any doubly stochastic matrix Λ can be

expressed as convex combination of permutation matrices πn:

Λ ∑

QoS Issues in Telecommunication Networks - 32Andrea Bianco – TNG group - Politecnico di Torino

Λ = ∑n an πn

• with– an≥0– ∑n an =1

Scheduling non-uniform known traffic

• Thanks to the Birkhoff - von Neumann theorem

• If the traffic is known and admissible, 100% thro ghp t can be achie ed b a TDM

QoS Issues in Telecommunication Networks - 33Andrea Bianco – TNG group - Politecnico di Torino

throughput can be achieved by a TDM scheme using:– for a fraction of time a1 matching M1 (π1)– for a fraction of time a2 matching M2 (π2)– for a fraction of time ak matching Mk (π3)

MSM: Maximum Size Matching• MSM maximizes the number of data

transferred in a single time slot, i.e. select the maximum number of edges– Instantaneous throughput maximization.

• Asymptotic computational complexity is

QoS Issues in Telecommunication Networks - 34Andrea Bianco – TNG group - Politecnico di Torino

• Asymptotic computational complexity is O(N2.5)

• Non optimal algorithm– Some admissible traffic pattern cannot be

scheduled, i.e. it does not always achieve 100% throughput.

MSM: exampleO1 O2 O3

I1 λ 0 0I2 0 λ 0

I3 λ λ 0 λ=0.5

I1

I2

I3

O1

O2

O3

QoS Issues in Telecommunication Networks - 35Andrea Bianco – TNG group - Politecnico di Torino

I1

I2

I3

O1

O2

O3

I1

I2

I3

O1

O2

O3

I1

I2

I3

O1

O2

O3

Three MSM are possible in overload, i.e. when all queues are full

When the first matching is chosen, the maximum throughput cannot be achieved

MWM: Maximum Weight Matching• A weight is associated with each edge• The MWM, among all possible N! matchings,

selects the one with the highest weight (sum of edge metrics)

QoS Issues in Telecommunication Networks - 36Andrea Bianco – TNG group - Politecnico di Torino

I1

I2

I3

O1

O2

O3

I1

I2

I3

O1

O2

O3

I1

I2

I3

O1

O2

O3

1

20

1

1

MWMMSM

IQ switch scheduling

Pag. 7

MWM: Maximum Weight Matching• MWM does not maximize instantaneous throughput (worse

than MSM)• It was demonstrated that a MWM algorithm

– in IQ switches with VOQ architecture– under admissible traffic

with infinite queue size

QoS Issues in Telecommunication Networks - 37Andrea Bianco – TNG group - Politecnico di Torino

– with infinite queue size– when using as weight either the queue length or the age of the HoL

data

achieves100% throughput • Asymptotic computational complexity O(N3)• With finite queue size, it behaves similarly to MSM• Problems with delays and possible starvation

MSM algorithm• Start from the bipartite graph• Add a source and a sink node, respectively

connected to all inputs and all outputs• Execute a flow maximization algorithm

QoS Issues in Telecommunication Networks - 38Andrea Bianco – TNG group - Politecnico di Torino

g– Flow augmentation path

I1

I2

I3

O1

O2

O3

source sink

MSM algorithm• Operates on the residual capacity graph• Select at random a path connecting source

to sink

Fl 1

QoS Issues in Telecommunication Networks - 39Andrea Bianco – TNG group - Politecnico di Torino

• Residue graph:

source sink Flow 1

source sink

MSM algorithm

source sink Flow 2

QoS Issues in Telecommunication Networks - 40Andrea Bianco – TNG group - Politecnico di Torino

• Residue graph:

source sink

MSM algorithm

R id h

source sink Flow 3

QoS Issues in Telecommunication Networks - 41Andrea Bianco – TNG group - Politecnico di Torino

• Residue graph:

• Algorithm terminates when there are no more admissible paths among source and sink

source sink

MSM algorithm• Final solution obtained by summing the flows

obtained ineach step

source sink

QoS Issues in Telecommunication Networks - 42Andrea Bianco – TNG group - Politecnico di Torino

• Algorithms never backtracks to modify a choice

• Problems: – Successive iterations cannot be made parallell– Not suitable to hardware implementation

IQ switch scheduling

Pag. 8

Uniform traffic• MWM and MSM behave almost identically

100Uniform Traffic

MWM MSM

QoS Issues in Telecommunication Networks - 43Andrea Bianco – TNG group - Politecnico di Torino

1

10

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0

Mea

n de

lay

Normalized Load

Diagonal traffic• MSM yields much longer delays than MWM at medium/high loads

100

1000Diagonal Traffic

MWM MSM

QoS Issues in Telecommunication Networks - 44Andrea Bianco – TNG group - Politecnico di Torino

1

10

100

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0

Mea

n de

lay

Normalized Load

Practical solutions• Need to define heuristics

– with reasonable complexity– that can be implemented in hardware

• Any scheduling algorithm defines three aspects:A method to compute the weights to be associated with

QoS Issues in Telecommunication Networks - 45Andrea Bianco – TNG group - Politecnico di Torino

– A method to compute the weights to be associated with each edge (metric):

• Approximate MSM– Binary (queue occupancy)

• Approximate MWM– Queue length (it is an indication of the fact that the queue, which

is associated with an input/output pair, is suffering) – Age of HoL data– Interface load– Ad hoc metrics to select critical edges/nodes

Practical solutions– ...– A heuristic algorithm to determine a matching– A contention resolution algorithm among edges

with the same metric:ro nd robin (initial choice state dependent)

QoS Issues in Telecommunication Networks - 46Andrea Bianco – TNG group - Politecnico di Torino

• round-robin (initial choice state dependent)• sequential search (initial choice non state dependent)• random

i-SLIP• Iterative algorithm• It defines a heuristic algorithm which

determines, with a proper number of iterations, a maximal size matching (i.e., a matching that

b f h d d i h h d

QoS Issues in Telecommunication Networks - 47Andrea Bianco – TNG group - Politecnico di Torino

cannot be further extended with other edges selection)

• Metric is the queue occupancy• To solve contentions, it exploits an arbiter for

each input and for each output

i-SLIP• In each iteration, three phases can be

identified:– Request: each unmatched input sends a request

to every output for which it has a cell– Grant: each unmatched output that has received

QoS Issues in Telecommunication Networks - 48Andrea Bianco – TNG group - Politecnico di Torino

requests, sends a grant to one of the requesting inputs.

• Contentions solved by a round robin mechanism.– Accept: if an unmatched input receives grants, it

selects an output and becomes matched to it • Contentions solved by a round robin mechanism

IQ switch scheduling

Pag. 9

i-SLIP: counters• Each input (output) has a pointer associated with

to solve contentions• The output pointer is incremented, modulo N, by

one unit beyond the index of the input to which the grant was issued

QoS Issues in Telecommunication Networks - 49Andrea Bianco – TNG group - Politecnico di Torino

the grant was issued • The input pointer is incremented, modulo N, by

one unit beyond the index of the output from which an accept was received

i-SLIP: properties• For uniform overload the bipartite graph is

a full mesh the higher the number of alternatives, the easier is to determine a good matching. iSLlP degenerates to a TDM scheme ES:

QoS Issues in Telecommunication Networks - 50Andrea Bianco – TNG group - Politecnico di Torino

scheme ES:

I2

I1

I3

I4

O2

O1

O3

O4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

• All outputs grant input 1

• Input 1 accepts output 1

i-SLIP: properties

I2

I1

I3

I

O2

O1

O3

O

• IT 1

For one iteration unsatisfactory

QoS Issues in Telecommunication Networks - 51Andrea Bianco – TNG group - Politecnico di Torino

I4 O4

I4 O4

I2

I1

I3

O2

O1

O3

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

• IT 2

I2

I1

I3

I4

O2

O1

O3

O4

1,2,3,4 1,2,3,4

i-SLIP: properties• At the end:

I2

I1

O2

O1

1 2 3 4 1 2 3 4

• IT N

I2

I1

O2

O11,2,3,41,2,3,4

QoS Issues in Telecommunication Networks - 52Andrea Bianco – TNG group - Politecnico di Torino

• A maximum (implies maximal) matching is obtained after N iterations

I4 O4

I2I3

O2

O3

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

I2

I3

I4

O2

O3

O4

i-SLIP: propertiesIn the following steps, pointers are staggered one iteration is enough to obtain a maximum matching

I2

I1

I

O2

O1

O

1,2,3,4

1,2,3,4

1 2 3 4

1,2,3,4

1,2,3,4

1 2 3 4

• Step 2

I2

I1

I

O2

O1

O

QoS Issues in Telecommunication Networks - 53Andrea Bianco – TNG group - Politecnico di Torino

I3I4

O3

O4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

I3I4

O3

O4

I2

I1

I3I4

O2

O1

O3

O4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

• Step 3

I2

I1

I3I4

O2

O1

O3

O4

1,2,3,4

1,2,3,4

1,2,3,4

1,2,3,4

i-SLIP: properties• Each iteration has a computational complexity of O(N2),

but it can be easily made parallel• Worst case in one iteration: 1 edge is selected• When executing N iterations, the matching is maximal

(depends on the choice made but cannot be extended) O( 3)

QoS Issues in Telecommunication Networks - 54Andrea Bianco – TNG group - Politecnico di Torino

however, the computational complexity is O(N3)• Experimental results show that log2N iterations are in

general enough to obtain good performance• Performance drops if pointers are badly synchronized• iSLIP was implemented on a single chip in the Cisco

12000 router

IQ switch scheduling

Pag. 10

iSLIP: extensions• Use the same heuristic algorithm (3-phase) but with

different metrics– Queue length iLQF– HoL cell age iOCF

• Input send requests containing the weight • Contentions are solved using the weight first only

QoS Issues in Telecommunication Networks - 55Andrea Bianco – TNG group - Politecnico di Torino

• Contentions are solved using the weight first, only for equal weights the choice is random

• Does not exploit pointer synchronization to obtain good performance, rather the edge weight

• OCF has better delay properties (never starves data), but the increase in complexity is significant and makes the algorithm practically unfeasible

2DRR: Two Dimensional Round Robin

• Operates on the request matrix• Extension of the WFA (Wave Front Arbiter), very

easily implementable in hardware• Definitions

– Generalized diagonal is a set of N elements of a matrix NxN such that two elements do not belong to the same

QoS Issues in Telecommunication Networks - 56Andrea Bianco – TNG group - Politecnico di Torino

NxN such that two elements do not belong to the same row or column

• A set of N diagonal is said to be covering if each element of the matrix belongs to one and only one diagonal

• In each time slot, the algorithm goes through N iterations

2DRR: Two Dimensional Round Robin

• At the beginning, all links (input-output connections) are enabled

• At each iteration, a given generalized diagonal is chosen– Only enabled links may be selected if the are covered by

th l t b l i t th h di l

QoS Issues in Telecommunication Networks - 57Andrea Bianco – TNG group - Politecnico di Torino

the elements belonging to the chosen diagonal • If a link from input i to output j is selected, all

requests issued by i or sent to j are disabled for the current time slot (cannot be chosen in the matching)

• In N iterations, all N generalized diagonal are considered and the request matrix is fully covered

2DRR: example

1 0 1 10 1 0 11 1 0 10 0 1 0 0100

101110101101

IT 1

0100101110101101

IT 2

RM=

QoS Issues in Telecommunication Networks - 58Andrea Bianco – TNG group - Politecnico di Torino

0 0 1 0

0100100000100001

0100

0100101110101101

IT 3

0100IT 4

0100101110101101

2DRR: Two Dimensional Round Robin

• At each time slot, a different covering set of generalized diagonal is chosen, to improve fairness– Indeed, edges covered to the first diagonal chosen are

more likely selected– Round robin over different sets of covering diagonal and

QoS Issues in Telecommunication Networks - 59Andrea Bianco – TNG group - Politecnico di Torino

g ground robin on each element in the set

• Emulates a MSM• Not easy to extend to other metrics• Asymptotic computational complexity O(N2)

MUCS: Matrics Unit Cell Scheduler• Information on queue status are concentrated

on a request matrix A (NxN)• Starting from A, the weight matrix W is

computed

QoS Issues in Telecommunication Networks - 60Andrea Bianco – TNG group - Politecnico di Torino

• The matching is computed on the basis of W• Elements aij of A are non-negative integers

that indicate the priority to be assigned to the edge connecting input i to output j

IQ switch scheduling

Pag. 11

MUCS: Matrics Unit Cell Scheduler

• 3 possible values can be associated with each aij:– QoS priority: aij is inversely proportional to the

minimum Maximum Cell Transfer Delay in queue storing packets from i to j

QoS Issues in Telecommunication Networks - 61Andrea Bianco – TNG group - Politecnico di Torino

storing packets from i to j– Binary priority: aij=1 if there is at least a packet in

the queue storing packets from i to j– Queue length priority: aij is the occupancy of the

queueu storing packets from i to j• Weights wij are computed starting from aij

MUCS: Matrics Unit Cell Scheduler• At step n, the reduced request matrix An is defined,

whose size Mn is decreasing (Mn+1< Mn)• At each iteration, the following steps are executed:

– Compute the weight matrix from matrix An:

QoS Issues in Telecommunication Networks - 62Andrea Bianco – TNG group - Politecnico di Torino

⎪⎩

⎪⎨

⎧ ≠+

otherwise

aifwca

wra

w ijij

ij

ij

ij

ij

0

0

∑=

=nM

jijij awr

1∑=

=nM

iijij awc

1

MUCS: Matrics Unit Cell Scheduler– Select the edge i→j associated with the largest value of wij

Once edges are selected, the reduced matrix An+1 is computed, taking into account all elements in A belonging to rows and columns not selected in previous iterations

– the algorithm ends when all elements in A were id d

QoS Issues in Telecommunication Networks - 63Andrea Bianco – TNG group - Politecnico di Torino

considered• It is a greedy algorithm• Asymptotic computational complexity is O(N3)• Good performance, but implemented using analog

components

RPA: Reservation with Preemption and Acknowledgment

• It is a heuristic algorithm based on two phases:– Reservation (possibly preemptive) – Acknowledgment

• Inputs keep an urgency metric Xij associated with each queue

• Inputs operate on a reservation vector RES which stores reservations made by inputs in each time slot

QoS Issues in Telecommunication Networks - 64Andrea Bianco – TNG group - Politecnico di Torino

reservations made by inputs in each time slot• Reservation vector RES has N elements; each element

refers to a different output port and contains– Urgency– Input port index

• Initially, at each round: Urgj = 0, ∀ j

Urg1,In1 Urg2,In2 Urg3,In3 UrgN,InN

Out 1 Out 2 Out 3 Out N

RES

RPA: reservation phase• Input ports access vector RES sequentially, according to a

pre-defined order• The first input makes a greedy selection, choosing the output

corresponding to the maximum urgency • The input following in the pre-defined order chooses the

output that maximizes the difference among queue urgency and RES vector urgencies

QoS Issues in Telecommunication Networks - 65Andrea Bianco – TNG group - Politecnico di Torino

and RES vector urgencies– Input i computes Wj = Xij – Urgj for all j– finds j* such that Wj* = max{ Wj }– if Wj* > 0,

⇒ reserve output j* and set Urgj*=Xij*, possibly overwriting the previous reservation

– It may pre-empt an already existing request if this choice maximizes the global urgency (may not select its maximum urgency)

• The reservation phase ends when all ports made their reservation

RPA: acknowledgment phase• Reservation vector RES is again processed by all

inputs, in the same order followed during the reservation phase

• Each input checks if its reservation was cancelled :– If NOT: input gets access to the reserved output

QoS Issues in Telecommunication Networks - 66Andrea Bianco – TNG group - Politecnico di Torino

– If NOT: input gets access to the reserved output– If YES: the input cannot gain access to the previously

reserved output; however, it may select another output among available (not reserved) outputs. Note that, since the number of inputs is normally equal to the number of outputs, if some reservations were cancelled, some outputs are surely available

IQ switch scheduling

Pag. 12

RPA features• Good performance

– Sum of the weights selected is at least half of the weights transferred by MWM

• Different urgency metrics may be chosen, making the algorithm very flexible

S i i i l i l l b f d

QoS Issues in Telecommunication Networks - 67Andrea Bianco – TNG group - Politecnico di Torino

– Strict priority among multiple classes may be enforced– Standard urgency is queue length

• No iterations are required• Asymptotic computational complexity O(N2)• Cannot be made parallel, since a sequential access

is fundamental

Uniform traffic• Comparison between MWM, iSLIP, iLQF, RPA

1000Uniform Traffic

MWMiSLIP iLQF RPA

QoS Issues in Telecommunication Networks - 68Andrea Bianco – TNG group - Politecnico di Torino

1

10

100

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0

Mea

n de

lay

Normalized Load

LogDiagonal traffic• iSLIP saturates close to 84% throughput

10000

100000LogDiagonal Traffic

MWMiSLIP iLQFRPA

QoS Issues in Telecommunication Networks - 69Andrea Bianco – TNG group - Politecnico di Torino

1

10

100

1000

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0

Mea

n de

lay

Normalized Load

Diagonal traffic• RPA achieves 98% throughput, iLQF 87%, iSLIP 83%

10000

100000Diagonal Traffic

MWMiSLIP iLQF RPA

QoS Issues in Telecommunication Networks - 70Andrea Bianco – TNG group - Politecnico di Torino

1

10

100

1000

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0

Mea

n de

lay

Normalized Load

Issues in IQ switches• Signalling:

– Signalling bandwidth required to transfer weights from inputs to the controller may be significant with respect to the available bandwidth in the switching fabric

– The more complex the adopted metric, the larger the

QoS Issues in Telecommunication Networks - 71Andrea Bianco – TNG group - Politecnico di Torino

signalling bandwidth required– Differential signalling may be adopted

• Multiple classes:– Given K classes, first the VOQ architecture must be

extended, by using KN queues at each input– Scheduling algorithms must be extended to support

priorities

Issues in IQ switches• QoS (fair queueing):

– Scheduling for QoS (need to serve the most urgent packet) has a difficult interaction with the scheduling to transfer data from inputs to outputs

– Need to balance performance and fairness

QoS Issues in Telecommunication Networks - 72Andrea Bianco – TNG group - Politecnico di Torino

– No ideal optimal solution known• Frame scheduling:

– Operate on a frame of length F slot, and compute a schedule on the frame and not on a slot by slot basis

– Scheduling algorithm executes only at frame boundaries– Relatively easy to provide QoS guarantees for each input-

output pair– Delay increases at low loads

IQ switch scheduling

Pag. 13

Issues in IQ switches• Variable packet length support

– May introduce packet scheduling instead of cell scheduling

• Packets transferred as trains of cells– An edge is selected when the first cell of a packet arrives

QoS Issues in Telecommunication Networks - 73Andrea Bianco – TNG group - Politecnico di Torino

– An edge is selected when the first cell of a packet arrives and is kept in all the following matchings until the last cell of the packet is transferred

– It avoids reassembly machines at outputs– Same throughput guarantees– Packet delay may be larger or shorter

• Depends on packet length distribution

Issues in IQ switches• Multicast:

– 2N possible different multicast flows– May be treated as unicast through input port replication

(often named multicopy) • At each input a number of copies equal to the packet fanout are

created, for the proper outputs, and inserted in the proper VOQ– Speedup required

QoS Issues in Telecommunication Networks - 74Andrea Bianco – TNG group - Politecnico di Torino

Speedup required– Increases the instantaneous input load– May lead to low throuhgput (unable to sustain a single broadcast

flow)– Scheduling for multicast must be defined to exploit

switching fabric multicast capabilities• Balance fanout splitting and no-fanout splitting• Often a single FIFO for multicast is proposed (HoL blocking, less

critical with respect to unicast)• Critical traffic patterns when few inputs are active

Example of a critical traffic pattern

QoS Issues in Telecommunication Networks - 75Andrea Bianco – TNG group - Politecnico di Torino

Issues in IQ switches– MC-VOQ architecture

• 2N separate queues at each input• Best possible solution (no HoL blocking)

– An optimal scheduling was defined (only theoretically)– Implies re-enqueueing and out-of-sequence

• However, admissible traffic pattern exist that cannot be

QoS Issues in Telecommunication Networks - 76Andrea Bianco – TNG group - Politecnico di Torino

, pscheduled in an IQ switch regardless of the queue architecture and of the scheduling algorithm

• Scalability problem– Number of queues– Scheduling algorithm

– Manage a finite number of queues• CIOQ switches (a moderate speedup helps a lot)

ReferencesKarol M., Hluchyj M., Morgan S., ``Input versus output queueing on a space division switch'', IEEE Transactions on Communications, vol.35, n.12, Dec.1987McKeown N., Anantharam V., Walrand J.,``Achieving 100% throughput in an input-queued switch'',IEEE INFOCOM'96, vol.1, San Francisco, CA, Mar.1996, pp.296-302McKeown N.,``iSLIP: a scheduling algorithm for input-queued switches'', IEEE Transactions on Networking, vol.7, n.2, Apr.1999, pp.188-201Tamir Y., Chi H.-C., ``Symmetric crossbar arbiters for VLSI communication switches'', IEEE Transaction on Parallel and Distributed Systems, vol.4, no.1, Jan.1993, pp.13 –27Anderson T., Owicki S., Saxe J., Thacker C.,``High speed switch scheduling for local area networks'', ACM Transactions on Computer Systems, vol.11, n.4, Nov.1993 LaMaire R.O., Serpanos D.N., ``Two dimensional round-robin schedulers for packet switches with multiple input queues'', IEEE/ACM Transaction on Networking, vol.2, n.5, Oct.1994, p.471-482Duan H., Lockwood J.W., Kang S.M., Will J.D., ``A high performance OC12/OC48 queue design prototype for input buffered ATM switches'', IEEE INFOCOM'97, vol.1, 1997, pp.20-8, Los Alamitos, CAAjmone Marsan M., Bianco A., Leonardi E., Milia L., ``RPA: a flexible scheduling algorithm for input buffered switches'', IEEE Transactions on Communications vol 47 n 12 Dec 1999 pp 1921 1933

QoS Issues in Telecommunication Networks - 77Andrea Bianco – TNG group - Politecnico di Torino

Communications, vol.47, n.12, Dec.1999, pp.1921-1933 Ajmone Marsan M., Bianco A., Giaccone P., Leonardi E., Neri F., ``Packet scheduling in input-queued cell-based switches'', IEEE Trans. on Networking, Oct.2002

• Nong G., Hamdi M., ``On the provision of integrated QoS guarantees of unicast and multicast traffic in input-queued switches'', IEEE GLOBECOM'99, vol.3, 1999

• Hayes J.F., Breault R., Mehmet-Ali M.K., ``Performance analysis of a multicast switch'', IEEE Transactions on Communications, vol.39, n.4, Apr.1991, pp.581-587

• Kim C.K., Lee T.T., ``Call scheduling algorithm in multicast switching systems'', IEEE Transactions on Communications, vol.40, n.3, Mar.1992, pp.625-635

• Prabhakar B., McKeown N., Ahuja R., ``Multicast scheduling for input-queued switches'', IEEE Journal on Selected Areas in Communications, vol.15, n.5, Jun.1997, pp.855-866

• Andrews M., Khanna S., Kumaran K., ``Integrated scheduling of unicast and multicast traffic in an input-queued switch'', IEEE INFOCOM'99, vol.3, New York, NY, 1999, pp.1144-1151

• Liu Z., Righter R., ``Scheduling multicast input-queued switches'', Journal of Scheduling, John Wiley & Sons, May 1999• Ajmone Marsan M., Bianco A., Giaccone P., Leonardi E., Neri F., ``On the throughput of input-queued cell-based switches with multicast traffic'', IEEE

INFOCOM'01, Anchorage Alaska, Apr.2001• Smiljanic A., “Flexible bandwidth allocation in high-capacity packet switches”, IEEE/ACM Transactions on Networking, vol.10, n.2, pp.287-293,

Apr.2002 Chang C.S., Lee D.S., Jou Y.S., ``Load balanced Birkhoff-von Neumann switches'', 2001 IEEE HPSR, 2001, pp.276-280.