46
10/17/2006 1 Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems Santa Fe, NM Towards the Incorporation of Dynamic Adaptation into Operating Systems: Adaptive Disk I/O Patricia J. (Pat) Teller Professor, Computer Science The University of Texas at El Paso [email protected]

Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 1

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Towards the Incorporation of Dynamic

Adaptation into Operating Systems:

Adaptive Disk I/OPatricia J. (Pat) Teller

Professor, Computer ScienceThe University of Texas at El Paso

[email protected]

Page 2: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 2

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Outline• Brief Introduction to Research• Adaptive Disk I/O

– How can experiments on small systems help w.r.t. extreme-scale systems?

– What kinds of adaptations are applicable to disk I/O management?

– How can adaptive disk I/O management help in terms of performance and productivity?

• Future Work

Page 3: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 3

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Goal

Enhanced PerformanceEnhanced Performance

Generalized CustomizedCustomizedResource Management

Fixed Dynamically AdaptableDynamically AdaptableOS/Runtime Services

Introduction

Page 4: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 4

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Vision

Small-scale MPs ExtremeExtreme--scale MPsscale MPsCommodity OS Dynamically Dynamically

Adaptable OSAdaptable OS• Currently developing prototypes for small-scale MPs

running Linux• Looking forward to applying DAiSES research to

compute nodes and I/O nodes of extreme-scale systems

Introduction

Dynamic Adaptivity in Support of Extreme Scale

Page 5: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 5

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Vision

Small-scale MPs ExtremeExtreme--scale MPsscale MPsCommodity OS Dynamically Dynamically

Adaptable OSAdaptable OS• Currently developing prototypes for small-scale MPs

running Linux• Looking forward to applying DAiSES research to

compute nodes and I/O nodes of extreme-scale systems– Performance isolation research is applicable to I/O servers– I/O stream throttling research is applicable to improving the

performance of applications with I/O programmed similarly to MADbench

Introduction

Dynamic Adaptivity in Support of Extreme Scale

Page 6: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 6

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

ChallengesDetermining •• WhatWhat to adapt

– policy– parameter value

•• WhenWhen to adapt – under what circumstances– definition of a heuristic, function, or table-driven decision map

•• HowHow to adapt– selection of policy or parameter value – mechanism to affect the adaptation

•• HowHow to measure effectiveness of adaptation– metric

Introduction

Page 7: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 7

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Disk I/Ocause for concern?adaptation target?

From HEC I/O Workshop , August 2005: HPCS I/O and Storage Issues, David Koester (Mitre) and Henry Newman (Instrumental)

Page 8: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 8

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Disk I/O Performance is dependent on …

• I/O request access times: seeks + rotational latencies- I/O request generation pattern- Number and type of concurrent I/O-generating

processes/threads- Disk I/O scheduler: policy and parameters- Application I/O requirements, e.g., latency,

utilization, fairness- I/O controller: order in which requests are

serviced by disk• Data transfer rate• File system

Page 9: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 9

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Disk I/O Performance what can be adapted?

• I/O request access times: seeks + rotational latencies- I/O request generation pattern- Number and type of concurrent I/O-generating

processes/threads- Disk I/O scheduler: policy and parameters- Application data delivery requirements, e.g., latency,

utilization, fairness- I/O controller: order in which requests are

serviced by disk• Data transfer rate• File system

Page 10: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 10

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Disk I/O Performance what can be adapted?

• I/O request access times: seeks + rotational latencies- I/O request generation pattern- Number and type of concurrent I/O-generating

processes/threads- Disk I/O scheduler: policy and parameters- Application data delivery requirements, e.g., latency,

utilization, fairness- I/O controller: order in which requests are

serviced by disk• Data transfer rate• File system

Page 11: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 11

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Space of Data Delivery Requirements and Disk Schedulers

utilization

late

ncy

fairn

ess

Max

Strict

Schedulers in Linux

SCAN LOOK SSTF FCFS

DeadlineEDF

CFQ SFQ YFQ

DCFQ SCAN-EDF

Page 12: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 12

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Adaptation of Disk Schedulers

• Linux 2.6 includes four schedulers (Anticipatory, Deadline, CFQ, and noop) with boot-time and run-time selection

• Match the scheduler to application or system data delivery requirements– Real-time database→Latency guarantees:

Deadline– Multimedia database →Fairness: CFQ– Throughput/disk utilization: SSF

Page 13: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 13

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Disk SchedulingChallenge

Different disk system configurations

Applications with different I/O needs Different disk

schedulers? No silver bullet

Page 14: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 14

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Disk SchedulingCurrent Solution

Different disk system configurations

Applications with different I/O needs Different disk

schedulers? No silver

bullet

Page 15: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 15

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Linux SolutionShortcomings

• Only one scheduler active at a time• Queue draining time• Does not provide

– fair allocation of disk resource among multiple concurrently executing applications with different data delivery requirements

– I/O performance isolation / insulation

Page 16: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 16

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

A Case for Adaptive Disk Scheduling

• Operating systems and storage systems service concurrently executing applications with different data delivery requirements

• There are various I/O schedulers that satisfy these requirements

• No single scheduler is likely to satisfy all the applications’ data delivery requirements

• Server consolidation and virtualization compound this problem

Page 17: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 17

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

New Solution:Virtual I/O Scheduler (VIOS) Subsystem

Scheduler of Disk Schedulers– A queue is associated with each

application– Each application queue is

associated with an instance of an I/O scheduler that can meet its data delivery requirements

– The VIOS distributes I/O service quanta among the scheduler instances (applications) via a round-robin algorithm

VIOS

I/O Scheduler

ApplicationQueues

1 2 M

Storage System

VIOS Subsystem

Requests from Application(s)

System Model and I/O Schedulers

Device Queue

Scheduler 1 Scheduler M

Disk Scheduling

Page 18: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 18

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Fairness and VIOS• How can fairness be achieved?

• What is fairness in disk scheduling?

• What does VIOS achieve by providing fairness?

Disk Scheduling

Page 19: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 19

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Fairness and VIOS• How can fairness be achieved?

– Using quanta as in round-robin process scheduling

• What is fairness in disk scheduling?– Depends on definition of quantum

• Number of requests dispatched• Number of bytes transferred• Amount of allocated disk time

• What does VIOS achieve by providing fairness?

Disk Scheduling

Page 20: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 20

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Fairness and VIOS• How can fairness be achieved?

– Using quanta as in round-robin process scheduling

• What is fairness in disk scheduling?– Depends on definition of quantum

• Number of requests dispatched• Number of bytes transferred• Amount of allocated disk time

• What does VIOS achieve by providing fairness?

Disk Scheduling

Page 21: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 21

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Disk Service Quantum Number of Requests - Example

• Applications essentially only generate I/O read requests

• App 1 and App2 generate a total of n 4KB requests each time they run

• App 3 generates a total of n 512KB requests

• Service is fair in terms of number of requests but it is not fair in terms of disk time allocated

• Unpredictable execution time – it depends on the I/O characteristics (size of requests and seek characteristics) of the application with which it runs

• No performance insulation

29% execution time increase

Disk Scheduling

Page 22: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 22

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Our Solution:Compensating Round Robin w.r.t. Disk Time

VIOS

I/O Scheduler

Storage System

VIOS Subsystem

Requests from Application(s)

System Model and I/O Schedulers

Device Queue

FCFS 1 FCFS M

ApplicationQueues

1 2 M

Disk Scheduling

Provides disk-time fairness• As requests are dispatched

and serviced, request service time is subtracted from the quantum

• If remaining quantum ≤ 0, then – no more requests are

dispatched from the queue and

– the quantum of the next round is shortchanged by the over-compensation of this round

• Unused quantum in a round is not carried forward

Page 23: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 23

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

VIOS SubsystemFairness given Different Request SizesFairness in terms of disk-time

allocation• Each application consists of

eight threads to ensure queues with enough requests to consume quantum

• One application performs 4KB reads and the other performs the same number of 256KB reads

• Since each queue has the same quantum, irrespective of the request sizes, approximately 50% of disk time is allocated to each application

VIOS Disk Access Time Partitioning among Applications with Different Request Sizes

Page 24: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 24

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

VIOS SubsystemFairness given Different Seek Characteristics

Fairness in terms of disk-time allocation

• Each application consists of three threads to ensure queues with enough requests to consume quantum

• One application performs a 1M sector seek for each read while the other performs a 64M sector seek for each read

• Since each queue has the same quantum, irrespective of the seek characteristics, approximately 50% of disk time allocated to each application

VIOS Disk Access Time Partitioning among Applications with Different Seek Characteristics

Page 25: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 25

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Fairness and VIOS

• What is fairness in disk scheduling?– Round-robin scheduling – Quantum: amount of allocated disk time– CFQ-CRR

• What does VIOS achieve by providing fairness, i.e., what are the benefits of providing fairness and are there penalties as well?

Disk Scheduling

Page 26: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 26

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Performance Isolation given Different Request Sizes• Each instance of Application 1

has a fixed 4KB request size• Each instance of Application 2

generates a different fixed request size in the range of 4KB to 512KB

• The execution times of both applications increase due to concurrent disk usage

• Note that the execution time of Application 1 is not affected by the I/O characteristics (request size) of Application 2

VIOS Execution Times of Different Instances of Applications 1 and 2. Application 1 always has a Fixed Request Size, while the Request Size of Application 2 Varies.

executed alone

executed concurrently

Page 27: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 27

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Predictable Performancegiven Varying Seek Characteristics

• Each instance of Program 1– generates only 4KB requests– has a fixed inter-request seek

distance of 1M sectors • Each instance of Program 2,

– generates only 4KB requests– has a fixed inter-request seek

distance in the range of 1M to 64M sectors

• The execution times of both applications increase due to concurrent disk usage

• Note that the execution time of Program 1 is not affected by the I/O characteristics (seek characteristics) of Program 2

VIOS Execution Times of Different Instances of Programs 1 and 2. Program 1 always has a Fixed Inter-request Seek Distance, while that of Program 2 Varies.

executed alone

executed concurrently

Page 28: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 28

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Implicit Benefits of Providing Disk-Time Fairness

• Different applications can be given different quanta, i.e., disk-time allocations ⇒Performance Differentiation

• Disk-time fairness results in• Predictable disk-time allocation

• Given the number of non-empty queues, the inter-service time interval for any queue is known within a specified bound

• Unpredictability hinders disk performance guarantees • Can support QoS

• Disk performance insulation / isolation• The I/O characteristics of one application cannot impact the

disk time allocated to another application• An application cannot monopolize the disk system • Not provided by contemporary disk schedulers

Page 29: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 29

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Adaptive Disk I/O opportunities - 1

• Performance improvements: but what does performance mean?- Improved utilization / decreased execution time /

throughput- Checkpoints: schedule sync and asynch requests

differently: 80% of I/O usage associated with checkpoints (async)

- Match disk I/O scheduler to applicationdata delivery requirements

- Throttle number of concurrent I/O-generating processes / threads

Page 30: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 30

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Adaptive Disk I/O opportunities - 2

• Performance improvements: but what does performance mean?- Provide fairness, performance isolation,

performance predictability- Virtualized I/O- Service guarantees / contracts- Repeatable performance- Load balance in terms of I/O

Page 31: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 31

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Adaptive Disk I/O opportunities - 2

• Performance improvements: but what does performance mean?- Provide fairness, performance isolation,

performance predictability- Virtualized I/O- Service guarantees / contracts- Repeatable performance- Load balance in terms of I/O

Can increase execution time

Page 32: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 32

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Adaptive Disk I/O opportunities - 2

• Performance improvements: but what does performance mean?- Provide fairness, performance isolation,

performance predictability- Virtualized I/O- Service guarantees / contracts- Repeatable performance- Load balance in terms of I/O

Facilitates performance debugging / tuning => PRODUCTIVITY

Page 33: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 33

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Virtualized I/O Performance vs. Productivity

PRODUCTIVITY: application development time (including performance debugging / tuning)

PERFORMANCE: application execution time

PRODUCTIVITY: application development time (including performance debugging / tuning)

PERFORMANCE: application execution time

WITHOUT

WITH

Page 34: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 34

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Virtualized I/OExecution Time vs. Fairness (QoS)

Application execution times of the threads that finished first and last among 32 concurrent threads; 1000 random 4KB requests/threads

Maximum and average latency of requests with different schedulers; each thread accesses disjoint areas of the disk

152

282

136119 125

104

12

5

67

3033

46

0

50

100

150

200

250

300

350

CFQ-CRR(P)

CFQ-CRR CFQ-Linux Deadline Anticipatory Noop

Exec

utio

n Ti

me

(sec

)

Last threadFirst thread

0

500

1000

1500

2000

2500

3000

3500

4000

4500

5000

CFQ-CRR(P) CFQ-CRR CFQ-Linux Deadline Anticipatory Noop

Late

ncy

(ms)

Average Latency

Maximum Latency

2.3%

>5% above 1 second latency

Shared

Virtualized

Fairest but at cost of

execution time

Smallest Minimum Latency

Page 35: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 35

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

MADbench

• dSdC – calculates signal correlation derivatives; writes to files

• invD – Calculates pixel-pixel data correlation matrix D; reads data

• W – Calculate dense matrix-matrix multiplications; reads data

• Each phases goes through several loops

Page 36: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 36

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

MADbench-invDPerformance vs. Productivity

Fair Scheduler (CFQ) Unfair Scheduler- deterministic

1 thread at a time per loop: 6.5 sec. each: 104 sec.

1 loop

Anticipatory Scheduler

Non-deterministric

Could appear as load imbalance – but best performance

Page 37: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 37

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Performance vs. ProductivitySometimes in Conflict

• Algorithms/schedulers are built to improve the performance of the resource

• Designed in isolation– Disk schedulers

• On extreme scale systems– We need schedulers that provide performance

isolation• Separating workloads• Performance analysis – productivity

– But we need schedulers that provide performance• Unfair I/O scheduler on BG/L I/O node makes performance

analysis of compute tasks that much difficult, e.g., load imbalance

Page 38: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 38

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Software on Extreme Scale

• Predictable systems improve productivity• Opportunistic / unpredictable systems may

improve performance• Adaptation is not just for improved

performance but it must be for improved productivity as well!

• Systems must be virtualized– Fault tolerance– Performance analysis– Quality of service

Page 39: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 39

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Future Related Work - 1• Using VIOS framework, provide both

fairness and latency guarantees• Apply fairness, performance isolation, and

performance predictability to parallel I/O systems

• Apply the framework to storage systems that support parallel access

• For quality of service, weights can be defined. Consider how these weights can be used in large systems with thousands of processors and multiple applications.

Page 40: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 40

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Future Related Work - 2• Further consider adaptation w.r.t. stateful vs.

stateless resources• Identify other ways that resource management

adaptivity can be applied successfully– Virtual memory management– Small/large page allocation– Muti-core scheduling

• Consider how fairness and performance insulation / isolation can be guaranteed in other parts of the system– Lightweight operating systems - CPU performance

Page 41: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 41

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Acknowledgments

• DOE, Office of Science (Grant # DE-FG02-04ER25622)

• IBM Corporation for IBM Shared University Research (SUR) Grants and IBM Faculty Awards

• UTEP and UT System for STAR (Science and Technology Acquisition and Retention) Program Award

• Seetharami Seelam, Ph.D.

Page 42: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 42

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Refereed Publications - 1• Seetharami Seelam and Patricia Teller, “Fairness and Performance

Isolation: an Analysis of Disk Scheduling Algorithms,” submitted to Workshop on High Performance I/O Techniques and Deployment of Very Large Scale I/O Systems (HiperIO’06), in conjunction with the IEEE International Conference on Cluster Computing, September 25-27, 2006.

• Seetharami Seelam, Jayaraman Suresh Babu, and Patricia Teller, “Performance Analysis of Disk Scheduling Algorithms for Asynchronous Requests,” to appear in Proceedings of the 3rd

International Conference on Quantitative Evaluation of SysTems (Qest ’06), Riverside, CA, September 11-14, 2006.

• Patricia Teller and Seetharami, Seelam, “Insights into Providing Dynamic Adaptation of Operating System Policies,” ACM Operating Systems Review, 40:2: 83-89, April 2006.

Page 43: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 43

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Refereed Publications - 2

• Seetharami Seelam, Jayaraman Suresh Babu, and Patricia Teller, “Automatic I/O Scheduler Selection for Latency and Bandwidth Optimization,” Proceedings of the Workshop on Operating System Interference in High Performance Applications – OSIHPA, Saint Louis, Missouri, 17 September 2005.

• Seetharami Seelam, Rodrigo Romero, Patricia Teller, and William Buros, “Enhancements to Linux I/O Scheduling,” Proceedings of the 2005 Linux Symposium, Ottawa, Canada, 20-23 July 2005.

Page 44: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 44

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Theses, Dissertations, Technical Reports

• Ricardo Portillo, Fine-grain Dynamic Adaptation of the Linux 2.6 Virtual Memory Manager: a First Step, Master’s Thesis, University of Texas-El Paso, Computer Science, July 2006.

• Seetharami Seelam, Towards Dynamic I/O Scheduling in Commodity Operating Systems, Ph.D. Dissertation, University of Texas-El Paso, Computer Engineering, May 2006.

• Jayaraman Suresh Babu, Coarse-Grain Dynamic Adaptation for Asynchronous I/O Scheduling: Is it needed?, Master’s Thesis, University of Texas-El Paso, Computer Science, May 2006.

• Seetharami Seelam and Patricia Teller, “Disk Scheduling Using Fair Queuing and Round-Robin: Fairness Analysis,” Technical Report, University of Texas-El Paso, Computer Science, December 2005.

• Rodrigo Romero, Seetharami Seelam, and Patricia Teller, “Workload Dependent Performance Analysis of Process Schedulers: A Case Study,” Technical Report, University of Texas-El Paso, Computer Science, November 2005.

Page 45: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 45

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Publications/Patents in Review or Preparation

• Seetharami Seelam and Patricia Teller, “CFQ-CRR: an I/O Scheduler that Guarantees Fairness, Performance Isolation, and Performance Predictability.”

• Seetharami Seelam, Andre Kerstens, and Patricia Teller, “ I/O Throttling to Improve the Performance of a Cosmology Application”.

• Seetharami Seelam, Sarala Arunagiri, and Patricia Teller, “VIOS2: a Scheduler of I/O Schedulers that Provides Explicit Latency Guarantees.”

• Seetharami Seelam and Patricia Teller, “Disk Scheduling with Performance Objectives.”

• Seetharami Seelam and Patricia Teller, “Disk Scheduling for Predictable Performance Behavior: Completely Fair Queuing and Compensating Round-Robin.”

• Seetharami Seelam and Patricia Teller, “CFQ-CRR (P): an I/O Scheduler for Meeting Performance Objectives.”

Page 46: Incorporation of Dynamic Adaptation into Operating Systems ...estrabd/LACSI2006/workshops/workshop3/… · Dynamic Adaptivity in Support of Extreme Scale 2006 LACSI Symposium Workshop

10/17/2006 46

Dynamic Adaptivity in Support of Extreme Scale

2006 LACSI Symposium Workshop on Performance and Productivity of Extreme-Scale Parallel Systems

Santa Fe, NM

Questions?

http://research.utep.edu/daises

[email protected]