DP-F AIR : A Simple Model for Understanding Optimal Multiprocessor Scheduling

DP-FAIR: A Simple Model for Understanding Optimal Multiprocessor Scheduling

Greg Levin† Shelby Funk‡ Caitlin Sadowski†

Ian Pye† Scott Brandt†

†University of California ‡University of GeorgiaSanta Cruz Athens

Backstory

NQ-Fairby Shelby Funk (Rejected from RTSS)

SNSby Greg, Caitlin,

Ian, and Scott (Rejected from RTSS)

DP-Fair

Multiprocessors

Most high-end computers today have multiple processors

In a busy computational environment, multiple processes compete for processor time More processors means more scheduling

complexity

Real-Time Multiprocessor Scheduling Real-time tasks have workload deadlines

Hard real-time = “Meet all deadlines!”

Problem: Scheduling periodic, hard real-time tasks on multiprocessor systems. This is difficult.

Real-Time Multiprocessor Scheduling

easy!!

Real-time tasks have workload deadlines

Hard real-time = “Meet all deadlines!”

Problem: Scheduling periodic, hard real-time tasks on multiprocessor systems. This is difficult.

Taxonomy of Multiprocessor Scheduling Algorithms

UniprocessorAlgorithms

LLFEDF

PartitionedAlgorithms

GlobalizedUniprocessor

Algorithms

GlobalEDF

DedicatedGlobal

Algorithms

PartitionedEDF

OptimalAlgorithms

Partitioned Schedulers ≠ Optimal Example: 2 processors; 3 tasks, each with 2 units

of work required every 3 time units

Global Schedulers SucceedExample: 2 processors; 3 tasks, each with 2 units

of work required every 3 time units

Task 3 migrates between processors

Problem Model n tasks running on m processors

A periodic task T = (p, e) requires a workload e be completed within each period of length p

T's utilization u = e / p is the fraction of each period that the task must execute

job release

time = 0 p 2p 3p

job releasejob releasejob release

period p

workload e

workcompleted

job release deadline

Fluid Rate Curve

period p

workload e

fluid rate curveutilization u =slope = e/p

actual work curveslope = 0 or 1

job release deadline

Feasible Work Region

period p

workload e

laxity

work completework

completed

CPU rate

Assumptions Processor Identity: All processors are equivalent

Task Independence: Tasks are independent

Task Unity: Tasks run on one processor at a time

Task Migration: Tasks may run on different processors at different times

No overhead: free context switches and migrations In practice: built into WCET estimates

The Big Goal (Version 1)

Design an optimal scheduling algorithm for periodic task sets on multiprocessors

A task set is feasible if there exists a schedule that meets all deadlines

A scheduling algorithm is optimal if it can

always schedule any feasible task set

Necessary and Sufficient Conditions Any set of tasks needing at most

1) 1 processor for each task ( for all i, ui ≤ 1 ) , and

2) m processors for all tasks ( ui ≤ m)

is feasible Proof: Small scheduling intervals can approximate a

task's fluid rate curve

Status: Solved pfair (1996) was the first optimal algorithm

Design an optimal scheduling algorithm with fewer context switches and migrations ( Finding a feasible schedule with fewest

migrations is NP-Complete )

Design an optimal scheduling algorithm with fewer context switches and migrations

Status: Solved BUT, the solutions are complex and confusing

Our Contributions: A simple, unifying theory for optimal global multiprocessor scheduling and a simple optimal algorithm

Greedy Algorithms Fail on Multiprocessors A Greedy Scheduling Algorithm will regularly

select the m “best” jobs and run those Earliest Deadline First (EDF) Least Laxity First (LLF)

EDF and LLF are optimal on a single processor ; neither is optimal on a multiprocessor Greedy approaches generally fail on multiprocessors

Example (n = 3 tasks, m = 2 processors) :

Why Greedy Algorithms FailOn Multiprocessors

Why Greedy Algorithms FailOn MultiprocessorsAt time t = 0, Tasks 1 and 2 are the obvious greedy choice

Why Greedy Algorithms FailOn MultiprocessorsEven at t = 8, Tasks 1 and 2 are the only “reasonable” greedy choice

Why Greedy Algorithms FailOn MultiprocessorsYet if Task 3 isn’t started by t = 8, the resultant idle time eventually causes a deadline miss

Why Greedy Algorithms FailOn Multiprocessors

How can we “see” this critical event at t = 8?

Proportioned AlgorithmsSucceed On Multiprocessors

Subdivide Task 3 with a period of 10

Proportioned AlgorithmsSucceed On Multiprocessors

Now Task 3 has a zero laxity event at time t = 8

Proportional Fairness

Insight: Scheduling is easier when all jobs have the same deadline

Application: Apply all deadlines to all jobs Assign workloads proportional to utilization Work complete matches fluid rate curve at

every system deadline

Proportional Fairness Is the Key

All known optimal algorithms enforce proportional fairness at all deadlines: pfair (1996) - Baruah, Cohen, Plaxton, and Varvel

( proportional fairness at all times )

BF (2003) - Zhu, Mossé, and Melhem LLREF (2006) - Cho, Ravindran, Jensen EKG (2006) - Andersson, Tovar

Why do they all use proportional fairness?

Scheduling Multiple Tasks is Complicated

workcompleted

Scheduling Multiple Tasks with Same Deadline is Easy

workcompleted

Actual Feasible Regions

workcompleted

job deadlines

Restricted Feasible Regions Under Deadline Partitioning

workcompleted

all system deadlines

The DP-Fair Scheduling Policy Partition time into slices based on all system deadlines

Allocate each job a per-slice workload equal to its utilization times the length of the slice

Schedule jobs within each slice in any way that obeys the following three rules:1. Always run a job with zero local laxity

2. Never run a job with no workload remaining in the slice

3. Do not voluntarily allow more idle processor time than

( m - ui ) x ( length of slice )

DP-Fair Work Allocation

time slice

Allocatedworkload

slope = utilization

workcompleted

DP-Fair Scheduling Rule #1

laxity

When job hits zero local laxity, run to completion

workcompleted

time slice

local workcomplete

When job finishes local workload, stop running

workcompleted

time slice

Don’t voluntarily allow idle time in excess of this limit

slope = m - u i

Allowableidle time

idletime

DF-Fair Guarantees Optimality We say that a scheduling algorithm is

DP-Fair if it follows these three rules

Theorem: Any DP-Fair scheduling algorithm for periodic tasks is optimal

Proof of Theorem

DP-Fair Implications

( Partition time into slices ) + ( Assign proportional workloads )

Optimal scheduling is almost trivial

Minimally restrictive rules allow great latitude for algorithm design and adaptability

What is the simplest possible algorithm?

All time slices are equivalent (up to linear scaling), so normalize time slice length to 1 Jobs have workload = utilization

Schedule by “stacking and slicing” Line up job utilization blocks Slice at integer boundaries

The DP-Wrap Algorithm

Example: 3 processors, 7 tasks

DP-Wrap Performance

n−1 context switches and m−1 migrations per time slice

This is about 1/3 as many as LLREF, similar to BF, and somewhat more than EKG

DP-Wrap has very low (O(n)) computational complexity

Extending DP-Fair:Sporadic Arrivals, Arbitrary Deadlines

Sporadic arrivals: jobs from a task arrive at least p time units apart

Arbitrary deadlines: A job from T = (p, e, D) has a deadline D time units after arrival; D may be smaller or larger than p

DP-Fair and DP-Wrap are easily modified to handle these cases

DP-Fair Easily Extended

Extending optimal algorithms to sporadic tasks and arbitrary deadlines has previously been difficult pfair : 10 years LLREF : 3 years

DP-Fair & DP-Wrap: 3 weeks

Ongoing Work

Extend to uniform multiprocessors (CPUs running at different speeds)

Extend to random, unrelated job model Heuristic improvements to DP-Wrap

Work-conserving scheduling Slice merging

Summary

DP-Fair : Simple rules to make optimal real-time multiprocessor scheduling easy!!

DP-Wrap : Simplest known optimal multiprocessor scheduling algorithm

Extensible : DP-Fair and DP-Wrap easily handle sporadic tasks with arbitrary deadlines We plan to extend DP-Fair and DP-Wrap to more

general problem domains

Thanks for Listening Questions?

=== EXTRA SLIDES ===

1. Sporadic Tasks, Arbitrary Deadlines

Sporadic Arrivals, Arbitrary Deadlines We extend our task definition to Ti = (pi, ei, Di) ,

where each job of Ti has a deadline Di time units after its arrival; Di may be smaller or larger than pi. These are referred to as arbitrary deadlines.

Sporadic arrivals means that pi is now the minimum inter-arrival time for Ti's jobs, so that a new job will arrive at least pi time units after the previous arrival. Scheduling is on-line, i.e. we don't know ahead of time

when new jobs will arrive

Task Density

Our expression for utilization must be updated to the density of a task:

i = ei / min{ pi , Di } and are still

sufficient conditions for the feasibility of a task set, but they are no longer necessary. Example: T1 = (2, 1, 1) , T2 = (2, 1, 2) on m =

1 is feasible, although = 1.5.

Optimality Our updated DP-Fair rules for sporadic tasks with

arbitrary deadlines will allow an algorithm to schedule any task set with and

; in fact, we require that these conditions hold

Since feasible task sets exist where these conditions do not hold, DP-Fair algorithms are not optimal in this problem domain

This is an acceptable limitation, since it has recently been proven that no optimal scheduler can exist for sporadic tasks

Updated DP-Fair Scheduling

The rules for scheduling within time slices is nearly the same with sporadic tasks and arbitrary deadlines

The third rule is now:

"Do not voluntarily allow more than units of idle time to occur during the time slice before time t"

The Fj (t) term represents unused capacity between tasks' deadlines and next arrivals

Updated DP-Fair Allocation The rules for allocating workloads in time slices

become more complicated than simple proportionality

A task is active between its arrival and its deadline. When a task becomes active, it is allocated a workload proportional to its density and the length of its active segment in the slice

If a job arrives during a slice that will have a deadline within the slice, the remainder of the slice is partitioned into two new subslices, with a subslice boundary at the deadline, and remaining work for all tasks allocated proportionally to subslice lengths

Handling a Sporadic Arrival

time slice

Workloadallocatedupon arrival

Sporadic arrival in the middle of a slice

workcompleted

Managing Idle Time

time slice

Available idle time grows by utilization of unarrived sporadic job…

slope = m - u i

Original idle time pool

…until that job finally arrives

Additional idle time from late arrival

idletime

Theorem 3 Any DP-Fair scheduling algorithm for

sporadic tasks with arbitrary deadlines will successfully schedule any task set where

Modifying DP-Wrap

The DP-Wrap algorithm is easily modified to accommodate the DP-Fair scheduling rules for sporadic tasks with arbitrary deadlines

If a job arrives during a time slice, wrap it onto the end of the stack of blocks

If the arriving job has a deadline within the current slice, partition the remainder of the slice into two subslices, as per our previous rule, and apply the usual scheduling rules to both of these subslices

DP-F AIR : A Simple Model for Understanding Optimal Multiprocessor Scheduling

Documents

lecture10 multiprocessor scheduling - GitHub Pages

Multiprocessor Scheduling I: Partitioned Scheduling · Multiprocessor Scheduling I: Partitioned Scheduling Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 11 July 2017 17 July 2017 Prof

Chapter 15 – Multiprocessor Management - Virginia Techcourses.cs.vt.edu/~cs3204/spring2004/Notes/OS3e_15.pdf15.6 Multiprocessor Scheduling 15.6.1 Job-Blind Multiprocessor Scheduling

Multiprocessor Scheduling - University of Massachusetts

Multiprocessor scheduling 2

Multiprocessor Real-Time Scheduling with Arbitrary

A Categorization of Real-time Multiprocessor Scheduling

Multiprocessor Scheduling - University of …lass.cs.umass.edu/~shenoy/courses/spring03/lectures/Lec...Computer Science CS677: Distributed OS Lecture 7, page 1 Multiprocessor Scheduling

Scheduling master-slave multiprocessor systems · 2017. 8. 29. · Scheduling Master-Slave Multiprocessor Systems * Sartaj Sahni Computer & Information Sciences Department University

Jim Anderson 1 Multiprocessor Fair Scheduling The Case for Multiprocessor Fair Scheduling James H. Anderson University of North Carolina at Chapel Hill

GPU Virtualization and Scheduling Methods: A Comprehensive ... · GPU Virtualization and Scheduling Methods: A Comprehensive Survey 1:3 Streaming Multiprocessor (SM) Streaming Multiprocessor

Chapter 10 Multiprocessor and Real-Time Scheduling

1 Multiprocessor and Real-Time Scheduling Chapter 10

Simulation of Multiprocessor System Scheduling · SIMULATION OF MULTIPROCESSOR SYSTEM SCHEDULING ... 3.3.3 Tightly-coupled verses Loosely-coupled Multiprocessor ... SMP Symmetric

Multiprocessor Scheduling I: Partitioned Schedulingls12-€¦ · Multiprocessor Scheduling I: Partitioned Scheduling Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 22/23, June, 2015 Prof

Nonmigratory Multiprocessor Scheduling for Response Time

Multiprocessor Scheduling

Multiprocessor Scheduling Grand Finale

Multiprocessor and Real-Time Schedulingftp.gunadarma.ac.id/pub/books/Operating-System/cs.mnt...Contents?Multiprocessor scheduling?Real-Time scheduling?real-time systems and real-time

SCHEDULING AND LOCKING IN MULTIPROCESSOR REAL-TIME