Multitasking and scheduling - INSA Lyon...Multitasking and scheduling Guillaume Salagnac Insa-Lyon – IST Semester Fall 2019. ... When executing, a program alternates between doing

Multitasking and scheduling

Guillaume Salagnac

Insa-Lyon – IST Semester

Fall 2019

Previously on IST-OPS: kernel vs userland

Application 1

Hardware

Application 2

OS Kernel

Architecture

VM2VM1

Each program executes on an isolated virtual machine :• the processor is just for me: “virtual CPU”• the memory is just for me: “virtual memory”

2/39

Separation of mechanism and policy

Principle: design for orthogonality

Operating system designers try not to confuse• mechanisms (and their implementation) on one hand

and• policies (and their specifications) on the other hand

Example: vs

3/39

Outline

1. Introduction: the concept of a process

2. Achieving multitasking through context switching

3. Scheduling: problem statement

4. Scheduling: classical algorithms

5. Evaluating a scheduling policy

4/39

Definitions: Multitasking vs Multiprocessing

Multiprocessing, multi-core computing

simultaneous execution of programs by separate processors

VS

Multiprogramming AKA multitasking

ability to run several programs “at the same time” on one machine

I typically: number of CPUs << number of programs

5/39

Pseudo-parallelism via execution interleaving

VCPU1 t

VCPU2

CPU

VS

Application A

A

Application B

B C A B C A B C A B C A B C

VCPU3 Application C

Policy = 1 VCPU / application

Mechanism = CPU time-sharing

t

t

t

Note: interleaving is fine as long as the user doesn’t notice

6/39

Degree of multiprogramming

Definition: degree of multiprogramming

Number of processes currently loaded in the system

source: Tanenbaum. Modern Operating Systems (4th ed, 2014). page 877/39

Why do we want multiprogramming ?

Empirical observationWhen executing, a program alternates between doing somecalculations (CPU burst) and waiting for data (I/O burst)

VCPU1 t

VCPU2

CPU

I/O

I/O

I/O

VS

A

A

A

A

A

A

A

A

A

A

B

B

B

B

B

B

B

B

B

B

waiting waiting

waiting waiting

idle idle idle

idle idle idle

Same performance, better resource utilization

8/39

Multiprogramming: remarksWhen/why does a program have to wait for data ?• because of access latency: disk≈5ms, network≈100ms...• interactive programs have to wait for user input• programs may also have to synchronize with each other

Bad approach: busy waiting AKA polling• difficult to program correctly• precious CPU time is wasted doing nothing

Solution: passive waiting AKA blocking• easier to use: just call a blocking function• better CPU utilization

• latency hiding: overlap computations and I/O

I we need a mechanism to share the CPU

9/39

Illustration of a context switch between two processes

Process 1 Kernel Process 2

interruptor syscall

P1 is running

P1 is dormant

copy CPUregistersto TCB1

choose P2load CPU registersfrom TCB2

RETI

P2 is dormant

P2 is running

(deal withsyscall?)

(deal withinterrupt?)

10/39

Context switch: remarks• dispatcher = implements the context switch• executed very often I must be quick (dispatch latency)

• scheduler = chooses which program to execute next• possible that P2 = P1, e.g. if P1 calls gettimeofday()...• possible that P2 6= P1, e.g. read() I blocking call

Associated kernel data structures :• Process Control Block = PCB• represents a running program: process number (PID),

executable filename, permissions...• contains one TCB

• Thread Control Block = TCB• represents a virtual processor AKA execution context• contains a copy of the CPU state: registers, PC, SR...

vocabulary: (in today’s lecture) “process” = “thread”

11/39

Outline






12/39

What program to execute next ?

P1 t

P2

I/O

I/O

A1 A3 A5

B1 B3

A2 A4

A7

A6

B2 B4 B6

B5 B7

P3

I/O

C1

C2

C3

C4 C10

C5

C6

C7

C8

C9

Question: On a single CPU, how should we execute this workload ?

13/39

Naive scheduling

First idea: always execute A, B, then C, and repeat

CPU tI/O

A1

A2

C1 A3

A4C2

C3

C4

B1

B2

B3

B4

I quite inefficient, especially for C

Second idea: execute C as often as possible

CPU tI/O

A1

A2

A3C1

C2

C3

C4

B1 C5

B2 C6

C7 B3

A4 C8

I a lot better for C, while almost the same for A and B

14/39

Not all processes want the CPU all the time

CPU tI/O

A

A B

BC

CC

BC

C

C

C

A C

A

t1 t2

I at time t1:• A has the CPU• B is ready to execute• C is waiting for an input/output request to complete

I at time t2:• B has the CPU• A and C are ready to execute

15/39

Process state diagram (1)

RunningReady

Blocked

New Terminated

Possible states for a process:• New = PCB/TCB currently being created by the kernel• Running = active i.e. currently executing on the processor• Ready = activable i.e. waits to be executed• Blocked = sleeping i.e. waits for some event to complete• Terminated = PCB/TCB being cleaned up by the kernel

16/39

Process state diagram (2)

RunningReady

Blocked

New Terminated1

2

34

0 5

Transitions:0 PCB/TCB initialization is done1 the dispatcher loads the thread on the CPU2 an IRQ or syscall interrupts execution3 the program makes a blocking syscall

• e.g. input-output read(), delay sleep(), etc...4 the awaited event completes

• e.g. data becomes available, delay expires, etc...5 execution comes to an end (either voluntarily or abruptly)

17/39

Scheduling: problem statement

Purpose of the CPU scheduler• given K threads which are ready to execute

• assuming that we know relevant “features” about them• given N > 1 available processors

decide which threads to execute on each processor

Remark: when is the scheduler activated ?• upon each transition running→ blocked (3) e.g. sleep()• when a process terminates (5)• upon each transition blocked→ ready (4)• upon each transition running→ ready (2)

• e.g. upon receiving and IRQ from the system timer

18/39

Two types of scheduling

Cooperative scheduler: activated only upon (3) and (5)• applications explicitely yield control of the CPU• blocking system calls• + needs a dedicated yield() syscall

• efficient but supposes to trust the applications

Preemptive scheduler : activated upon (3), (5) and also (0), (2), (4)• enables the kernel to stay in control of the machine• system timer sends periodic IRQs to trigger preemption

• less efficient but allows for executing untrusted applications

19/39

States implemented as queues

Ready queuedispatchprocess creation

preemption

CPU

disk request

Disk queue

Net queue

networkrequestrequest

completed

Sleeping queue

sleep()delay expired

requestcompleted

20/39

Process queues: remarks

Thread Control Blocks are chained together, forming queues• Ready Queue AKA Run Queue

Purpose of the scheduler: choose a TCB in the Ready Queue

Blocked processes: transfered to another queue• one Device Queue for each input/output device• one queue for sleeping processesI ... one queue for each reason to be Blocked

21/39

Outline






22/39

Scheduling in project managementOff-line scheduling: projects, workshop, factory, etc

Input: a set of “tasks” (or “jobs”) with duration and dependencies( + a list of available “resources” )

Output: a start date for each task( + assignment of resources to tasks )

23/39

Off-line scheduling vs long-running processesOmniscient point of view

P1 t

P2

I/O

I/O

A1 A3 A5

B1 B3

A2 A4

A7

A6

B2 B4 B6

B5 B7

P3

I/O

C1

C2

C3

C4 C10

C5

C6

C7

C8

C9

VS

Point of view of the scheduler at t=0The Ready Queue contains P1, P2 and P3. The CPU is idle.I how can we decide what to do ?

24/39

FCFS Scheduling: First Come First Servedalso known as FIFO (First In First Out)

FCFS scheduling: principlerun jobs in the same order they arrived in the queue

In our example:

CPU tI/O

A1

A2

C1 A3

A4C2

C3

C4

B1

B2

B3

B4

Remarks:• inspired from real-life situations• intuitive fairness ; no risk of starvation• non-preemptive scheduling• “small” tasks (e.g. C) may be penalized

25/39

The problem of resource starvation

Definition: starvation AKA livelockIndefinite postponement of a task which is otherwise ready

Causes for starvation:• non-preemptive scheduler + 1 infinite task

I working hypothesis: no tasks with infinite duration• preemptive scheduler + pathological scenario

• bad luck, or malicious behaviour (e.g. DoS attack)

Risk of starvation VS proven impossibility of starvationAKA bounded waiting, finite bypass...

26/39

Remember: execution is a sequence of bursts

Working hypothesis

For each process in the ready queue, the kernel has a way ofknowing the duration of its next CPU burst

Remark: the “tasks” managed by a CPU scheduler are theseCPU bursts, and not processes or threads

Point of view of the scheduler at t=0

the Ready Queue contains A1 B1, , and C1 .

I which process should we execute next ?

27/39

Different types of bottlenecks

In our example:• A and B are “mostly doing calculations” I bottleneck = CPU• C is “mostly doing intput/output” I bottleneck = I/O device

Definitions• A program is said to be «compute-bound» if a faster

processor would reduce its execution time• A program is said to be «I/O-bound» if faster intput/output

would reduce its execution time• variants: memory-bound, disk-bound, network-bound...

Empirical observationIn practice, a thread will be either compute-bound or I/O-bound.

28/39

Distribution of CPU bursts durations

source: Silberschatz Operating Systems Concepts Essentials (2011). p 17729/39

SJF Scheduling: Shortest Job First

SJF Scheduling: principle

in the Ready Queue, pick the job with smallest execution time

In our example:

CPU tI/O

A1

A2

A3C1

C2

C3

C4

B1 C5

B2 C6

C7 B3

A4 C8

Remarks:• beneficial to IO-bound processes...• ...while not harming CPU-bound processes too much• risk of starvation if many short jobs arrive

30/39

SRTF Scheduling: Shortest Remaining Time First

SRTF Scheduling: principle

like SJF but with preemptionI choice re-evaluated on each transition blocked→ ready

CPU tI/O

t=0

Ready Queue: , ,C1 A1 B1

31/39




CPU tI/O

t=1

Ready Queue: ,

C1

A1 B1

31/39




CPU tI/O

t=3

Ready Queue: ,

A1

A1 ,

C1

C2

C3 B1

31/39




CPU tI/O

Ready Queue: ,

A1

t=4

C1

C2

C3 B1

31/39




CPU tI/O

Ready Queue:

A1

t=5

A3,

C1

C2 A2

C3

B1

31/39




CPU tI/O

A1

A2

A3C1

C2

C3

C4

B1 C5

B2 C6

C7 B3

A4 C8

Remarks:• similar to SJF (with our example: result identical)• preemptive: one process can’t monopolize the CPU• but still prone to starvation

31/39

RR Scheduling: Round RobinRound Robin Scheduling: principle• ready processes are each given the CPU in turns...• ...for a maximum duration q each. I time quantum• a burst which exceeds its quantum gets preempted

In our example, with a quantum duration q = 2 ticks

CPU tA1 B1 A1 B3A3

I/O

system timer IRQ

A3

A2

C1

C2

B1

B2

C3

C4

C5

C6

B3

Remarks:• kernel tick = regular IRQ from system timer• new tasks go at the end of the queue

I naturally fair and immune to starvation• but how do we choose the value of q ?

32/39

In real life: priority schedulingPriority scheduling: principle• maintain several ready queues simultaneously• consider them by decreasing order of priority• each queue can implement a different policy: RR, SRTF...

Variants:• fixed priority I real-time scheduling• variable priority I time sharing AKA best-effort computing

• example: Multi-Level Feedback Queues Scheduling (MLFQ)• with criteria to promote/demote processes

MLFQ scheduling: example• high priority: RR q=5ms I interactive processes• average priority: RR q=50ms I I/O-bound tasks• low priority: SRTF I run CPU-bound tasks in the background

33/39

Outline






34/39

Evaluating a scheduling policy

Evaluation methodology• deterministic simulation: on a given scenario

• play out the algorithms, on paper or with a computer• schocastic modeling

• queueing theory, markov chains...• real system instrumentation AKA benchmarking

• impact on performance, choice of the workload...

Example scenarios:

task durationT1 6T2 8T3 3

task arrival durationT1 0 8T2 1 4T3 2 9T4 3 5

35/39

Evaluation criteria

• CPU utilization rate: proportion of time when the CPU is active• i.e. executing application code (vs running kernel code, or idling)

• Throughput: number of jobs finished by unit of time• only makes sense if “jobs” can “finish”

• Fairness in general and non-starvation in particular• a whole subject by itself

• Turnaround time: time elapsed between arrival and termination• only makes sense if “jobs” can “finish”

• Waiting time: duration spent in the ready queue• all time spent in the ready queue really is wasted

• Response time: time elapsed before first “response”• depends on the definition of response

36/39

Example

Consider this scenario:

task arrival durationT1 0 8T2 1 4T3 2 9T4 3 5

FCFS T1 T2

SJF T1 T2

SRTF T2

time

T4

1 2 3 4 5 10 15 20 25

T3 T4

T3T4

T1 T3

RR q=3 T1 T2 T3 T4 T1 T3

1

2 T4 T1 T3

0

For SRTF: TT =(17− 0) + (5− 1) + (26− 2) + (10− 3)

4= 13

WT=(10− 1) + 0 + (17− 2) + (5− 3)

4= 6.5

37/39

Outline






38/39

SummaryPolicy vs Mechanism• Multitasking vs Multiprocessing• VCPU vs context switch + scheduling

Important concepts• Dispatcher, Scheduler, Process Control Block, Preemption,

CPU-burst / IO-burst, process states, Ready Queue...

Scheduling policies• First Come First Served• Shortest Job First, Shortest Remaining Time First• Round Robin with a value for the time quantum• Priority scheduling, either with fixed or dynamic priorities

• Multi-Level Feedback Queue

Evaluation: Turnaround Time, Waiting Time...

39/39

Documents

Multitasking and scheduling - INSA Lyon...Multitasking and scheduling Guillaume Salagnac Insa-Lyon – IST Semester Fall 2019. ... When executing, a program alternates between doing