Transcript

Efficiency Analysis of Decentralized

Grid Scheduling with Job Migration

and Replication

International Conference on Ubiquitous Information Management and Communication (ACM ICUIMC 2013)

January 17–19, 2012, Kota Kinabalu, Malaysia

[email protected], [email protected]

Rzhanov Institute of Semiconductor Physics Siberian Branch of Russian Academy of Sciences

Siberian State University of Telecommunications and Information Sciences

Mikhail Kurnosov, Alexey Paznikov

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Decentralized Grid Scheduling

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 2M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling …

Subsystem1

EC1 EC2 ECn1

…

RMS

Local jobs

Subsystem3

EC1 EC2 ЭМn3

…

RMS

Subsystem2

EC1 EC2 ECn2

…

RMS

Subsystem4

EC1 EC2 ECn4

…

RMS

Local jobs

Local jobs Local jobsJob

r – number of processes,

z1, z2, …, zk – stage-in files sizes,

h1, h2, …, hk – subsystems with stage-in files.

ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Scheduling of Parallel MPI-programs

in Geographically-distributed Computer Systems

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 3

Centralized

scheduling

β€’ GridWay

β€’ GrADS

β€’ AppLeS

β€’ Condor-G

β€’ Nimrod/G

β€’ WMS

Decentralized

scheduling

β€’ GBroker

β€’ (Korneev, 1985)*,

(Monakhov, 2000)**

Korneev V.V. Architecture of Computer Systems with Programmable Structure. Novosibirsk, 1985.

Monakhov O.G., Monakhova E.A. Parallel Systems with Distributed Memory. Resource and Task Management. Novosibirsk, 2001.

*

**

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Decentralized Scheduling of Parallel Programs

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 4

Scheduler1

EC1 EC2 ECn1

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler3

Scheduler2

EC1 EC2 ECn2

…

RMS

EC1 EC2 ECn4

…

RMS

Scheduler4

Local jobs Local jobs

Local jobs Local jobs

Subsystem1

Subsystem3

Subsystem2

Subsystem4

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Algorithms of Decentralized Scheduling

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 5

i j1

j2

ju

Scheduler Scheduler

Scheduler

Scheduler

The algorithms are proposed:

β€’ Locally optimal scheduling (LOS),

β€’ Based on job replication (RS),

β€’ Based on job migration (MS),

β€’ Based on job migration and

job replication (MRS).

Each of the algorithms describes

functioning of the scheduler i after

submitting a job to its queue.

Step 0: tij, cj, sj, qj and nj are obtained from the monitoring system, then

the set S(i) = {j | nj β‰₯ r, j ∈ L(i) βˆͺ {i}} is built up.

L(i) ={ j1, j2, …, ju }

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.

Scheduleri

ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 6

Locally Optimal Scheduling (LOS)

EC1 EC2 ECni

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler2

Scheduler1

EC1 EC2 ECn2

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler3

Job

Local jobs

Local jobs

Local jobs

Local jobs

Subsystemi

Subsystem3

Subsystem2

Subsystem4

++

=

βˆ’

βˆ’

,

,

)(

max

max

1

max

1

max

t

t

w

w

c

c

t

t

jFj

jjjif cj < r or qj > 0,

else.

βˆ‘=

=

k

l

llj zjhtt1

),,(

cj – number of non-used EC on subsystem j;

qj – number of jobs in the queue;

wj = qj / nj – number of jobs in the queue per one EC

tj – time of staging-in to the subsystem j

Step 1. Subsystem j* = argmin{F(j)} is chosed

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.

Scheduleri

ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 7

Locally Optimal Scheduling (LOS)

EC1 EC2 ECni

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler2

Scheduler1

EC1 EC2 ECn2

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler3

Job

Local jobs

Local jobs

Local jobs

Local jobs

Subsystemi

Subsystem2

Subsystem1

Subsystem3

Step 2. Job is submitted to the local RMS

of subsystem j*.

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.

Scheduleri

ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 8

Scheduling Based on Job Replication (RS)

EC1 EC2 ECni

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler2

Scheduler1

EC1 EC2 ECn2

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler3

Job

Local jobs

Local jobs

Local jobs

Local jobs

Subsystemi

Subsystem2

Subsystem1

Subsystem3

Step 1. m subsystem j*1, j*2, …, j*m ∈ S(i)

with the least function F(j) value are chosen

cj – number of non-used EC on subsystem j;

qj – number of jobs in the queue;

wj = qj / nj – number of jobs in the queue per one EC

tj – time of staging-in to the subsystem j

++

=

βˆ’

βˆ’

,

,

)(

max

max

1

max

1

max

t

t

w

w

c

c

t

t

jFj

jjjif cj < r or qj > 0,

else.

βˆ‘=

=

k

l

llj zjhtt1

),,(

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.

Scheduleri

ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 9

Scheduling Based on Job Replication (RS)

EC1 EC2 ECni

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler2

Scheduler1

EC1 EC2 ECn2

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler3

Local jobs

Local jobs

Local jobs

Local jobs

Subsystemi

Subsystem2

Subsystem1

Subsystem3

Step 2. Job are submitted to the queues of

subsystems j*1, j*2, …, j*m.

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

JobJob

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.

Scheduleri

ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 10

Scheduling Based on Job Migration (MS)

EC1 EC2 ECni

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler2

Scheduler1

EC1 EC2 ECn2

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler3

Job

Local jobs

Local jobs

Local jobs

Local jobs

Subsystemi

Subsystem2

Subsystem1

Subsystem3

Step 1. With the interval βˆ†1 locally optimal subsystem

j' search procedure is performed:

j'= argmin{F(j)}

Step 2. If F(j*) – F(j') > Ξ΅, then migration is

performed from subsystem j to the subsystem j'.

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.

Scheduleri

ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 11

Scheduling Based on Combined Approach (MRS)

EC1 EC2 ECni

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler2

Scheduler1

EC1 EC2 ECn2

…

RMS

EC1 EC2 ECn3

…

RMS

Scheduler3

Job

Local jobs

Local jobs

Local jobs

Local jobs

Subsystemi

Subsystem2

Subsystem1

Subsystem3

The algorithm is based on combination of two

approaches – replication to the multiple subsystem

and migration between the subsystems.

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

GBroker Functional Structure

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 12

– software, developed in CPCT SibSUTIS

– third-party software

Local RMS

Globus Toolkit DCSMon

GBroker

NetMon

GClient

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Multicluster Computer System Testbed

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 13

GBroker

NetMon DCSMon

Globus Toolkit 5.0

GBroker

NetMon DCSMon

Globus Toolkit 5.0

GBroker

NetMon DCSMon

Globus Toolkit 5.0

GBroker

NetMon DCSMon

Globus Toolkit 5.0

GBroker

NetMon DCSMon

Globus Toolkit 5.0

GBroker

NetMon DCSMon

Globus Toolkit 5.0

GridWay 5.6.1

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

MPI-programs (SPEC MPI2007):

β€’ Weather Research and Forecasting (WRF) – simulation of climate

processes.

β€’ The Parallel Ocean Program (POP) – simulation of processes in the

ocean.

β€’ LAMMPS – molecular dynamics simulation tools.

β€’ RAxML – bioinformatics modeling.

β€’ Tachyon – graphical rendering.

Jobs flows:

β€’ Simple flows with flux level λλλλ.

β€’ The jobs are chosen randomly with uniform distribution.

β€’ Rank r of parallel program is chosen from the set {1, 2, 4, 8}.

β€’ Stage-in files were on the subsystem Xeon80.

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Experiments Organization

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 14M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

β€’ System bandwidth B

B = M / Ο„

β€’ Mean queue waiting time W

β€’ Mean job service time T

Notations:

β€’ tk – job submitting time to the scheduler k ∈ {1, 2, …, M}.

β€’ – time of job k beginning execution.

β€’ – time of job k finishing execution.

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Indices of Efficiency of Scheduling Algorithms

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 15

ktβ€²

kt β€²β€²

βˆ‘=

βˆ’β€²=

M

k

kk ttM

W1

)(1

βˆ‘=

βˆ’β€²β€²=

M

k

kk ttM

T1

)(1

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Flow of M = 200 jobs was submitted to the subsystem Xeon80

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Efficiency comparison of algorithms LOS and RS

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 16

System bandwidth B

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Efficiency comparison of algorithms LOS and RS

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 17

Mean queue waiting time W Mean job service time T

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Efficiency Comparison LOS, MS, MRS

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 18

Flow of M = 200 jobs was submitted to the subsystem Xeon80

System bandwidth B

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Efficiency Comparison LOS, MS, MRS

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 19

Mean queue waiting time W Mean job service time T

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π° Π± Π²

Π³ Π΄ Π΅

Π° – full graph, Π± – D2(6; 2, 3), Π² – ring, Π³ – mesh, Π΄ – 2D-torus, Π΅ – star

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Schedulers Logical Structures

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 20M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Flows of M = 50 jobs were submitted to all the subsystem (algorithm MS)

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Schedulers Logical Structures

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 21

System bandwidth B

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Schedulers Logical Structures

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 22

Mean queue waiting time W Mean job service time T

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Efficiency comparison of schedulers GBroker and GridWay

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 23

GBroker GBroker GBroker GBroker

GBroker GBroker

NetMon DCSMon

Globus Toolkit 5.0

NetMon DCSMon

Globus Toolkit 5.0

NetMon DCSMon

Globus Toolkit 5.0

NetMon DCSMon

Globus Toolkit 5.0

NetMon DCSMon

Globus Toolkit 5.0

GridWay 5.6.1

NetMon DCSMon

Globus Toolkit 5.0

GridWay configuration(recommended by developers)

SCHEDULING_INTERVAL (interval of scheduling) = 5 s;

DISCOVERY_INTERVAL (interval of resource discovery) = 100 s;

MONITORING_INTERVAL (interval of resource state monitoring) = 2 s;

POLL_INTERVAL (interval of job state query) = 10 s;

DISPATCH_CHUNK (number of jobs, dispatched simultaneously) = 15.

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Efficiency comparison of schedulers GBroker and GridWay

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 24

System bandwidth B

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Efficiency comparison of schedulers GBroker and GridWay

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 25

Mean queue waiting time W Mean job service time T

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Algorithms usage recommendations

β€’ Small stage-in files β‡’ LOS, MS, RS, MRS.

β€’ Large stage-in files β‡’ LOS and MS.

β€’ Low rate of job flows or small stage-in files β‡’ RS and MRS

LOS – locally optimal scheduling,

RS – based on job replication,

MS – based on job migration,

MRS – based on job migration and job replication

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Recommendations

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 26M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Conclusion

β€’ Job migration helps increase system bandwidth.

β€’ More system bandwidth (algorithm MS) of decentralized scheduling as

compared with centralized scheduling (GridWay).

β€’ Usage of non-full graphs with small diameter (e.g. ND-torus, D2-graphs) as

logical structures of decentralized schedulers do not leads to job service

efficiency degradation.

β€’ Decentralized algorithms are independent from the number of subsystems

and allows to robustness job flows service.

Future Works

β€’ Algorithms of scheduling Grid workflows

β€’ Forecasting methods in network and subsystems monitoring

β€’ Generation of optimal local neighborhood structures

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Conclusions and Future Works

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 27M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

Thank you for your attention!

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 28M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

jjjj

jjj

j

nqsw

w

w

c

c

t

t

t

t

iF

/)(

,

,

)(

max

1

max

1

max

max1

+=

++

=

βˆ’

βˆ’

if ci β‰₯ r ΠΈ qi = 0,

else

jjj

jjj

j

nqw

w

w

c

c

t

t

t

t

jF

/

,

,

)(

max

1

max

1

max

max3

=

++

=

βˆ’

βˆ’

if ci β‰₯ r and qi = 0,

else

jjj

jj

j

nqw

w

w

t

t

t

t

iF

/

,

,

)(

maxmax

max2

=

+

=

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

if ci β‰₯ r ΠΈ qi = 0,

else

Subsystem Ranking

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 29M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

1 – LOS, F1; 2 – LOS, F2; 3 – LOS, F3

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Алгоритм локально-ΠΎΠΏΡ‚ΠΈΠΌΠ°Π»ΡŒΠ½ΠΎΠΉ диспСтчСризации.

ΠŸΡ€ΠΎΠΏΡƒΡΠΊΠ½Π°Ρ ΡΠΏΠΎΡΠΎΠ±Π½ΠΎΡΡ‚ΡŒ систСмы

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 30

B, jobs/s

Ξ», s-1

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

1 – LOS, F1; 2 – LOS, F2; 3 – LOS, F3

Π±Π°

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Алгоритм локально-ΠΎΠΏΡ‚ΠΈΠΌΠ°Π»ΡŒΠ½ΠΎΠΉ диспСтчСризации. Π‘Ρ€Π΅Π΄Π½Π΅Π΅ врСмя

оТидания Π² ΠΎΡ‡Π΅Ρ€Π΅Π΄ΠΈ ΠΈ срСднСС врСмя обслуТивания Π·Π°Π΄Π°Ρ‡ΠΈ

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 31

Ξ», s-1 Ξ», s-1

W, s T, s

Mean queue waiting time W Mean job service time T

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19

1 – forecasting by NetMon; 2 – time of GridFTP

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников

Forecasting of File Transfer Time

Π‘Π΅ΠΌΠΈΠ½Π°Ρ€ Β«Π’Ρ‹Ρ‡ΠΈΡΠ»ΠΈΡ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ систСмы», Новосибирск, 29 апрСля 2011 Π³.ΠœΠΈΡ…Π°ΠΈΠ» ΠšΡƒΡ€Π½ΠΎΡΠΎΠ², АлСксСй Пазников 32

Time of transfer file with size

m = 100 Мб between subsystems

Xeon80 ΠΈ L400l1

Time of transfer file with size

m = 1 Π“Π± between subsystems

Xeon80 ΠΈ L400l1

M. Kurnosov, A. Paznikov. Efficiency Analysis of Decentralized Grid Scheduling … ACM ICUIMC, Kota Kinabalu, Malaysia, January 17-19


Recommended