23
Calcul Québec [email protected] MoabCon - April 2012 MOAB: Transitioning from Grid Engine 1

moabcon2012 - Transitioning from Grid Engine

Embed Size (px)

Citation preview

Page 1: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

[email protected] - April 2012

MOAB: Transitioning from Grid Engine

1

Page 2: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Plan2

ColosseRationalTransitionStatus update

Page 3: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Colosse3

Room for 56 racksPower: 1.1 MWCooling: ~1.5 MWUPS and power generator

for filesystems and servers

Page 4: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Colosse4

cold air plenum(32 m2)

hot air core(25 m2)

Page 5: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Colosse5

Sun Constellation System deployed in 2009960 Diskless Compute Nodes

7680 Nehalem CoresQDR Infiniband only, Full Bisection

1PB of Lustre Storage

Page 6: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Colosse : Current architecture

Everything is tied together with custom scriptsAccounting is extracted from Grid Engine and

moved to a SQL databaseHas been working well for 2 years...

6

Provisioning Scheduler + ressource manager

Page 7: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Grid Engine Vs Moab7

Why switch ???

Page 8: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Grid Engine Vs Moab8

SGEUsed on only 1 large Compute Canada systemUnknown vendor commitmentFractured communityLimited support available for large HPC deployment

Page 9: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Grid Engine Vs Moab9

MoabAlready well known to our usersSingle vendorCommercial supportStrong communityKnown to scale

Page 10: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Colosse : basic plan10

Scheduler

Resource Manager

Torque

Scheduler + Resource Manager

Page 11: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Transition planImplement the existing scheduling policyTrain users and get them on the new schedulerTrain staff to work with MoabAdapt/port our management scriptsGive control of the cluster to Moab/Torque

11

Page 12: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Scheduling policyPriority based on share treeDedicated nodes onlyMax 200 jobs per project4 queues

test (15m, 16 cores) - 2 nodesshort (24h, 256 cores) - all nodesmed (48h, 128 cores) - all nodeslong (7 days, ? cores) - 120 nodes

12

Page 13: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Scheduling policy - exceptionsAnalysts use overide tickets on user’s jobsUsers can qualify for more coresNo exception on maximum wallclock times

BLCR is used to allow checkpoint/restart of serial jobs

13

Page 14: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Share Tree80% of system reserved for special allocations20% for groups without an allocation

14

Share tree (100%)

Project1 (20%)

Project2 (15%)

Project9 (5%)

...

Project10 (0.1%)

Project11 (0.1%)

Project13 (0.1%)...

...

Page 15: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Share Tree80% of system reserved for special allocations20% for groups without an allocation

15

Share tree (100%)

Special allocations

(80%)

Default allocations

(20%)

Project1 (20%)

Project2 (15%)

Project9 (5%)

...

Project10 (0.1%)

Project11 (0.1%)

Project13 (0.1%)...

...

Page 16: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

User’s transitionMuch easier than expected for users

Submit files are very similar

Job submission is easy (qsub becomes msub) but more commands to learn to monitor jobsA lot of questions about the difference between the Torque

and Moab commands

16

#!/bin/bash#$ ...#$ ...

mpirun ...

SGE#!/bin/bash#PBS ...#PBS ...

mpirun ...

Moab

Page 17: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Staff ’s transitionHarder than expectedThe workflow for working with users’ issues will

need to be reviewedhow to figure out where is the original submit filewhere are each processes and how much memory they use...

More internal documentation will be required

17

Page 18: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Management scriptsOld habits die hard....Accounting/reporting

Accounting data is read from Moab event filesQueue status

Maintenance related scriptsmonitoring,account creation,node maintenanceprolog/epilog

18

Page 19: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Deployment - progress report

Progressive deploymentGrid engine and Moab will live together for a

couple of months

19

Page 20: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Deployment - progress report

We built a different oneSIS image for Torque compute nodesAlso rebuilt the MPI implementation with Torque support

Rebooting a node in the Torque image switches it over to Moab

20

Page 21: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Deployment - progress report

10% of nodes controlled by Moab right nowOpen to all users to test their workflow

21

Page 22: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Deployment - progress report

The Moab partition will grow over the next weeks

22

Page 23: moabcon2012 - Transitioning from Grid Engine

Calcul Québec

Thank you23

[email protected] - April 2012