20
SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking Center, Poland

SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

Embed Size (px)

Citation preview

Page 1: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

SUN HPC Consortium, Heidelberg 2004

Grid(Lab) Resource Management System (GRMS)

and GridLab ServicesKrzysztof Kurowski

Poznan Supercomputing and Networking Center, Poland

Page 2: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

What can you do with the Grid?

Access: secure, transparent, remote, wireless, …

Visualization: access to computers and services, not server…

On demand: get resources you need, when you need, …

Sharing: share data & resources over the net, …

Failover: migrate and restart applications, …

Balance between distributed and central control…

(by Brian Hammond 4.30 p.m., 21 June 2004, Heidelberg)

Page 3: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GridLab Project

Funded by the EU (5+ M€), January 2002 – March 2005SUN is our commercial partnerOpen source license for our softwareMain goal: to develop a Grid Application Toolkit (a set of high level tools and libraries) together with a set of grid middleware services/systems for:

resource management (GRMS), data management,monitoring, adaptive components, mobile user support, security services,portals,mobile access.

... and test all GridLab technologies/applications on real testbeds...

Page 4: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GridLab Project and GRMS (1)

Page 5: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GridLab Project and GRMS (2)

Page 6: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GridLab Project and GRMS (3)

Page 7: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GRMS and Core Services

Page 8: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GRMS and Core Services

GRMS is using C/Java APIs to Globus 2.X and Pre WS, namely GRAM, GridFTP, GRIS/GISSGRMS stores all historic data in database and various logsMercury Monitoring System is the most low-level service in GridLab (generic monitoring framework for the grid)

Provides instant information about the state of hosts, services and jobs

Provides monitoring data represented as metrics via both pull and push model data access semantics and also supports steering by controls.

Based on the Grid Monitoring Architecture (GMA) as proposed by the GGF

Support application steering (SIGNALS)

Page 9: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GRMS and Middleware Services

Page 10: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GRMS and Middleware Services

MORE...

Page 11: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

Various clients to GRMS

Page 12: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

Various clients to GRMS

Page 13: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

Applications and GRMS

Example Pegasus/Chimera work-flow

Page 14: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GRMS Job Description

XML based language

Job executableFile location

Arguments

File argument (files which have to be present in working directory of running executable)

Environment variables

Standard input

Standard output

Standard error

Checkpoint file (user-level checkpoint)

Page 15: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GRMS Job Description

Resource requirementsName of host for job execution (if provided no scheduling algorithms would be used)

Operating system

Required local resource manager

Network parameters

Lots of constraints:Minimum memory required

Minimum CPUs required

Minimum speed of CPUs

...

Page 16: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

What is the functionality of the GRMS?

11th May, 2004, GRMS v 1.9.0 released !!! to act in behalf of users on resources and meet application requirements concerning resources, data, etc... to stage-in and stage-out files required by jobs before and after executions using Core Services (GridFTP/GASS/FTP) or GridLab Middleware Services (Replica Catalog Service and Data Movement Service).to use GAS for more advanced security scenarios,to run and control batch jobs remotely,to run and control MPI batch jobs remotely,to run Java applications remotely,to register GAT applications and receive unique JOB IDs, to checkpoint GAT applications remotely,to migrate GAT applications remotely,to store all historic information about job statuses and resources which have been used during a job submission process,to contact the Information Service to receive static and dynamic information about resources,to contact an Adaptive Components Service to get additional information about distributed resources and networks,

Page 17: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

GRMS statistics

Page 18: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

Who is using our software?

N*Grid, Cactus, UCoMS, GriPhyN, Griphyn The Grid Infrastructure, Geon, EPhysics Portal, CLUSTERIX, CASPer, GridOneD, GEO 600, Einstein@home, VL-E, GriKS, NRL Protean Group, GEMSS, Ibis, and more...

Page 19: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

What can you do with the Grid?

Access: secure (GSI, GAS), transparent (GridLab Middleware), remote, wireless, (portal, mobile phone)

Visualization: access to computers and services, not server (Vis service, mobile client to GRMS)

On demand: get resources you need, when you need (Job description)

Sharing: share data & resources over the net, (GRMS and data management services)

Failover: migrate and restart applications (user-level checkpointing and migration)

Page 20: SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking

HPC SUN Consortium, Heidelberg 2004Krzysztof Kurowski, PSNC

Please visit:

www.gridlab.org www.gridlab.org/WorkPackages/wp-9/

Thank you!

More Information / Summary