24
Managed by P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, G. Kecskemeti and G. Hermann G. Kecskemeti and G. Hermann MTA SZTAKI www.lpds.sztaki.hu/pgportal [email protected] Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

  • Upload
    ziarre

  • View
    37

  • Download
    0

Embed Size (px)

DESCRIPTION

Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal. P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, G. Kecskemeti and G. Hermann MTA SZTAKI www.lpds.sztaki.hu/pgportal [email protected]. Outline of the talk. Introduction to P-GRADE portal PS-extension motivations - PowerPoint PPT Presentation

Citation preview

Page 1: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

Managed by

P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, G. Kecskemeti P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, G. Kecskemeti and G. Hermannand G. Hermann

MTA SZTAKIwww.lpds.sztaki.hu/pgportal

[email protected]

Defining and Running Parametric Study Workflow Applications by the P-GRADE

Portal

Page 2: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 2

Outline of the talk

1. Introduction to P-GRADE portal

2. PS-extension motivations

3. Principles of the PS-portal

4. Usage of the PS-portal

5. Distributing workflows among different Grid resources

6. Conclusions

Page 3: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 3

Proxy management

P-GRADE portal in a nutshell

Definition of Grid resources

Workflow creation

Job mapping to Grid resources

Workflow management and

execution visualization

Page 4: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 4

References

P-GRADE Portal service is available for

– Central European VO of EGEE– GILDA: Training VO of EGEE– HUNGRID VO of EGEE– SEE-GRID infrastructure– GIN VO– National Grids (UK National Grid Service, HunGrid, etc.)

Under preparation for

– US Open Science Grid, TeraGrid– Economy-Grid, Swiss BioGrid, Bio and Biomed EGEE

VOs, BioInfoGrid, BalticGrid – Many national Grids

Page 5: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 5

User’s feedback

Users want parameter study (PS) support at workflow level

It means:– If the user has an existing workflow in a

repository, he would like to run • the same workflow (without any change) • with many different parameters

Page 6: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 6

Related research

1. PS at job level – main subject is scheduling

2. PS at workflow level – The workflow description supports the creation of PS runs

3. What our users want: executing workflows stored in workflow repositories with many inputs as PS

Page 7: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 7

Principles of the PS-portal

1. Take the existing workflow as a black box

2. Modify only the external input ports to accept many input files

3. Create the cross-product of the input files and run the workflow with all combinations

4. Because of the large number of files both PS inputs and outputs are stored in the Grid!

Storage elements

Storage elements

Storage elements

4 input files

3 input files

12 outputfiles

Page 8: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 8

Generating cross products and executable workflows (e-WFs)

PS port: 4 instances of the

input file

PS port: 3 instances of the input file =

4 x 3 normal workflow execution1 PS workflow execution

Page 9: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 9

Steps of creating a PS-workflow1. Start from a tested Workflow

Page 10: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 10

2. Define the PS Port(s)

Any external input port can be defined as

PS-port

Page 11: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 11

3.Define Subdirectory of Inputs

PS port is represented by a directory and not

by a single file

Page 12: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 12

4. Define target Grid and subdirectory for the results

Page 13: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 13

5. PS Workflow ready to Submit

Page 14: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 14

Progress of submissions in PS detailed view

Total = Init + Submitted + Rescue + Error + Finished

Page 15: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 15

How to distribute workflows among different Grid resources?

PS port: 4 instances of the

input file

PS port: 3 instances of the input file =

Goal: Assign the 24 jobs to 24 different Grid resources1 PS workflow execution

Page 16: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 16

Single-Grid, Static Assignment

PS port: 4 instances of the

input file

PS port: 3 instances of the input file =

Assigns the 24 jobs to 2 Grid Resources within 1 Grid1 PS workflow execution

Assign resource A in Grid X

Assign resource Bin Grid X

A A A A

A A A A

A A A A

B B B B

B B B B

B B B B

Page 17: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 17

Single-Grid, Dynamic Assignment via Broker

PS port: 4 instances of the

input file

PS port: 3 instances of the input file =

1 PS workflow execution

Assign to broker of Grid X

Assign to brokerof Grid X

A B C D

I J K L

R S T V

E F G H

M N O P

U X Y Z

Assigns the 24 jobs to 24 Grid Resources within 1 Grid

Page 18: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 18

P-GRADE-Portal

London Rome

Athens

TeraGrid (GT2/GT4)

SEE-Grid (glite)

P-GRADE portal is a multi-Grid portal

Page 19: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 19

Static Multi-Grid Assignment

PS port: 4 instances of the

input file

PS port: 3 instances of the input file =

1 PS workflow execution

Assign to broker of Grid X

Assign to brokerof Grid Y

XA XB XC XD

XI XJ XK XL

XR XS XT XV

YE YF YG YH

YM YN YO YP

YU YX YY YZ

Assigns the 24 jobs to 24 Grid Resources within 2 Grids

Page 20: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 20

Dynamic Multi-Grid Assignment

PS port: 4 instances of the

input file

PS port: 3 instances of the input file =

1 PS workflow execution

Assign to meta-broker

Assign to meta-broker

XA XB YC XD

YI XJ XK XL

XR XS XT YV

YE XF YG XH

YM XN YO YP

YU YX YY XZ

Assigns the 24 jobs to 24 Grid Resources within 2 Grids

Page 21: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 21

Conclusions

Our solution:1. Keeps both the workflow GUI and the parameter

study support concept as simple as possible. 2. Enables run any existing workflow with different

parameter sets without modifying the structure of the workflow.

3. Manages the execution of the workflows on as many Grid resources as possible.

4. Enables the collection of Grid resources from several Grids even if they are based on different Grid technology.

Page 22: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 22

Thanks for your attention

Thanks for your attention

?

Further information: www.lpds.sztaki.hu

Page 23: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 23

Distributing workflows among different Grid resources

1. Single-Grid, Static Assignment: Statically assigning each node of the workflow to different resources in the same Grid (done)

2. Single-Grid, Dynamic Assignment: Dynamically assigning each node of the workflow to different resources in the same Grid using the broker of the Grid (done)

3. Static multi-Grid Assignment: Statically assigning different Grids to different nodes (done)

4. Dynamic multi-Grid Assignment: Dynamically assigning the nodes to different Grids using a meta-broker (under investigation)

Page 24: Defining and Running Parametric Study Workflow Applications by the P-GRADE Portal

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 24

Multi-Grid portals with meta-broker

EGEE Grid

UK NGS

BrokerBroker

BrokerBroker

Meta-Broker

Selects Grid for each node of the PS workflow

Provides the largest achievable parallelism for PS workflows