15
INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org Use Case of gLite Services Utilization . Multiple Ligand Trajectory Docking Study Jan Kmuníček CESNET NA4 & JRA1 effort

INFSO-RI-031688 Enabling Grids for E-sciencE Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Embed Size (px)

DESCRIPTION

Enabling Grids for E-sciencE INFSO-RI Joint EGEE and SEE-GRID Summer School on Grid Application Support 3 Scientific context Introduction to the application –interactions between large biomolecules and smaller bio-active ligands –lies on the foundation of many biological properties –interest in the biomolecular/pharmaceutical research –interactions studied through a process called docking –aims to find energetically favorable orientation of the ligand within an active site of a biomolecule

Citation preview

Page 1: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

INFSO-RI-031688

Enabling Grids for E-sciencE

www.eu-egee.org

Use Case of gLite Services Utilization.Multiple Ligand Trajectory Docking Study

Jan Kmuníček

CESNET NA4 & JRA1 effort

Page 2: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 2

Enabling Grids for E-sciencE

INFSO-RI-031688

Contents

• Scientific context

• Grid added value

• Implementation details

• Achievements & plans

Page 3: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 3

Enabling Grids for E-sciencE

INFSO-RI-031688

Scientific context

• Introduction to the application

– interactions between large biomoleculesand smaller bio-active ligands

– lies on the foundation of many biological properties – interest in the biomolecular/pharmaceutical research

– interactions studied through a process called docking

– aims to find energetically favorable orientation of the ligand within an active site of a biomolecule

Page 4: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 4

Enabling Grids for E-sciencE

INFSO-RI-031688

Scientific context

• Introduction to the application

– search is done on snapshots takenfrom molecular dynamic trajectorydescribing dynamic behavior of a biomolecule and a set of candidate ligands

– such computation is infeasible without Grid infrastructure

– Grid environment serves for

performing required computations job submissionmanagement of results provenance service

Page 5: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 5

Enabling Grids for E-sciencE

INFSO-RI-031688

Studied problem

• Biomolecular aspects of chemical weapons

– many organophosphate nerve agents

– interference with the signaling system usedby the nervous system

– these compounds inhibit acetylcholinesterase (AChE)essential chemical that breaks down nerve signals between nerve cells

– looking for universal reactivator able to liberate AChEwhen it is poisoned by the nerve paralytic compounds

Page 6: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 6

Enabling Grids for E-sciencE

INFSO-RI-031688

Grid added value

• Added Value for Application Experts GUI application for solving generic biomolecular parametric jobs

user application metrics evaluation based on targeted parameters

potential extension for biomedical screening

effective collaboration among researchers

computational jobs manipulation (input modification, jobs resubmission)

targeted search & selection of desired jobs (finished / non-finished, aborted)

modular application specific plugins for results presentation (e.g. visualization)

Page 7: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 7

Enabling Grids for E-sciencE

INFSO-RI-031688

Implementation details

The applica tion supports se lection o f s u b s e ts o f b o th t ra je c to ry snapshots and specific ligands, queries JP, and d isp lays a

.

2D array of finished Grid jobs matching the criteria

3 views on the job array are supported: a p p lica t io n -sp e c if ic m etrics of the job re s u l ts , th e u s e r annota tions, and the job sta tus in G rid m iddlew are

Sum m ary of jobs in a se le c ted a rray ce ll

L is t o f d e t a i l e d in fo rm ation abou t s e le c te d jo b a s g a th e re d b y J P service

Page 8: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 8

Enabling Grids for E-sciencE

INFSO-RI-031688

Implementation details

Page 9: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 9

Enabling Grids for E-sciencE

INFSO-RI-031688

Implementation details

• Job Provenance

– Job Provenance Motivation

the information about jobs has longer value • e.g. repeat a submission of a job executed year ago

the information about job control flow and jobexecution environment complements job results

• e.g. to be able to reliably resubmit a job

– Job Provenance Service

preserve information about Grid jobs allow data-mining in this information assist jobs re-submission

Page 10: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 10

Enabling Grids for E-sciencE

INFSO-RI-031688

Implementation details

• Charon Extension Layer

– Charon Extension Layer (CEL) Motivation

uniform and modular approach for (complex) computationaljobs submission and management

generic system for use of application programs in the Gridenvironment (LCG/gLite middleware, …)

– Charon Extension Layer Service

application management• single/parallel execution without job script modification

job management• easy job submission, monitoring, and results retrieval

Page 11: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 11

Enabling Grids for E-sciencE

INFSO-RI-031688

Implementation details

• Virtual Organization for Central Europe (VOCE)

– provides complete grid infrastructure under EGEE wings

VOCE officially registered in the list of EGEE VO the first regional VO in EGEE currently this model is recommended for all regions

– based on regional principle

VOCE spans the whole Central Europe (CE) Federation core services operated by CESNET resources are provided by many institutions across the CE

(these resources are available to all users registered in VOCE)

– documentation http://egee.cesnet.cz/en/voce/index.html

Page 12: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 12

Enabling Grids for E-sciencE

INFSO-RI-031688

Implementation details

• Virtual Organization for Central Europe (VOCE) resources from AustriaCroatiaCzech RepublicHungaryPolandSlovakia

more than 283 registered usersfrom 14 institutes and 6 countries

in total 1248 CPUs (18 CEs)cca 32.6 TB disk space (17 SEs)

Page 13: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 13

Enabling Grids for E-sciencE

INFSO-RI-031688

User Experience

• Advantages

– manual work with results (annotation, comparison, evaluation) easily substituted by semi-automatic toolsupporting in-depth overview of finished jobs

– support typical workflow for day-to-day work

– effective collaboration among researchers highly improved

– simplified integration of research newcomers into studied scientific problem

– leveraging Grid services while hiding Grid complexity

Page 14: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 14

Enabling Grids for E-sciencE

INFSO-RI-031688

User Experience

• Suggested enhancements

– allow not only Grid jobs output annotations but job data item annotations too

– extend the functionality towards generic molecular dynamic trajectory selection (this feature currentlyunder development)

Page 15: INFSO-RI-031688 Enabling Grids for E-sciencE  Use Case of gLite Services Utilization. Multiple Ligand Trajectory Docking Study Jan Kmuníček

Joint EGEE and SEE-GRID Summer School on Grid Application Support 15

Enabling Grids for E-sciencE

INFSO-RI-031688

Information Pointer

• Uncover all details!

– visit available Wiki pages at

http://egee.cesnet.cz/mediawiki/index.php/Job_Provenance_Demo