21
www.eminerals.org Experience of the SRB in support of collaborative grid computing Martin Dove University of Cambridge

A voyage of discovery

  • Upload
    faraji

  • View
    56

  • Download
    0

Embed Size (px)

DESCRIPTION

Experience of the SRB in support of collaborative grid computing Martin Dove University of Cambridge. A voyage of discovery. We aimed to focus on grid computing to support molecular-scale simulations ... ... but discovered the important role of data and information delivery - PowerPoint PPT Presentation

Citation preview

Page 1: A  voyage of discovery

www.eminerals.org

Experience of the SRB in support of

collaborative grid computing

Martin DoveUniversity of Cambridge

Page 2: A  voyage of discovery

www.eminerals.org

A voyage of discovery‣We aimed to focus on grid computing to

support molecular-scale simulations ...‣ ... but discovered the important role of

data and information delivery

‣We thought that the SRB would provide a means to archive data ...

‣ ... but discovered that it could be much more useful than that

The SRB has radically changed our view of how we should carry out the scientific

process

Page 3: A  voyage of discovery

www.eminerals.org

My view of eScience

Computing grids

Data grids Collaborative grids

Page 4: A  voyage of discovery

www.eminerals.org

Science beyond the lab-book‣ Management of too

many tasks

‣ Management of the resultant data deluge

‣ Sharing the information content with collaborators

‣ Maintaining accuracy and verification

Page 5: A  voyage of discovery

www.eminerals.org

Expansion of calciteNeutron diffraction experiments

small decrease in a

5% increase in c

Page 6: A  voyage of discovery

www.eminerals.org

5.0

5.5

6.0

6.5

7.0

7.5

8.0

0 500 1000 1500 2000 2500

Temperature (K)

Un

it c

ell

len

gth

)

abc

R3 c

R3 m

Pm3 m

BaCO3: lattice parameters

Molecular dynamics simulations on the NGS

Molecular dynamics simulations on the NGS

Page 7: A  voyage of discovery

www.eminerals.org

Challenge for the researcher

‣ Short-term collation of the data

‣ Longer-term management of the data

‣ Sharing the data with collaborators

Page 8: A  voyage of discovery

www.eminerals.org

SRB and grid computing

‣ It was important to build the data grid – in our case the SRB – into the heart of the computing grid environment

‣ Then we needed tools to make the integration of the data and compute grids seamless, and which are easy to use – non-intrusive

Page 9: A  voyage of discovery

www.eminerals.org

Profile of our users‣ They want maximum control

over their work processes – they don’t want to access them through portals or GUI’s

‣ They also don’t want their applications pre-wrapped as services: they want to have complete control over their applications, e.g. to add capability

‣ They know what they are doing ...

‣ ... and they don’t want to be told how to do things!

Page 10: A  voyage of discovery

Internet

Researcher

Compute clusters Cluster

JobMgr

Globus

Application server

Access to external facilities and grids

Desktop pools Condor

JobMgr

Globus

Campus grids Condor

JobMgr

Globus

Parallel (HPC) clusters Cluster

JobMgr

Globus

Data vault

Data vault

Data vault

Data vault

Globus is useda) to provide user authentication via digital certificatesb) job submission middleware

Our data grid is based on the San Diego Storage Resource Broker

The application server provides databases and server capabilities for the SRB, metadata tools, and job submission tool

Page 11: A  voyage of discovery

www.eminerals.org

Job submission process‣ We have developed RMCS to run the

job submission process

‣ It integrates with the use of the data grid, specifically with the SRB

‣ RMCS can be run from the user’s desktop via a shell-command client tool

Page 12: A  voyage of discovery

4. Job runs on grid compute resourcesApplication

server

1. Upload data files and application to

data vault

5. Metadata is sent to the application server

6. Output files are transferred

to the data vault

7. Researcher interacts with the metadata

database to extract core

output values 2. Submit job to grid via RMCS

3. Data files and application are transferred to

the grid resource

Researcher

Data vault

Page 13: A  voyage of discovery

www.eminerals.org

Parameter sweeps

‣ implement bulk file upload to the SRB or other data grid

‣ generate set of RMCS input files

‣ submit all the RMCS jobs

We have perl programs that

Bulk job creation and submission is a one-command

procedure

Page 14: A  voyage of discovery

www.eminerals.org

Data and information

?XML data

representation instead

Page 15: A  voyage of discovery

Data vaultUpload XML data files to data

vault for sharing with collaborator

Researcher A

Researcher B

View information content of data files using ccViz

View information content of data files using ccViz

Access Grid with JMAST

Instant messaging

SciSpace.net

Page 16: A  voyage of discovery

www.eminerals.org

SRB: some early positives

‣ When we started, it was the only show in town to facilitate easy data sharing

‣ It was affordable in terms of capital and person ££££

‣ It is easily extended through addition of new vaults

‣ It proved easy to use

Page 17: A  voyage of discovery

www.eminerals.org

Anecdote: Lucy’s project

‣ Lucy learned to use the SRB-based data grid very easily

‣ Using our data tools, she was able to provide me with remote access to the information content of her data very easily

Lucy was a third-year project student, and we let her perform her project using all our grid infrastructure with no compromises

Page 18: A  voyage of discovery

www.eminerals.org

Some caveats‣ We didn’t actually need to federate or

distribute different data sources ...

‣ ... and by distributing our data we discovered that such an approach gives an unnecessary weak link and issues of ownership

‣ We didn’t need the access-control tools, nor the data replication tools, in which case some of the infrastructure was heavier than needed

Page 19: A  voyage of discovery

www.eminerals.org

So what is different now?‣ We now expect to be able to share their

data with collaborators ...

‣ ... and we expect this to be easy (ie not via a multi-stage process)

‣ We now routinely produce complete archives of all files associated with a study easily and automatically, rather than have stuff dumped to our desktops

‣ And we now expect a single place to deposit data, and for this process to be easy and automatic

Page 20: A  voyage of discovery

www.eminerals.org

Summary‣ The SRB was critical to the successes of

the eMinerals project

‣ The SRB was easy to use, and affordable

‣ We have developed some tools on top of the SRB to make access, display of data, and access control easier (eg webdav access, web interface)

‣ The SRB has radically changed the way we think about managing data – but I don’t think that this was an easy change to acquire

Page 21: A  voyage of discovery

www.eminerals.org

Credits

Cambridge: Kat Austen, Richard Bruin, Mark Calleja, Gen-Tao Chiang, Ian Frame, Peter Murray-Rust, Toby White, Andrew Walker

STFC: Kerstin Kleese van Dam, Phil Couch, Tom Mortimer-Jones, Rik Tyer

Funded by NERC