17
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University of California, Los Angeles Kejian Jin ([email protected]) Prakashan Korambath([email protected] )

Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Embed Size (px)

Citation preview

Page 1: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Implementation of the UCLA Grid Using the Globus Toolkit

Grid Center’s 2005 Community Workshop

University of California, Los AngelesKejian Jin ([email protected])

Prakashan Korambath([email protected])

Page 2: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Who Are We?• Support Academic Research Computing at UCLA• Host 10+ clusters in our Data Center. • Support Additional Clusters on Campus• Develop software, High Performance Computing Consulting

4,009

10,020

19,408

632

6,575

10,983

0

2000

4000

6000

8000

10000

12000

14000

16000

18000

20000

22000

Oct-03 Current (Dec. 2004) Projected (2005)

Computational Growth at UCLA (GFLOPS)

Campus - 484%

IDRE - 1738%

October 03 December 04 Projected 05

Page 3: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

https

Uniform browser based interface

Credential mapped to local ID

Credential mapped to local ID

GRID Appliance

GRID Appliance

Head node

Cluster III

Head node

Cluster I

Credential mapped to local ID

GRID Portal

& CA

Single Credential

Head node

Cluster II

Web Server

GRID Appliance

UCLA GRID Architecture

Page 4: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

History

• Started as a UCLA Technology Sandbox project in 2002 using GT2.0.

• Developed UCLA Grid Portal (Web-based) using GT3.x in 2003

• Added the first cluster to the UCLA Grid in June 2004

• Grant from Sun Microsystems for 8 Grid Appliance nodes in 2004

• Currently, there are 6 clusters on the UCLA Grid representing Physics, Astronomy, Chemistry, Biology, Social Sciences, Neuro Imaging, Electrical, Chemical and Mechanical Engineering, Material Sciences.

– Compute Power Available in the UCLA Grid Portal: • Number of Clusters: 6 • Number of Nodes: 383 • Aggregate Peak Performance: 5625 GFlops

• Began to port the UCLA Grid Portal to GT4 in April 2005.

Page 5: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Features• Automatic certificate signing process

– SSH web client is used for verifying the user identity.

• Resource Discovery– IndexService is used to retrieve information from participating clusters

• Job Submission– Generic parallel and serial job submission– Customized application submission services for: Gaussian, Qchem, xmd,

Mathematica, Matlab– GRAM service used for job submission– SGE local scheduler is used by all clusters, other supported.

• Data management– Upload file from local to any target cluster– Cluster File Manager– File transfer between different clusters– GridFTP is used extensively in this service

Page 6: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Resource Discovery

Page 7: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Cluster File Manager

Page 8: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Data Visualization

File Formats supported•Gamess•Gaussian•Q-chem•CML•PDB•Ghemical•XYZ•CIF•HIN•Jaguar•MOL•MOPAC•Spartan

Page 9: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Live Demohttp://grid.ucla.edu

Page 10: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Technologies Used

• Java Servlet• Java COG toolkit• SSH web client API• File Upload client API• Java Web Service• XML• Globus Toolkit

Page 11: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Challenges

• Many clusters on campus• Operational issues

– Different departments– Diverse procedures and resources

• schedulers, different OSs, different processors, apps.• Some clusters will be contributing cycles to the campus

– How to share resources• Users can have (different) ID’s among several clusters

– No common user ID (UID) space• Difficult to get job status and resource information

– From a single cluster– Across clusters

Page 12: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Experiences

• Expert users prefer command line interface– Comfortable with UNIX command line– Need to login to head node in order to compile.

• lack of web-based development environment

• New Users prefer the web interface such as File Manager to edit/create/upload files

• Users with multiple cluster accounts prefer UCLA Grid Portal because of the single-login (transparency).

• Cluster managers and PIs like the resource discovery interface to get a visual feedback of cluster status, usage, job information, etc.

Page 13: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Future Directions

• Web-based Development environment– Edit source code with syntax (C, Fortran, C++, Java, etc )– Compile/Debug code from the web– Highly transparent development environment– Technologies that will be used: XMLHttpRequest, GridFTP, Java CoG.

• Immediate Testing and Feedback

• Additional Visualization support such as visualizing Plasma Physics data

• Add dynamic resource discovery and meta-scheduling for Sun Grid Engine (SGE) using Community Scheduler Framework (CSF).

Page 14: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Future Directions - Continued

• Integrate the UCLA Grid with other Grids by writing an InterGrid Broker Service

– Clusters are usually behind firewalls• In our infrastructure, the appliance node (where GT is installed) is only accessible

from UCLA Grid Portal Web server for security reasons. Head node does not have GT installed.

– User has account in another Grid which is trusted by UCLA. – User has account on a cluster in the UCLA Grid– User wants to submit job from other Grid Portal to one of the participating

cluster in the UCLA Grid where he/she has an account. – But other Grid Portal cannot directly submit jobs to the cluster in the UCLA

Grid because of the firewall.– How do we solve this problem?

Page 15: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

InterGrid Broker Service

UCLA Grid PortalOther Grid Portal

InterGridBroker Service

Cluster A

Cluster B

Cluster C

Cluster 1

Cluster 2

Cluster 3

Page 16: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Why not use other Portals?

• No other portals were available when the project started.• Specific User Requirement

– a large percentage of our users want a specific application service– No customized application service was available

• Only interested in Cluster Computing in a parallel environment.

Page 17: Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University

Institute For Digital Research and Education

Questions?