45

Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Embed Size (px)

Citation preview

Page 1: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000
Page 2: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globus

Grid Middleware Toolkit

Otto Sievert

CSE 225

8 June, 2000

Page 3: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globe

and other European Grid Activities

Otto Sievert

CSE 225

8 June, 2000

Page 4: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

EGRID

• European Grid Community

• Collaborative community, not a standards group

• Commercial and Academic Interests

• www.egrid.org

Page 5: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

European Tour

• Netherlands• Germany• Poland• Italy• Sweden

Page 6: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000
Page 7: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000
Page 8: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000
Page 9: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000
Page 10: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Grid Theme 1

• Be very (very) careful when choosing a project name.

Page 11: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Amsterdam: Globe

• Vrije Universiteit– Maarten van Steen– Andrew Tanenbaum

• “Middleware to facilitate

large-scale distributed applications”– Web focus– object-based coherency

Page 12: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globe Uniqueness

• Too much data, too little resources (bandwidth, etc.)

• Caching helps

• Data Coherency integral to Cache Policy

• Release constraint of a single replication/ distribution policy for all objects– example: web pages

Page 13: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

IMAGES

COUNTS

HTML

Globe Object

• Physically Distributed• Replicated

• Distribution Policy

Page 14: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globe Local Object

• 4 subobjects (minimum)

• Modularity

IMAGES

SEMANTICS

REPLICATION

CONTROL

COMMUNICATION

Page 15: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globe Binding

1. Name server

2. Object handle

3. Location service

4. Contact points

5. Choose point

6. Repository

7. Protocol

8. Bind!

NAMINGSERVICE

LOCATIONSERVICE

IMPLEMENTATIONREPOSITORY

CLIENTPROCESS

1

2

3

4

8

6

7

DISTRIBUTEDSHAREDOBJECT

Page 16: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Legion Binding

• Two-stage name resolution

• Binding agent• No implementation

repository

BINDINGAGENT

CLIENTOBJECT

1

2

3

4

5

SERVEROBJECT

Page 17: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Autopilot Binding

1. Sensor registers with the sole manager

2. AP client requests sensors from the manager

3. Manager returns available sensors

4. Client and sensor communicate directly

MANAGERCLIENT

1

2

3

4 SENSOR

Page 18: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globe Claim: Remote Object Model Lacks Replication

• Globe– objects can be

replicated

– still maintain one state

– allows complex coherency policies

• Legion– in theory, supports

replication

– replicated state

– allows some but not all coherency policies

– in practice is not allowed

Page 19: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

NAMINGSERVICE

LOCATIONSERVICE

IMPLEMENTATIONREPOSITORY

CLIENTPROCESS

1

2

3

4

8

6

7

DISTRIBUTEDSHAREDOBJECT

Globe Architecture

• Why all these servers?– separate naming from

locating– allow extensible

binding protocols (?)

Page 20: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Grid Theme 2: Communication

Centralized– NWS

– Globus MDS*

• Simple Management• Single Point of Failure

Distributed– NetSolve

– Fran’s Sensor Net

• Complex Management

• Scalable

Mixed– Legion

– Globe

– Autopilot

*

***

*** * *

*

*

***

*

Page 21: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globe Implementation Example

• Set of HTML/image/ Java files

• One semantics subobject• browsers aren’t that

extensible, so …use gateway

SEMANTICS

REPLICATION

CONTROL

COMMUNICATION

browser

gateway

http://globe.foo.edu:8989/dir/file.html

Page 22: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globe Example (cont’d)

• Replication Policies– Object-based

• “Permanent store”

• “Object-initiated store”

– Client-based• “Client-initiated store”

• How is this any better than what we have now?

Page 23: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globe Live Demo ...

Page 24: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globe Location Service

• Scalability Questions

• Tree Heirarchy– again Legion-like in its worst-case behavior

• Typical sol’n assumes mobile client

• Globe sol’n assumes mobile software

Page 25: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Does This Work?

• Single experiment - 5 wk. web server trace

• compare– no caching– various complex replication/coherency policies– automatic adaptive policy

• Results– (essentially) any replication scheme wins big– individual object adaptivity didn’t perform well

Page 26: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Globe: Conclusion

• Explicit coherency is clearly a Good Thing

• Security?

• Representative implementation?

Page 27: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Germany: Cactus

• Albert Einstein Institute, Potsdam– Thomas Radke

– Ed Seidel

• Distributed Astrophysics• Software Engineering• NCSA “hot code”

Page 28: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Cactus

• Separate CS from disciplinary science– Flesh = CS architecture

– Thorns = science modules

• 2-stage compilation– encapsulation

– modularity

– reuse

Page 29: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Cactus Compilation

• Two stage– Permanently bind thorns [Perl]– Compile binary [C++/F77]

• Efficient– Don’t carry unneeded thorn info

Page 30: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Grid Theme 3: Applications

• Numerical, or Non– Computation vs. Specialization

• Performance Measures:– Execution Time– Scale– Efficiency– Distribution

Page 31: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Grid Theme 4: Transparency

• Ease of Use vs. High Performance– As system becomes opaque, EoU increases– As system becomes opaque, Perf decreases– Where is the balance?

Page 32: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Germany: Gridware

• 1999 San Jose-based merger of two companies: Genias GmbH and Chord (U.S.)

• CoDINE– Compute farm load balancing system– Recently adopted by Sun™

• PaTENT– WinNT MPI

Page 33: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Grid Theme 5: Commoditization

• Reuse is strong in the Grid– Resources (Beowulf)– Middleware (Globus, PaTENT)– Applications (Cactus)

• Industry is influential– Largest grid apps in use today are commercial– Grid-ernet is profitable

Page 34: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

To This Point ...

Resource

Middleware X X

Application X X

Commercial X

Page 35: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Germany: Unicore

• UNIform Computer Resources (German SC access)• Goal is to provide uniform access to high performance

computers - painful to learn– OS details

– Data storage conventions

– Administration policies

• 3 phase project:I self-contained jobs

II remote data access

III concurrent remote execution

Page 36: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Unicore (cont’d)

• How is this done?– Web (Java) user interface– X.509 authentication– Network Job Supervisor

• interprets Abstract Job Objects

• manages jobs and data

• interfaces with local batch systems (like LoadLeveler and CoDINE)

• vs. Globus?

Page 37: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Poland: POL-34

• National Grid• Very like the system used

by Unicore, a collection of widely-distributed parallel computers

• Tree-connected ATM network

Page 38: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

POL-34

• Yellow = 2 Mb/s• Red = 34 Mb/s• Cyan = 155 Mb/s• Single administrative

domain via Virtual Users (skirting the grid issue)

• Use Load Sharing Facility (LSF)

Page 39: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Italy: SARA

• University of Lecce, Italy Giovanni Aloisio (with Roy Williams of Caltech)

• Synthetic Aperature Radar Atlas– Distributed data-intensive app

– Alan Su and the UCSD AppLeS group is involved

Page 40: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

SARA Architecture

• The goal: easy, fast, efficient retrieval and processing of SAR image data

• Issues– data is distributed, stored

in tracks

– complex hierarchical system

• Prototypical grid app

Page 41: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Sweden: Computational Steering

• Parallelldatorcentrum Royal Institute of Technology Per Oster

• Using Globus and the Visualization Toolkit (VTK) to steer a single CFD code.

• Little data available• Eclipsed by Autopilot

Page 42: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Grid Theme 6: Heterogeneity

• Some attempt to hide it– Globus, CORBA, Java/Jini

• Some take advantage of it– Netsolve, Ninf

• Some characterize and manage it– AppLeS (SARA), NWS

Page 43: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

To This Point ...

Resource X

Middleware X X X X

Application X X X X

Commercial X

Page 44: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Conclusion

• Explored Globe, Cactus, and other EuroGrid favorites in the context of– Communication architectures– Grid application characteristics– Grid transparency– Commodity computing influence– Grid heterogeneity

Page 45: Globus Grid Middleware Toolkit Otto Sievert CSE 225 8 June, 2000

Network Weather Service (NWS)

• Rich Wolski, U. Tenn.• Monitors and Predicts

Grid Resources– network latency,

bandwidth

– CPU load, avail. Mem.

• Central NWS data server

• nws.npaci.edu/NWS