Upload
karissa-frye
View
216
Download
1
Tags:
Embed Size (px)
Citation preview
1
Beauty and the BeastRITA meets Beowulf
Richard WallaceMITRE Presentation
6/2/04
6/2/04 RW@MITRE 2
Our Agenda today
• We will cover three big technology areas :– A brief history/overview of Beowulf clusters, the
problems it was designed to solve, the current research and commercial products
– A brief introduction to RITA, what it is, how it works, and how it is applied to Beowulf, especially Grid Computing
• We are time-limited today. We will have ample time at the end of the presentation for questions
3
Beowulf
6/2/04 RW@MITRE 4
Beowulf…
Originally developed by Donald Becker* at NASA. A Beowulf cluster is a grouping of, usually, identical cheap PC computers. They are networked into a small TCP/IP LAN, with libraries and programs installed allowing processing to be duplicated across multiple CPUs.
See Beowulf.org
*Since 1998 at Scyld in Annapolis, MD
6/2/04 RW@MITRE 5
Beowulf name and meaning
Besides being a “cool” name, it does have bearing…An epic poem from circa 10th century Anglo-Saxon Europe. The story traces the life of a heroic king of the Geats called Beowulf, and his great battles with the troll-like monster Grendel, then Grendel's mother, and a fire-breathing dragon, which costs Beowulf his life. It is fundamentally a depiction of a pre-Christian warrior society, in which the relationship between the leader, or king, and his thanes is of paramount importance. This relationship is defined in terms of provision and service. This society also had strongly defined terms of kinship.
6/2/04 RW@MITRE 6
What makes a Beowulf Cluster?[Brown1]
• The nodes are dedicated to the Beowulf and serve no other purpose. • The network, or networks, on which the nodes reside are dedicated to the Beowulf and
serve no other purpose. • The nodes are mass-market COTS computers. An essential part of the Beowulf definition
(that distinguishes it from, for example a vendor-produced massively parallel processor - MPP - system) is that its compute nodes are mass produced commodities, readily available ``off the shelf'', and hence relatively inexpensive.
• The network is also a mass-market COTS entity (if not actually ``mass market'' - some Beowulf networks are sold pretty much only to Beowulf builders), at least to the extent that it must integrate with mass-market COTS computers and hence must interconnect through a standard (e.g. PCI) bus. Again, this is primarily to differentiate it from vendor-produced MPP systems where the network and CPUs are custom-integrated at very high cost.
• The nodes all run open source software. • The resulting cluster is used for High Performance Computing (HPC, also called “parallel
supercomputing” and other names).
6/2/04 RW@MITRE 7
Problem Classes for a Beowulf• Superior use of a Beowulf cluster
– Autonomous Parallel Execution– Particle Physics / Vector processing / Fluid Dynamics, et. al.– Coordinated Service Delivery (computational fan-out, fan-in)– The classic “Bank Teller Problem”
• Inferior use of a Beowulf cluster– Processes sharing common memory (Tight Parallel Execution)– Serial processing and multiple disparate programs running on the nodes in the
system.– Web service, SOA architectures (at least at the front end).– The classic “Dining Philosophers Problem”
6/2/04 RW@MITRE 8
Beowulf Topology[Brown1]
True Beowulf NOWs (network of workstations) or COWs (cluster of workstations) or POPs (Pile of PC's)
6/2/04 RW@MITRE 9
Commercial VendorsAccelerated Servers Penguin Computing Aspen Systems PSSC Labs Custom Fit, Inc. Atipa Turbotek Microway Linux Labs Open Clustering UK Scyld Software Corporation
Plus 115 non-commercial projects!
6/2/04 RW@MITRE 10
Pictures of some real Beowulfs…Physics Department, Drexel University, Philadelphia, PA
6/2/04 RW@MITRE 11
294 Node Los Alamos National Laboratory System
The Space Simulator is a 294-processor Beowulf cluster. It is based on the Shuttle XPC SS51G mini chassis, which uses a heat pipe instead of a CPU fan. The small size of the XPC cases allowed us to fit the cluster in about half the space of the previous 144-processor Avalon cluster. Each node consists of a 2.53 GHz Pentium 4 processor, 1 Gb of 333 MHz DDR SDRAM, an 80 Gbyte Maxtor hard drive, and a 3Com 3C996B-T gigabit ethernet card. The cost of an individual node was less than $1000. The network switch is composed of a Foundry FastIron 1500 switch trunked to another FastIron 800 switch, which provides a total of 304 Gigabit Ethernet ports using the 16-port JetCore modules.
The system was delivered in late September, 2002. It achieved Linpack performance of 665.1 Gflops on 288 processors in October 2002, making it the 85th fastest computer in the world according to the TOP500 list.
12
RITA
6/2/04 RW@MITRE 13
RITA…
Regulated Isomorphic Temporal Architecture – I developed RITA for MCI as a telecommunication resource* grid control. It regulates isomorphic reconfiguration with regard to temporal changes in network topology and changes to client, server, or peer architecture changes. There is a patent pending on the technology.The RITA technology is refined from the original 1984 WPAFB AFWAL/AVSAIL Data-Driven Operating System work by Wallace, McDonald, and Hague.
*Internal and edge router provisioning, computing element message-oriented middleware, 1-N 00 NPA translation, et. al.
6/2/04 RW@MITRE 14
RITA-Beowulf Integration• RITA was developed to solve distributed application issues with dissimilar
computational elements having complicated event interactions based on temporal and conditional values
• Parallel applications for Beowulf systems do not have dissimilar system complexity, but do have temporal and computational complexities
• System testing in any parallel system is difficult. System testing for such systems can lead to exponential time for test case generation. [Butler1, Littlewood1]
• RITA controls canonical event transforms by formal mathematics and a novel condition-event matrix. This produces deterministic execution and few, if any, event interaction errors
• RITA is designed for element level to system-of-systems level interaction
6/2/04 RW@MITRE 15
RITA Basics
Spike Set-At
Transitional
Event Transform
Canonical Events
6/2/04 RW@MITRE 16
RITA Elements
nknn
k
qknqn
k
n R
R
opop
opop
ECEC
ECEC
VG
VG
1
1,1,
1,11,1
,
1111
)()(
)()(
)(
)(
CGG :
))((: 1 kECV
TrueRRRRR nn 211 :
TrueECopECopECopEC kk )()()()( 11,1132,1121,111
Condition-Event Matrix
Guard Elements
Condition Vector
Resultant Conjunction
Realization of Conditions and Operands
Each event matrix M is a matrix of event conditions and operations that are applied to those conditions. Each vector V of conditions is gated by a guard G such that the vector is evaluated if and only if the guard evaluates to True. Each evaluated vector has a resultant R that can be further reduced by conjunction to True. The guard is evaluated first and if True then the vector is evaluated. Optionally the series of resultants are evaluated. If G, V, and optionally R, evaluate to True, the action is initiated. The matrix can be sparse or dense depending on the condition interactions in the event matrix. Conditions can be compounded by applying a Boolean operation matrix comprehended across the event matrix. Evaluation is by vector with the result being true or false.
6/2/04 RW@MITRE 17
Event Operations
Spike Set-At
Transitional
6/2/04 RW@MITRE 18
Event Operations (Cont.)
Spike Set-At
Transitional
6/2/04 RW@MITRE 19
Event Operations (Cont.)
Spike Set-At
Transitional
6/2/04 RW@MITRE 20
RITA Screen shotsEvent Manager
F ile E d it V iew
M atrix
Tes t M a trix
E ven t C ond ition M a trix N am e
Test_S ystem
S ystem A ss ignm ent
S ave C ance l
C ond ition V ecto r G uard
N ew
Another Test M atrix
E d it
Another Test System
Event Manager Main Window
6/2/04 RW@MITRE 21
RITA Screen Shots (Cont.)
Event Manager
F ile E d it V iew
C ond ition V ecto r
S ave C ance l
M atrix G uardM: T est M a trix S: T est_S ystem
System Value1 In itia lC heck ANDF low R ate
C ondition 3C ondition 2C ondition 1
7
65
4
3
21
Vector Variab le
Sys temV alue1
Initia lChec kC ondition
INTEGERType1Vector
ANDO perator
S ize 4 A pp ly
C ance l
switch( SystemValue1 ){
1:return( InitialCheck_TRUE );break;
default:return( InitialCheck_FALSE );
}
Condition Vector Tab with Condition Code
Extremely simplified example!
6/2/04 RW@MITRE 22
RITA Screen Shots (Cont.)
Event Manager
F ile E d it V iew
G uard
S ave C ance l
M a trix C ond ition V ec to rM: T est M a trix S: T est_S ys tem
System Value1 In itia lC heck1GF low R ate
G uard
7
65
4
3
21
Vector Variab le
Sys temV alue1
Init ia lChec k1GG uard
INTEGERType1Vector S ize 4 A pp ly
C ance l
if( SystemValue1 <= 10 && SystemValue1 > 0 ){
return ( InitialCheck1G_TRUE );}else{
return ( InitialCheck1G_FALSE );}
TRANSITIONALEvent Type
Guard Tab
6/2/04 RW@MITRE 23
RITA Usage
System 3/Event 1 T System 1/Event 1
Guard Variableidentified by eventsystem and eventname
Event Type for Guard:"T" is Transitional"A" is Set-At"S" is Spike
Condition Variableidentified by eventsystem and eventname
System 3/Event 1 T
D t A
System 1/Event 1
System 1/Event 2
System 1/Event 3 System 1/Event 3S
System 1/Event 1 T
System 1/Event 2 A
System 2/Event 1
System 1/Event 3 System 1/Event 3S
System 1 System 2
System 2/Event 1 T
System 1/Event 3 S
System 3/Event 1
System 3
An example of the event shorthand being used to layout a non terminating event system. Stimulus starts in system 1. The following slides show the data entry for the generation of the condition event matrix
6/2/04 RW@MITRE 24
RITA Usage (Cont.)
6/2/04 RW@MITRE 25
RITA Usage (Cont.)
Vector / Matrix/ System
Guard
Variable
Event Type
Guard Condition
Condition Variable
Event
Condition
Boolean
1/1/2 S1/E1 T G3( S1/E1 ) S2/E1 C4( S2/E1 ) AND 2/1/2 S1/E2 A G4( S1/E2) --
1/2/2 S S1/E3 C3( S1/E3 ) --
Example Event System 2
System two has two matrices. Matrix one has a conjunction of S1/E1 and S1/E2 producing a new event S2/E1. Guard four is used to verify that S1/E2 is set-at a value and that if S1/E2 is violated, S2/E1 is never generated as it would fail the conjunction. Matrix two is an example of a non-self-stimulating spike event that is in fact a pass-through in event system two. If S2/E1 had been the guard variable for matrix two, vector one, then a cascade of event matrices within an event system would have resulted forcing the generation of the same two events (S2/E1 and S1/E3) but with an explicit order dependency between S2/E1 and S1/E3.
6/2/04 RW@MITRE 26
RITA Usage (Cont.)
6/2/04 RW@MITRE 27
• RITA is not PVM nor MPI. It would use such primitives local to a Beowulf cluster system. RITA leverages well established message-oriented middleware to control communication between systems thus providing an architecture enabling a Beowulf to participate in a Grid
• RITA extends the Beowulf cluster into a Grid system by controlling its temporal and computational behavior allowing heterogeneous system-of-systems to be created
RITA Grid-enables Beowulf
28
Grid Computing
6/2/04 RW@MITRE 29
Grid Computing Time Line
6/2/04 RW@MITRE 30
Where Grids are going...• “Grid is a type of parallel and distributed system that enables the sharing, selection, and
aggregation of geographically distributed "autonomous" resources dynamically at runtime depending on their availability, capability, performance, cost, and users' quality-of-service requirements.” Dr. Rajkumar Buyya, The University of Melbourne, Australia
• “The term 'Grid' is chosen to suggest the idea of a 'power grid': namely that application scientists can plug into the computing infrastructure like plugging into an electrical power grid. It is important to note, however, that the term 'Grid' is sometimes used synonymously with a networked, high performance-computing infrastructure. Obviously this aspect is an important enabling technology for future applications, but in reality it is only part of a much larger scenario that also includes information handling and support for knowledge within the scientific process. It is this broader view of the infrastructure that is now being referred to as the Semantic Grid. The Semantic Grid is characterized by an open system, with a high degree of automation, which supports flexible collaboration and computation on a global scale.” IEEE Distributed Systems On-line
Too many projects to talk about here. See the Backup Slides section
31
Questions?
32
Backup Slides
6/2/04 RW@MITRE 33
American Grid technologyAmerican Projects
Apples. Project Status: Research.Application-Level Scheduling - this is an application-specific approach to scheduling individual parallel applications on production heterogeneous systems.
Bond. Project Status: RES.The Bond system consists of: the Bond shell able to execute a set of Bond commands, a resource database, a knowlege processing system, and a set of utilities.
Bricks. Project Status: RES.Bricks is a performance evaluation system that allows analysis and comparison of various scheduling schemes on a typical high-performance global computing setting.
DOCT. Project Status: GOV.The Distributed Object Computation Testbed (DOCT) is an environment for handling complex documents on geographically distributed data archives and computing platforms.
Entropia.com. Project Status: COM. Desktop software that should provide universal and pervasive source of computing power via the Internet.
Folding@Home. Project Status: RES.The chemistry department of Stanford University is involved in understanding how proteins self-assemble. The group has developed a new method of protein simulation to address
this grand challenge problem. In order to perform protein simulation many processors are required. This project distributes the processing demands across 'volunteer' Internet clients.
GLOBUS. Project Status: RES.This project is developing basic software infrastructure for computations that integrate geographically distributed computational and information resources.
6/2/04 RW@MITRE 34
American Grid technology (Cont.)
GLUE. The Grid Laboratory Uniform Environment (GLUE) effort is sponsored by the High Energy and Nuclear Physics Intergrid Joint Technical and Coordination Boards. It aims to sponsor and enable interoperability between the EU physics grid project efforts (EDG, DataTag, etc.) and the US physics grid project efforts (iVDGL, PPDG, GriPhyN). GLUE addresses each service required by an end-to-end application or experiment system running over the grid, starting from the lowest level functionality for which interoperability between international projects is needed.
Grid Resource Broker (GRB). Project Status: RES. Country: Italy and USA. A grid portal that allows trusted users to create and handle computational grids on the fly exploiting a simple and friendly gui.
HARNESS. Project Status: RES.Harness builds on the concept of the virtual machine and explores dynamic capabilities beyond what PVM can supply. It focused on developing three key capabilities: Parallel plug-ins,
Peer-to-peer distributed control, and multiple virtual machines.
HTC (Condor). Project Status: RES.The Condor project aims is to develop and deploy, and evaluate mechanisms and policies that support high throughput computing (HTC) on large collections of distributed computing
resources.
InfoSpheres. Project Status: RES.The Caltech Infospheres Project researches compositional systems, which are systems built from interacting components
JAM (for Job & Application Manager) is a proof-of-concept, Jini(TM) technology based, graphical interface to Grid Engine. It provides a framework for selecting from a set of registered applications (or specifying one yourself directly), and selecting a Grid Engine queue to which the job is submitted (from a set of available queues originating from one or more clusters or cells), with support for filtering on various queue attributes. Once the job is submitted, it can be monitored and controlled via JAM.
Javelin. Project Status: RES.Javelin: Internet-Based Parallel Computing Using Java
6/2/04 RW@MITRE 35
American Grid technology (Cont.)LEGION. Project Status: RES.Legion is an object-based metasystem. Legion supports transparent scheduling, data management, fault tolerance, site autonomy, and a wide range of security options.
NASA IPG. Project Status: RES.The Information Power Grid is a testbed that provides access to a grid – a widely distributed network of high performance computers, stored data, instruments, and collaboration
environments.
NETSOLVE. Project Status: RES.NetSolve is a project that aims to bring together disparate computational resources connected by computer networks. It is a RPC based client/agent/server system that allows one to
remotely access both hardware and software components
NSF Middleware Initiative. Project Status: RESFeaturing the GRIDS Center Software Suite and NMI-EDIT Components. It integrates key software packages, standards, and best practices for science, engineering, and education.
PARDIS. Project Status: RES.PARDIS is an environment providing support for building PARallel DIStributed applications. It employs the Common Object Request Broker Architecture (CORBA) to implement
application-level interaction of heterogeneous parallel components in a distributed environment.
PUNCH. Project Status: RES.A platform for Internet computing that turns the World Wide Web into a distributed computing portal. Users can access and run programs via standard Web browsers. Applications can
be installed "as is" in as little as thirty minutes. Machines, data, applications, and other computing services can be located at different sites and managed by different entities. Jobs can be automatically routed to Condor, DQS, Globus, Grid Engine, or PBS. PUNCH provides a network operating system, logical user ccounts, a virtual file system service that can access remote data on demand, and an active yellow pages service that can manage resources spread across administrative domains. Together, these capabilities allow PUNCH to manage and broker resources among end users, application service providers, storage warehouses, and CPU farms. PUNCH as been operational for five years, and currently powers three portals and serves 70 engineering applications to about 2,000 users across two dozen countries.
WebFlow. Project Status: RES.WebFlow can be regarded as a high level, visual user interface and job broker for Globus.
WebSubmit. Project Status: RES.A Web-based Interface to High-Performance Computing Resources.
6/2/04 RW@MITRE 36
Asia-Pacific Grid technologyAsia-Pacific Projects
cJVM. Project Status: Research. Country: Israel.cJVM is a Java Virtual Machine (JVM) which provides a single system image of a traditional JVM while executing in a distributed fashion on the nodes of a cluster.
DesignDrug@Home: Molecular Modelling for Drug Design on Peer-to-Peer Grid. Project Status: RES. Country: AU.
DISCWorld. Project Status: RES. Country AU.An infrastructure for service-based metacomputing across LAN and WAN clusters. It allows remote users to login to this environment over the Web and request access to data, and
also to invoke services or operations on the available data.
Gridbus Project. Project Status: RES. Country AU.The key objective of the Gridbus project is to develop fundamental,next-generation cluster and grid technologies that support a true utility-driven service-oriented computing.
GridScape: A tool from the GridBus project for the creation of interactive and dynamic Grid testbed web portals. Features include rapid creation of Grid testbed portals, simple portal management and administration, clear and user-friendly overall view of Grid testbed resources.
HEPGrid. Project Status: RES. Country: AU.This project aims to design and develop a safe, secure, dynamic and adaptive Data Grid Resource Broker for managing and scheduling data intensive science applications on
geographically distributed resources.
Nimrod/G & GRACE. Project Status: RES. Country: AU.A global scheduler (resource broker) for parametric computing over a enterprise wide clusters or computational grids.
NINF. Project Status: RES. Country: JP.Ninf allows users to access computational resources including hardware, software and scientific data distributed across a wide area network with an easy-to-use interface.
6/2/04 RW@MITRE 37
European Grid technologyEuropean Projects
ALiEn@GRID, CERN.
AVO. Project Status: RES. Country: EUThe Astrophysical Virtual Observatory (EVO) will combine astronomical databases and processing capabilities in a virtual observatory.
CERN Data Grid. Project Status: RES. Country: EU.This project aims to develop middleware and tools necessary for the data-intensive applications of high-energy physics.
Covise. Project Status: COM. Country: DE.COVISE - Collaborative, Visualization and Simulation Environment.
CrossGrid. Project Status:RES. Country: EUDeveloping techniques for large-scale grid-enabled real-time simulations and visualisations that require responses in real-time. Addresses issues such as the
distribution of source data, simulation and visualisation, virtual time management, interactive simulation and visualisation rollback and platform-independent virtual reality.
DAMIEN. Project Status: RES. Country: DE.Distributed Application and Middleware for Industrial Use of European Networks, focusing on providing a framework of tools for Grid-Applications.
DAS. Project Status: RES. Country: NL.This is a wide-area distributed cluster, used for research on parallel and distributed computing by five Dutch universities.
6/2/04 RW@MITRE 38
European Grid technology (Cont.)
DataGrid. Projects Status:RES. Country:EUThe objective of DataGrid is to enable next generation scientific exploration which requires intensive computation and the analysis of shared, large-scale databases. DataTAG. Project Status:RES. Country: EUThe objective of DATATAG is to implement a network infrastructure for high-speed interconnection between individual GRID domains in Europe and the US.
EGSO. Project Status:RES. Country: EUThe European Grid of Solar Observations (EGSO) aims to address the problem of combining heterogeneous data from scattered archives of space- and ground-
based observations into a single "virtual" dataset. A new, unified solar feature catalogue will allow the user to search for observations on the basis of events and phenomena, rather than just time and location.
EROPPA. Project Status: COM. Country: EU.Software to design, implement, and experiments with remote/distributed access to 3D graphic applications on high-performance computers for the use of post-
production SMEs. e-Science Environment for the CLRC applies a range of techniques within the UK CLRC to develop an Integrated e-Science Environment which will also be deployed
to support grant-funded collaborations. The project currently has three streams: i. HPC Grid Services Portal; ii. Data Portal; iii. Advanced Visualisation Tools. The UK e-Science Community InfoPortal is designed to increase the effectiveness of users of HPC resources on the Grid.
e-Science OGSA Testbed. Project Status: RES. Country UK.A one-year project funded by EPSRC via the e-Science programme to test and evaluate the first implementation of the OGSA core by deploying the GT3 toolkit in a
service-based Grid testbed spanning organisational boundaries with differing institutional services, security policies and firewalls.
eSecurity Centre. Project Status: RES. Country UK.A joint venture between by the Universities of Manchester and Salford, the eSecuroty Centre provides a point of reference for the pooling their combined Grid
security knowledge.
6/2/04 RW@MITRE 39
European Grid technology (Cont.)
EuroGrid. Project Status:COM. Country EU.Application TEstbed for European Grid Computing. The EUROGRID project will develop core Grid software components and integrate them into an environment
providing fast file transfer, resource brokerage, interfaces for coupled applications and interactive access.
Globe. Project Status: RES. Country: EU.Globe is a research project aiming to study and implement a powerful unifying paradigm for the construction of large-scale wide area distributed systems: distributed
shared objects.
GLUE. The Grid Laboratory Uniform Environment (GLUE) effort is sponsored by the High Energy and Nuclear Physics Intergrid Joint Technical and Coordination Boards. It aims to sponsor and enable interoperability between the EU physics grid project efforts (EDG, DataTag, etc.) and the US physics grid project efforts (iVDGL, PPDG, GriPhyN). GLUE addresses each service required by an end-to-end application or experiment system running over the grid, starting from the lowest level functionality for which interoperability between international projects is needed.
GRIDs for Complex Problem Solving , Information Society Technologies (IST). Objectives: 1. To expand the potential of the Grid and peer-to-peer approaches to solving complex problems which can not be solved with current technologies in application fields such as, but not limited to, industrial design, engineering and manufacturing, health, genomics and drug design, environment, critical infrastructures, energy, business and finance, and new media. 2. To overcome present architectural and design limitations hampering the use and wider deployment of computing and knowledge Grids and to enrich its capabilities by including new functionalities required for complex problem solving. This should help the larger uptake of Grid type architectures and extend the concept from computation Grids to knowledge Grids, eventually leading to a “semantic Grid”.
GRIA. Project Status:RES. Country: EUGrid for Business and Industry will devise business models and processes that make it feasible and cost-effective to offer and use computational services securely in
an open GRID marketplace.
Grid Computing Technology Infoware. Project Status: RES. Country: Intl. GCTI aims to contribute to the development and advancement of technologies that enable us to access computing power and resources with the ease similar to
electrical power.
6/2/04 RW@MITRE 40
European Grid technology (Cont.)
GridLab. Project Status: RES. Country: Poland.Aims to build components for Grid applications (as MatLab does for mathematics), and realistic testbeds for their development.
GRIDSTART. Project Status: RES. Country: EU Information Society Technologies. Ten EU-funded projects have been clustered with the intention to stimulate the wide deployment of appropriate technology and to support the early adoption of best practice. This will be achieved by raising the awareness of potential users of the solutions developed, by connecting technology suppliers with those who will deploy it and by fully identifying and exploiting synergies within the cluster. The projects involved in developing the Grid infrastructure are: AVO, CrossGrid, DAMIEN, DataGrid, DataTAG, EGSO, EuroGrid, GRIA, GridLab, GRIP.
Grid Interoperability Project (GRIP). Project Status: RES. Country: EUR.A 2-year project funded by the European Union to realise the interoperability of Globus and UNICORE and to work towards standards for interoperability in the
Global Grid Forum. The GRIP project will demonstrate interoperability of jobs launched via UNICORE and running on resources controlled purely by Globus. Demonstrations will be given of a portal for biomolecular applications built on UNICORE's intuitive job preparation client and of a relocatable weather forecasting model that can access data from large scale weather prediction models run at a large supercomputing and tailor them to local condition by running on local resources with results displayed to a client's workstation.
Grid Resource Broker (GRB). Project Status: RES. Country: Italy and USA. A grid portal that allows trusted users to create and handle computational grids on the fly exploiting a simple and friendly gui.
JaCo3. Project Status: RES. Country: EU.Java and CORBA based Collaborative Environment for Coupled Simulations. The project aims to assess the effectiveness of a collaborative working environment, in
the scientific simulation design process, by coupling existing codes and visualisation systems.
JaWs. Project Status: RES. Country: GR.JaWS is an economy-based computing model where both resource owners and programs using these resources place bids to a central marketplace that generates
leases of use.
6/2/04 RW@MITRE 41
European Grid technology (Cont.)
LHC Computing Grid Project. Project: RES. Country EU.The goal of the LCG project is to meet the needs of the Large Hadron Collider (LHC), that is being constructed at CERN in Switzerland, by deploying a worldwide
computational grid service, integrating the capacity of scientific computing centres spread across Europe, America and Asia into a virtual computing organisation.
MAMMO Grid, European Federated Mammogram DatabaseImplemented on a GRID Structure. The aim of this project is to, develop a European-wide database of mammograms that will be used to investigate a set of
important healthcare applications as well as the potential of this Grid to support effective co-working between healthcare professionals throughout the EU.
MetaMPI. Project Status: RES. Country: DE.MetaMPI supports the coupling of heterogeneous MPI systems, thus allowing parallel applications developed using MPI to be run on grids without alteration.
METODIS. Project Status: COM. Country: DE.Metacomputing Tools for Distributed Systems - A metacomputing MPI library implemented both on TCP/IP and on ATM will serve as the application programming
model. This project has now finished. The DAMIEN project is METODIS' successor.
MOL. Project Status: RES. Country: DE.Metacomputer OnLine is a toolbox for the coordinated use of WAN/LAN connected systems. MOL aims at utilizing multiple WAN-connected high performance
systems for solving large-scale problems that are intractable on a single supercomputer.
PACX-MPI. Project Status: RES. Country: DE..MPI-implementation to seamlessly run a MPI-application on a Computational Grid.
Poznan Metacomputing. Project Status: RES. Country: PL.Poznan Centre works on development of tools and methods for metacomputing.
6/2/04 RW@MITRE 42
European Grid technology (Cont.)
The Semantic Grid. Project Status: RES. Country: UK.As the Semantic Web is to the Web, so is the Semantic Grid to the Grid.
UK OGSA Evaluation Project. Status: RES. Country UK.The overall aim of the project is to evaluate the strengths and weaknesses of OGSA by developing an experimental OGSA-based grid across organizational
boundaries.
WAMM. Project Status: RES. Country: IT.WAMM (Wide Area Metacomputer Manager) is a graphical tool, built on top of PVM. It provides user with a graphical interface to assist in repetitive and tedious tasks
such as: host add/check/removal, process management, compilation on remote hosts, remote commands execution.
UNICORE. Project Status: RES. Country: DE.The UNiform Interface to Computer Resources aims to deliver software that allows users to submit jobs to remote high performance computing resources.
XtremWeb. Project Status: RES. Country FR.XtremWeb is an academic, non profit, multidisciplinary platform of Global Computing, designed to serve as a substrate for large scale experiments. XtremWeb uses
remote PCs connected to the Internet. Participants cooperate by providing their CPU idle time for outstanding research projects or large contests. The first project using this platform can be found at http://xtremweb.lal.in2p3.fr/. Other projects are under installation at Toronto and Huwan (China).
6/2/04 RW@MITRE 43
References[Brown1] Engineering a Beowulf-style Compute Cluster, 4/3/2003, Robert G. Brown, Duke University Physics Department http://www.phy
.duke.edu/resources/computing/brahma/Resources/beowulf_book/[Miles1] High Throughput Computing in the Beowulf Environment, 8/11/2003, Arnie Miles, Georgetown University Advanced Research Computing
http://www.clusters.arc.georgetown.edu/seminar/miles/[Butler1] Butler, R.W. and Finelli, G.B. The infeasibility of quantifying the reliability of life-critical real-time software. IEEE Transactions on Software
Engineering, (19): 3-12, January, 1993.[Littlewood1] Littlewood B. and Strigini L. Validation of Ultrahigh Dependability for Software-based Systems. Com. ACM, 11(36):69-80,
November 1993.