53
Ian Foster Computation Institute Argonne National Lab & University of Chicago A Sociology of the Grid? Carl Kesselman Information Sciences Institute, University of Southern California

Sociology Of The Grid May 2009

Embed Size (px)

DESCRIPTION

I gave this keynote talk at the 2009 Grid and Parallel Computing conference in Geneva. The goal of the talk was to lay out what might be involved in formalizing the steps involved in creating and operating a virtual organization. I'm not sure the slides make too much sense by themselves. But maybe there are useful pictures.

Citation preview

Page 2: Sociology Of The Grid May 2009

3

“When the network is as fast as the

computer’s internal links,

the machine disintegrates across the net into a set of

special purpose appliances”

(George Gilder, 2001)

Page 3: Sociology Of The Grid May 2009

4

“The Anatomy of the Grid,” 2001 The … problem that underlies the Grid concept is

coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations. The sharing that we are concerned with is not primarily file exchange but rather direct access to computers, software, data, and other resources, as is required by a range of collaborative problem-solving and resource -brokering strategies emerging in industry, science, and engineering. This sharing is, necessarily, highly controlled, with resource providers and consumers defining clearly and carefully just what is shared, who is allowed to share, and the conditions under which sharing occurs. A set of individuals and/or institutions defined by such sharing rules form what we call a virtual organization (VO).

Page 4: Sociology Of The Grid May 2009

5

Examples (from AotG, 2001)

“The application service providers, storage service providers, cycle providers, and consultants engaged by a car manufacturer to perform scenario evaluation during planning for a new factory”

“Members of an industrial consortium bidding on a new aircraft”

“A crisis management team and the databases and simulation systems that they use to plan a response to an emergency situation”

“Members of a large, international, multiyear high-energy physics collaboration”

Page 5: Sociology Of The Grid May 2009

6

Quantitative medicine is the key to reducing healthcare costs and

improving healthcare outcomes

Patients with same diagnosis

Page 6: Sociology Of The Grid May 2009

7

Quantitative medicine is the key to reducing healthcare costs and

improving healthcare outcomes

Patients with same diagnosis

Misdiagnosed

Non-responders,toxic responders

Non-toxic responders

Page 7: Sociology Of The Grid May 2009

8

Virtual organizations in medicine span multiple dimensions

Basic Research

Clinical Practice

Clinical Trials

trial subjects, outcomes

library

Outco

mes

,

tissu

e ba

nksc

reen

ing

test

s

ongoing

investigative

studies

pathways

Page 8: Sociology Of The Grid May 2009

9

From the organizational behavior and management community

“[A] group of people who interact through interdependent tasks guided by common purpose [that] works across space, time, and organizational boundaries with links strengthened by webs of communication technologies”

— Lipnack & Stamps, 1997

Yes—but adding cyber-infrastructure: People computational agents & services Communication technologies IT infrastructure

Collaboration based on rich data & computing capabilities

Page 9: Sociology Of The Grid May 2009

10

NSF Workshops on

Building Effective Virtual

Organizations

[Search “BEVO 2008”]

Page 10: Sociology Of The Grid May 2009

11

Two perspectives Technology used to enhance collaboration

(Computer Supported Collaborative Work)

Collaboration used to enhance technology

Page 11: Sociology Of The Grid May 2009

12

What is an organization?

A organization has an identity and a purpose, which it seeks to fulfill within its environment

The organization’s purpose influences its participants, structure, activities, and deliverables, whether products or services

The organization’s performance can be evaluated with respect to various metrics

Is a virtual organization any different?

Page 12: Sociology Of The Grid May 2009

13

Identity Legal aspects. Credentials.

Purpose [Let’s assume] anything legal

EnvironmentAvailable service & resource providers. Legal & organizational constraints

ParticipantsIdentity-based or attribute-based.People, services, resources, sensors.

Structure Centralized, decentralized, …

Activities Business processes. Workflows.

DeliverablesData products. Services. Instrument operations. …

PerformanceThroughput, responsiveness, growth, happiness, security, …

Page 13: Sociology Of The Grid May 2009

14

“I can’t solve this problem alone—I

need to involve my buddies Sebastien

and Krishna”

It looks like you’re creating a VO

Get help withcreating the VO

Just create the

VO without help

“Where am I going to eat

tonight?”

From: IanTo: Krishna, SebastienSubject: Help me find a restaurant

Page 14: Sociology Of The Grid May 2009

15

Identity Geneva-Dinner-VO. Informal, so no legal status.

Purpose Find a good restaurant in Geneva. Avoid one that speaks to you in English.

Environment News feeds. Satellite data feeds. Analytic services. EGEE, OSG, TeraGrid.

Participants Ian, Krishna, Sebastien. Anyone else we trust to make good recommendations.

Structure Central database and associated services (membership, …)

Activities Restaurant identification workflow. English-speaking staff tracking workflow.

Deliverables Restaurant recommendation for today. Recommendation service. Business plan?

Performance Food quality. Cost. Service response time. VO setup time.

Page 15: Sociology Of The Grid May 2009

16

Building a Virtual Organization Define &implement policy

Negotiation, trust management, credentials Determine and implement membership & roles

Terms of engagement Virtualization & integration of providers

Create VO-wide services Global behaviors

Manage work Collaborative problem solving, workflow

management Manage the VO

Monitor performance, report metrics

Page 16: Sociology Of The Grid May 2009

17

Defining community: Membership & laws Identify VO participants and roles

And map participants to attributes and roles Specify and control actions of members

Empower members delegation Enforce restrictions federate policy

A

1 2

B

1 2

A B

1

10

1

10

1

16

Page 17: Sociology Of The Grid May 2009

20

A set of core security mechanisms

Attribute Assertions C asserts that S has attribute A with value V

Authentication and digital signature Allows signer to assert attributes

Delegation C asserts that S can perform O on behalf of C.

Attribute mapping {A1, A2… An}VO1 {A’1, A’2… A’m}VO2

Policy Entity with attributes A asserted by C may

perform operation O on resource R

Page 18: Sociology Of The Grid May 2009

21

Trust in VOs

Do I “believe” an attribute assertion Used to evaluate cost vs. benefit of performing

an operation E.g., perform untrusted operation with extra

auditing Look at attributes of assertion signer Rooting trust

Externally recognized source, e.g., CA Dynamically via VO structure delegation Dynamically via alternative sources, e.g.,

reputation

Page 19: Sociology Of The Grid May 2009

22

Building blocks

Attribute Authority (ATA): Issue signed attribute assertions

(including identity, delegation, mapping)

Authorization Authority (AZA) Makes decisions based on assertions & policy

ATA

User A is an admin

User B is a member

User B can use service

X

Page 20: Sociology Of The Grid May 2009

23

VO policy at a service

GT4 authorization and delegation services provide implementations

ResourceATA

WSResource

ResourceAZA

VO ATA

WS-Subject

ATA: Attribute AuthorityAZA: Authorization Authority

Page 21: Sociology Of The Grid May 2009

24

Establishing VO-Wide policy

ResourceATA

WSResource

ResourceAZA

VO ATAVO AZA

SubjectATA

SubjectAZA

WS-Subject

ATA: Attribute AuthorityAZA: Authorization Authority

GT4 authorization and delegation services provide implementations

Page 22: Sociology Of The Grid May 2009

26

Protected health information problem

What do we want? Use clinical data for research Share clinical data, make research data available Reuse same infrastructure Image exchange between health providers

Patient authorizes use of data – consent process Intact unmodified DICOM workflow for diagnostics De-identified DICOM workflow for research

(Modality profiles) Group authorization problem: Patient data–to-user

(Physician/Researcher) relationship not manageable!

Page 23: Sociology Of The Grid May 2009

27

Patient

Primary Care Physician

HealthGrid

Pathologist

Radiologist HospitalPhysician

Visit

Medical Images Personal Health Record

Neurosurgeon

Treatment Planning

Pathology Report

Medical Record

PHR Vendor

Page 24: Sociology Of The Grid May 2009

28

Patient-authorized grid image workflow

Patient

Healthcare Provider

Internet2 IdP

Globus OGSA-DAIMeta Catalog Service

PHI safe entries

2.1

2.2

Hippocratic Verification ServicePolicy Decision Point (PDP)

2.3

Globus GridFTPStorage Service Provider

Compressed DICOM Series Records

4.1

4.3

4.4

Physician

Globus RLSReplica Location Service

4.2

GridShib

5.1

1.1

3.1

MEDICUS(Erberich et al.)

Page 25: Sociology Of The Grid May 2009

29

HIPAA-compliant research access

Research Center

Globus OGSA-DAIMeta Catalog Service

PHI safe entries

2.1

Hippocratic Verification ServicePolicy Decision Point (PDP)

2.2

Globus GridFTPStorage Service Provider

Compressed DICOM Series Records

3.2

4.2

Researcher

Globus RLSReplica Location Service

3.1

4.1

DICOM De-Identification ServiceModality Profiles

1.1

MEDICUS(Erberich et al.)

Page 26: Sociology Of The Grid May 2009

30

PHI-safe workflow with patient-centric authorization

Healthcare Provider Healthcare Provider

MCS

HVS RLS

SSP

HVS

Los Angeles CountyCalifornia

Columbia CountyNew York

3.2

4.1

4.2 2.1

IdP

2.2

2.3

3.1

1.1

Page 27: Sociology Of The Grid May 2009

31

Online-CAAuthN Svc

Application Client + PKIClient

App Svc 1.LoginUsername/Password

0. Trusted CA/CRLs

AuthenticationDB

ProvisioningDatabase

Attribute Service

MyProxyLogin with

Provisioning

Page 28: Sociology Of The Grid May 2009

32

Online-CAAuthN Svc

Application Client + PKIClient

App Svc

2. Authentication and Attributes retrieval

0. Trusted CA/CRLs

AuthenticationDB

ProvisioningDatabase

Attribute Service

MyProxyLogin with

Provisioning

Page 29: Sociology Of The Grid May 2009

33

Online-CAAuthN Svc

Application Client + PKIClient

App Svc

3. Short term X509 credentials with

attributes, CAs, CRLs

0. Trusted CA/CRLs

AuthenticationDB

ProvisioningDatabase

Attribute Service

MyProxyLogin with

Provisioning

Page 30: Sociology Of The Grid May 2009

34

Online-CAAuthN Svc

Application Client + PKIClient

App Svc

0. Trusted CA/CRLs

Authentication DB

ProvisioningDatabase

Attribute Service

4. Access using X509 Credentials

MyProxyLogin with

Provisioning

Page 31: Sociology Of The Grid May 2009

35

Online-CAAuthN Svc

Application Client + PKIClient

App Svc

0. Trusted CA/CRLs

Authentication DB

ProvisioningDatabase

Attribute Service

5. Update trust roots

MyProxyLogin with

Provisioning

Page 32: Sociology Of The Grid May 2009

36

Browser

Web SSO using OpenID

Application

ServerService

Provider (SP/RP)

IdentityProvider

(IdP)Authentication

DB

Site Attribute Service

Page 33: Sociology Of The Grid May 2009

37

Browser

Web SSO using OpenID

Application

ServerService

Provider (SP/RP)

IdentityProvider

(IdP)Authentication

DB

Site Attribute Service

1. Client access application

server

Page 34: Sociology Of The Grid May 2009

38

Browser

Web SSO using OpenID

Application

ServerService

Provider (SP/RP)

IdentityProvider

(IdP)Authentication

DB

Site Attribute Service

2. Redirected to Identity Provider

Page 35: Sociology Of The Grid May 2009

39

Browser

Web SSO using OpenID

Application

ServerService

Provider (SP/RP)

IdentityProvider

(IdP)Authentication

DB

Site Attribute Service

3. User authenticates

with IdP

Page 36: Sociology Of The Grid May 2009

40

Browser

Web SSO using OpenID

Application

ServerService

Provider (SP/RP)

IdentityProvider

(IdP)Authentication

DB

Site Attribute Service

4. AuthN completed,

user identity.

Page 37: Sociology Of The Grid May 2009

41

Browser

Web SSO using OpenID

Application

ServerService

Provider (SP/RP)

IdentityProvider

(IdP)Authentication

DB

Site Attribute Service

5. Authenticated Call.

Page 38: Sociology Of The Grid May 2009

42

Making it easy: Social VPNs

Alice

CarolBobSocialNetwork

Web interface

Social network(e.g. Facebook)

Overlay network(IPOP)

carol.facebook.ipop10.10.0.2 node0.alice.facebook.ipop

10.10.0.3

SocialNetwork API

Social network Information system

Alice’s public key certificateBob’s public key certificate

Carol’s public key certificate

Identities are managed with web-based interface profiles;Public key certificates retrieved through API

Symmetric keys exchanged and point-to-point private tunnels created on demand;

Multicast-based resource discovery

Bob: browses Alice’s SMB share

RenatoFigueiredo

Page 39: Sociology Of The Grid May 2009

43

Globus

Page 40: Sociology Of The Grid May 2009

44

As of Oct19, 2008:

122 participants105 services

70 data35

analytical

Page 41: Sociology Of The Grid May 2009

45

Registries(E.g., caBIG)

Core Services

Grid Service

Uses TerminologyDescribed In

Cancer Data Standards Repository

Enterprise Vocabulary

Services

References ObjectsDefined in

Index Service

Service Metadata

Publishes

Subscribes Toand Aggregates

Queries ServiceMetadata Aggregated In

Registers To

Discovery Client API

Page 42: Sociology Of The Grid May 2009

46

ApplnService

Create

Index service

StoreRepository

ServiceAdvertize

Discover

Invoke;get results

Introduce

Container

Transfer GAR

Deploy

Ohio State University and Argonne/U.Chicago

Service oriented medicine:caGrid, Introduce, and gRAVI

Introduce Define service Create skeleton Discover types Add operations Configure security

Grid Remote Application Virtualization Infrastructure Wrap executables

Globus

Page 43: Sociology Of The Grid May 2009

47

Microarray clustering using Taverna

1. Query and retrieve microarray data from a caArray data service:cagridnode.c2b2.columbia.edu:8080/wsrf/services/cagrid/CaArrayScrub

2. Normalize microarray data using GenePattern analytical service node255.broad.mit.edu:6060/wsrf/services/cagrid/PreprocessDatasetMAGEService

3. Hierarchical clustering using geWorkbench analytical service: cagridnode.c2b2.columbia.edu:8080/wsrf/services/cagrid/HierarchicalClusteringMage

Workflow in/output

caGrid services

“Shim” servicesothers

Wei Tan

Page 44: Sociology Of The Grid May 2009

48

VO as a Service (VOaaS)

Virtual organizations integrate participants and resource providers Participants are selected or self assemble Select “best of breed” providers for VO services

Much of this process can be automated Provisioning of enabling services, at least

Function

Resource

Page 45: Sociology Of The Grid May 2009

49

Community

Services Provider

Content

Services

Capacity

1

3

4

5

VOs assemble services

Integrate services from various sources Virtualize external services as VO services

Deploy new services for the VO

Capacity Provider

2

Page 46: Sociology Of The Grid May 2009

51

Providing VO services

Integrate existing services

Delegate and deploy capabilities/services Provision service to deliver defined capability Configure execution environment Host higher-level functions GRAM, Nimbus, EC2, …

Coordinate and compose Build new functions from individual services

Page 47: Sociology Of The Grid May 2009

52

ApplnService

Create

Index service

Store

Repository ServiceAdvertize

Discover

Invoke;get results

Introduce

Container

Transfer GAR

Deploy

gRAVI: Ravi Madduri et al., Argonne/U.Chicago & OSU

Service authoring and deployment grid Remote

Application Virtualization Infrastructure

Builds on Introduce Define service Create skeleton Discover types Add operations Configure security

Wrap arbitrary executables

RaviMadduri

Page 48: Sociology Of The Grid May 2009

53

Pull “missing” files to a storage system

List of required Files

GridFTPLocalReplicaCatalog

ReplicaLocationIndex

Data Replicati

on Service

Reliable File

Transfer Service Local

ReplicaCatalog

GridFTP

Service Composition:Data Replication Service

“Design and Implementation of a Data Replication Service Based on the Lightweight Data Replicator System,” Chervenak et al., 2005

ReplicaLocationIndex

Data MovementData Location

Data Replication

AnnChervenak

Page 49: Sociology Of The Grid May 2009

54

Decomposition EnablesSeparation of Concerns & Roles

User

ServiceProvider

“Provide access to data D at S1, S2,

S3 with performance P”

ResourceProvider

“Provide storage with performance P1, network with

P2, …”

D

S1

S2

S3

D

S1

S2

S3Replica catalog,User-level multicast, …

D

S1

S2

S3

Page 50: Sociology Of The Grid May 2009

55

Policy, revisited

Traditionally policy is enforced at end points, integrated with application E.g., PDP call-out in Globus container

We can also apply policy at the VO level Define interactions between services at the

organizational level Factor policy out of service implementations

Page 51: Sociology Of The Grid May 2009

56

Policy-driven service oriented architecture

Need stand-alone policy engine to coordinate at VO level

Connection between application policy and infrastructure policy (dynamic provisioning)

Policy extension points designed into services allow Coordination at VO level Dynamic policy enforcement

across services and service oriented infrastructure

Web Services 2.0: Policy-driven Service Oriented Architectures Thomas B Winans and John Seely Brown

Page 52: Sociology Of The Grid May 2009

57

2001 view of the “grid problem”

Resource sharing & coordinated problem solving in dynamic, multi-institutional virtual organizations

Too limited a view

Page 53: Sociology Of The Grid May 2009

58

We need an end-to-end perspective

A organization has an identity and a purpose, which it seeks to fulfill within its environment

The organization’s purpose influences its participants, structure, activities, and deliverables, whether products or services

The organization’s performance can be evaluated with respect to various metrics

Then focus on clear identification of roles, separation of concerns, isolation of policy