12
Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago http://www.mcs.anl.gov/~foster Symposium on Knowledge Environments for Science, November 26, 2002

Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

  • View
    217

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

Knowledge Environments for Science:

Representative Projects

Ian Foster

Argonne National Laboratory

University of Chicago

http://www.mcs.anl.gov/~foster

Symposium on Knowledge Environments for Science, November 26, 2002

Page 2: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

2

[email protected] ARGONNE CHICAGO

Comments Informed By Participation in …

E-science/Grid application projects, e.g.– Earth System Grid: environmental science

– GriPhyN, PPDG, EU DataGrid: physics

– NEESgrid: earthquake engineering Grid technology R&D projects

– Globus Project and the Globus Toolkit

– NSF Middleware Initiative Grid infrastructure deployment projects

– Alliance, TeraGrid, DOE Sci. Grid, NASA IPG

– Intl. Virtual Data Grid Laboratory (iVDGL) Global Grid Forum: community & standards

Page 3: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

3

[email protected] ARGONNE CHICAGO

Data Grids for High Energy Physics

Enable community to access & analyze petabytes of data

Coordinated intl projects– GriPhyN, PPDG, iVDGL, EU

DataGrid, DataTAG Challenging computer science

research Real deployments and

applications Defining analysis architecture for

LHC

Page 4: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

4

[email protected] ARGONNE CHICAGO

NEESgrid Earthquake Engineering Collaboratory

2

Network for Earthquake Engineering Simulation

Field Equipment

Laboratory Equipment

Remote Users

Remote Users: (K-12 Faculty and Students)

High-Performance Network(s)

Instrumented Structures and Sites

Leading Edge Computation

Curated Data Repository

Laboratory Equipment (Faculty and Students)

Global Connections(fully developed

FY 2005 –FY 2014)

(Faculty, Students, Practitioners)

U.Nevada Reno

www.neesgrid.org

Page 5: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

5

[email protected] ARGONNE CHICAGO

Size distribution ofgalaxy clusters?

1

10

100

1000

10000

100000

1 10 100

Num

ber

of C

lust

ers

Number of Galaxies

Galaxy clustersize distribution

Chimera Virtual Data System+ GriPhyN Virtual Data Toolkit

+ iVDGL Data Grid (many CPUs)

Communities Need Not be Large:E.g., Astronomical Data Analysis

www.griphyn.org/chimera

Page 6: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

6

[email protected] ARGONNE CHICAGO

A “Knowledge Environment” is a System For …

“Interpersonalcollaboration”

“Integratingdata”

“Accessingspecializeddevices”

“Enablinglarge-scale

computation”

“Sharinginformation”

“Accessingservices”

“Largecommunities”

“Smallteams”

Page 7: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

7

[email protected] ARGONNE CHICAGO

It’s All of the Above: Enabling “Post-Internet Science”

Pre-Internet science– Theorize &/or experiment, in small teams

Post-Internet science– Construct and mine very large databases

– Develop computer simulations & analyses

– Access specialized devices remotely

– Exchange information within distributed multidisciplinary teams

Need to manage dynamic, distributed infrastructures, services, and applications

Page 8: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

8

[email protected] ARGONNE CHICAGO

Enabling Infrastructure for Knowledge Environments for Science

(aka “The Grid”)

“Resource sharing & coordinated problem solving in dynamic, multi-institutional virtual organizations”

Page 9: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

9

[email protected] ARGONNE CHICAGO

Grid Infrastructure What?

– Broadly deployed services in support of fundamental collaborative activities

– Services, software, and policies enabling on-demand access to critical resources

Open standards, software, infrastructure– Open Grid Services Architecture (GGF)

– Globus Toolkit (Globus Project: ANL, USC/ISI)

– NMI, iVDGL, TeraGrid Grid infrastructure R&D&ops is itself a distributed &

international community

Page 10: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

10

[email protected] ARGONNE CHICAGO

Lessons Learned (1)

Importance of standard infrastructure– Software: facilitate construction of systems, and

construction of interoperable systems

– Services: authentication, discovery, …, …

– Needs investment in research, development, deployment, operations, training

Building & operating infrastructure is hard– Challenging technical & policy issues

– Requisite skills not always available

– Can challenge existing organizations

Page 11: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

11

[email protected] ARGONNE CHICAGO

Lessons Learned (2)

Importance of community engagement– “Maine and Texas must have something to

communicate”

– Big science traditions seem to help

– Discipline champions certainly help

– Effective projects often true collaborations between disciplines and computer scientistis

Importance of international cooperation– Science is international, so is expertise

– Challenging, requires incentives & support

Page 12: Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago foster

12

[email protected] ARGONNE CHICAGO

Lessons Learned (3)

Collaborative science/Grids are a wonderful source of computer science problems– E.g., “virtual data grid” (GriPhyN): data,

programs, derivations as community resources

– E.g., security within virtual organizations Work in this space can be of intense interest

to industry– E.g., current rapid uptake of Grid

technologies