22
A PROPOSED EARTH SCIENCE COLLABORATORY K-S Kuo 1,2 , Chris Lynnes 1 , Rahul Ramachandran 3 1 NASA Goddard Space Flight Center, USA 2 Caelum Research Corporation, USA 3 University of Alabama-Huntsville, USA 7/27/11 1 IGARSS 2011, Vancouver, Canada

A PROPOSED EARTH SCIENCE COLLABORATORY

  • Upload
    lyle

  • View
    32

  • Download
    0

Embed Size (px)

DESCRIPTION

A PROPOSED EARTH SCIENCE COLLABORATORY . K-S Kuo 1,2 , Chris Lynnes 1 , Rahul Ramachandran 3 1 NASA Goddard Space Flight Center, USA 2 Caelum Research Corporation, USA 3 University of Alabama-Huntsville, USA. Why ESC?. Data Intensive Science Many forms and sources of data - PowerPoint PPT Presentation

Citation preview

Page 1: A PROPOSED  EARTH SCIENCE COLLABORATORY

IGARSS 2011, Vancouver, Canada 1

A PROPOSED EARTH SCIENCE

COLLABORATORY K-S Kuo1,2, Chris Lynnes1, Rahul Ramachandran3

1NASA Goddard Space Flight Center, USA2Caelum Research Corporation, USA

3University of Alabama-Huntsville, USA

7/27/11

Page 2: A PROPOSED  EARTH SCIENCE COLLABORATORY

Why ESC?

7/27/112IGARSS 2011, Vancouver, Canada

Data Intensive ScienceMany forms and sources of data

In situ measurementsRemote sensing observationsModel simulations

Large volumes of dataEffectiveness as a scientist

Increasing proportion of effort in data managementThreatening:

ReproducibilityCorrectnessProductivity

Page 3: A PROPOSED  EARTH SCIENCE COLLABORATORY

IGARSS 2011, Vancouver, Canada 3

What is an ESC?Vision of a rich model development/simulation and data analysis environment that:

Provides access to various Earth Science modelsFacilitates model and analysis software developmentProvides access across a wide spectrum of Earth Science dataProvides a diverse set of science analysis services and toolsSupports the application of services and tools to dataSupports collaboration, i.e. sharing of data, tools and resultsSupports discovery and publication of all science artifacts

7/27/11

Basically, a new and natural place for Earth scientists to conduct their work and

collaborate with others!

Page 4: A PROPOSED  EARTH SCIENCE COLLABORATORY

7/27/114IGARSS 2011, Vancouver, Canada

The Situation TodayIslands of data and services with selective

connectivity

Data Center A

Data Center C

Data Center B

Page 5: A PROPOSED  EARTH SCIENCE COLLABORATORY

IGARSS 2011, Vancouver, Canada 5

High-Level View

7/27/11

Cyberinfrastructure

Tool Library

Data Library

Laboratory Notebook

Workflow

Mediator

Data Centers

Page 6: A PROPOSED  EARTH SCIENCE COLLABORATORY

7/27/116IGARSS 2011, Vancouver, Canada

Tool Library• Discovery• Social

oSharingoTaggingoDiscussion

• Configuration ManagementoTestingoVersioning

Packager• autoconf• RPM• Web

wrapper

PROVISIONED

• GrADS• IDL• MatLab• ncl• nco• cdat

COMMUNITY• Quality filter• Coincidence• Feature

detection• Event service• Visualization

CONTRIBUTED

• [Tool 1]• [Tool 2]• [Tool 3]• [Tool 4]• [Tool 5]• …

PERSONAL• [Tool 1]• [Tool 2]• [Tool 3]• [Tool 4]• [Tool 5]• …

Page 7: A PROPOSED  EARTH SCIENCE COLLABORATORY

7/27/117IGARSS 2011, Vancouver, Canada

Data Library• Cache• Discovery• Social

oSharingoTaggingoDiscussion

• Configuration ManagementoTestingoVersioning

Packager• data probe• format

check• metadata

wizard

PROVISIONED

• EOSDIS

COMMUNITY• Field

campaigns• MEaSUREs• ACCESS• Validation

CONTRIBUTED

• [Dataset 1]• [Dataset 2]• [Dataset 3]• [Dataset 4]• [Dataset 5]• …

PERSONAL• [Dataset 1]• [Dataset 2]• [Dataset 3]• …

Page 8: A PROPOSED  EARTH SCIENCE COLLABORATORY

7/27/118IGARSS 2011, Vancouver, Canada

Workflow Library• Discovery• Social

oSharingoTaggingoDiscussion

• Configuration ManagementoTestingoVersioning

Packager• Workflow editor

PROVISIONED

• Processing Algorithms

COMMUNITY• GeoBrain• SciFlo• Data Mining• Giovanni

CONTRIBUTED

• [Workflow 1]• [Workflow 2]• [Workflow 3]• [Workflow 4]• [Workflow 5]• …

PERSONAL• [Workflow 1]• [Workflow 2]• [Workflow 3]• …

Page 9: A PROPOSED  EARTH SCIENCE COLLABORATORY

7/27/119IGARSS 2011, Vancouver, Canada

Laboratory Notebook• Discovery• Social

oSharingoTaggingoDiscussion

• Configuration ManagementoVersioning

Packager• Project Manager

• Experiment manager

• Notebook editor

PROVISIONED

• Tutorials• User guides• Example

uses• Educational

packages

COMMUNITY• Project results• Publications• Example

cases• Educational

packages

PROJECT• [Project 1]• [Project 2]• [Project 3]• [Project 4]• [Project 5]• …

PERSONAL• Notes• Journals• …

Page 10: A PROPOSED  EARTH SCIENCE COLLABORATORY

7/27/1110IGARSS 2011, Vancouver, Canada

Mediator• Mediates tool interaction with data• OPeNDAP – a common data model

(accessible by most tools)• Custom modules reformat data for

the rest of the tools• Ontology matches tools with data,

and vice versa.

Page 11: A PROPOSED  EARTH SCIENCE COLLABORATORY

IGARSS 2011, Vancouver, Canada 11

CyberinfrastructureServices used by all other

componentsSecurity

authenticationauthorizationcode audit/padded cell integrity checking

Socialtaggingsharingdiscussionsgroups

Cloudelastic provisioned storage and computing

Discoverydata, tools, workflows, experimentssearch by keyword, variable, time, author

Information Management

provenanceidentifiersarchive

Semantic Webdata ontologytools ontology 7/27/11

Page 12: A PROPOSED  EARTH SCIENCE COLLABORATORY

IGARSS 2011, Vancouver, Canada 12

Key Advantages of ESC

Tool availability will be a force multiplierMore tools will be usable with more datasetsMore tools will be more available to more users

Knowledge sharing evolves from text on paper to a rich mixture of data, tools, workflows and articlesA “wikihow” for Earth Science data analysis

Incorporating live data, services and workflowsESC maintains a record of the analysis process

Share, repeat, build upon analysis techniquesTransparency of the process is built in

7/27/11

Page 13: A PROPOSED  EARTH SCIENCE COLLABORATORY

IGARSS 2011, Vancouver, Canada 13

Prior ArtTalkoot, myExperiment.org – workflow sharing, virtual notebooksEarth System Grid – provisioned tools, format standards/checkersNASA Earth Exchange (NEX)Land Information System – OPeNDAP as access infrastructureEarth Science Modeling Framework – programmatic approach to integrationGiovanni, LAS – community services/toolsCanadian Space Science Data Portal (EOS, Feb. 22, 2011)Nebula – cloud provisioning

7/27/11

Page 14: A PROPOSED  EARTH SCIENCE COLLABORATORY

A Use CaseGPM Precipitation Retrieval Algorithm

Development

7/27/1114IGARSS 2011, Vancouver, Canada

GPM Core Satellite: Dual-Frequency Precipitation Radar (JAXA) and GPM Microwave Imager (NASA)GPM Constellation: International partner satellites with mostly microwave radiometersRetrieval algorithms – 3 types

Radar-onlyRadiometer-onlyRadar-radiometer-combined

Participants in algorithm development are distributed in Japan, NASA centers (GSFC, MSFC, JPL), NCAR, and universities (FSU, Uwisc, etc.)

Page 15: A PROPOSED  EARTH SCIENCE COLLABORATORY

A Use CaseGPM Algorithm Development – Current

Situation

7/27/1115IGARSS 2011, Vancouver, Canada

Interdependence among 3 types of algorithmsCommunication/Coordination – Narrow bandwidth

Periodic workshop meetings and teleconferences

Data access – DuplicativeEach location/group has a copy or subset of required data

Sharing of data/tools – Individual, not concertedthrough ftp/email

Knowledge sharing – Delayed

Page 16: A PROPOSED  EARTH SCIENCE COLLABORATORY

A Use CaseGPM Algorithm Development – with ESC

7/27/1116IGARSS 2011, Vancouver, Canada

Tools

Data

ESC ClientmyS

ci Cat.

A

Tools

Data

ESC ClientmyS

ci Cat.

Z

Cloud

VM ImageTool

s Data

A

VM ImageTool

s Data

B

Community Catalog

ESC

Page 17: A PROPOSED  EARTH SCIENCE COLLABORATORY

A Use CaseGPM Algorithm Development – Multi-level

MembershipDC

B

A

K

J I H

G

F

E

GPMRadar-Only

Radiometer-Only

Combined

Algorithm

M

L

Page 18: A PROPOSED  EARTH SCIENCE COLLABORATORY

A Use CaseGPM Algorithm Development – in ESC

7/27/1118IGARSS 2011, Vancouver, Canada

Enhanced communication/coordination – wide bandwidthEfficient data access – less duplicationImproved sharing – more pervasiveEffective knowledge sharing – immediate

Page 19: A PROPOSED  EARTH SCIENCE COLLABORATORY

Thank you!

7/27/1119IGARSS 2011, Vancouver, Canada

Page 20: A PROPOSED  EARTH SCIENCE COLLABORATORY

Why now?Because we can do it (finally)!

Advances in standards acceptance andimplementation (OPeNDAP, autoconf)A consistent, loosely coupled architecture encapsulates complexity and maximizes flexibilitySocial networking has reached the mainstreamKey lessons can be learned from prior efforts

The need is growingInterest in working with multiple datasets is growingCalls for transparency and reproducibility are growing

7/27/1120IGARSS 2011, Vancouver, Canada

Page 21: A PROPOSED  EARTH SCIENCE COLLABORATORY

What’s New?Macro View (forest-level)

Systematic approach to making data available to services and vice versaIntegration of all major analysis componentsConsistent view of all architectural componentsCyberinfrastructure services for all architectural components

Micro View (tree-level): Nothing!

7/27/1121IGARSS 2011, Vancouver, Canada

Page 22: A PROPOSED  EARTH SCIENCE COLLABORATORY

How to move forward?

Option 1RFC to community on feasibility, challenges, approachFollowed by RFPs for component and integration

Option 2Narrow end-to-end prototypeFollowed by refactoring and broadening

7/27/1122IGARSS 2011, Vancouver, Canada