14
X-SIGMA (An X ML based S imple data I ntegration system for G athering, M anaging and A ccessing scientific experimental data in grid environments) Karpjoo Jeong([email protected] ) Applied Grid Computing Center Department of Advanced Technology Fusion Konkuk University, Seoul, Korea

X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong([email protected])[email protected]

Embed Size (px)

Citation preview

Page 1: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

X-SIGMA(An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid

environments)

Karpjoo Jeong([email protected])

Applied Grid Computing Center

Department of Advanced Technology Fusion

Konkuk University, Seoul, Korea

Page 2: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Motivations

Two Application Projects– KOCED. TeleScience and Data Sharing Environments for

Civil Engineering Research in Korea– Glyco-MGrid. Collaborative Molecular Simulation Grid

Environments for Glycomics

Required to support scientific data management– Multiple data models, but not many. They may be changed

and new models may be added, but not frequently– Legacy data (just files)and analysis software (just code with

input and output files), but they are kind of simple

Conventional Systems– DBMS-based Systems. Too inflexible– Semantic Web Systems. Too advanced and complicated– File based Systems. Too flexible. Custom approach

Page 3: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Scientific Data Management

Data Models– Metadata– Experimental Data

Indexing & Searching– Metadata Management– Data Indexing and Searching

Data Repository– Storage for Experimental Data – Access Management

Data Repository

Data Models

ExperimentalData

Metadata

Sto

re

Register

Search

Explains

•3D Structure•Trajectory

•Scientist: John•Parameter: #$@•Data locationexplain

Page 4: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Goals: Data Models

Multiple Data ModelsData Model EvolutionData Integration

Data Repository

Data ModelA

ExperimentalData

Metadata

Data ModelB

ExperimentalData Metadata

Query

Page 5: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Metadata Model (Context Data)

Experimental Context– Information about experiments (e.g. owners and parameter

settings)

Logical View of Experimental Data– Logical organization of physical experimental data– Their location information– Associated software information

Page 6: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

X-SIGMA Metadata Model Example

< (b) User Interface for Editor >

Page 7: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Logical View of Experimental Data

Location, Format and Associated Software

Page 8: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Goals: Federated Data Repository

Distributed RepositoryLocal Site Autonomic ManagementDecoupling between Metadata and Experimental DataLegacy Data Access Support

Global MetadataManagement

Distributed Access to Experimental Data

Site ASite B

Page 9: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

X-SIGMA System Structure

Global X-SIGMAGlobal X-SIGMA

GlobalSchema

Management

QueryProcessor

ExperimentalData Access

System

Local X-SIGMA

LocalSchema

Management

Local QueryProcessing

System

LocalExperimental

Access System

Storage(File, Legacy)

Schema &Context Data

SchemaIntegration

DistributedQuery System(OGSA-DAI)

DistributedAccess System(SRB,GRID-FTP)

Register/Insert

Search Access

Local X-SIGMA

LocalSchema

Management

Local QueryProcessing

System

LocalExperimental

Access System

Storage(File, Legacy)

Schema &Context Data

Register/Insert

Search Access

Local X-SIGMA

LocalSchema

Management

Local QueryProcessing

System

LocalExperimental

Access System

Storage(File, Legacy)

Schema &Context Data

Register/Insert

Search Access

ContextData

GlobalSchema

Search& Access

Page 10: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

X-SIGMA System Architecture

XMLDatabase

< Site A >

StorageXML

Database

< Site B >

StorageXML

Database

< Site C >

Storage

Global Schema Management

Distributed Query Processor

ExperimentalData Access System

Schema & ContextDataManagement

Query Process

Real Data Access

Schema & ContextDataManagement

Query Process

Real Data Access

Schema & ContextDataManagement

Query Process

Real Data Access

Global X-SIGMA

RDF Database

Local X-SIGMA Local X-SIGMA Local X-SIGMA

OGSA-DAI SRB

Grid Middleware

Integrated Access Layer

ContextData

SearchingContext Data

AccessingReal Data

Page 11: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Interfaces

GUI-based InterfacesWeb Services based Interfaces

Page 12: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Glyco-MGrid

Glyco-MGrid is a molecular simulation computing and data grid portal for glycomics

It provides shared and integrated cyber-environments which support simulation, databases, and trajectory analysis in a collaborative way

Data sharing is based on X-SIGMA

Page 13: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Glyco-MGrid

Glyco-MGrid

Database Simulation TrajectoryActive Projects

MGrid-SDG(X-SIGMA)

MGrid-CG AnalysisComputing

PSE

XMLDocument

GridFTP/RFTRealData

Page 14: X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr)jeongk@konkuk.ac.kr

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

References

“X-SIGMA: XML based Simple data Integration system for Gathering, Managing, and Accessing Scientific Experimental Data in Grid Environments”, 2nd Conference on eScience and Grid Computing, 2006

“Glyco-MGrid : A Collaborative Molecular Simulation Grid for e-Glycomics”, To appear in 3rd Conference on eScience and Grid Computing, 2007