Upload
antonio-montgomery
View
213
Download
0
Tags:
Embed Size (px)
Citation preview
X-SIGMA(An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid
environments)
Karpjoo Jeong([email protected])
Applied Grid Computing Center
Department of Advanced Technology Fusion
Konkuk University, Seoul, Korea
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
Motivations
Two Application Projects– KOCED. TeleScience and Data Sharing Environments for
Civil Engineering Research in Korea– Glyco-MGrid. Collaborative Molecular Simulation Grid
Environments for Glycomics
Required to support scientific data management– Multiple data models, but not many. They may be changed
and new models may be added, but not frequently– Legacy data (just files)and analysis software (just code with
input and output files), but they are kind of simple
Conventional Systems– DBMS-based Systems. Too inflexible– Semantic Web Systems. Too advanced and complicated– File based Systems. Too flexible. Custom approach
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
Scientific Data Management
Data Models– Metadata– Experimental Data
Indexing & Searching– Metadata Management– Data Indexing and Searching
Data Repository– Storage for Experimental Data – Access Management
Data Repository
Data Models
ExperimentalData
Metadata
Sto
re
Register
Search
Explains
•3D Structure•Trajectory
•Scientist: John•Parameter: #$@•Data locationexplain
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
Goals: Data Models
Multiple Data ModelsData Model EvolutionData Integration
Data Repository
Data ModelA
ExperimentalData
Metadata
Data ModelB
ExperimentalData Metadata
Query
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
Metadata Model (Context Data)
Experimental Context– Information about experiments (e.g. owners and parameter
settings)
Logical View of Experimental Data– Logical organization of physical experimental data– Their location information– Associated software information
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
X-SIGMA Metadata Model Example
< (b) User Interface for Editor >
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
Logical View of Experimental Data
Location, Format and Associated Software
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
Goals: Federated Data Repository
Distributed RepositoryLocal Site Autonomic ManagementDecoupling between Metadata and Experimental DataLegacy Data Access Support
Global MetadataManagement
Distributed Access to Experimental Data
Site ASite B
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
X-SIGMA System Structure
Global X-SIGMAGlobal X-SIGMA
GlobalSchema
Management
QueryProcessor
ExperimentalData Access
System
Local X-SIGMA
LocalSchema
Management
Local QueryProcessing
System
LocalExperimental
Access System
Storage(File, Legacy)
Schema &Context Data
SchemaIntegration
DistributedQuery System(OGSA-DAI)
DistributedAccess System(SRB,GRID-FTP)
Register/Insert
Search Access
Local X-SIGMA
LocalSchema
Management
Local QueryProcessing
System
LocalExperimental
Access System
Storage(File, Legacy)
Schema &Context Data
Register/Insert
Search Access
Local X-SIGMA
LocalSchema
Management
Local QueryProcessing
System
LocalExperimental
Access System
Storage(File, Legacy)
Schema &Context Data
Register/Insert
Search Access
ContextData
GlobalSchema
Search& Access
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
X-SIGMA System Architecture
XMLDatabase
< Site A >
StorageXML
Database
< Site B >
StorageXML
Database
< Site C >
Storage
Global Schema Management
Distributed Query Processor
ExperimentalData Access System
Schema & ContextDataManagement
Query Process
Real Data Access
Schema & ContextDataManagement
Query Process
Real Data Access
Schema & ContextDataManagement
Query Process
Real Data Access
Global X-SIGMA
RDF Database
Local X-SIGMA Local X-SIGMA Local X-SIGMA
OGSA-DAI SRB
Grid Middleware
Integrated Access Layer
ContextData
SearchingContext Data
AccessingReal Data
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
Interfaces
GUI-based InterfacesWeb Services based Interfaces
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
Glyco-MGrid
Glyco-MGrid is a molecular simulation computing and data grid portal for glycomics
It provides shared and integrated cyber-environments which support simulation, databases, and trajectory analysis in a collaborative way
Data sharing is based on X-SIGMA
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
Glyco-MGrid
Glyco-MGrid
Database Simulation TrajectoryActive Projects
MGrid-SDG(X-SIGMA)
MGrid-CG AnalysisComputing
PSE
XMLDocument
GridFTP/RFTRealData
© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.
References
“X-SIGMA: XML based Simple data Integration system for Gathering, Managing, and Accessing Scientific Experimental Data in Grid Environments”, 2nd Conference on eScience and Grid Computing, 2006
“Glyco-MGrid : A Collaborative Molecular Simulation Grid for e-Glycomics”, To appear in 3rd Conference on eScience and Grid Computing, 2007