49
TieJian Luo Ph.D [email protected] Graduate University of Chinese Academy of Sciences Scientific Collaboration Cyberspace for China State Key Labs

TieJian Luo Ph.D [email protected] Graduate University of Chinese Academy of Sciences Scientific Collaboration Cyberspace for China State Key Labs

Embed Size (px)

Citation preview

TieJian Luo [email protected]

Graduate University of Chinese Academy of Sciences

Scientific Collaboration Cyberspace

for China State Key Labs

2

Agenda

• Research Models

• Challenges & Collaboration Services

• Scientific research Projects

• Lessons from the SCC development

3

New Science Paradigms

• Thousand years ago: Experimental Science

- description of natural phenomena• Last few hundred years:

Theoretical Science - Newton’s Laws, Maxwell’s Equations …

• Last few decades: Computational Science

- simulation of complex phenomena• Today:

e-Science or Data-centric Science - unify theory, experiment, and simulation - using data exploration and data mining– Data captured by instruments – Data generated by simulations– Processed by software– Scientist analyzes databases/files(Credit to Jim Gray)

2

22.

3

4

a

cG

a

a

2

22.

3

4

a

cG

a

a

4

The Problem for the e-Scientist

• Data ingest

• Managing a petabyte

• Common schema

• How to organize it?

• How to reorganize it?

• How to coexist & cooperate with others?

• Data Query and Visualization tools

• Support/training

• Performance– Execute queries in a minute

– Batch (big) query scheduling

Experiments &Instruments

Simulationsfacts

facts

answers

questions

?Literature

Other Archives facts

facts

5

Basic Research Paradigm

extract information

data

simulation

Data mining

deduce

model

idea

Scientific computing

result

Verify model( 1 ) problem model

( 2 ) experiment, data collectio

n

( 3 ) get computing resource

( 4 ) computing, verify model

Deduce natural law( 1 ) identify problem domain( 2 ) experiment, data collection ( 3 ) get computing resource( 4 ) analyze data , deduce law

6

Scenario

Site F: Sechedule

Site C : Raw data Site D : Storage

1

2

3

Site E : Computer( simulation, data mining )

4

5

6

7

8

9

cerficiation

Site A userSite B user

10

7

Collaboration Interactive Model and its Service• People to People

– Communication

– Groupware Service

• Access to Facilities– Interaction with the

Physical World

– Access to Instrument online

– Access to Applications

• Access to Information– Digital Libraries, E-Pub

– Search Service

Collaboration Interactive Model

8

What is a scientific Collaboration ?

• Definition• Two or more people work together to create or achieve the same

thing. • Key points

• Different or same discipline scientists• Form a Virtual Team• Need a agreement• Have a common research theme• Divide task into several works• Discuss problems, use instruments and share information• Goal: create new knowledge

9

Collaboration Technologies

• Access to• People

• Instruments

• Stored information

• Synchronous vs asynchronous• Chat, video conferencing, audio streaming

• Shared objects of conversation (like data)

• Threaded discussions

• Electronic notebooks

• Transitions between them

• They are limited to address users’ problems!

10

Primary services for SCC

• Virtual Team

• Access Instrument

• Sharing Data / Expertise

• Activity Management

• Online Discuss

11

Overview of China State Key Labs

SKL disciplines coverage

Total number is 162-----------------------------------------

Disciplines Number

----------------------------------

Chemistry 22

Math and Physics 15

Geognosy 18

Biology 38

Information 26

Material 18

Engineering 25

---------------------------------- Target users more than 100k Potential connected large instruments more than 6k

12

Physical Resources and Virtual Team

people

deviceKnow how

data

SKL A

Info

people

deviceKnow how

data

SKL B

info

people

deviceKnow how

data

SKL D

Info

people

deviceKnow how

data

SKL C

Info

Virtual team C

Virtual team A

Virtual team B

13

Strategy to Create the SCC

• State of the art ---Some efforts of science communities• UK : e-Science• EuroGrid• USA: Cyberinfrastructure---Globus Toolkits• Japan : ITBL ( IT Based Laborarory )• China Grid initiative

• IT and commercial product• Netmeeting Polycom WebEx• Middleware: Workplace 、 SharePoint 、 WebLogic • Platform: .NET , J2EE• Forum and whiteboard , etc.

• Strategy: Choose the mature and simpler technology to address users’ problem.

14

Scie

ntifi

c Colla

bora

tion

Cybersp

ace

Che

Network Infrastructure

LDAPACLAuth SSL

Task

Know How Expert

Network com

Team

Instrument Acc

Pub/Search

Project Metrics

Semantic Data

Phy Math

……

InstrumentsInstruments Data ResourceData ResourceComputingComputing Instrumen

tInstrument

Archie Archie DataData Multimedia Multimedia Raw data Raw data Physical

Network

Services

Collaboration ‘game rules’

SitesResources JointManagement

Mat EngBio Geo

……Services

Domain

15

Virtual Team management

SCC Portal

Set up Project

Support management

Project1 Project2 Project n

Form a team

Collaboration actions

Set up Project

Support management

Form a team

Collaboration actions

Set up Project

Support management

Form a team

Collaboration actions

…… ……

AttributesnamegoaldomainPI infoResource planning ……

Optimize team

Task assignment

Know how

Information share

Instrument using

Online discuss

Optimize team

Task assignment

Know how

Information share

Instrument using

Online discuss

Optimize team

Task assignment

Know how

Information share

Instrument using

Online discuss

16

Virtual team

Registration

[no pass]

[pass]

usersystem PI: Primary Investigator

PHProject Head

Apply create project space

Got space

Verify member

Set up permit

Assign task

Manage task

[pass]

Apply form a team

Team set up

set up PH permit

verify PH

17

Support Roles

• SSC• Project

• PI • Team member• Team leader ( adm ):• Web Site adm ( adm ):• Web site designer• …

• visitors• experts

• Roles :• Visitors, members 、 discuss participant 、 web design 、 adm 、 content adm

18

Virtual Team management

• Problems• Compromise the free and restrict jointing • Contradiction on distribution and centre control• Trust

• solutions• AAA technology (Authentication Authorization

Accounting) • Online registration • Trust metrics• Portal

19

Access to instruments

Logon SSC

[no pass]

[pass]

user

system

Instrument owner

Instrument user

registration

Instrument adm

[pass]

apply

success

using

Schedule

Accounting log

verify

Project HeadAssign adm

20

Access to Instruments

• Problems• Instrument description, registration, publishing,

finding and using • Accounting and Fee

• Solutions• Web Service • LDAP and light weigh instrument access protocol

21

Data sharing

Logon SKLCRE

[link]user

system

Data owner

Data user

Data registration

[push]

find

publishUsing data

Accounting log

[pull]

DownLoad data UpLoad data

Capture result

Up load data

22

Data Sharing

• Problems• Large scale scientific data two-way transfer• Accounting and Fee

• solutions• DBMS support data publish on Internet • Fast FTP automatic re-transfer

23

Online Discuss

Logon SKLCRE

usersystem Conf. host participant

Invite conf.

Text messageConf mem Conf mem

Enter conf.

Audio/vedio

Pub conf notice

reply

Set up conf.

Exit conf

Conf bro

End of conf

Exp. participant

Connect to monitor

reservation

collabortion

Record process

member

participating

24

• Problems• Multi-group’s participants joint the meeting at sam

e time• Co-laboratory environment

• solutions• Software based video and audio tech. • Light weigh desktop sharing tech.• Wiki• Forum

Online Discuss

25

Activity management

Logon SSC

user Data onwerPI Project head

achivement tasks

Personal achivement

member

progress

activity

Instrument adm

Pub info

Device usage

Data usage

Personal time

Personal usage

26

Activity management

• Problems• Global monitor the whole process• Contents index and statistic• Individual performance assessment

• solutions• Access control and monitor • DBMS and Data mining • Rating model

27

SSC Applications

• Project1 : Soil Carbon recycle mechanism database (CNSF,973,863-- grant)• 100+research units• Distribute data across mainland of China• More than 1K scientists

• Project2 : FACE ( Free Air CO2 Enrichment ) (CNSF,973--grant)• Experiment site and office at different place• Raw data, model software share• Many Sub tasks

• Project3 : China neutron experiment center (973,863,MoST -- grant)• Protect the staff from radiation• Monitor the experiment process• Inter-discipline field

• Project4 : Collaboration for Bio-Safety Level 3( MoST – grant)• 6+ institutes • Monitor the experiment process• Inter-discipline field

• Project5 : Collaboration for large scale instruments center (MoST – grant) • 10+ large scientific instruments• Data and expertise sharing• Remote control instruments

28GUCAS/ZGC---Server Farm

GUCAS/YQL---Distance Backup

科技部网络

SKL 1 SKL 162

CSTNet

CERNet

ChinaNet

10M

100M

1G

10M backup

MOST

2.5 G

10 GIPv6

Internet

SCC Network Architecture

29

Web Architecture

Benefits Adaptable

More than 10 different science subjects templates

Scalability Dynamic growing VT does not affect platform performance

ExtensibilityEasy plug in the a new service to the platform

30

SSC for China Key Labs

31

Requirement of FACE project

• FACE ( Free Air CO2 Enrichment )

• Find out the mechanism for the rice growing when the CO2 climate change

• 15 Collaboration org.• 9 national orgs ( 3SKLs ), 7 international , 10

0+ research staff.

• Invest 100m RMB, only one in China

• 30 mini projects

32

Contribution to the FACE science research

• Monitor the farm site by video

• Automatic upload the raw data

• Real time display control pane by Browser

33

FACE Project deployment

34

FACE Project

• New• 5 collaborations use case• Run more than one year• English version is ongoing

• Old• Only one function • Info pub

35Neutron emission facility

Remote monitor in Browser

36Data analysis software

Neutron diffraction remote control interface in Browser

37

·Å ÖÃʵÑéÑùÆ·Set s am p le

ºË ·´ Ó¦ ¶Ñn u clear react o r

ÑùÆ·Ðýת ÒÇs am p le circu m gy rat e d ev ice

ÖÐ×ÓÊøB eam o f n eu t ro n

Æ×ÒÇp ed igree d ev ice

Êý ¾Ý²É ¼¯

Êý ¾Ý²É ¼¯

Êý ¾Ý²É ¼¯

Êý ¾Ý²É ¼¯

ʵÑéÔËÐмà¿Ø»ú co m p u t er fo r m o n it o r exp erim en t s t at u s

ʵ Ñé ÊÒ Lab

Ô¶³Ì ¼à¿Ø·þ Îñ Æ÷M o n it o r Serv er

Éã Ïñ Í·cam era

ʵÑé ¹ý ³Ì ¼à¿Ø»ú co m p u t er fo r m o n it o r exp erim en t p ro ces s

ʵÑé¼à¿Ø´ú Àí »úm o n it o r p ro xy

In t ern et

.......

¼à¿ØʵÑéÔËÐÐm o n it o r exp erim en t s t at u s

¼à¿ØʵÑé¹ý ³Ìm o n it o r exp erim en t p ro ces s

¼à¿ØʵÑéÔËÐÐm o n it o r exp erim en t s t at u s

Benefit 1 staff are isolated from experiment site 2 remote real time monitor the process 3 scientists online discuss

Deployment of neutron diffraction remote access

38

Collaboration for Bio-Safety Level 3

39

Collaboration for Bio-Safety Level 3

40

Collaboration for large instruments center

41

Collaboration for large instruments center

42

BSL3 Lab of FUDAN University

43

BSL3 setting

办公区

生物安全柜

倒置显微镜

监控视频服务器

麦克风

控制输入

摄像机带云台的摄像头

实验室 控制及演播室

交换机

C CC

操控服务器

视频服务器

2M带宽到互联网

防火墙

Mini Camera

44

SSC Portal

user

Org.A

Org.B Org.C Org.D Org.E

Org.F

Portal model

Benefits:Trust access path Single sign-on Delegate permits and Proxy Interoperate

45

Expertise Share Model

Collaboration Cyberspace

Labs

Outside labs

Buy ruleBuy rule

Agreement (1,n)(1,m)

3

Extra members

1.contribution  2.get Knowledge   3.oversee

21

1 2

(1,p)(1,q)

Members

22

ParticipantManagement Agency

Affiliate org

E-community

Instrument

Data storage

Platform

Behavior

ConstraintsInfo flowCapital flow

Data Flow

Communicate

Management

{活动描述}

{规则名称}

LegendEnsure a fair collaboration game !

46

Video and audio interactive component

Benefits1. Multi node access2. Plug in SSC3. Security and efficiency

Benefits1. Multi node access2. Plug in SSC3. Security and efficiency

47

Lessons Learned

• How to address the user’s application simpler? • System development is less and less about coding than

about using things and gluing them together.• Create a flexible enough architecture to allow for

changes.• Customers will not be able to elucidate at the start what

they want; only by using the system will they be able to tell you what you should have done. (Semantic gaps)

• SSC 1.0 rely on MS SharePoint ; SSC 2.0 open source • Individual performance assessment model and fair

agreements (win-win business model). • Future concerns should be e-community trust metrics.

48

Acknowledge

• Dec.2005--Dec.2007, China Bio-Safe Level 3 Labs Collaborative Cyberspace, Grant from MOST of China

• Dec.2005--Dec.2007, China Large Scientific Instruments Collaborative Cyberspace, Grant from MOST of China

• Jul.2005--Jul.2008, EU-Asia Link Programme HPC-Grid Computing Course Model, Grant from EU

49

• URL:• http://crewiki.gucas.ac.cn/• http://cec.gucas.cn/• http://cbl.gucas.cn/• http://co-lab.chinalab.gov.cn/

• Questions?• Comments?