Upload
others
View
5
Download
0
Embed Size (px)
Citation preview
Open Science activities with ICSU‐World Data System, International Enterprise of
Long Term Data Preservation
Dr. Yasuhiro Murayama
Member of Cabinet Office Expert Panel of Open Science
Associate member, Science Council of Japan
ICSU‐World Data System Scientific Committee ex officio
Natl. Inst. Information & Communications Technology
1
International Programme Office Hosted by
Based in Tokyo, Japan
Data Sharing Symposiumas a side event of RDA‐P7 meeting, Tokyo 29 February 2016
What to discuss about data?
• Data issue – Relation with Society, and Mutual Trustworthiness
– Accelerate Science (and Social Activity)
• Data Driven Innovation
• New era of digital information vs. printed records
• What we need to discuss?– We already have TCP/IP, Internet, Web, XML, RDF…
– Is anything else necessary for Information Society & Data Driven Innovation?
• Issue has multi‐facets. One is Open Science.
Shared in the research community
Open discussion and criticism
Paper
Data
Conclusion
Analysis/Discussion
Experiment/Observatoin
Hypothesis
Question
Conventional Science Method
Science and Society: research papers and data
Community consensus
General Society/political decision making
Who does? Immediately? Mandatory?
An article is not sufficient for validating results.Reproducibility issueResearch integrity issue
Data as 1st‐class research outputSocial information asset, provided to the general society
Essential for “irreproducible” natural phenomenaGlobal change, space, living organisms, health . . .
Mutual trust between Science and Society
e.g., IPCC report by >3000 scientists policy makers
[IPCC, 2013]
Approx. 1,300 scientists worked for the IPCC WG1.(3,000‐4,000 scientists for all WG1‐3?)
IPCC (Intergovernmental Panel on Climate Change)
WG1 “Physical Science Basis”
Climate Change Knowledge with Thousands of Scientists
What to discuss about data?
• Data issue – Relation and Trustworthiness with Society
• Data Driven Innovation
• New era of digital information vs. printed records
• What we need to discuss?– We already have technology of Internet, Web, XML, RDF…
– Is anything else necessary for Information Society & Data Driven Innovation?
• Issue has multi‐facets. One is Open Science.
An analogy: distance from technology to service
• For Users
• Application
• Usage Policy
• Professional Use Techniques
• Technology Basis
Weather Radar System
Radar echo from rain
Radar echo data uncorrected
rainfall data
Noise reduction, data cleansingTransform: time distance
Calibrating estimated rainfall data
Visualization
Rainfall Map
http://www.jmbsc.or.jp/hp/offline/sanpl/radar_amedas.png
Persistent Identifier
Data = Digital Object
DOI Metadata, Domain metadata,Data granularityDynamic data…
DOI (Digital Object Identifier)
Value AddedServices
Brokering, Aggregation,managementinfrastructure
Search, analytics
[Kathleen Fontaine, 2015]
“Data and Science”
• RDA Community Capability Model Interest Group – Secretary: Univ. of Bath & Microsoft Research Connections
↑https://www.rd‐alliance.org/filedepot_download/383/230
データポリシーData policy,
Data sharing principles
E‐インフラストラクチャE‐infrastructure,
メータデータ, 識別子、オントロジmetadata, PIDs,
ontology…
データパブリケーション
Data publication‐Cost Recovery
Model
データ引用、被引用度、「データ業績」
Data citation, credit…
研究職採用、昇進評価
Scholar position,evaluation
データマネジメントプラン
Data Management Plan (DMP)
Data repository,Long‐term preservation
Print & Electronic Technologies as Social Info. Infrastructures‐‐‐ 百年の印刷文化の基礎支えと、成長途中のディジタル・サイエンス
351 years
70 years
Public library (paper media) :8c
Printing press/Gutenberg: 1445
First scientific journal: 1665
Intl. Assoc. Academies: 1899
ICSU established: 1931
World Data Center system : 1957
ENIAC, von Neumann: 1946
Hard Disk Drive: 1956
TCP/IP, dial‐up (64kbps): 1982
WWW (CERN): 1991
Broadband internet(>1Mbps):~2000
New global data initiatives: ICSU‐WDS, RDA etc.:2008~2013
Print MediaElectronicMedia
9
Creation of ICSU‐World Data Systemon ICSU 29th General Assembly decision (October 28, 2008):
10
PAST(since 1950’s)
PRESENT(2008~)
ICSU International Scientific Unions data bodiesICSU National Members data bodiesICSU Interdisciplinary Bodies data activities
WDC (World Data Center) : 50 WDSs at max.
FAGS (Federation of Astronomical and Geophysical Data Analysis Services)
60 Regular Data curation & data analysis services
10 Network Networks of Regular Members & umbrella organizations
4 Partner Do not deal directly with data stewardship, but support to ICSU-WDS
18 Associate Organizations interested in the WDS endeavour
92 Members (June 2015)
Toward “Global Community of Trusted Science Data Repository/Services for Long Term Preservation”
supported by
Opening Ceremony of WDS International Programme Office
[Y. Murayama, 2012]
ICSU (Intl. Council for Science) has established ICSU‐WDS (October 2008).
Science Council of Japan (SCJ) cooperated hosting WDS Intl.
Programme Office at NICT, Japan.
ICSU President Prof. Y.T.Lee(2012)
SCJ President Prof. Ohnishi
SCJ former Vice President Prof. Doi
[F. Kasuga, 2013]
May 2012)
WDS‐IPO Executive Director: Dr. Mustapha Mokrane
Scientific Committee
Minister of Internal Affairs & Communications Vice Minister
of MEXT
NICT President Prof. Miyahara
ICSU‐WDS Objectives
• Enable universal and equitable access to quality‐assured multi‐disciplinary scientific data;
• Ensure long term data preservation/ stewardship;
• Foster compliance to agreed‐upon datastandards and conventions;
• Provide mechanisms to facilitate and improve access to data and data products.
12
In harmony with GEOSS open data policy (ICSU/CODATA contirubted to)
Population data
Income data
Health insurance articles
(d) Referential context
How datasets are cited by articles
Inter-university Consortium for Political and Social Research (ICPSR)
(http://www.icpsr.umich.edu/)115 154 citations from OAI PMH
(a) Data collection community
Australian Data Archive (ADA)(http://www.ada.edu.au/)
(b) Data sharing community
Working
Inequality
Attitudes
National SocialScience Survey
Article Data
Pangaea(http://www.pangaea.de/)
384,815 citations from OAI-PMH
Reports of the Deep Sea Drilling Project
Physical properties of Hole ##
[Zettsu et al., 2014]
Dynamics of InfrastructureEdwards, et al. 2007 Understanding Infrastructure: Dynamics, Tensions, andDesign.
• Infrastructures become “ubiquitous, accessible, reliable, andtransparent” as they mature.
• Systems Networks Inter-networks
• “system-building, characterized by the deliberate and successful design of technology-based services.”
• “technology transfer across domains and locations results invariations on the original design, as well as the emergence ofcompeting systems.”
• Finally, “a process of consolidation characterized by gateways thatallow dissimilar systems to be linked into networks.”
オープンサイエンスとデータ出版・引用
15http://www.ands.org.au/cite‐data/index.html
“Research Data Sharing” Today •Data sharing practices in past
• Earth science, high energy physics, genomics,…
• Why is it a topic to discuss, in WDS、G8、RDA、etc.?
• Some thoughts in context of Open Science
– Example of past practices
• Sharing in a specific community or domain: its background culture/knowledge are already shared in the community.
• “Metadata” (data attribution, credit, licensing…) can be minimum. Data reuse culture is based on & depends on the community’s own norm.
– Example of discussion in context of Open Science
• Data: 1st class output of research (cf. research papers)
• Wish Data be findable, citable, and reusable by anybody in future (like research papers) Info. Management as “Information Asset”
• Metadata (data provenance, identifier , etc.) is increasingly important.
• “Information Organization” should the key.16
Landscape of Open Science/Research Data Sharing (from my viewpoint)
17
Earth, Space,Physics, Informatics,…
Space Sci.Computer Sci.Physics
Seismology
LinguisticsHistoryPsychology
日本学術会議
Science Council of Japan
DIAS w/GEOSS(U Tokyo, JAXA、
JAMSTEC, NIES etc)
Future Earth(ICSU, UNESCO, UNEP,
UNU, Belmont Forum,…)
RDA(Research Data Alliance)
G8 Science Minister Meeting (2013.6)
…etc.(96 Member Bodies)
Linguistics
2008‐
2012‐
OECD Open Science WG etc.
文部科学省
Ministry of Education, Science…
科学技術振興機構Japan Sci. &Tech Agency
総合科学技術・イノベーション会議
Council for Sci. Tech. & Innovation
Cabinet Office of JapanSocial Science
IonosphereSpace Weather
WDS Intl. Program Office, Tokyo
UNESCO Ocean Data Exchange
European Open Science Cloud
PLEASE KEEP IN MIND:
• The objective is to promote science and social activity.– Any rules or regulations should not discourage scientists/researchers.
• DON’T regulate what we don’t yet understand.(“The Data Harvest”, RDA Europe, December 2014)
12-16 September 2016in
Denver, Colorado, USA
FIN.
• “Science is built of facts the way a house is built of bricks” (Henri Poincare, 1902)
家がレンガから建てられるように科学は事実から作られる
• “Facts”: Data
• ”…but an accumulation of facts is no more science than a pile of bricks is a house”
しかしレンガの山が家でないように、事実の積み上げだけでは科学でない
• Today’s Science tends to be advanced, specialized, far from what is sensed in your daily life. Fact as basis of science is increasingly important.
Examples of “what is going about scientific data in Japan”
1965~
2006~ 2007~
1958~
1957~
ICSU 29th General Assembly decision (October 28, 2008):
23
PAST(since 1950’s)
PRESENT(2008~) ICSU International Scientific Unions data
bodiesICSU National Members data bodiesICSU Interdisciplinary Bodies data activities
WDC (World Data Center) : 50 WDSs at max.
FAGS (Federation of Astronomical and Geophysical Data Analysis Services)
Opening Ceremony of WDS Intl. Programme Office (May 2012, Tokyo)
WDS‐IPO Hosted by
Based in Tokyo, Japan
ICSU President
Sci. Council Of JapanPresident
Ministers for ICT, Education & Science
ICSU‐World Data SystemAcademic enterprise for long term data preservation
G8 2013 Science Ministers’ Agreement of Open Research Data
“Open Government Data”
Open Access to Open Data and Open ScienceOverview example
Open Access Open Science
Open Data(Open gov.)
Open Research Data
Creative Commons
Open Source
2000’s 2010’s
Science 2.0+
Citizen Science
Self Archiving
Science Commons
Institutional Repository Full OA (mega) journal
Article Research Outputs
Database, Repository
Research ActivityAccess ReUse
Data Sharing
Data journal
Open Innovation
Improve,Incremental
Redesign,Disruptive?
(Scholarly activity)
(Common activity)
Code for X
25
[K. Hayashi, 2015]
Expert Panel on Open Science based on Global Perspectives (Cabinet Office, Japan)
26
Promoting Open Science in Japan Opening up a new era for the advancement of science Report by the Expert Panel on Open Science, based on Global Perspectives Cabinet Office, Government of Japan March 30, 2015
[H. Manago, 2015]
Cabinet Office/CSTI:National Principle of Open ScienceCabinet Office “Expert Panel of Open Science” (Dec, ‘14 ‐‐‐March ‘15)
http://www8.cao.go.jp/cstp/sonota/openscience/
= Final Report was published at the Web site 30 March 2015.
Input to the 5th National Basic S&T Plan
[H. Manago, 2015]
Policy map for Promotion of Open Science
04/03/2015
[H. Manago, 2015]
28
Example of STI Infrastructure in Japan• University‐based Institutional Repository network over the nation
– For Articles– Pilot project for research data started.– JAIRO: Japan Institutional Repositories Online Online Cloud
[K. Yamaji (NII), 2015]
“JAIRO” Cloud service
University On‐premise
Cloud Services for Open Science
SINET
research groupsuniversities
cloud cloudcloud
thesespapersthesespapers
class
Inter-Cloud
openscience/data
researchdata
researchdata
researchlogs
researchlogs
openscience/data
educationdata/logseducationdata/logs
openeducation
cloudproviders
universities
GakuNinCloudresearch
data as evidence
support for procurement
repository hosting support for multi-cloud
Direct Connect Direct Connect high-speed & secure comm.
30[K. Yamaji (NII), 2015]
Visit Tokyo 29 February‐3 March!
Openness is not a final destination, but be interoperable.
Fin.
http://www.icsu‐wds.org/community/webinars/webinar‐2/RDAWDSPublishingDataIGWebinarIntro.pdf
Data Sharing, the Informatics Way:DOI (Digital Object Identifier) for Research Data
[Adapted from Hideaki Takeda (2015) ]
IdentifiersIdentifiers
Domain Science Contents
Domain Science Contents
MetadataMetadata
Identifier’s MetadataIdentifier’s Metadata
DomainMetadataDomainMetadata
ProduceProduce PreservePreserve PublishPublish ReviseRevise DiscardDiscard
ResgisterResgister
ResgisterResgister
ResgisterResgister
ReviseRevise
ReviseReviseProduceProduce
ProduceProduce
LibraryLibraryResearchInstituteResearchInstitute
ResearchProjectResearchProject
ResearchersResearchers
Data Life Cycle, Its Steps, and Its StakeholdersData Life Cycle, Its Steps, and Its Stakeholders
Japanese Metadata framework “IUGONET” (inter‐univ. upper atmos. obs. network)
IUGONET Metadata Databaseand search system
NASA’s Metadata Schema “SPASE”
MD Schema ExtensionFor GroundbasedObservations
MD Schema Partnership
EU‐ESPAS Project Ontology & MD
Japan Link Center’s Experimental Project for minting/registration of dataset’s DOI
• DOI (Digital Object Identifier) for citing an object digitally.
– DOI is managed by “Intl. DOI Foundation” (IDF).
– DOI can be given only by Registration Agency (RA) under IDF.
– Japan Link Center (JaLC) is only a RA in Japan.
• JaLC experiment project: Japan’s first attempt to register dataset’s DOI
• Project steering committee: universities, natl. research institutes, & NDL.
36
DOI SystemJapanese Data Centers
Assign DOI prefix
Register DOI‐URL mapping
9 RAs
doi:10.xxxx (DOI prefix)
Web Interface
[Nose et a., 2014]