1
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( [email protected] ) , Yu Chen 1 ([email protected] ) , Patrick West 1 ([email protected] ) , John S. Erickson 1 ([email protected] ) , Xiaogang Ma 1 ([email protected] ) , Peter Fox 1 ([email protected] ) ( 1 Rensselaer Polytechnic Institute 110 8 th St., Troy, NY, 12180 United States) Deep Carbon Observatory (DCO) is a decade-long scientific endeavor to understand carbon in the complex deep Earth system. Thousands of DCO scientists from institutions across the globe are organized into communities representing four domains of exploration: Extreme Physics and Chemistry, Reservoirs and Fluxes, Deep Energy, and Deep Life. Cross-community and cross- disciplinary collaboration is one of the most distinctive features in DCO's flexible research framework. VIVO is an open-source Semantic Web platform that facilitates cross-institutional researcher and research discovery. it includes a number of standard ontologies that interconnect people, organizations, publications, activities, locations, and other entities of research interest to enable browsing, searching, visualizing, and generating Linked Open (research) Data. The DCO-VIVO solution expedites research collaboration between DCO scientists and communities. Based on DCO's specific requirements, the DCO Data Science team developed a series of extensions to the VIVO platform including extending the VIVO information model, extended query over the semantic information within VIVO, integration with other open source collaborative environments and data management systems, using single sign-on, assigning of unique Handles to DCO objects, and publication and dataset ingesting extensions using existing publication systems. We present here the iterative development of these requirements that are now in daily use by the DCO community of scientists for research reporting, information sharing, and resource discovery in support of research activities and program management. Poster: IN43A-3679 Glossary: CKAN – Data management system, the DCO Data Portal, http://data.deepcarbon.net DCO – Deep Carbon Observatory – https://deepcarbon.net Drupal – Content Management System, the DCO Community Portal, http://deepcarbon.net Handle – resolution services for unique and persistent identifiers, http://dx.deepcarbon.net RPI – Rensselaer Polytechnic Institute TWC – Tetherless World Constellation at Rensselaer Polytechnic Institute VIVO - https://wiki.duraspace.org/display/VIVO/ VIVO , the DCO Information Portal Acknowledgments: The DCO Data Science Team would like to acknowledge the valuable contributions from the DCO Engagement Team, DCO Secretariat, and the Sloan Foundation. Sponsors: Alfred P. Sloan Foundation Abstract Global community of ‘Carbon scientists’ contributing to the Deep Earth Computer (data legacy) comprising: Global Earth Mineral Laboratory Global Inventory of Deep Fluids Global Volcano Gas Emissions Global Census of Deep Microbial Life State of High Pressure and Temperature Carbon and Related Materials Global Inventory of Diamonds with Inclusions Group data deposit and reporting Listings of group content Group management and messaging Listings of group documents VIVO - represents academic research communities Every person, organization, or other data entity in VIVO has a unique identifier VIVO enables the discovery of research and scholarship across disciplines at one institution or across many Records are both human-readable and machine- readable VIVO Extension - we’ve extended (yes, ontologies) VIVO to the science network – datasets, instruments, sites, etc. DCO Statistics: Over 2700 people across 462 organizations. Over 1000 publications Over 1700 datasets And 339 research locations We take all of that information from all those different science domains across all those organizations and we organize it into a knowledge graph, currently over 450,000 triples. Data Information Knowledge Producers Consumers Context Presentation Organization Integration Conversation Creation Gathering Experience We take the raw data, have users augment that with additional information and context, link the data together, then present it to the user from the knowledge store Semantic representation of information stored and maintained in VIVO, a Knowledge Graph info.deepcarbon.net All Linked Together And all resources receive a unique handle, a DCO-ID dx.deepcarbon.net Visualizations within Drupal using twsparql module Community Network Map S2S Faceted Browser DCO Resources (e.g. datasets, documents, images, movies, etc…) stored using CKAN Repository data.deepcarbon.net The Community Semantic Representation of information and data into the VIVO Knowledge Store VIVO Becomes Central Hub of Information Reports, Visualization, Search and Browse all using VIVO Knowledge Store Take Away: 1. Lots of information and data from multiple seemingly disparate sources, pulling it into a common knowledge graph and making it available as one seamless system. 2. Using VIVO as the central point of information store and retrieval, the central Knowledge Store 3. Having multiple integration points, ingest points, and visualizations using the VIVO knowledge store Resource addition links to VIVO, users work in VIVO directly. DCO Community Portal for collaboration using Drupal. deepcarbon.net VIVO Semantic Representation, Ontologies in VIVO

DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( [email protected] ), Yu Chen 1 ([email protected]), Patrick West

Embed Size (px)

Citation preview

Page 1: DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( wangh17@rpi.edu ), Yu Chen 1 (cheny18@rpi.edu), Patrick West

DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science CommunitiesHan Wang1 ([email protected]), Yu Chen1 ([email protected]), Patrick West1 (

[email protected]), John S. Erickson1 ([email protected]), Xiaogang Ma1 (

[email protected]), Peter Fox1 ([email protected]) (1Rensselaer Polytechnic Institute 110 8th St., Troy, NY, 12180 United States)

Deep Carbon Observatory (DCO) is a decade-long scientific endeavor to understand carbon in the complex deep Earth system. Thousands of DCO scientists from institutions across the globe are organized into communities representing four domains of exploration: Extreme Physics and Chemistry, Reservoirs and Fluxes, Deep Energy, and Deep Life. Cross-community and cross-disciplinary collaboration is one of the most distinctive features in DCO's flexible research framework.

VIVO is an open-source Semantic Web platform that facilitates cross-institutional researcher and research discovery. it includes a number of standard ontologies that interconnect people, organizations, publications, activities, locations, and other entities of research interest to enable browsing, searching, visualizing, and generating Linked Open (research) Data.

The DCO-VIVO solution expedites research collaboration between DCO scientists and communities. Based on DCO's specific requirements, the DCO Data Science team developed a series of extensions to the VIVO platform including extending the VIVO information model, extended query over the semantic information within VIVO, integration with other open source collaborative environments and data management systems, using single sign-on, assigning of unique Handles to DCO objects, and publication and dataset ingesting extensions using existing publication systems. We present here the iterative development of these requirements that are now in daily use by the DCO community of scientists for research reporting, information sharing, and resource discovery in support of research activities and program management.

Poster: IN43A-3679Glossary:CKAN – Data management system, the DCO Data Portal, http://data.deepcarbon.net DCO – Deep Carbon Observatory – https://deepcarbon.netDrupal – Content Management System, the DCO Community Portal, http://deepcarbon.net Handle – resolution services for unique and persistent identifiers, http://dx.deepcarbon.netRPI – Rensselaer Polytechnic InstituteTWC – Tetherless World Constellation at Rensselaer Polytechnic InstituteVIVO - https://wiki.duraspace.org/display/VIVO/VIVO, the DCO Information Portal

Acknowledgments:The DCO Data Science Team would like to acknowledge the valuable contributions from the DCO Engagement Team, DCO Secretariat, and the Sloan Foundation.

Sponsors:

Alfred P. Sloan Foundation

Abstract

Global community of ‘Carbon scientists’ contributing to theDeep Earth Computer (data legacy) comprising:

Global Earth Mineral LaboratoryGlobal Inventory of Deep FluidsGlobal Volcano Gas EmissionsGlobal Census of Deep Microbial LifeState of High Pressure and Temperature Carbon and Related MaterialsGlobal Inventory of Diamonds with Inclusions

Group data deposit andreporting

Listings of group content

Group management and

messagingListings of

group documents

VIVO - represents academic research communities• Every person, organization, or other data entity in VIVO has a

unique identifier• VIVO enables the discovery of research and scholarship across

disciplines at one institution or across many• Records are both human-readable and machine-readable• VIVO Extension - we’ve extended (yes, ontologies) VIVO to the

science network – datasets, instruments, sites, etc.

DCO Statistics:• Over 2700 people across 462

organizations.• Over 1000 publications• Over 1700 datasets• And 339 research locations

We take all of that information from all those different science domains across all those organizations and we organize it into a knowledge graph, currently over 450,000 triples.

Data Information Knowledge

Producers Consumers

Context

PresentationOrganization

IntegrationConversation

CreationGathering

Experience

We take the raw data, have users augment that with additional information and context, link the data together, then present it to the user from the knowledge store

Semantic representation of information stored and maintained in VIVO, a Knowledge Graphinfo.deepcarbon.net

All Linked Together

And all resources receive a unique handle, a DCO-IDdx.deepcarbon.net

Visualizations within Drupal using twsparql module

Community Network Map

S2S Faceted Browser

DCO Resources (e.g. datasets, documents, images, movies, etc…) stored using CKAN Repositorydata.deepcarbon.net

The Community Semantic Representation of information and data into the VIVO Knowledge Store

VIVO Becomes Central Hub of Information

Reports, Visualization, Search and Browse all using VIVO Knowledge Store

Take Away:

1. Lots of information and data from multiple seemingly disparate sources, pulling it into a common knowledge graph and making it available as one seamless system.

2. Using VIVO as the central point of information store and retrieval, the central Knowledge Store3. Having multiple integration points, ingest points, and visualizations using the VIVO knowledge store

Resource addition links to VIVO, users work in VIVO directly.

DCO Community Portal for collaboration using Drupal. deepcarbon.net

VIVO

Semantic Representation, Ontologies in VIVO