46
VIVO: A SEMANTIC WEB NETWORK ENABLING COLLABORATION AMONG SCIENTISTS Mike Conlon, University of Florida John Ruffing, Weill Cornell Medical College Friday 21 October 2011

VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

  • Upload
    gypsy

  • View
    44

  • Download
    0

Embed Size (px)

DESCRIPTION

VIVO: A Semantic Web Network Enabling Collaboration Among Scientists. Mike Conlon, University of Florida John Ruffing, Weill Cornell Medical College Friday 21 October 2011. What is VIVO?. - PowerPoint PPT Presentation

Citation preview

Page 1: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

VIVO: A SEMANTIC WEB NETWORK ENABLING COLLABORATION AMONG

SCIENTISTS

Mike Conlon, University of FloridaJohn Ruffing, Weill Cornell Medical College

Friday 21 October 2011

Page 2: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

WHAT IS VIVO?

• VIVO is open standards and linked open data regarding science – people, papers/products, funding, events, resources, projects, data, concepts – and the relationships between them

• VIVO is open source, community maintained software tools for research discovery and networking

• VIVO is a world community of collaborators – scientists, implementers, developers

Page 3: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Data, Tools and Community

Page 4: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

OVERVIEW

• Institutional Context• Data• Tools

• Consuming Data• Providing Data

• Community

Page 5: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

VIVO HISTORY AT A GLANCE• 2003 – VIVO created for local use at Cornell University (Ithaca) to support a

university-wide life sciences initiative

• 2009 – The National Center for Research Resources (NIH) awards the VIVO Collaboration a two-year, $12.2 million grant to VIVO for networking of researchers. A parallel grant for collecting and networking research resources was awarded to the eagle-i Consortium.

• 2010 Apr – Version 1.0 released

• 2010 July – Version 1.1 released

• 2010 Aug – First VIVO conference (NYC 207 attendees)

• 2011 Feb – Version 1.2 and Harvester version 1.0

• 2011 July – Version 1.3 released

• 2011 Aug – Second VIVO conference (D.C., 262 attendees)

• 2012 Aug – Third VIVO conference (Miami)

Page 6: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

CURRENT PILOT IMPLEMENTATION SITES AND COLLABORATORS

Page 7: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

DATA: LINKED AND OPEN

• Semantic Web• RDF• Ontology

Page 8: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

VIVO’s semantic advantage

Data modeled as bidirectional relationships

All data has standard formatEv

eryt

hing

has

its

own

URI

Page 9: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Resource Description Framework (RDF)

• simple data model for representing information • allows anyone to make statements about any resource• Can be represented in XML• based on “triples”:

Subject [Susan Riha]

Object [NYS WRI]

Predicate

[head of]

From: http://www.w3.org/TR/rdf-concepts/

Page 10: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Andrew McDonald

author of

has author

research arearesearch area for

academic staff in

academic staff

Susan Riha

Mining the record: Historical evidence for…

author of has author

teaches research area for

research area

headed by

crop management

CSS 4830 head offaculty appointment in

faculty members

taught by

featured in

features person

Semantic representation of data

NYS WRI

Cornell’s supercomputers crunch weather data to help farmers manage chemicals

Earth and Atmospheric Sciences

Page 11: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

11

Opendata

Page 12: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

processOrg<-function(uri){ x<-xmlParse(uri) u<-NULL name<-xmlValue(getNodeSet(x,"//rdfs:label")[[1]]) subs<-getNodeSet(x,"//j.1:hasSubOrganization") if(length(subs)==0) list(name=name,subs=NULL) else { for(i in 1:length(subs)){ sub.uri<-getURI(xmlAttrs(subs[[i]])["resource"]) u<-c(u,processOrg(sub.uri)) } list(name=name,subs=u) }}

VIVO produces both HTML and RDF

Software reads VIVO RDF and displays

Page 13: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists
Page 14: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Alignment with eagle-I ontology

Page 15: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

VIVO ENABLES AUTHORITATIVE DATA ABOUT RESEARCHERS TO JOIN THE LINKED DATA CLOUD.

http://richard.cyganiak.de/2007/10/lod/lod-datasets_2010-09-22_colored.png

Page 16: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

PROVIDING OPEN LINKED DATA• VIVO version 1.3 completed. Includes spreadsheet upload. Google Refine. Harvester• Fifty US schools adopting VIVO• Harvard Profiles (30 sites) providing data using VIVO ontology and RDF • SciVal experts (20 sites) working to provide VIVO ontology data• American Psychological Association adopts VIVO for its 154,000 members• USDA adopts VIVO. 40,000 scientists, 80,000 staff, 50 land grant universities• CTSA SG3 to propose VIVO ontology as a consortium wide standard• University of Rochester to provide CTSA-IP as VIVO data• Eagle-I and VIVO working to produce common ontology via RDF• ORCID, Community of Science interchange with VIVO• Stonybrook producing UMLS concept linkages to VIVO profiles• Indiana provides HubZero profiles (3,000) via VIVO. Iowa Loki profiles (1,000) via VIVO.• Adoptions in Mexico, Costa Rica, Puerto Rico, India, China, UK, Netherlands, Brazil• Eight major Australian research universities and Australian federal research adopt VIVO• Thomson-Reuters and Elsevier providing data to VIVO• Wellspring offering individual VIVO profiles• Wellspring, Elsevier, Symplectics offering VIVO implementation services• OpenPhacts (EU) proposing VIVO• Implementation Fest held June 22-23, St. Louis. 12 schools

Page 18: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Full integration with Digital Vita

CV

Page 19: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Faceted search, browse, and ontology hierarchy

Page 20: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

VISUALIZATIONS ABOUT PEOPLE

Page 21: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

INTER-INSTITUTIONAL COLLABORATION EXPLORER

http://xcite.hackerceo.org/VIVOviz

Page 22: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Draw organizational charts http://vivoweb.org/files/orgLast.pdf

Page 23: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists
Page 24: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists
Page 25: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists
Page 26: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists
Page 27: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists
Page 28: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists
Page 29: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Repurpose content into Drupal http://bit.ly/gmm8Ng

Page 30: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

RESEARCH DISCOVERY AND NETWORKING TOOLS• VIVO search – research discovery and networking• Duke, Florida – web site plug-ins for reuse of VIVO data• Digital Enterprise Research Institute – analytics for VIVO data• VIVO Search Light – find experts related to any page on the world wide web • UCSF – find investigators “like me” across the network• Harvard – visualize publication collaboration patterns• Northwestern – C-IKnow Recommender for team building• APA society portal. Identity management• CTSA consortium portal• Pittsburgh – Digital Vita – produce vita and biosketches• Direct2Experts – get counts of researchers matching criteria and link to them• Community of Science – use VIVO data for faculty interests, route opportunities to

faculty• Federal Researcher Profile System – avoid duplication of entry, simplify administration• OpenPhacts (EU) – provide provenance for assertions• NRN visualization – show data sources and their inventory of data• VIVO concept – what topic areas are covered by people, departments, universities

Page 31: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Providing Data

Page 32: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

> > >

> RDF harvestSPARQL endpoint

VIVO(RDF)

data ingest ontologies

(RDF)shared as

RDF

interactiveinput

local systems of record

externalsources

DATA FLOW THROUGH A VIVO SYSTEM

Page 33: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

VIVO

VIVO APPLICATION ARCHITECTURE

MySQLrelational database

JenaJava RDF library

TomcatJava servlet container

VIVOservlets, page templates,

javascript, css

Apacheweb server

Java Freemarker &

JSPs

local ontology

extensions

theming & branding,

navigation, browse tools

customization

applicationdelivery

foundation LuceneJava search

library

Pellet reasoning

engine

Page 34: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

VIVO-Cornell: Harvester…and disseminator

Page 35: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Manual

Annual faculty reporting

Manual

PubMed

Course database

Annual faculty reporting

Manual

Annual faculty reporting

OSP data warehouse

Annual faculty reporting

OHR – appointment

OHR – appointment

Annual faculty reporting

Annual faculty reporting

Annual faculty reporting

Manual

VIVO-Cornell as harvester: Content sources

Page 36: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Harvester design

Page 37: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

POWER TOOL FOR DIRTY DATA – GOOGLE REFINE + VIVO

Page 38: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

WCMC/CTSA SOURCES OF DATA

• Local Systems of Record• HR• RASP

• Data Aggregators and Repositories• PubMed• Web of Science• Grants.gov

• Individuals or their Proxies

Page 39: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

TARGETS FOR HARVESTING DATA

Page 40: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists
Page 41: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

From local to national

> VIVO

local sources

nat’l sources

>share as RDF

website data

searchbrowse

visualize

share as RDF

search browse

visualize

•Cornell University•University of Florida•Indiana University•Ponce School of Medicine•The Scripps Research Institute•Washington University, St. Louis•Weill Cornell Medical College

Local

National

Aggregatingand indexing RDF Exemplar

Page 42: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

LINKED OPEN DATA

RDFTriples

RDFTriples

Page 43: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

BUILDING COMMUNITY• Federal agencies – OSTP, NIH, NLM, NSF, USDA, FDP, FRPS, STAR Metrics, …• Publishers and Aggregators – Elsevier, Thomson Reuters, ORCID, CiteSeer, Arxiv, Dspace,

…• Professional Societies – APA, AAAS, AIRI, AAMC, ABRF, …• International collaborators – Ireland, Germany, Australia, China, Netherlands, UK, Costa

Rica, Iceland, Brazil, Mexico, …• Semantic Web community – DERI, Tim Berners-Lee, MyExperiment, ConceptWeb, Open

Phacts (EU), Linked Data, …• Research resources – Eagle-I, BRO, eBIRT, RDS, …• Open Source cooperatives – Kuali, Sakai, Duraspace, …• Social Network Analysis Community – Northwestern, Davis, UCF, INSNA, …• Schools and Consortia – CTSAs, CIC, Pitt, Emory, Iowa, Harvard, UCSF, Stanford, MIT,

Brown, Michigan, Nebraska, Colorado, Duke, Hunter, OHSU, Minnesota, …• Software downloads (>10,000) and contact list (>1,600)• Four annual events – conference, workshop, hackathon, implementation fest• On-line community http://vivo.sourceforge.net

Page 44: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

VIVO 2012, August 22-24, Hotel Intercontinental, Miami, Florida

Page 45: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

Thank you!The VIVO Team 2011

Page 46: VIVO: A Semantic Web Network Enabling Collaboration Among Scientists

LEARN MORE ABOUT VIVO

• Project – http://vivoweb.org/

• Sourceforge –

http://sourceforge.net/projects/vivo/

• Facebook –

http://facebook.com/VIVOcollaboration

• Twitter – http://twitter.com/VIVOcollab

• Multi-site search (beta) –

http://vivosearch.org/