Transcript
Page 1: VRA 2014 Brave New World Cataloging, Mixter

VRA Core as Linked Data

03-15-2014

Jeff Mixter

Research Support Specialist

OCLC Research

[email protected]

Page 2: VRA 2014 Brave New World Cataloging, Mixter

• This project evolved out of a master’s thesis project published as partial completion of an MLIS degree from Kent State University

• The emergence of Schema.org as a general Linked Data vocabulary and the publication of WorldCat.org data as Linked Data served as a catalyst for the work

• CIDOC CRM and the Europeana Data Model were examples of how visual resource data models could be represented using Linked Data

Background

Page 3: VRA 2014 Brave New World Cataloging, Mixter

• Collaborative effort between Google, Yahoo, Bing and Yandex

• 2011

• Serves as a general Linked Data vocabulary

• 15% of Web pages use Schema.org markup

• Data using Schema.org is understood and indexed by search engines as structured data

• Using Schema.org has the possibility to help improve SEO

• SEO in context!

• SEO != Google Ranking

• Search engine understanding of your structured data

Schema.org

Page 4: VRA 2014 Brave New World Cataloging, Mixter

• Structured data drives the creation of Google Knowledge Cards

• These “entity cards” are derived from structured data that Google has harvested from the Semantic Web

Structured Data

Page 5: VRA 2014 Brave New World Cataloging, Mixter

Structured Data

vs.

Page 6: VRA 2014 Brave New World Cataloging, Mixter

• Both of these data models have produced Linked Data representations

• Both use a specialized vocabulary

• The models were reviewed and consulted during the modeling/mapping process

CIDOC CRM and Europeana

Page 7: VRA 2014 Brave New World Cataloging, Mixter

• The initial research project sought to map the VRA Core 4 model into Linked Data

• Use Schema.org as the baseline vocabulary

• Helps improve interoperability

• Follow the Linked Data principles outlined and described by Tim Berners-Lee

• Do not reinvent the wheel

• Gov 2.0 Expo 2010

• The project successfully modeled the VRA Core 4 data model as Linked Data and demonstrated this by converting an existing VRA Core 4 compliant dataset (XML) into Linked Data

Initial Project

Page 8: VRA 2014 Brave New World Cataloging, Mixter

• Linked Data model was mapped from the VRA Core 4 Restricted Schema

• The model included:

• 35 Schema.org terms

• 1 DC Terms term

• 2 VoID terms

• 127 Custom VRA Core terms

• Of the 127 custom terms, 88 (66%) were positioned as sub-terms of Schema.org terms

• Overall 74% of the VRA Core terms were directly or indirectly mapped to Schema.org

Project Results

Page 9: VRA 2014 Brave New World Cataloging, Mixter

• After the prototype and results were published, there was interest in using the model

• The VRA Core Oversight Committee subsequently formed a Task Force to design and publish as VRA Linked Data data model

• Current members:

• Esme Crowles

• Trish Rose-Sandler

• Johanna Bauman

• Rebecca Guenther

• Jeff Mixter

Aftermath

Page 10: VRA 2014 Brave New World Cataloging, Mixter

• Adapt the prototype model to create an official VRA Core data model

• Using a VRA namespace!

• In line with the principles of Linked Data, there will be an emphasis on mapping the VRA Core model to other Linked Data vocabularies

• Schema.org

• BIBFRAME

• Ect.

Current Project

Page 11: VRA 2014 Brave New World Cataloging, Mixter

• Prototype of the ontology:

• http://www.essepuntato.it/lode/https://s3.amazonaws.com/VRA/Ontology/VRA_OntologyRevised.owl

• Converted a data sample for review

• The model is just a prototype and we welcome comments, questions, criticism, etc.

Progress

Page 12: VRA 2014 Brave New World Cataloging, Mixter

• Getty vocabularies are being published as Linked Data

• This is very important to the micro-domain of visual/cultural heritage resources

• In particular it is important to VRA since it is a standard authority recommended in the VRA Core 4 Schema

• Collaboration and integration with other Linked Data models

• The Semantic Web relies on interconnected datasets/models

• Data silos are bad

Future Potential

Page 13: VRA 2014 Brave New World Cataloging, Mixter

©2013 OCLC. This work is licensed under a Creative Commons Attribution 3.0 Unported License. Suggested attribution: “This work uses content from [presentation title] © OCLC, used under a Creative Commons Attribution license: http://creativecommons.org/licenses/by/3.0/”

Thanks!

Question?