29
Refactoring HUBzero for Linked Data Michael Witt Yongyang Yu Purdue University JCDL’12, Washington DC

Refactoring HUBzero for Linked Data

Embed Size (px)

DESCRIPTION

This the presentation file from the JCDL'12 which talks about HUBzero and linked data. We refactored HUBzero and exposed the metadata information in a linked data style. One salient contribution of our work is that we implemented OAI-ORE in HUBzero and exposed the aggregation relationships in linked data. To show how user agents and human users can benefit from our work, we design and implement 3 tools, OAI-PMH data provider, RDF crawler and graph browser. The first two tools show how user agents can leverage our refactoring process to make use of the metadata and the last tool can serve as a debugging tool for human users.

Citation preview

Page 1: Refactoring HUBzero for Linked Data

Refactoring HUBzero for Linked

Data

Michael WittYongyang Yu

Purdue UniversityJCDL’12, Washington DC

Page 2: Refactoring HUBzero for Linked Data

Outline HUBzeroLinked DataData ModelingSoftware &ToolsConclusion

Page 3: Refactoring HUBzero for Linked Data

What is HUBzero?

• Web-based virtual research environment

• Publish and reuse resources (software, videos, datasets, etc.)

• LAMP stack (Linux, Apache, MySQL, PHP)

• Joomla! framework

Page 4: Refactoring HUBzero for Linked Data

HUBzero• Platform for scientific collaboration• Create and upload resources by

types• Courses, seminars, tools, downloads,

series, workshops, publications, and teaching materials

• Tag, rank, discuss, etc.

Page 5: Refactoring HUBzero for Linked Data

HUBzero Features• Interactive simulation tools – zoom in

on a graph, rotate a molecule, etc.• Online presentations – deliver online

presentation in flash format• Tool development areas – source

code management and version control

Page 6: Refactoring HUBzero for Linked Data
Page 7: Refactoring HUBzero for Linked Data
Page 8: Refactoring HUBzero for Linked Data
Page 9: Refactoring HUBzero for Linked Data
Page 10: Refactoring HUBzero for Linked Data
Page 11: Refactoring HUBzero for Linked Data

HUBzero Reference• http://hubzero.org• M.McLennan and R. Kennell,

HUBzero: a platform for dissemination and collaboration in computational science and engineering

Page 12: Refactoring HUBzero for Linked Data

What is Linked Data?

Page 13: Refactoring HUBzero for Linked Data

Linked Data

• URIs (names for various things)• HTTP URIs• Publish linked data in RDF format on

the Web• Links to other URIs

Page 14: Refactoring HUBzero for Linked Data

Data Modeling

Linked Data OAI-ORE Dublin Core Creative Commons

Page 15: Refactoring HUBzero for Linked Data

Object Reuse and Exchange (OAI-ORE)• Aggregation• Aggregated Resources (Nested

Aggregations)• Resource Maps

Page 16: Refactoring HUBzero for Linked Data

Dublin Core Mapping to HUBzero

Dublin Core HUBzero tabledc:title jos_resources.title

dc:creator jos_resources.created_bydc:subject jos_tags.raw_tag

dc:date jos_resources.createddc:identifier jos_resources.id

dc:description jos_resources.introtextdc:type jos_resources.type

dc:publisher (statically assigned)dc:rights jos_resources.params

Page 17: Refactoring HUBzero for Linked Data

Hub

downloads

Slimy salamander

Slimy.m4v

RM

RM

RM

<http://hub#agg> <ore:aggregates> <http://hub/downloads#agg>

Example: Modeling HUBzero with OAI-ORE

Page 18: Refactoring HUBzero for Linked Data

Implementation

• Resource component• Exposing metadata from MySQL RDB• Serialized in RDFa+XHTML

Page 19: Refactoring HUBzero for Linked Data

Aggregation

Resource Map

ore:aggregates

Page 20: Refactoring HUBzero for Linked Data

Software & Tools We Have Developed• Implementation of OAI-ORE for HUBzero• OAI-PMH Data Provider• RDF Crawler• Graph Browser

Page 21: Refactoring HUBzero for Linked Data

OAI-PMH Data Provider

• Implemented as a stand-alone component in Joomla

• Serialized as RDF/XML• Each resource type OAI-PMH set• OAI-ORE Resource Maps

ListMetadataFormats• ore:aggregates ListRecords

Page 22: Refactoring HUBzero for Linked Data

RDF Crawler

• A demonstration of utilizing linked data from the Web

• Starting from the top level of the hub• Parsing XHTML and dereferencing URIs• RDF triples to N-Triple file• N-Triple to Jena TDB triple store• Basis of the graph browser

Page 23: Refactoring HUBzero for Linked Data

Graph Browser

Page 24: Refactoring HUBzero for Linked Data

Graph Browser

HTTP URIs

string literalsshared boundaryof a larger graph

Page 25: Refactoring HUBzero for Linked Data

Graph Browser

Page 26: Refactoring HUBzero for Linked Data

Graph Browser

Page 27: Refactoring HUBzero for Linked Data

Summary

• Successfully extended and refactored HUBzero for Linked Data

• Exposed Hub resources with OAI-ORE, Dublin Core, Creative Commons, etc.

• Built software and tools to implement and utilize Linked Data from HUBzero

Page 28: Refactoring HUBzero for Linked Data

Future work• Incorporate richer and more specific

vocabularies e.g. Library of Congress Subject Headings (LCSH)• Enable users to create and publish their own

collections (Aggregations)• Enable semantic search

Page 29: Refactoring HUBzero for Linked Data

Acknowledgements

• National Leadership Grant(LG-05-10-0117-10) from

Institute of Museum and Libraries Services