Upload
bradley-allen
View
849
Download
1
Embed Size (px)
DESCRIPTION
A presentation describing Elsevier's perspective of linked data in STM publishing, presented 2011-12-06 at the W3C Linked Enterprise Data Patterns Workshop in Cambridge, MA.
Citation preview
Linked Data Standards and Infrastructure for Scientific Publishing
Bradley P. Allen Elsevier Labs W3C Workshop on Linked Enterprise Data Patterns 6 December 2011
The role of linked data in STM publishing
Entities, concepts and relationships
Smart Content Delivery
Better understanding through analysis and visualization •Tag clouds •Heatmaps •Streamgraphs •Scatterplots •Time series •Animations
Better discovery through semantic search & navigation •Faceted search & browse •Ontology-driven navigation •Task-specific results •Personalized/localized results •Question answering
New knowledge through aggregation and synthesis •Topic pages •Social network maps •Geolocation maps •Data mashups •Text mining reports
Images
Text
Tables
Scholarly content
Scholarly knowledge organization systems
Linked data from partners and the Web
2
3
Scientific publications as linked data
Linked data
Acquire
Transform, Enhance, Index, Analyze,
Compose
Deliver
Document
Entity record
Media object
4
• Embrace linked data principles while leveraging our existing content production workflow and infrastructure – Find the right balance between production/QA and online
delivery • Leverage partners for content enhancement and
knowledge organization – Reuse Web-standard vocabularies, taxonomies, ontologies
and entity resources where possible • Build out linked data design patterns for application
development • Deliver benefits across the complementary use cases
of researcher and practitioner
Elsevier’s approach
Elsevier work to date
• Standards – RDF named graphs
conformant with use-specific XML schemas for production/QA
– Taxonomies in SKOS • Infrastructure
– Linked Data Repository with CRUD API, Atom feeds for online delivery services
– Virtual Total Warehouse for content repository federation
• Applications – Semantic search for medical
researchers and practitioners
– Lancet, SciVerse app mashups
5
6
• Easing technology adoption by enterprise IT staff
• Best practices for knowledge organization systems management
• Infrastructure for scholarly linked data publishing
LEDP2011: what we want to discuss
7
• Tools and best practices for URL and namespace management and governance
• Best practices for publishing and consuming linked data that address IT concerns rather than legacy RDF issues – 2006 vs. later versions of “Four Principles” – Serialization “impedance mismatch” – RDF APIs vs. SPARQL – HTTP Range-14
Easing technology adoption
8
• Tools and best practices for global/local knowledge organization systems management
• Standards for named entities and registries crucial to accreditation, provenance and trust – e.g. author identifiers and profiles in ORCID
Best practices for knowledge organization
9
• Validators for linked data • Standards supporting scholarly publishing
workflows – Named graphs – Versioning – Access & entitlement
• Standards and best practices for annotation of scholarly content – e.g. CITO, SWAN, SIOC, AO, OAC
• Support for free text search
Infrastructure for scholarly linked data publishing