Upload
datadryad
View
271
Download
2
Embed Size (px)
DESCRIPTION
Presentation by Natalia Manola on OpenAire and data publishing given at the Now and Future of Data Publishing Symposium, 22 May 2013, Oxford, UK
Citation preview
OpenAIREOpen Knowledge & Scientific Information
Infrastructure
Natalia ManolaUniversity of Athens, Greece
Linking
Citation
Classification
De-duplication
Cleaning & Transformation
Validation
Publication repositoriesInstitutional & ThematicOpen Access Journals
Data repositoriesData Journals
CRIS systems
Funding information
Registries
OpenAIRE in a nutshell
Publication in context
Statistics
Learning Material Objects
Public Sector Information
Semantic publishing for OpenAIRE
• Linked entities • Beyond a flat data model – CERIF compliant
• Overlapping efforts in data modelling basic entities
• Using multiple identifier schemes• Discipline specific best practices (DOIs, PIDs, URI/URN’s, db
ids, …)
• Contextualizing by relationships • Multiple types and vocabularies
Publications in context
The future of data publishing. Oxford May 22, 2013 3
Semantic enrichment services
• Citation discovery• Text mining – lots of it…
• Discipline specific algorithms
• Classification• Supervised
• Discipline specific vocabularies – library oriented
• Training sets – hard to find
• Unsupervised classification• Interdisciplinary complexity
• Finding trends
Citation, classification, clustering
The future of data publishing. Oxford May 22, 2013 4
Zenodo
• Metadata general enough not to capture discipline
semantics
• Different types of material• Supplementary data or …?
• Context in relation to funding and publication
• Community regulated quality
• To be linked to OpenAIRE text mining services for
metadata enrichment
An all purpose data repository – www.zenodo.org
The future of data publishing. Oxford May 22, 2013 5
Challenges•Implementation of guidelines/standards
•OpenAIRE guidelines for literature, data, CRIS• Global alignment and adoption (RDA, WDS, W3C, …)
•Uniform vocabularies to support• Interdisciplinary classification
• Multilinguality (e.g., EUROVOC)
• Links to other domains
•Links to other domains• Mapping of data models (DCAT, LOM, …)
• Existing projects (e.g., fp7 ENGAGE)
•Tools for semantic enrichment at publishing time
The future of data publishing. Oxford May 22, 2013 6
www.openaire.eu@openaire_eufacebook.com/groups/openaire linkedin.com/groups/OpenAIRE-3893548
Thank you!OpenAIRE / LIBER workshop @ Ghent May 28, 2013
Dealing with data – what’s the role for the library?
The future of data publishing. Oxford May 22, 2013 7