Research Data
Management (RDM)19.10.16Ben Mollitt,
Research Data Manager
This session will cover:
what is research data and RDM?
the benefits of RDM and making research data openly available
policy environment and funders’ expectations
research data lifecycle
support and resources for managing your data
demonstrations of useful tools / services
Overview
What is research data?
Any recorded information necessary to support or validate a research project’s observations, findings
or outputs, regardless of format.
https://www.liverpool.ac.uk/media/livacuk/computingservices/research-data-management/researchdatamanagementpolicy.pdf
What is research data management?
“an explicit process covering the creation and stewardship of research materials to enable
their use for as long as they retain value.”
Data Management is part of good research practice
Benefits of managing research data and making it openly available
- Integrity and efficiency of research- Impact and visibility of research- To stop yourself drowning in irrelevant information- In case you need the data later- Increased citations- Data sharing and re-use- Avoid duplication- Interdisciplinary collaborations - Data security and preservation- Enables easier location and understanding of files - Compliance with funder policies
Trend towards coalescence on open research data
Principle 1 - enabler of high quality research, facilitator of innovation and safeguards good practice Principle 2 - sound reasons why openness may need to be restricted but must be justified and justifiablePrinciple 3 - carries significant cost which must be respected by all partiesPrinciple 4 - right of the creators to reasonable first use is recognisedPrinciple 5 - use of others’ data should always conform to legal, ethical and regulatory frameworkPrinciple 6 - good data management is fundamental to all stages of research process and should be established at outsetPrinciple 7 - data curation is vital to make data useful and for long-term preservationPrinciple 8 - data supporting publications should be accessible by the publication date and citeablePrinciple 9 - support for development of appropriate data skills is recognisedPrinciple 10 - regular reviews of progress towards open data should be taken.
http://www.rcuk.ac.uk/documents/documents/concordatonopenresearchdata-pdf/
What do funders ultimately expect?
Data management plan
Timely release of data
Open data sharing
Preservation of data
The Research Data Lifecycle
PLAN
RE-USE
PUBLISH & SHARE
CREATE & ORGANISE
STORE
PRESERVE
The Data Lifecycle - Plan
PLAN
RE-USE
PUBLISH & SHARE
CREATE & ORGANISE
STORE
PRESERVE
Data Management Plans
Ethics
Intellectual Property
DMPonline
dmponline.dcc.ac.uk
• A web-based tool to help researchers write data management plans
• Contains templates and guidance for many of the main funders
• Allows DMPs to be shared with collaborators as they are being developed
• Facility to export DMP in a variety of different formats for submission with a grant application
• Funder templates will be customised with Liverpool-specific guidance and best practice examples
The Data Lifecycle - Store
PLAN
RE-USE
PUBLISH & SHARE
CREATE & ORGANISE
STORE
PRESERVE
Storage options
Access options
Security
Back up
Liverpool’s Active DataStore• Fast, high quality, high
capacity storage with guaranteed backup and resilience
• Data is conveniently accessible from wherever and whenever required using DatAnywhere
• 1TB of storage per research project by default. Additional storage available on request https://liverpool.service-now.com/ess/order_rdm.do
The Data Lifecycle – Publish & Share
PLAN
RE-USE
PUBLISH & SHARE
CREATE & ORGANISE
STORE
PRESERVE
Where to publish
Data access statements
Licensing
Domain data repository
General data repository e.g. Figshare, Zenodo, Dryad
Institutional data repository – Liverpool’s DataCat
Journal supplementary material
Project or departmental web page
Options for publishing data
What is metadata?
Data about data
Metadata for DISCOVERY
Metadata enabling REUSE
It can be helpful to define research metadata by its use:
DataCat: Liverpool’s Research Data Cataloguedatacat.liverpool.ac.uk/
• Create records of information about finalised research data, and save data in a secure online environment
• Two types of record:Discovery-only – data is held
elsewhere but a record is provided to help people find it
Discovery and data – data is also deposited into DataCat, which creates a Digital Object Identifier (DOI) for citations
• Not for storing of active data, i.e. data being added to, or not yet cleaned and processed – see Active DataStore
Data Access Statements
Data Access Statement: All data supporting this study are openly available from https://dx.doi.org/10.17638/datacat.liverpool.ac.uk/171
The Data Lifecycle – Re-use
PLAN
RE-USE
PUBLISH & SHARE
CREATE & ORGANISE
STORE
PRESERVE
• Data Citation
• Innovative reuse
• Secondary/Meta Analysis
• Teaching and Learning
• A dataset citation includes all of the same components as any other citation
• Creator (Publication Year): Title. Publisher. Identifier
Data Citation
Example:
Irino, T; Tada, R (2009): Chemical and mineral compositions of sediments from ODP Site 127-797. Geological Institute, University
of Tokyo. http://dx.doi.org/10.1594/PANGAEA.726855
Summary of RDM services at Liverpool
Data Management Planning DMPonline – customised for Liverpool
Active DataStore Storage infrastructure and secure backups for active data
Data Catalogue Preservation and sharing of finalised research data Dataset validation – checking of the metadata DOI minting – to provide a persistent identifier for datasets
Further information and contact details- Liverpool RDM web pages:www.liverpool.ac.uk/csd/research-data-management/
- Liverpool RDM policy:www.liverpool.ac.uk/media/livacuk/computingservices/ research-data- management/Research,Data,Management, Policy,04.pdf
- RDM queries:[email protected] Mollitt (Research Data Manager)[email protected] Gary Jeffers (Research Data Management Officer)[email protected] Victoria Barragan (Research Data Management Assistant)[email protected]
Thank you.