Upload
research-support-team-it-services-university-of-oxford
View
158
Download
0
Tags:
Embed Size (px)
DESCRIPTION
This presentation was delivered at IT Services, University of Oxford on 2014-05-28, as part of the 'Things To Do With Data' series of lunchtime talks. It offers an overview of resources available for management and support staff whose responsibilities include planning and implementing data management strategies.
Citation preview
Resources for Research Data Managers Things To Do With Data – Trinity Term 2014
Slides provided by Research Support Team, IT Services, University of Oxford
What does data include?
“A reinterpretable representation of information in a formalized manner suitable for communication, interpretation, or processing.”
Digital Curation Centre
Slide adapted from the PrePARe Project
What does data include?
Any information used in research
Slide adapted from the PrePARe Project
What does research data management cover?
Storage
Organizing
Preservation
Documenting
Sharing
Choosing technology
Versioning
Structuring
Backing up
Curation
Security
Elements of a data management strategy – questions to ask
What sort of data are we dealing with? How much? What format(s) is it in?
Is data pre-existing or newly generated? How will data be stored? How will it be backed up? Who needs to be able to access it? Are there any
security requirements? What records are being kept? How will the data be
documented? What are the long-term plans for the data?
Will it be published or shared?
Carrots and sticks
Enable efficient day-to-day work
More time for the meat of the research process
Avoid problems in the future
University of Oxford Policy on the Management of Research Data and Records
Funding body requirements
University of Oxford policy
Introduced July 2012
University of Oxford policy
The full policy can be viewed on the University of Oxford Research Data Management website
Research data is defined as the information needed ‘to support or validate a research project’s observations, findings or outputs’
Research data should be: Accurate, complete, identifiable,
retrievable, and securely stored Able to be made available to others
Funders’ requirements
Funding bodies are taking an increasing interest in what happens to research data
Researchers may be required to make data publicly available at the end of a project
Many funders require a data management plan as part of grant applications
Oxford’s RDM website provides a summary of requirements
KNOWING WHAT YOU’VE GOT
Data audits
A comprehensive overview of data holdings What is the data? How much of it is there?
Where and how is it stored? Who owns it? Most useful where there’s already an
established body of data
But can involve a lot of work
Needs input from researchers Tools can help structure the process
http://www.data-audit.eu/
http://cardio.dcc.ac.uk/
KNOWING WHERE YOU’RE GOING
Data management plans
A document created early on in a project While planning, applying for funding, or setting up
An initial plan may be expanded later Details plans and expectations for data
Nature of data and its creation or acquisition
Storage and security
Preservation and sharing
Benefits of data management plans
A framework for asking key questions before problems arise
A framework for ongoing review of data management practices
A useful way of harvesting information about current and planned research data
Lower overheads than a full data audit – especially if a DMP is being produced anyway for the funder
Data management planning tools
Online templates for data management plans
DMP Online can be customized according to funding body
https://dmponline.dcc.ac.uk/
http://www.miidi.org/dmp/
RESOURCES TO HELP YOU GET THERE
Storage
Departmental IT support may be able to provide server space or a shared drive
IT Services’ NSMS offers server rental and management, storage on the University’s private cloud, and other storage solutions
What storage media are in use?
How about file formats?
Slide adapted from the PrePARe Project
Data security
Is there data that needs special treatment?
Sensitive or confidential information
Commercial potential InfoSec at IT Services can provide advice –
see http://www.it.ox.ac.uk/infosec/ for more details
http://blogs.ch.cam.ac.uk/pmr/2011/08/01/why-you-need-a-data-management-plan/
Backing up is easier than replacing lost data…
Slide adapted from the PrePARe Project
LOCKSS – Lots Of Copies Keeps Stuff Safe
Keep copies in different places
Can the process be automated?
Slide adapted from the PrePARe Project
IT Services: data back-up on the HFS
HFS is Oxford’s central back-up and archiving service
Free of charge to University staff and postgraduates
Automated back-ups of machines connected to University network
Copies kept in multiple places
Documentation and metadata
Documentation is the contextual information required to make data intelligible and aid interpretation
A users’ guide to data
May be given at study level or data level
Metadata is similar, but usually more structured
Conforms to set standards
Machine readable<tit
le>2014
HBS Surv
ey
Results<
/title>
<author>
Joe Blog
gs
</author
>
M. F
arinelli et al. (2012) P
LoS O
NE
7(3): e34047
• Who created the data, when and why
• Description of the item• Methodology and methods• Units of measurement• Definitions of jargon,
acronyms and code• References to related data
Documentation – what needs to be included
?www.texample.net
Slide adapted from the PrePARe Project
Maintaining consistency
Agree a set of standard working practices as early as possible in a project
Method of recording what’s been done to data – and who did it
File naming conventions
Version information Have these clearly documented, and
store the documentation centrally
Help with metadata
The DCC provides advice on disciplinary metadata standards
ISA tools is a suite of open source metadata tracking tools for the life sciences
Data management tools and systems
LabTrove – an electronic lab notebook system NeuroHub – an information environment for
managing data from lab-based research DataStage – a secure personalized file
management environment myExperiment – record and
share scientific workflows Taverna – for managing
scientific workflows
Long term solutions
Data repositories or archives offer a secure long-term home for research data
Data can be embargoed if needed Databib and Re3Data.org offer searchable
catalogues of repositories Figshare offers a DIY option
Planning ahead
Data sharing needs to be planned from the beginning of a project
With sensitive data, consent may be needed
Third party data may come with restrictions If data is destined for a particular archive, they
may have specific requirements
Do they use a specific metadata schema, for example?
Data licensing
A licence clarifies the conditions for accessing and making use of a dataset
User knows what’s allowed without asking further permission
Doesn’t exclude possibility of specific requests to go beyond the terms of the licence
Licences used for data include Creative Commons and Open Data Commons
‘How to License Research Data’
One of several guides from the Digital Curation Centre
http://www.dcc.ac.uk/resources/how-guides/
Digital Curation Centre
A national service providing advice and resources for the whole research data lifecycle
http://www.dcc.ac.uk/
UK Data Archive
Largest UK collection of social sciences and humanities data
Advice on best practice for creating, preparing, storing and sharing data
http://www.data-archive.ac.uk/
UNIVERSITY SERVICES
ORA-Data (formerly known as DataBank)
University of Oxford’s institutional data archive Will work alongside ORA-Publications to form a
composite University archive Long term preservation for datasets without
another natural home Plus records for data
archived elsewhere
ORDS – Online Research Database Service
Specifically designed for academic research data Cloud-hosted and automatically backed up Web interface makes collaboration straightforward If desired, databases can easily be made public Designed to permit easy archiving Currently being used by a small group of test users –
will become more widely available later in 2014
http://ords.ox.ac.uk/
IT Services: Research Support Team
Can assist with technical aspects of research projects at all stages of the project lifecycle
Help with DMPs, selecting software or storage, etc.
But the earlier you seek advice, the better For more information, see: http
://research.it.ox.ac.uk/
Research data management website
Oxford’s central advisory website
University policy is available
Questions? Email [email protected]
http://researchdata.ox.ac.uk/
Rights and re-use
This presentation is part of a series of research data management training resources prepared by the Research Support Team at IT Services, University of Oxford.
Parts of this slideshow draw on material produced as part of the Oxford-based DaMaRO Project, and on resources produced by the PrePARe Project
With the exception of clip art used with permission from Microsoft, and commercial logos and trademarks, and images specifically credited to other sources, the slideshow is made available under a Creative Commons Attribution Non-Commercial Share-Alike License
Within the terms of this licence, we actively encourage sharing, adaptation, and re-use of this material