Upload
kfear
View
13
Download
1
Tags:
Embed Size (px)
Citation preview
Goals of this session
• Learn how the DMPTool can help you generate a DMP
• Learn the basic components of a DMP
• Understand how good data management practices translate to a good DMP
What is a DMP?
A formal plan outlining how you will handle your data throughout and after your project…
…which is now required by many funders…
…and which is a good idea anyhow, even if it’s not required.
Data Products
Describe the kind of data you’re collecting or using, whether it’s digital…
…or physical.
(or all of the above)
Data Products: What to specify
• What are your data products, both primary and derived?
• When will you collect / produce each data product?
• How much data will you generate?
Documenting data
• Data are machine readable, but must also be understandable to humans
• What information would someone else (or you, long in the future) need to understand the data?
Data and Metadata Standards: What to specify
• File formats: – Open or proprietary?• If you need special software to open a file, how will you
ensure its accessibility over time?
– Standard or non-standard?
Data and Metadata Standards: What to specify
• Naming standards: – Can you tell what a file is and what it contains
without opening it? How do your files relate to one another?
Data and Metadata Standards: What to specify
• Metadata: Contextualizing information about an object, physical or digital
• Some fields have defined standards; some repositories ask for a specific set of metadata
A DMP does NOT:
Require that you share all data with anyone who wants it
“at no more than incremental cost and within a reasonable time” (NSF)
“indicate the criteria for deciding who can receive your data” (NIH)
Access and sharing: what to specify
• What data products will you share freely? When? How?– Data necessary for replication of public results– Other data?
• What data products won’t you share freely? Why not?
• How will you resolve ethical or privacy issues?
• Consider restrictions, embargo, etc. for data that can’t be immediately shared freely
Access and sharing: What to specify
• Backup: – Where? (and what?)
• Local (hard drive, dept/local server, personal laptop, flash drive) vs. distant (PDC, hard drive at home)
• Central (PDC, UR Research) vs. cloud (Amazon, Box, CrashPlan, Google Drive)
– How often?– Who’s responsible?
• Security: Locked cabinets? Password-protected computer? Non-networked storage?
Access and sharing: Placing data in a repository
• Long-term commitment to data preservation• Higher visibility for your data• Permanent URL / DOI enables data citation• Reuse tracking and usage statistics
Access and sharing: Placing data in a repository
• UR Research: https://urresearch.rochester.edu/home.action– Example: STOP-ROP Clinical Trial
Access and sharing: Placing data in a repository
• UR Research: https://urresearch.rochester.edu/home.action
• Repository directories: re3data.org; biosharing.org
• Integration with journal submission processes
• Link to data held elsewhere• Not free: $80/submission…• …but talk to us about a voucher
Reuse and distribution
• Who is the audience for your data?• What possible uses might someone make of
your data?
• Are there any permissions restrictions necessary?
Plans for archiving and preservation
• How long should data be retained for?• Where will the data be placed for long-term
preservation? What policies are in place there to guarantee its preservation?
• How will you ensure accessibility and usability over the long term?– Data transformations?– Archiving associated information?
Revisiting Metadata and Documentation
• Information about data processing, collection details: the ‘story’ of the data
(…but it’s all in the paper!)
• Are your variable names meaningful? It is clear how different parts of the dataset relate to each other? Is it in a format others can use?
A little help: consultation
• Call me! (Or email, or drop by.)5-6882
Carlson [email protected]
• DMP consultation & review; trainings; data archiving support; etc.
A request
• When you get a grant funded, send me your DMP.
• If you’re comfortable, if you get negative feedback on your DMP, share it with me.