26
DATA CURATION ISSUES Michelle Hudson SCOPA Forum 9.21.11

data curation issues

Embed Size (px)

DESCRIPTION

A very short, very minimal presentation I prepared for the Yale Libraries' SCOPA event to introduce librarians in diverse disciplines to the concepts and challenges of data curation.

Citation preview

Page 1: data curation issues

DATA CURATION ISSUES

Michelle HudsonSCOPA Forum9.21.11

Page 2: data curation issues

WHAT IS DATA?

Page 3: data curation issues

WHAT IS DATA?

Definition varies by discipline and can include experimental, observational, and computational data.

Page 4: data curation issues

WHAT IS DATA?

Definition varies by discipline and can include experimental, observational, and computational data.

In general “research data” refers to raw or processed products of a research project.

Page 5: data curation issues

WHAT IS DATA?

Definition varies by discipline and can include experimental, observational, and computational data.

In general “research data” refers to raw or processed products of a research project.

These products can be video, images, or numeric files in the form of geographic information, spreadsheets, and other formats.

Page 6: data curation issues

WHAT IS DATA CURATION?

Page 7: data curation issues

WHAT IS DATA CURATION?

“Data curation is the active and ongoing management of research data through its lifecycle of interest and usefulness to scholarship, science, and education.” – Carole Palmer, UIUC GSLIS

Page 8: data curation issues

WHAT IS DATA CURATION?

“Data curation is the active and ongoing management of research data through its lifecycle of interest and usefulness to scholarship, science, and education.” – Carole Palmer, UIUC GSLIS

“Curation” includes selection, appraisal, maintenance, preservation.

Page 9: data curation issues

WHY IS DATA CURATION IMPORTANT FOR US?

Page 10: data curation issues

WHY IS DATA CURATION IMPORTANT FOR US?

According to Paul F. Uhlir, Director of the Board on Research Data and Information, researchers are “contributing to a networked information enterprise where data are a fundamental infrastructural component of the modern research system.”

Page 11: data curation issues

WHY IS DATA CURATION IMPORTANT FOR US?

According to Paul F. Uhlir, Director of the Board on Research Data and Information, researchers are “contributing to a networked information enterprise where data are a fundamental infrastructural component of the modern research system.”

Increasingly, data itself is a product and record of scholarship.

Page 12: data curation issues

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

Page 13: data curation issues

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Page 14: data curation issues

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Lack of interoperability.

Page 15: data curation issues

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Lack of interoperability.

Controlled vocabularies are missing.

Page 16: data curation issues

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Lack of interoperability.

Controlled vocabularies are missing.

Storage space is limited.

Page 17: data curation issues

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Lack of interoperability.

Controlled vocabularies are missing.

Storage space is limited.

Domain of stewardship/responsibility is unclear.

Page 18: data curation issues

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Lack of interoperability.

Controlled vocabularies are missing.

Storage space is limited.

Domain of stewardship/responsibility is unclear.

Individual repositories make silos of content.

Page 19: data curation issues

IDEAS FOR SOLUTIONS!

Page 20: data curation issues

IDEAS FOR SOLUTIONS!

Experiment tracking software and electronic notebooks.

Page 21: data curation issues

IDEAS FOR SOLUTIONS!

Experiment tracking software and electronic notebooks.

Automatic metadata.

Page 22: data curation issues

IDEAS FOR SOLUTIONS!

Experiment tracking software and electronic notebooks.

Automatic metadata.

Integrating curation early into the researcher workflow.

Page 23: data curation issues

IDEAS FOR SOLUTIONS!

Experiment tracking software and electronic notebooks.

Automatic metadata.

Integrating curation early into the researcher workflow.

Educating graduate students on proper data management.

Page 24: data curation issues

IDEAS FOR SOLUTIONS!

Experiment tracking software and electronic notebooks.

Automatic metadata.

Integrating curation early into the researcher workflow.

Educating graduate students on proper data management.

DataONE and the Data Conservancy.

Page 25: data curation issues

OTHER STUFF!

Data citation

Data sharing

Reward models

Identity control (ORCID, EZID)

Semantic web and linked data

Cyberinfrastructure

Page 26: data curation issues

QUESTIONS?

[email protected]@michellehudsonin person for coffee @ kbt cafe