48
Data citation... Who cares? Heather Piwowar DataONE postdoc with Dryad and NESCent DataONE summer internship meeting July 7, 2010

Data citations: who cares?

Embed Size (px)

DESCRIPTION

Who cares how research data is attributed and cited? Lots of people. Presented by Heather Piwowar to DataONE summer internship 2010 group on data citatio

Citation preview

Page 1: Data citations:  who cares?

Data citation...Who cares?

Heather Piwowar

DataONE postdoc with Dryad and NESCentDataONE summer internship meeting 

July 7, 2010

Page 2: Data citations:  who cares?

http://www.metmuseum.org/toah/ho/09/euwf/ho_24.45.1.htm

Page 3: Data citations:  who cares?

http://www.flickr.com/photos/jsmjr/62443357/

Page 4: Data citations:  who cares?

http://www.flickr.com/photos/camilleharrington/3587294608/

Page 5: Data citations:  who cares?

http://www.flickr.com/photos/rkuhnau/3318245976/

Page 6: Data citations:  who cares?

http://www.flickr.com/photos/conformpdx/1796399674/

Page 7: Data citations:  who cares?

http://www.flickr.com/photos/rkuhnau/3317418699/

Page 8: Data citations:  who cares?

http://www.flickr.com/photos/zemlinki/261617721/

Page 9: Data citations:  who cares?

http://www.flickr.com/photos/tracenmatt/3020786491/

Page 10: Data citations:  who cares?

http://www.flickr.com/photos/the-o/2078239333/

Page 11: Data citations:  who cares?

Probably.

Page 12: Data citations:  who cares?

In theory.

Page 13: Data citations:  who cares?

?

Page 14: Data citations:  who cares?

• Genbank

• PDB

Page 15: Data citations:  who cares?

http://www.oxfordjournals.org/nar/database/cap/

Page 16: Data citations:  who cares?

http://www.flickr.com/photos/archeon/2941655917/

Page 17: Data citations:  who cares?

Data citation...

Page 18: Data citations:  who cares?

datasetpaper

paper

paper

paper

paper

paper

dataset

dataset

dataset

dataset

dataset

Page 19: Data citations:  who cares?

• Alas, no unique standard identifier• URL• accession number• DOI• citation to paper• citation to database• reference to supplementary material• search strategy

Page 20: Data citations:  who cares?

Example: full-text phrases containing “... accessed”

Page 21: Data citations:  who cares?

“submitted”

Page 22: Data citations:  who cares?

“downloaded”

Page 23: Data citations:  who cares?

• Citations are indexed and machine-extractable

Page 24: Data citations:  who cares?

datasetpaper

paper

paper

paper

paper

paper

dataset

dataset

dataset

dataset

dataset

Page 25: Data citations:  who cares?

• understand current practice• articulate the best best-practices

Page 26: Data citations:  who cares?

datasetpaper

paper

paper

paper

paper

paper

dataset

dataset

dataset

dataset

dataset

Page 27: Data citations:  who cares?

Who cares?

Page 28: Data citations:  who cares?

1.  Data creators

• personal reward• motivation:

• “if it really helped”• even esoteric datasets are useful

• how prevalent is scooping?• alert to possible misuses• grounded requirements

Page 29: Data citations:  who cares?

2.  Data reusers

• clear guidelines are helpful• what has been reused, for what?• what hasnʼt?

Page 30: Data citations:  who cares?

3.  Repository creators, maintainers

• funding• how much metadata• how to format• what additional tools are useful• lifecycle of data

Page 31: Data citations:  who cares?

4.  Funders

• most, best science for their money• cost/benefit of mandate• inform funding decisions:

• what has been extra useful?• what hasnʼt?

• what support is needed

Page 32: Data citations:  who cares?

5.  Journals

• increasingly called upon to mandate or fund:

• how to decide• how to rationalize

• another avenue to compete

Page 33: Data citations:  who cares?

6.  Information scientists

• extension of citation analysis for studying information behaviour

Page 34: Data citations:  who cares?

6.  Me

Page 35: Data citations:  who cares?
Page 36: Data citations:  who cares?

Articles published in journals

with a strong data-sharing

policy are more likely to have

publicly available datasets

Page 37: Data citations:  who cares?

Reuse estimate

• 2703 submissions in 2007 • GSE* in PubMed Central• Exclude author overlap• Exclude data creation

• automatically, manually

• 139

• 520

Page 38: Data citations:  who cares?
Page 39: Data citations:  who cares?
Page 40: Data citations:  who cares?
Page 41: Data citations:  who cares?

7.  You

Page 42: Data citations:  who cares?

8.  Your mom

Page 43: Data citations:  who cares?

9.  These mice

http://www.flickr.com/photos/ryanr/142455033/

Page 44: Data citations:  who cares?

10.  Scientific progress

• trace errors, fraud• increase transparency• more efficient and effective

Page 45: Data citations:  who cares?

you can not manage what you do not measure

quote: Lord Kelvinhttp://www.flickr.com/photos/archeon/2941655917/

Page 46: Data citations:  who cares?

science about our science

Page 47: Data citations:  who cares?

http://www.flickr.com/photos/druclimb/293046352/

Page 48: Data citations:  who cares?

questions?

Thanks to:

NSF, DataONE, NESCent, Dryad

UBC Dept of Zoology

NLM, U of Pittsburgh Dept of Biomedical Informatics

Open science online community and those who release their articles, datasets and photos openly