45
Save the Cows! Cyberinfrastructure for the rest of us Dorothea Salo Digital Repository Librarian University of Wisconsin 11 March 2009

Save the Cows! Cyberinfrastructure for the Rest of Us

Embed Size (px)

DESCRIPTION

Expanded version of the Save the Cows presentation, for a mixed librarian/IT-professional audience.

Citation preview

Page 1: Save the Cows! Cyberinfrastructure for the Rest of Us

Save the Cows!

Cyberinfrastructure for the rest of us

Dorothea SaloDigital Repository Librarian

University of Wisconsin11 March 2009

Page 2: Save the Cows! Cyberinfrastructure for the Rest of Us

E-ScienceE-Science

Cyberinfrastructure

Data Curation

Grid Computing

Metadata

IT? Libraries?Faculty?

Data mining

EXABYTESPetabytes

TerabytesE-Research

Standards

AAAARGH!Collaboration Identity

Page 3: Save the Cows! Cyberinfrastructure for the Rest of Us

It’s simpler than that.

(thank goodness!)

Page 4: Save the Cows! Cyberinfrastructure for the Rest of Us

Scholars use

in their research

Page 5: Save the Cows! Cyberinfrastructure for the Rest of Us

This produces

DATA.

Page 6: Save the Cows! Cyberinfrastructure for the Rest of Us

In addition to

DATA.

Page 7: Save the Cows! Cyberinfrastructure for the Rest of Us

So now we have to support that.

Data generation

Data management

Data storage

Data certification

Data discovery and reuse

Page 8: Save the Cows! Cyberinfrastructure for the Rest of Us

That’s all this is about. Really.

Page 9: Save the Cows! Cyberinfrastructure for the Rest of Us

What I will not talk about today

• Collaboration technology

• Identity-management, authentication, authorization, etc.

• Grid computing

• Instrument science

• Open Notebook Science

Of course these are important. I’m just not competent to opine. Fortunately, you have Melissa!

Page 10: Save the Cows! Cyberinfrastructure for the Rest of Us

What I’m on about

DATA.

Page 11: Save the Cows! Cyberinfrastructure for the Rest of Us

Data?

Page 12: Save the Cows! Cyberinfrastructure for the Rest of Us

Charts and graphs are DEAD data

Killed! Cut in pieces!

Ground up! Unrecognizable!

Not revivable! Not reusable!

Page 13: Save the Cows! Cyberinfrastructure for the Rest of Us

Okay, what’s data, then?

We have to save the cows!

Page 14: Save the Cows! Cyberinfrastructure for the Rest of Us

In case you’re wondering...

“Converting PDF to XML is a bit li

ke

converting hamburgers into cows.”

—Michael Kay

<http://l

ists.xml.

org/arch

ives/xm

l-dev/20

0607/

msg0050

9.html>

Page 15: Save the Cows! Cyberinfrastructure for the Rest of Us

Do we have to keep data?

SOMETIMES.(but it’s often a good idea even if

you don’t have to)

Page 16: Save the Cows! Cyberinfrastructure for the Rest of Us

Funders may require it.

Page 17: Save the Cows! Cyberinfrastructure for the Rest of Us

Journals may require it.

Page 18: Save the Cows! Cyberinfrastructure for the Rest of Us

Here’s the catch

Some of these placeshave built barns

for the cows.Many haven’t.

Page 19: Save the Cows! Cyberinfrastructure for the Rest of Us

Guess who’s on if they don’t?

Page 20: Save the Cows! Cyberinfrastructure for the Rest of Us

What can be done with data?

• Experimental validation

• Meta-analysis, data-mining, mashups

• Interdisciplinary investigation

• Historical investigation

• Modeling and model validation

• ... the possibilities are endless—IF we have the cows the data.

Page 21: Save the Cows! Cyberinfrastructure for the Rest of Us

Is all data from “BIG SCIENCE”?

Page 22: Save the Cows! Cyberinfrastructure for the Rest of Us

Absolutely not.

(they don’t even need our help)

Page 23: Save the Cows! Cyberinfrastructure for the Rest of Us

“Small Science”

Less money

Less know-how

In aggregate? MORE COWS.

Page 24: Save the Cows! Cyberinfrastructure for the Rest of Us

Arts & Humanities

Page 25: Save the Cows! Cyberinfrastructure for the Rest of Us

Here’s the catch.

Page 26: Save the Cows! Cyberinfrastructure for the Rest of Us

Nobody knowshow to do all this.

(yet)

Page 27: Save the Cows! Cyberinfrastructure for the Rest of Us

But we do know a few things...

Page 28: Save the Cows! Cyberinfrastructure for the Rest of Us

Cows are dumb.

They will not save themselves.

Page 29: Save the Cows! Cyberinfrastructure for the Rest of Us

It takes a village

to save the cows.

Page 30: Save the Cows! Cyberinfrastructure for the Rest of Us

ResearchersCan you tell a Holstein from an Angus?

Me neither.

But researchers know their cows.

Page 31: Save the Cows! Cyberinfrastructure for the Rest of Us

Information Technologists

Page 32: Save the Cows! Cyberinfrastructure for the Rest of Us

Librarians

But what I see happening is .

.. this b

eautiful

combination of understanding the str

ucture of

information, and understanding the code that goes

behind it, and how to make it u

sable to the people

who want to access it

. I think that w

e used to talk

about blended, or the hybrid lib

rarian — now that’s

the librarian.

“Librarian 15”

Palmer et al., “Identify

ing Factors of Success...

Page 33: Save the Cows! Cyberinfrastructure for the Rest of Us

Grant administrators

Cows don’t corral themselves.Neither do researchers.

Page 34: Save the Cows! Cyberinfrastructure for the Rest of Us

The big gray area

Informaticists?

Researchers who code?

IT pros who grok metadata?

Librarians who model data?

Page 35: Save the Cows! Cyberinfrastructure for the Rest of Us

Great. So now what?

Page 36: Save the Cows! Cyberinfrastructure for the Rest of Us

Find use cases

Page 37: Save the Cows! Cyberinfrastructure for the Rest of Us

Plan for infrastructure

Page 38: Save the Cows! Cyberinfrastructure for the Rest of Us

Build alliances

Page 39: Save the Cows! Cyberinfrastructure for the Rest of Us

Start conversations

Page 40: Save the Cows! Cyberinfrastructure for the Rest of Us

Ten Questions1. What is the story of your data?2. What form and format are the data in?3. What is the expected lifecycle of your data?4. How could your data be used, reused, and repurposed?5. How large is your dataset, and what is its rate of

growth?6. Who are the potential audiences for your data?7. Who owns the data?8. Does the dataset include any sensitive information?9. What publications or discoveries have resulted from the

data?10.How should the data be made accessible?

—Michael Witt and Jake Carlson, Purdue University

Page 41: Save the Cows! Cyberinfrastructure for the Rest of Us

Keep an eye out

Page 42: Save the Cows! Cyberinfrastructure for the Rest of Us

If this seems like common sense...

... good! It mostly is!

Page 43: Save the Cows! Cyberinfrastructure for the Rest of Us

Thank you!

(and save a cow today!)

Page 44: Save the Cows! Cyberinfrastructure for the Rest of Us

• Title slide: http://www.flickr.com/photos/flikr/131673772/• Server rack: http://www.flickr.com/photos/dumbledad/3276756770/• Command centre: http://www.flickr.com/photos/soundman1024/2054512893/

• Laptop: http://www.flickr.com/photos/arbron/56216464/• Dual-monitor setup: http://www.flickr.com/photos/blakespot/2372432028/

• Photo-data: http://www.flickr.com/photos/51114580@N00/1597765466/• Word cloud: http://www.flickr.com/photos/55772089@N00/3291287830/• Internet map: http://www.flickr.com/photos/jurvetson/63009926/

• Dhaka image: http://www.flickr.com/photos/ahaqueusa/1268467179/• Plant cross-section: http://www.flickr.com/photos/tonios-pics/387510805/

• Journals: http://www.flickr.com/photos/emdot/56157732/• Books: http://www.flickr.com/photos/guwashi999/2635608241/• Manuscript: http://www.flickr.com/photos/86624586@N00/10187684/

• Hamburger: http://www.flickr.com/photos/nadya/1019816514/• Row of cows: http://www.flickr.com/photos/flikr/230379411/

• Beware of cow: http://www.flickr.com/photos/tm-tm/2339539399/• Cowboys: http://www.flickr.com/photos/bistrosavage/30710414/• Hands: http://www.flickr.com/photos/iandesign/1204632335/

• Money: http://www.flickr.com/photos/emraya/2867188734/• Barn: http://www.flickr.com/photos/efleming/2814015008/

• Hook: http://www.flickr.com/photos/28481088@N00/2077768050/• Large Hadron Collider: Fanny Schertzer, Wikimedia Commons• One Size Fits: http://www.flickr.com/photos/hmk/2280657662/

• Herd: http://www.flickr.com/photos/krossbow/2530875540/• Angus: http://www.flickr.com/photos/royalty-free-images/139138902/

• Digital libraries: http://www.flickr.com/photos/dullhunk/3272867908/• Holstein: http://www.flickr.com/photos/jdickert/539619160/• Green tech: http://www.flickr.com/photos/jurvetson/2126204366/

• Permission in advance: http://www.flickr.com/photos/mikeblogs/2762543380/• Librarian: http://flickr.com/photos/webchicken/1352009526/

• Org chart: http://www.flickr.com/photos/mwichary/2356663850/• Conversation: http://flickr.com/photos/eggybird/97707771/• Rodeo: http://www.flickr.com/photos/omaromar/49239249/

• Thumbs up: http://www.flickr.com/photos/striatic/2135057566/• Cow eye: http://www.flickr.com/photos/foxypar4/918567682/

Credits

Page 45: Save the Cows! Cyberinfrastructure for the Rest of Us

Thank you!

(and save a cow today!)