"Data in the Digital Age" - Hadoop Big Data Meetup

Preview:

DESCRIPTION

 

Citation preview

made available by Paul Keller under a CC-BY-2.0 license

Thursday, 14 July 2011

made available by Paul Keller under a CC-BY-2.0 license

well .. sort of.

Thursday, 14 July 2011

data in the digital age

kaitlin thaneyaustin big data user group, 13 july 2011

austin, texas

Thursday, 14 July 2011

xi. background

Thursday, 14 July 2011

about me

about me

sameAs

expattechnologistopen science

Thursday, 14 July 2011

Thursday, 14 July 2011

technology companypublisher link

london, nyc, tokyo

Thursday, 14 July 2011

investment armincubator rolein-house dev

Thursday, 14 July 2011

tiered approachbuild to scale

researcher-focused

Thursday, 14 July 2011

<text>

Thursday, 14 July 2011

about

1815

Thursday, 14 July 2011

first geological map“strata”

reputation

Thursday, 14 July 2011

data ... metadata / markup

experimentationmetrics

Thursday, 14 July 2011

1. science is changing

Thursday, 14 July 2011

1. science is changing(and the research workflow)

Thursday, 14 July 2011

research

idea

experiment

lit review

materials

publish

share results

retestanalyze

collect data

Thursday, 14 July 2011

blocking points

idea

experiment

lit review

materials

publish

share results

retestanalyze

collect data

(to name a few ... )

Thursday, 14 July 2011

types of information

idea

protocolsparameters

content

the non-digital “stuff”

articlesproceedings

share results

retestanalysissynthesis

datasets

(will revisit later)

prof activitiesmentorship

patents

Thursday, 14 July 2011

text texttext

Thursday, 14 July 2011

Thursday, 14 July 2011

remaining roadblocksspecialisation of tools (+/-)

interoperabilityaccessibility

design decisionsthe “social issue”

Thursday, 14 July 2011

2. focus areas

Thursday, 14 July 2011

(3)

Thursday, 14 July 2011

knowledge discovery

software applications

research management

Thursday, 14 July 2011

knowledge discovery

software applications

research management

Thursday, 14 July 2011

data ...content, compounds,

collections

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

text texttext

Thursday, 14 July 2011

can fine-tune

Thursday, 14 July 2011

name disambiguation

Thursday, 14 July 2011

10,11-dihydro-5-methyl-5H-dibenzo[b,e][1,4]diazepin-11-one

Thursday, 14 July 2011

knowledge discovery

software applications

research management

Thursday, 14 July 2011

gigabytes, not terabytes

Thursday, 14 July 2011

CC-BY-2.0 - Plaxco Lab - http://www.flickr.com/photos/34857812@N04/

Thursday, 14 July 2011

Thursday, 14 July 2011

better trackingis needed

Thursday, 14 July 2011

the non-digital

+ordering

processing

Thursday, 14 July 2011

protocols parameterscalibrationliterature

Thursday, 14 July 2011

parameters

the non-digital

Thursday, 14 July 2011

about

the non-digital

Thursday, 14 July 2011

not just tracking, but organisation +

analysis

Thursday, 14 July 2011

Thursday, 14 July 2011

000s100s200s300s400s500s600s

Thursday, 14 July 2011

000s100s200s300s400s500s600s

... philosophy/psychreligionsocial sci...lang, natsci, mathstech/appliedsci

Thursday, 14 July 2011

000s100s200s300s400s500s600s

computersciphilosophy/psychreligionsocial sci...lang, natsci, mathstech/appliedsci

problems

Thursday, 14 July 2011

spatial, topical mappingarbitrary, heuristic

difficult to edit

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

knowledge discovery

software applications

research management

Thursday, 14 July 2011

data capture(of a different sort)

Thursday, 14 July 2011

tools for decision makers (research admin / funders)

using technology to spur cultural shift

Thursday, 14 July 2011

existing system is imperfect

Thursday, 14 July 2011

“Right now we're going through a Cambrian explosion of metrics.”

- Johan Bollen

Nature 465, 864-866 (2010) | doi:10.1038/465864a

Thursday, 14 July 2011

citation / impact factorh - index

weighted citations (eigenfactor, sjr)“betweenness centrality”

alt-metrics, etc.

a wealth of mechanisms exist ...

Thursday, 14 July 2011

challenges :harmonisation

track /maintainjudgement calls

external pressures

Thursday, 14 July 2011

measurements still stuck in the paper

metaphor

Thursday, 14 July 2011

what do we want on the back of our (science) baseball

cards?

“- paul groth (et al.)

UK folks, think Top Trumps

*

*

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

Thursday, 14 July 2011

4. our goal

Thursday, 14 July 2011

software that understands science

Thursday, 14 July 2011

software that understands scientists

Thursday, 14 July 2011

reflect changes in digital research

account for new “data”

Thursday, 14 July 2011

more efficient researchincrease productivityaccelerate discovery

shift culture

Thursday, 14 July 2011

thank you.

k.thaney@digital-science.comwww.digital-science.com

Thursday, 14 July 2011