23
Big Data = O’Reilly Strata Conference February 29 2012 Bigger Meta

Big Data = Bigger Metadata

Embed Size (px)

Citation preview

Page 1: Big Data = Bigger Metadata

Big Data = O’Reilly Strata Conference

February 29 2012

Bigger Metadata

Page 2: Big Data = Bigger Metadata

Pivot/Skate, etc…

Refounded 2006

Neighborhood boundaries

Mass transit data

Refocused 2009

SaaS for mapping + on-demand data

Founded 2003

Poor man’s GIS

Panamap

Page 3: Big Data = Bigger Metadata

Achtung!

NoSQL is no panacea

Big Data isn’t about data

Big Data isn’t new

Big Data doesn’t present a Boolean quandary

With power comes responsibility

AWS bills

Lady Gaga tweets

Innumeracy (correlation v causation)

Page 4: Big Data = Bigger Metadata

Big v Important

Big

Heterogeneous

Raw

Distributed

Streaming/real time

Search for meaning

Time-sensitive

Philosophical

Important

Well-defined schema

High value (not free)

Test-driven

Relational

Historical

Enterprise-focused

Page 5: Big Data = Bigger Metadata

Data Exhaust

Analytics Probes

Gov 2.0Social Media

Page 6: Big Data = Bigger Metadata

Platforms

Commoditization of compute and storage

Page 7: Big Data = Bigger Metadata

A Brief History of Metadata

Callimachus Library of Alexandria, Egypt

Page 8: Big Data = Bigger Metadata

A Brief History of Metadata

“Pinakes” (lists)

Title

Category

Author

Author birthplace

Father

Word count

Callimachus

Page 9: Big Data = Bigger Metadata

A Brief History of Metadata

Page 10: Big Data = Bigger Metadata

A Brief History of Metadata

Page 11: Big Data = Bigger Metadata

A Brief History of Metadata

Card catalog room,

Library of Congress c. 1920

Page 12: Big Data = Bigger Metadata

A Brief History of Metadata

Dewey Decimal System goes electronic in 1967

Page 13: Big Data = Bigger Metadata

Out with the Old, in with the New

Archiving card catalogs

after digitization

Page 14: Big Data = Bigger Metadata

Why Can’t We Be Together?

Metadata Data

Page 15: Big Data = Bigger Metadata

Exponential Growth in Data

1876

TaxonomyPinakes

300 BC

Database

1970

Catalog

1595 AD

Data

Unprecedented rate of data creation, 1995-today

Page 16: Big Data = Bigger Metadata

Oh, How I’ve Missed You

The reunification of metadata

and the artifact

Page 17: Big Data = Bigger Metadata

Together At Last

Page 18: Big Data = Bigger Metadata

GIS Data is Unevolved

+ =

Page 19: Big Data = Bigger Metadata

Enter the Data Curator

Part social scientist, part librarian,

part statistician, part RDBMS wiz

Page 20: Big Data = Bigger Metadata

DIKW Model

Data

Fact, Signal, Symbol

Information

Structural v Functional

Symbolic v Subjective

Knowledge

Processed

Procedural

Propositional

Page 21: Big Data = Bigger Metadata

Popularity (Google Trends)

Page 22: Big Data = Bigger Metadata

Words to Live By

dxdt/

Page 23: Big Data = Bigger Metadata

Thank you!

[email protected]

@urbanmapping

R.I.P.

Schema