Transcript
Page 1: Data Mashups -Data Science Summit

Data Mashups

Turning Data Exhaust into Insights

May 12, 2011Data Scientist SummitPete SkomorochLinkedIn@peteskomoroch

Page 2: Data Mashups -Data Science Summit

We have an explosion of data

•DataWrangling

• InfoChimps

•Data.gov

• Factual

• SimpleGeo

Page 3: Data Mashups -Data Science Summit

And the tools to make sense of it

•Hadoop

•NoSQL

•R

•Python

•Mechanical Turk

Page 4: Data Mashups -Data Science Summit

Diverse datasets = better signal

Page 5: Data Mashups -Data Science Summit
Page 6: Data Mashups -Data Science Summit
Page 7: Data Mashups -Data Science Summit

Find a meaningful problem

http://www.flickr.com/photos/aloshbennett/

• Identify pain points

•Work on stuff that matters

• Focus on underutilized data

Page 8: Data Mashups -Data Science Summit

Trendingtopics.org @hourlytrends

Page 9: Data Mashups -Data Science Summit

LinkedIn Skills

Page 10: Data Mashups -Data Science Summit

The best mashups are actionable

•Reveal patterns

•Enable predictions

•Recommendations

Page 11: Data Mashups -Data Science Summit

Mashup: Skills & Cities

Page 12: Data Mashups -Data Science Summit

Yuba City, California: 21.3% Unemployment

Page 13: Data Mashups -Data Science Summit

Ames, Iowa: 4.7% Unemployment

Page 14: Data Mashups -Data Science Summit

Make data mashups work for you

•Open Data = powerful mashups

•Mashup > sum of its parts

• Focus on meaningful problems

•Actionable mashups are better


Recommended