Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Introduction to Data ScienceWeek 1, Lecture 1
Jeff HammerbacherJanuary 18, 2011
1
Wednesday, January 19, 2011
2
Lecture Outline▪ Course Content
▪ What we’ll cover▪ What we won’t cover
▪ Course Logistics
▪ Meeting time and location▪ Prerequisites
▪ Course Motivations
▪ Personal▪ Putting data to work▪ The emergence of Data Science
▪ Homework!
Wednesday, January 19, 2011
Course ContentWhat We’ll Cover
▪ Data Collection and Integration
▪ Data Presentation
▪ Experimentation
▪ Longitudinal Analysis
▪ Data Products
▪ Final Project
3
Wednesday, January 19, 2011
Course ContentWhat We Won’t Cover
▪ Data Mining
▪ Artificial Intelligence
▪ Statistics
▪ Machine Learning
▪ Knowledge Discovery in Databases
▪ Big Data
▪ Relational Databases
▪ NoSQL
4
Wednesday, January 19, 2011
Course Logistics▪ Course Website: http://datascienc.es
▪ Instructors: Jeff Hammerbacher, Mike Franklin
▪ Course Times: 12:30 pm - 2:00 pm, Tuesday and Thursday (Citris 240)
▪ Office Hours: 2:00 pm - 4:00 pm, Thursday (Soda Hall 449)
▪ Mailing List: [email protected]
▪ Prerequisites
▪ Python▪ Web Programming▪ Statistics
5
Wednesday, January 19, 2011
6
Course MotivationsPersonal
Wednesday, January 19, 2011
7
Course MotivationsPersonal
Wednesday, January 19, 2011
8
Course MotivationsPersonal
Wednesday, January 19, 2011
9
Course MotivationsPersonal
Wednesday, January 19, 2011
10
Course MotivationsPersonal
“Information Platforms and the Rise of the Data Scientist”
Wednesday, January 19, 2011
Course MotivationsPu!ing Data to Work
11
1935: “The Design of Experiments”
Wednesday, January 19, 2011
Course MotivationsPu!ing Data to Work
12
1955: “Artificial Intelligence”
Wednesday, January 19, 2011
Course MotivationsPu!ing Data to Work
13
1958: “A Business Intelligence System”
Wednesday, January 19, 2011
Course MotivationsPu!ing Data to Work
14
1977: “Exploratory Data Analysis”
Wednesday, January 19, 2011
Course MotivationsPu!ing Data to Work
15
1989: “Business Intelligence”
Wednesday, January 19, 2011
Course MotivationsPu!ing Data to Work
16
1995: TDWI
Wednesday, January 19, 2011
Course MotivationsPu!ing Data to Work
17
1996: “From Data Mining to Knowledge Discovery in Databases”
Wednesday, January 19, 2011
Course MotivationsPu!ing Data to Work
18
1997: “Machine Learning”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
19
1994: “Managing Gigabytes”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
20
1996: Google
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
21
2007: “The Fourth Paradigm”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
22
2007: “The Case for DISC”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
23
2008: “More Data Usually Beats Better Algorithms”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
24
2009: “The Unreasonable Effectiveness of Data”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
25
2007: “Competing on Analytics”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
26
2007: “Super Crunchers”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
27
2007: “The Coming Exaflood”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
28
2008: “The End of Science”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
29
2010: “The Data Deluge”
Wednesday, January 19, 2011
Course MotivationsThe Emergence of Data Science
30
2010: “What is Data Science?”
Wednesday, January 19, 2011
Homework!▪ 1. How does X work?
▪ 2. How would you build X if you had to start from scratch?
▪ 3. Why is X useful?
▪ Where X can be:
▪ Google Analytics▪ 23 and Me▪ Standard and Poor’s bond ratings
31
Wednesday, January 19, 2011