23
1 Baseball, Shakespeare, and Modern Statistical Theory Bradley Efron Stanford

Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

Embed Size (px)

Citation preview

Page 1: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

1

Baseball, Shakespeare, andModern Statistical Theory

Bradley EfronStanford

Page 2: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

2

What Is “Statistics”?

Page 3: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

3

PROSTATE CANCER DATA (Microarray)(Singh et al. 2002)

-0.91-0.790.00-0.80-0.80-0.70-0.67-0.09-0.25gene6033

0.100.09-0.89-0.88-0.87-0.91-0.881.33-0.90gene6032

-1.18-0.82-1.18-1.17-0.92-0.91-0.790.100.35gene6031

.

.

-0.14-0.14-0.10-1.080.941.701.050.18-1.12gene5

-1.130.43-0.19-0.36-0.13-1.13-0.102.42-0.36gene4

-0.03-1.100.094.040.11-1.160.220.100.06gene3

3.57-0.82-0.27-0.830.25-0.75-0.16-0.85-0.84gene2

1.470.732.77-1.09-0.58-0.99-1.08-0.75-0.93gene1

“z”pat102pat101pat52pat51pat50pat49pat2pat1

TESTSTATISTICS

PROSTATE CANCERHEALTHY

Page 4: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

4

The Puzzled Physicist

Page 5: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

5

BAYES’ RULE (1763)

Page 6: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

6

Corbet’s Butterflies(Malaysia 1943)

Page 7: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

7

Page 8: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

8

Proving the Magic Formula

Page 9: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

9

Empirical Bayes (1952)

Page 10: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

10

Shakespeare’s Word Counts

837 …104314632292434314376?

6 …543210

Page 11: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

11

Shakespeare’s Missing Words

Page 12: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

12

“Shall I Die?”

Page 13: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

13

Eighteen Baseball Players

0.265.265.265Grand Average0.239.200.1567/4518. Alvis0.244.316.1788/4517. Munson0.249.286.2009/4516. Campaneris0.254.226.22210/4515. E Rodriguez0.254.264.22210/4514. Petrocelli

:::::0.277.222.33315/454. Johnstone0.281.276.35616/453. F Howard0.286.298.37817/452. F Robinson0.290.346.40018/451. Clemente

James-Stein“TRUTH”Avehits/ABNameObserved

Page 14: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

14

Page 15: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

15

Stein Estimation (1956)

Page 16: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

16

Page 17: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

17

Frequentist Hypothesis Testing

Page 18: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

18

PROSTATE CANCER DATA (Microarray)(Singh et al. 2002)

-0.91-0.790.00-0.80-0.80-0.70-0.67-0.09-0.25gene6033

0.100.09-0.89-0.88-0.87-0.91-0.881.33-0.90gene6032

-1.18-0.82-1.18-1.17-0.92-0.91-0.790.100.35gene6031

.

.

-0.14-0.14-0.10-1.080.941.701.050.18-1.12gene5

-1.130.43-0.19-0.36-0.13-1.13-0.102.42-0.36gene4

-0.03-1.100.094.040.11-1.160.220.100.06gene3

3.57-0.82-0.27-0.830.25-0.75-0.16-0.85-0.84gene2

1.470.732.77-1.09-0.58-0.99-1.08-0.75-0.93gene1

“z”pat102pat101pat52pat51pat50pat49pat2pat1

TESTSTATISTICS

PROSTATE CANCERHEALTHY

Page 19: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

19

Page 20: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

20

False Discovery Rates (1995)

Page 21: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

21

Modern Statistical Theory (2000+)

Page 22: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

22

Page 23: Baseball, Shakespeare, and Modern Statistical Theorystatweb.stanford.edu/~ckirby/brad/talks/2006Baseball.pdf · gene2 -0.84 -0.85 -0.16 -0.75 0.25 -0.83 -0.27-0.82 3.57 gene1 -0.93

23

References