22
Stat 217 – Day 10 Measures of spread, position (Topic 9)

Stat 217 – Day 10 Measures of spread, position (Topic 9)

  • View
    215

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Stat 217 – Day 10

Measures of spread, position (Topic 9)

Page 2: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Thought Question

Three landmarks of baseball achievement are Ty Cobb’s batting average of .420, Ted William’s .406 in 1941, and George Brett’s .390 in 1980. Which performance was the most impressive?

Page 3: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Last Time – Measures of Spread Variability is an important property of the

distribution of a quantitative variable. You have seen several ways to measure this property: Range = max – min Interquartile range = Upper quartile – lower quartile

Width of middle 50% Standard deviation = square root of sum of squared

deviations divided by sample size minus 1 “Typical” deviation of observations from the mean Not resistant (best with symmetric data, no outliers)

Page 4: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Activity 9-3 (p. 166)

Class F (n = 24)2 2 2 3 4 4 4 4 4 5 5 5 5 5 5 5 6 6 7 7 7 7 8 8

IQR = 6.5 -4 = 2.5 Class G (n = 25)

1 2 2 3 3 3 4 4 4 4 5 5 5 5 5 6 6 6 6 7 7 7 8 8 9

IQR = 6.5 – 3.5 = 3 Standard deviations: F = 1.77, G = 2.04

Page 5: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Class F Class G Class H Class I Class J

Range 6 8 8 8 8

IQR 2.5 3 0 8 4

Std Dev 1.769 2.041 1.18 4 2.657

Activity 9-3

H vs. I vs. Jn=24 n=25

n=18

Because there are two clumps on both ends of the graph and very few in the middle.

Page 6: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Activity 9-3

“Spread” looks at “horizontal width”/”distance from center” Not bumpiness Not variety

(i) 10 ratings between 1 and 9 inclusive with standard deviation as small as possible

(j) 10 ratings with standard deviation as large as possible

Page 7: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Try this (Example 1)

Open Dotplot Summaries applet BB: Course Materials > Applets

Click Draw Samples until you get a fairly symmetric, mound-shaped distribution

Then check both Guess Std Dev and Show actual (SD) boxes.

Drag out the Guess SD edges to match the actual SD

Click Show Percentages

Page 8: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Activity 9-4

Roughly symmetric and mound-shaped Within one standard deviation of the mean

(6.36, 14.08) or really (7-14): 146/213 = .685

Page 9: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Activity 9-4

Roughly symmetric and mound-shaped Within one standard deviation of the mean

(6.36, 14.08) or really (7-14): 146/213 = .685

Within two standard deviations of the mean (2.50, 17.9): 202/213 = .948

All within 3 standard deviations

Page 10: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Turns out (p. 170)

These percentages apply in many, many situations Mound-shaped, symmetric distributions

Page 11: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Another interpretation of SD

With mound-shaped symmetric distributions, standard deviation tells you the width of the middle 68% of the observations IQR = width of middle 50%

Page 12: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Activity 9-5: SATs and ACTs

College admissions…

(a) Bobby: 240 points

(b) Kathy: 9 points

(d) Bobby: (1740-1500)/240 = 1

(e) Kathy: (30-21)/6 = 1.50

Page 13: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Z-score

z = (observation – mean)standard deviation

Unitless Positive when observation is above mean,

negative when observation is below mean The standard deviation also provides a

convenient “yard stick” for measuring the position of observations within a distribution

Interpretation: number of standard deviations away from the mean of the distribution

Page 14: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Best Hitter

Ty Cobb .420 Williams .406 Brett .390

Williams’ performance was further above his peers All pretty impressive, better than 99.85% of the

populations

Mean SD

.266 .0371

.267 .0326

.261 .0317

Mean SD z

.266 .0371 4.15

.267 .0326 4.26

.261 .0317 4.07

Page 15: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Boxplot

Page 16: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Example 2

From the Stat 217 Data Files page (Course Materials), choose bumpus.xls Select the 3rd column and copy

From Stat 217 Applets page click on the Creating Boxplots link Scroll to the bottom to Enter your data below, one

per line and replace the data there by pasting in the bumpus data.

Update boxplot Check Graph by Category radio button

Page 17: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Surviving Sparrows

Min QL med Qu max

Page 18: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Surviving Sparrows

Min QL med Qu max

Page 19: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Surviving Sparrows

Page 20: Stat 217 – Day 10 Measures of spread, position (Topic 9)

Activity 10-1

Underneath “identify outliers,” use the scroll bar to make “modified boxplots.”

Page 21: Stat 217 – Day 10 Measures of spread, position (Topic 9)

To Turn In, with partner

To Turn In with partner Activity 10-2

For Wednesday Lab 3 See review sheet online Submit review question in discussion board

Page 22: Stat 217 – Day 10 Measures of spread, position (Topic 9)

To Turn In, with partner

To Turn In with partner Activity 10-1 parts (j) and (k)

For Wednesday Activity 10-2 See review sheet online Submit review question in discussion board Bring HW 2, Lab 2 to class