30
Normal and Standard Normal and Standard Normal Normal Distributions Distributions June 29, 2004 June 29, 2004

Normal and Standard Normal Distributions June 29, 2004

Embed Size (px)

Citation preview

Page 1: Normal and Standard Normal Distributions June 29, 2004

Normal and Standard Normal Normal and Standard Normal DistributionsDistributions

June 29, 2004June 29, 2004

Page 2: Normal and Standard Normal Distributions June 29, 2004

HistogramHistogram

80 90 100 110 120 130 140 150 1600

5

10

15

20

25

Percent

POUNDS

Data are divided into 10-pound groups (called “bins”).

With only one woman <100 lbs, this bin represents <1% of the total 120-women sampled.

Percent of total that fall in the 10-pound interval.

85-95

95-105

105-115

115-125125-135

135-145

145-155155-165

Page 3: Normal and Standard Normal Distributions June 29, 2004

What’s the shape of the What’s the shape of the distribution?distribution?

80 90 100 110 120 130 140 150 1600

5

10

15

20

25

Percent

POUNDS

Page 4: Normal and Standard Normal Distributions June 29, 2004

~ Normal Distribution ~ Normal Distribution

80 90 100 110 120 130 140 150 160 0

5

10

15

20

25

P e r c e n t

POUNDS

Page 5: Normal and Standard Normal Distributions June 29, 2004

The Normal DistributionThe Normal DistributionEquivalently the shape is described as:

“Gaussian” or “Bell Curve”Every normal curve is defined by 2

parameters:– 1. mean - the curve’s center – 2. standard deviation - how fat the curve is

(spread)X ~ N (, 2)

Page 6: Normal and Standard Normal Distributions June 29, 2004

Examples:Examples: height weight age bone density IQ (mean=100; SD=15) SAT scores blood pressure ANYTHING YOU

AVERAGE OVER A LARGE ENOUGH #

Page 7: Normal and Standard Normal Distributions June 29, 2004

A Skinny Normal DistributionA Skinny Normal Distribution

Page 8: Normal and Standard Normal Distributions June 29, 2004

More Spread Out...More Spread Out...

Page 9: Normal and Standard Normal Distributions June 29, 2004

Wider Still...Wider Still...

Page 10: Normal and Standard Normal Distributions June 29, 2004

The Normal Distribution:The Normal Distribution:as mathematical function as mathematical function

2)(2

1

2

1)(

x

exf

Note constants:=3.14159e=2.71828

Page 11: Normal and Standard Normal Distributions June 29, 2004

Integrates to 1Integrates to 1

12

1 2)(2

1

dxex

Page 12: Normal and Standard Normal Distributions June 29, 2004

Expected ValueExpected Value

E(X)= =dxexx

2)(

2

1

2

1

Page 13: Normal and Standard Normal Distributions June 29, 2004

VarianceVariance

Var(X)= = 22)(

2

12 )

2

1(

2

dxexx

Standard Deviation(X)=

Page 14: Normal and Standard Normal Distributions June 29, 2004

normal curve with =3 and =1

Page 15: Normal and Standard Normal Distributions June 29, 2004

**The beauty of the normal curve:

No matter what and are, the area between - and + is about 68%; the area between -2 and +2 is about 95%; and the area between -3 and +3 is about 99.7%. Almost all values fall within 3 standard deviations.

Page 16: Normal and Standard Normal Distributions June 29, 2004

68-95-99.7 Rule68-95-99.7 Rule

68% of the data

95% of the data

99.7% of the data

Page 17: Normal and Standard Normal Distributions June 29, 2004

How good is rule for real data?How good is rule for real data?

Check the example data:

The mean of the weight of the women = 127.8

The standard deviation (SD) = 15.5

Page 18: Normal and Standard Normal Distributions June 29, 2004

80 90 100 110 120 130 140 150 160 0

5

10

15

20

25

P e r c e n t

POUNDS

127.8 143.3112.3

68% of 120 = .68x120 = ~ 82 runners

In fact, 79 runners fall within 1-SD (15.5 lbs) of the mean.

Page 19: Normal and Standard Normal Distributions June 29, 2004

80 90 100 110 120 130 140 150 160 0

5

10

15

20

25

P e r c e n t

POUNDS

127.896.8

95% of 120 = .95 x 120 = ~ 114 runners

In fact, 115 runners fall within 2-SD’s of the mean.

158.8

Page 20: Normal and Standard Normal Distributions June 29, 2004

80 90 100 110 120 130 140 150 160 0

5

10

15

20

25

P e r c e n t

POUNDS

127.881.3

99.7% of 120 = .997 x 120 = 119.6 runners

In fact, all 120 runners fall within 3-SD’s of the mean.

174.3

Page 21: Normal and Standard Normal Distributions June 29, 2004

ExampleExample

Suppose SAT scores roughly follows a normal distribution in the U.S. population of college-bound students (with range restricted to 200-800), and the average math SAT is 500 with a standard deviation of 50, then:– 68% of students will have scores between 450 and 550– 95% will be between 400 and 600 – 99.7% will be between 350 and 650

Page 22: Normal and Standard Normal Distributions June 29, 2004

ExampleExampleBUT…What if you wanted to know the math SAT

score corresponding to the 90th percentile (=90% of students are lower)?

P(X≤Q) = .90

90.2)50(

1

200

)50

500(

2

1 2

Q x

dxe

Solve for Q?….Yikes!

Page 23: Normal and Standard Normal Distributions June 29, 2004

The Standard Normal The Standard Normal Distribution Distribution

“Universal Currency” “Universal Currency” Standard normal curve: =0 and =1

Z ~ N (0, 1)

2

2

1

2

1)(

zezf

Page 24: Normal and Standard Normal Distributions June 29, 2004

The Standard Normal Distribution (ZThe Standard Normal Distribution (Z) )

All normal distributions can be converted into the standard normal curve by subtracting the mean and dividing by the standard deviation:

X

Z

Somebody calculated all the integrals for the standard normal and put them in a table! So we never have to integrate!

Even better, computers now do all the integration.

Page 25: Normal and Standard Normal Distributions June 29, 2004

ExampleExample For example: What’s the probability of getting a math SAT score of 575 or less, =500 and =50?

5.150

500575

Z

i.e., A score of 575 is 1.5 standard deviations above the mean

5.1

2

1575

200

)50

500(

2

1 22

2

1

2)50(

1)575( dzedxeXP

Zx

Yikes! But to look up Z= 1.5 in standard normal chart (or enter into SAS) no problem! = .9332

Page 26: Normal and Standard Normal Distributions June 29, 2004

Use SAS to get areaUse SAS to get area

You can also use also use SAS:

data _null_;theArea=probnorm(1.5);

put theArea;run;

0.9331927987

This function gives the area to the left of X standard deviations in a standard normal curve.

Page 27: Normal and Standard Normal Distributions June 29, 2004

In-Class ExerciseIn-Class Exercise

If birth weights in a population are normally distributed with a mean of 109 oz and a standard deviation of 13 oz,

a. What is the chance of obtaining a birth weight of 141 oz or heavier when sampling birth records at random?

b. What is the chance of obtaining a birth weight of 120 or lighter?

Page 28: Normal and Standard Normal Distributions June 29, 2004

AnswerAnswer

a. What is the chance of obtaining a birth weight of 141 oz or heavier when sampling birth records at random?

46.213

109141

Z

From the chart Z of 2.46 corresponds to a right tail (greater than) area of: P(Z≥2.46) = 1-(.9931)= .0069 or .69 %

Page 29: Normal and Standard Normal Distributions June 29, 2004

AnswerAnswer

b. What is the chance of obtaining a birth weight of 120 or lighter?

From the chart Z of .85 corresponds to a left tail area of: P(Z≤.85) = .8023= 80.23%

85.13

109120

Z

Page 30: Normal and Standard Normal Distributions June 29, 2004

Reading for this weekReading for this week

Walker: 1.3-1.6 (p. 10-22), Chapters 2 and 3 (p. 23-54)