47
CHAPTER 6 SAMPLING DISTRIBUTION OBJECTIVES: After completing this chapter, student should be able to: 1. Form a sampling distribution for a mean and proportion based on a small, finite population. 2. Understand that a sampling distribution is a probability distribution for a sample statistic. 3. Present and describe the sampling distribution of sample means and the central limit theorem. 1

Statistik Chapter 6

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Statistik Chapter 6

CHAPTER 6

SAMPLING DISTRIBUTION

OBJECTIVES:

After completing this chapter, student should be able to:

1. Form a sampling distribution for a mean and proportion based on a small, finite population.

2. Understand that a sampling distribution is a probability distribution for a sample statistic.

3. Present and describe the sampling distribution of sample means and the central limit theorem.

4. Explain the relationship between the sampling distributions with the central limit theorem.

5. Compute, describe and interpret z-scores corresponding to known values of

6. Compute z-scores and probabilities for applications of the sampling distribution of sample means and sample proportions.

1

Page 2: Statistik Chapter 6

1.0 POPULATION AND SAMPLING DISTRIBUTION

1.1 Population Distribution

Definition: The probability distribution of the population

data.

Example 1:

Suppose there are only five students in an advanced

statistics class and the midterm scores of these five students

are:

70 78 80 80 95

Let x denote the score of a student.

2

Page 3: Statistik Chapter 6

Mean for Population:

μ=∑ x

N=

70+78+80+80+955

=80 . 6

Standard Deviation for Population:

σ=√∑ x2−(∑ x )N

2

N

¿√32809−( 403 )5

2

5=8 .0895

1.2 Sampling Distribution

Definition: The probability distribution of a sample

statistic.

Sample statistic such as median, mode, mean and standard

deviation

3

Page 4: Statistik Chapter 6

1.2.1 Sampling Distribution of Mean Sample

Definition: The sampling distribution of x̄ is a distribution

obtained by using the means computed from random

samples of a specific size taken from a population.

Example 2:

Reconsider the population of midterm scores of five

students given in example 1.

Let say we draw all possible samples of three numbers each

and compute the mean.

Total number of samples = 5C3 =

5 !3! (5−3 )!

=10

Suppose we assign the letters A, B, C, D and E to scores of the five students, so that

A = 70, B = 78, C=80, D = 80, E = 95

Then the 10 possible samples of three scores each are

ABC, ABD, ABE, ACD, ACE, ADE, BCD, BCE, BDE, CDE

4

Page 5: Statistik Chapter 6

5

Page 6: Statistik Chapter 6

x

Sampling Error

Sampling error is the difference between the value of a

sample statistic and the value of the corresponding

population parameter. In the case of mean,

Sampling error =x̄−μ

Mean (μ x̄ ) for the Sampling Distribution ofx̄ :

Based on example 2,

μ x̄=76+76+81+76 .67+81 . 67+81 .67+79 .33+84 .33+84 .33+8510

=80 .6

“The mean of the sampling distribution of x̄ is always equal to the mean of the population”

6

Page 7: Statistik Chapter 6

Standard Deviation (σ x ) for the Sampling Distribution

of x̄ :

σx

= σ

√n

where: σ is the standard deviation of the population n is the sample size

This formula is used when

nN

≤0 .05

When N is the population size

σ x=σ√n √ N−n

N−1

where √ N−nN−1 is the finite population correction factor

Formula 1

Formula 2

7

Page 8: Statistik Chapter 6

This formula is used when

nN

>0. 05

The spread of the sampling distribution of x̄ is smaller than the spread of the corresponding

population distribution, σ x̄<σ

.

The standard deviation of the sampling distribution of x̄ decreases as the sample size increase.

The standard deviation of the sample means is called the standard error of the mean.

1.3 Sampling from a Normally Distributed Population

μ x̄=μ

σ x=

σ

√n

The shape of the sampling distribution of x̄ is normal,

whatever the value of n.

8

Page 9: Statistik Chapter 6

Shape of the sampling distribution

9

Page 10: Statistik Chapter 6

1.4 Sampling from a Not Normally Distributed Population

Most of the time the population from which the

samples are selected is not normally distributed. In

such cases, the shape of the sampling distribution of x̄

is inferred from central limit theorem.

Central limit theorem:

for a large sample size, the sampling distribution of x̄ is approximately normal, irrespective of the population distribution.

the sample size is usually considered to be large

ifn≥30 .

μ x̄=μ

σ x=

σ

√n

10

Page 11: Statistik Chapter 6

Shape of the sampling distribution

11

Page 12: Statistik Chapter 6

1.5 Applications of the Sampling Distribution of x̄

The z value for a value of x̄ is calculated as:

z= x̄−μσ x̄

Example 3:

In a study of the life expectancy of 500 people in a certain

geographic region, the mean age at death was 72 years and

the standard deviation was 5.3 years. If a sample of 50

people from this region is selected, find the probability that

the mean life expectancy will be less than 70 years.

Solution:

μ x̄=72nN

=50500

=0.1>0 .05 need correction factor

σ x=σ√n √ N−n

N−1= 5.3

√50 √500−50500−1

=0 .7118

12

Page 13: Statistik Chapter 6

P( x̄<70 )=P(z<x̄−μ x̄

σ x̄)

=P(z<70−720 . 7118 )

=P( z<−2 . 81) =0 . 0025

Example 4:

Assume that the weights of all packages of a certain brand

of cookies are normally distributed with a mean of 32

ounces and a standard deviation of 0.3 ounce. Find the

probability that the mean weight, x̄ of a random sample of

20 packages of this brand of cookies will be between 31.8

and 31.9 ounces.

Solution:

13

Page 14: Statistik Chapter 6

Although the sample size is small (n<30 ), the shape of the

sampling distribution of x̄ is normal because the

population is normally distributed.

μ x̄=μ=32

σ x̄=σ

√n= 0 .3

√20=0 . 0671

P (31.8< x̄<31 . 9 )=P(31 . 8−μσ x̄

<z<31. 9−μσ x̄

) =P(31 . 8−32

0. 0671< z<31. 9−32

0. 0671 ) =P (−2 .98< z<−1.49 ) =P( z<−1 . 49)−P( z<−2. 98 ) =0 . 0681−0 . 0014 =0 . 0667

14

Page 15: Statistik Chapter 6

Example 5:

A Bulletin reported that children between the ages of 2 and

5 watch an average of 25 hours of television per week.

Assume the variable is normally distributed and the

standard deviation is 3 hours. If 20 children between the

ages of 2 and 5 are randomly selected, find the probability

that the mean of the number of hours they watch television

will be:

a) greater than 26.3 hours.

b) less than 24 hours

c) between 24 and 26.3 hours.

Solution:

μ x̄=μ=25

15

Page 16: Statistik Chapter 6

σ x̄=σ

√n= 3

√20=0 . 6708

a) greater than 26.3 hours

P( x̄>26 . 3)=P (z>x̄−μ x̄

σ x̄)

=P(z>26 . 3−250 . 6708 )

=P( z>1 .94 ) =1−P (z<1 . 94 ) =1−0. 9738 =0 . 0262

b) less than 24 hours

16

Page 17: Statistik Chapter 6

P( x̄<24 )=P(z<x̄−μ x̄

σ x̄)

=P(z<24−250 . 6708 )

=P( z<−1 . 49) =0 . 0681

c) between 24 and 26.3 hours

P (24< x̄<26 .3 )=P (24−μσ x̄

<z<26 . 3−μσ x̄

) =P(24−25

0. 6708< z<26 . 3−25

0. 6708 ) =P (−1 . 49<z<1. 94 ) =P( z<1 .94 )−P( z<−1.49 ) =0 . 9738−0. 0681 =0 . 9057

OR

17

Page 18: Statistik Chapter 6

P (−1 . 49<z<1. 94 ) =1−(0 .0681+0 .0262 ) =0 . 9057

Remember!Sometimes you have difficulty deciding

whether to use

z= x̄−μσ x̄

or

z= x−μσ

18

Page 19: Statistik Chapter 6

The formula z= x̄−μ

σ x̄ should be used to gain information about a sample mean.

The formula z= x−μ

σ is used to gain information about an individual data value obtained from the population.

Example 6:

The average number of pounds of meat that a person

consumes a year is 218.4 pounds. Assume that the standard

deviation is 25 pounds and the distribution is approximately

normal.

a) Find the probability that a person selected at

random consumes less than 224 pounds per year.

19

Page 20: Statistik Chapter 6

b) If a sample of 40 individuals selected, find the

probability that the mean of the sample will be

less than 224 pounds per year.

Solution:

a) The question asks about an individual person

z= x−μ

σ .μ=218. 4 ,σ=25

P( x<224 )=P( z<x−μσ )

=P(z<224−218 . 425 )

=P( z<0 .22) =0 . 5871

b) The question concerns the mean of a sample with a size of 40

z=

x̄−μ x̄

σ x̄

20

Page 21: Statistik Chapter 6

μ x̄=μ=218 . 4 ,

σ x̄=σ

√n=25

√40=3 . 9528

P( x̄<224 )=P( z<x̄−μ x̄

σ x̄)

=P(z<224−218 . 43 . 9528 )

=P( z<1 .42) =0 . 9222

2.0 POPULATION AND SAMPLE PROPORTIONS

Formula for:

Population proportion

Sample proportion

p= XN

p̂= xn

Where N = total number of element in the population

n = total number of element in the sample

21

Page 22: Statistik Chapter 6

X = number of element in the population that possess a specific characteristic

x = number of element in the sample that possess a specific characteristic

Example 7

Suppose a total of 789 654 families live in a city and 563

282 of them own homes. A sample of 240 families is

selected from the city and 158 of them own homes. Find

the proportion of families who own homes in the

population and in the sample.

Solution:

The proportion of all families in this city who own homes is

p= XN

=563282789654

=0 .71

The sample proportion is

p= x

n=158

240=0 .66

22

Page 23: Statistik Chapter 6

2.1 Sampling Distribution of p̂

Example 8

Boe Consultant Associates has five employees. Table

below gives the names of these five employees and

information concerning their knowledge of statistics.

Where: p = population proportion of employees who know statistics

p= X

N=3

5=0 .6

Let say we draw all possible samples of three employees each and compute the proportion.

Total number of samples = 5C3 =

5 !3! (5−3 )!

=10

23

Page 24: Statistik Chapter 6

Mean of the sample proportion:

μ p̂=p Standard deviation of the sample proportion:

σ p̂=√ pqn

σ p̂=√ pq

n √ N−nN−1

If

nN

≤0 .05,

then use formula

If

nN

>0. 05,

then use formula

24

Page 25: Statistik Chapter 6

where p = population proportion q = 1 – p n = sample size

Example 9

Based on example Boe Consultant Associates,

nN

=35=0. 6>0. 05

σ p̂=√ pqn √ N−n

N−1=√(0 . 601)(0 . 399)

3 √ 5−35−1

=(0 . 2828)(0 .7071)=0 . 1999

Shape of the sampling distribution of p̂

According to the central limit theorem, the sampling

distribution of p̂ is approximately normal for a sufficiently large sample size.

In the case of proportion, the sample size is considered to be sufficiently large if np > 5 and nq > 5

Example 10

25

μ p̂=3(0 . 33)+6 (0 .67 )+110

=0 . 601

Page 26: Statistik Chapter 6

A binomial distribution has p = 0.3. How large must sample size be such that a normal distribution can be used

to approximate sampling distribution of p̂ .

Solution:

np>5n( 0. 3 )>5

n>50. 3

=16 .6667≈17

so , n>17

2.2 Applications Of The Sampling Distribution of p̂

z value for a value of p̂

z= p̂−pσ p̂

Example 11:

The Dartmouth Distribution Warehouse makes deliveries of a large number of products to its customers. It is known

26

Page 27: Statistik Chapter 6

that 85% of all the orders it receives from its customers are delivered on time.

a) Find the probability that the proportion of orders in a random sample of 100 are delivered on time:i. less than 0.87ii. between 0.81 and 0.88

b) Find the probability that the proportion of orders in a random sample of 100 are not delivered on time greater than 0.1.

Solution:

a) p = 85% = 0.85 np = 85 >5q = 1-p = 0.15 nq = 15 > 5n = 100 approximately normal

μ p̂=p=0 .85

σ p̂=√ pqn

=√ (0 . 85)(0 . 15)100

=0 . 0357

i.

27

Page 28: Statistik Chapter 6

P( p̂<0 . 87 )=P(z<p̂−pσ p̂

) =P(z<0 . 87−0 .85

0 . 0357 ) =P( z<0 . 56) =0 . 7123

ii.

P(0 . 81< p̂<0 .88)=P ( p̂−pσ p̂

< z<p̂−pσ p̂

) =P(0. 81−0. 85

0. 0357< z<0 .88−0 . 85

0 .0357 ) =P(−1. 12<z<0 . 84 ) =P( z<0 .84 )−P( z<−1 .12 ) =0 . 7995−0 . 1314 =0 . 6681

b) proportion are not delivered on time = p

28

Page 29: Statistik Chapter 6

P( p̂>0 . 1)=P (z>p̂−pσ p̂

) =P(z>0 . 1−0 . 15

0 . 0357 ) =P( z>−1 . 40) =1−P (z<−1 . 40 ) =1−0. 0808 =0 .9192

Example 12

The machine that is used to make these CDs is known to produce 6% defective CDs. The quality control inspector selects a sample of 100 CDs every week and inspects them for being good or defective. If 8% or more of the CDs in the sample are defective, the process is stopped and the machine is readjusted. What is the probability that based on a sample of 100 CDs the process will be stopped to readjust the mashine?

Solution:

29

Page 30: Statistik Chapter 6

p = 6% = 0.06 np = 6 >5q = 1-p = 0.94 nq = 94 > 5n = 100 approximately normal

μ p̂=p=0 .06

σ p̂=√ pqn

=√ (0 .06)(0 . 94 )100

=0 .0237

P(process is stopped)

=P ( p̂≥0 . 08)

=P( z≥ p̂−pp̂ )

=P( z≥0 . 08−0 .060 . 0237 )

=P (z≥0 .84 )=1−P( z<0 .84 )=1−0 .7995=0.2005

EXERCISES

30

Page 31: Statistik Chapter 6

¯¯¯

1. Given a population with mean, µ = 400 and standard deviation, σ = 60.

a) If the population is normally distributed, what is the shape for the sampling distribution of sample mean with random sample size of 16

b) If we do not know the shape of the population in 1(a), Can we answer 1(a)? Explain.

c) Can we answer 1(a) if we do not know the population distribution but we have been given random sample with size 36? Explain.

2. A random sample with size, n = 30, is obtained from a normal distribution population with µ = 13 and s = 7.

a) What are the mean and the standard deviation for the sampling distribution of sample mean.

b) What is the shape of the sampling distribution? Explain.

c) Calculatei) P ( x < 10)ii) P ( x < 19)iii) P ( x < 16)

3. Given a population size of 5000 with standard deviation 25, Calculate the standard error of mean sample for:

a) n = 300b) n = 100

4. Given X ~ N (5.55, 1.32). If a sample size of 50 is randomly selected, find the sampling distribution for ¯. X

31

Page 32: Statistik Chapter 6

X

XX

(Hint: Give the name of distribution, mean and variance).Then, Calculate:

a) P ( 5.25 ≤ X ≤ 5.90)b) P (5.45 ≤ X ≤ 5.75)

5. Given ¯ ~ N (5, 16). Find the value of:

a) P ( ¯ > 3)b) P( -4 < ¯ < 4)

6. 64 units from a population size of 125 is randomly selected with mean 105 and variance 289, Find:

a) the standard error of the sampling distribution above

b) P( -4 < ¯ < 4)

7. The serving time for clerk at the bank counter is normally distributed with mean 8 minutes and standard deviation 2 minutes. If 36 customers is randomly selected:

a) Calculate σx

b) The probability that the mean of serving time of a clerk at the bank counter is between 7.7 minutes and 8.3 minutes

8. The workers at the walkie-talkie factory received salary at an average of RM3.70 per hour and the standard deviation is RM0.80. If a sample of 100 is randomly selected, find the probability the mean of sample is:

a) at least RM 3.50 per hour

¯

X

32

Page 33: Statistik Chapter 6

b) between RM 3.20 and R3.60 per hour

9. 1,000 packs of pistachio nut have been sent to one of hyper supermarket in Puchong. The weight of pistachio nut packs is normally distributed with mean 99.3g and standard deviation is 1.8g.

a) If a random sample with 300 packs of pistachio nut is selected, find the probability that the mean of the sample will be between 99.2g and 99.5g.

b) Find the probability that mean of sample 300 packs of pistachio nut is between 99.2g and 99.5g with delivery of

i) 2,000 packsii) 5,000 packs

10. An average age of 1500 staffs Tebrau Co. Limited is 38 years old with standard deviation 6.2 years old. If the company selects 50 staffs at random,

a) Do we need correction factor in this situation? Justify your answer.

b) Find the probability of average age for the group of this staff is between 35 and 40 years old.

11. A research has been conducted by an independent research committee about the efficiency of wire harness, A12-3 production at the P.Tex Industries Sdn. Bhd. An average number of wire harness that has been produced a day is 60 pieces with standard deviation 10. A random sample of 90 pieces of wire harness is selected.

33

Page 34: Statistik Chapter 6

a) Find mean and standard error for the wire harness that has been produced a day.

b) Find the probability of wire harness that can be produced in a day is between 58 pieces and 62 pieces.

12. A test of string breaking strength that has been

produced by Z factory shows that the strength of string is only 60%. A random sample of 200 pieces of string is selected for the test.

a) State the shape of sampling distributionb) Calculate the probability of string strength is at

lest 42%

13. Mr. Jay is a teacher at the Henry Garden School. He has conducted a research about bully case at his school. 61.6% students said that they are ever being a bully victim. A random sample of 200 students is selected at random. Find the proportion of bully victim is

a) between 60% and 66%b) more than 64%

14. The information given below shows the response of 40 college students for the question, “Do you work during semester break time?” (The answer is Y=Yes or N=No).

N N Y N N Y N Y N Y N N Y N Y Y N N N Y

34

Page 35: Statistik Chapter 6

N Y N N N N Y N N Y Y N N N Y N N Y N N

If the proportion fo population is 0.30,

a) Find the proportion of sampling for the college student who works during semester break.

b) Calculate the standard error for the proportion in (a).

15. A credit officer at the Tiger Bank believes that 25% from the total credit card users will not pay their minimum charge of credit card debt at the end of every month. If a sample 100 credit card user is randomly selected:

a) What is the standard error for the proportion of the customer who does not pay their minimum charge of credit card debt at the end of every month?

b) Find the probability that the proportion of customer in a random sample of 100 do not pay their minimum charge of credit card debt:

i) less than 0.20ii)more than 0.30

c) What is the consequence of the incremental in population size toward the probability value on the 6(b)?

35