15
SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

Embed Size (px)

Citation preview

Page 1: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

SADC Course in Statistics

Estimating population characteristics with simple random sampling

(Session 06)

Page 2: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

2To put your footer here go to View > Header and Footer

Learning ObjectivesBy the end of this session, you will be able to• explain exactly what is meant by a simple

random sample

• distinguish between “with” and “without” replacement sampling

• estimate the population mean or total using a sample where the sampling has been by simple random sampling

• compute measures of precision for such an estimate, recognising the need for a finite population correction.

Page 3: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

3To put your footer here go to View > Header and Footer

Simple random sampling: definition

• The exact definition of simple random sampling is a procedure whereby every sample of size n has an equal chance of being selected.

• In practice, this is achieved by picking one unit at a time without replacement from a list of population members.

• “Without replacement” means that once a unit is chosen, it is not returned to the population list until all the necessary units have been sampled.

Page 4: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

4To put your footer here go to View > Header and Footer

An illustration:

• Suppose population size is N = 6 with the observable values of the six members being 10, 4, 17, 6, 8, 15. Suppose the values are observed accurately without any error.

• Suppose we want a sample of size 2.

• How many possibilities are there to choose 2 out of 6 members?

• A list of all such pairs appears below.

Page 5: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

5To put your footer here go to View > Header and Footer

Illustration continued…

(10, 4) (4,17) (17, 6) (6, 8) (8,15)

(10,17) (4, 6) (17, 8) (6,15)

(10, 6) (4,8) (17,15)

(10, 8) (4,15)

(10,15)

In simple random sampling, each of the above have an equal chance of selection.

i.e. probability of selection = 1/15.

Page 6: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

6To put your footer here go to View > Header and Footer

“With replacement” sampling

• Taking a simple random sample is done using “without replacement” sampling.

• “With replacement” involves noting the value for the unit drawn, and returning the unit to the population.

• This means there is potential for the same unit to be selected more than once!

• Is this sensible?

Note: In multi-stage sampling, there is often a valid reason for doing “with replacement” sampling at the first stage of selection. More on this later!

Page 7: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

7To put your footer here go to View > Header and Footer

Estimation with SRS

Suppose a sample of size n (x1, x2, …., xn) is

selected from a population of size N whose true mean is , 10 for our 6-member pop.n

Then the best estimator of the population mean is the sample mean given by

Note: Lower case letters will be used for sample values, and upper case for population values

X

n

ii=1

1x = x

n

Page 8: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

8To put your footer here go to View > Header and Footer

Variance of the SRS estimator

The variance of the sample mean is given by

using population values 10, 4, 17, 6, 8, 15.Hence

2

2i2

X -Xn SV x = 1- , where S = =26.0

N n N-1

2 26.0V x = 1 - = 8.67

6 2

Page 9: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

9To put your footer here go to View > Header and Footer

Notes concerning the variance

• Compared to the variance of a sample mean used in Module H2 (assuming an infinite population), the formula here is similar except for the inclusion of the term (1-n/N).

• This multiplier is called the finite population correction. It may be ignored if the population is very large since n/N is then nearly zero.

• The quantity n/N is called the sampling fraction, often denoted by f. Thus f=n/N.

Page 10: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

10To put your footer here go to View > Header and Footer

Example

• Suppose the sample values were 6 and 15.

• The population mean is then estimated by

• Its variance is estimated by

6+15x = = 10.5

2

22i2 x -xn s

V x = 1 , where s = = 40.5, i.e.N n n-1

2 40.5V x = 1 = 13.5

6 2 std. error=3.7

Page 11: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

11To put your footer here go to View > Header and Footer

Estimating population total, XT

• The appropriate estimate is given by

= (6).(10) = 60 in example above

• The variance of this estimator is: = 62 (13.5) = 486

Hence std.error = 22.0

• Confidence intervals for both the population mean and the population total can be obtained in the usual way (refer to methods covered in Module H2).

Tx = N x

2TV x = N V x

Page 12: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

12To put your footer here go to View > Header and Footer

Estimating population proportion

• Results below are for use when the denominator for the proportion is fixed, e.g. proportion of HHs with at least 1 child aged 12-23 months of age. Denominator (total no. of HHs) is fixed by the investigator.

• Appropriate estimate for the population proportion is the sample proportion p=r/n where r=number of samples having attribute and n=sample size.

• Standard error of this estimate is sq. root of (1-f)p(1-p)/(n-1) where f=n/N.

Page 13: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

13To put your footer here go to View > Header and Footer

Further notes

• Important not to confuse estimating a population proportion with estimating a population ratio.

• For example, estimating the ratio of male children to female children in the population

• You will briefly meet with the estimation of a ratio through the practical exercise “To the Woods” done in Sessions 8, 9, 10.

Page 14: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

14To put your footer here go to View > Header and Footer

Further notes

• In computing confidence intervals for the estimators considered above, large sample sizes are usually assumed, so z-values from a standard normal distribution are usedi.e.

• However, if n is small, t-values should be used, i.e.

2

α

sx ± Z 1-f

n

2

α,n-1

sx ± t 1-f

n

Page 15: SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)

15To put your footer here go to View > Header and Footer

Some practical work follows…