35
Stratified and Cluster Sampling Lecture 8 Sections 2.6, 2.8 Robb T. Koether Hampden-Sydney College Tue, Jan 26, 2010 Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 1 / 23

Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Stratified and Cluster SamplingLecture 8

Sections 2.6, 2.8

Robb T. Koether

Hampden-Sydney College

Tue, Jan 26, 2010

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 1 / 23

Page 2: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Outline

1 Introduction

2 Stratified Random Samples

3 Estimating Parameters

4 Cluster Samples

5 Stratified vs. Cluster

6 Assignment

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 2 / 23

Page 3: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Outline

1 Introduction

2 Stratified Random Samples

3 Estimating Parameters

4 Cluster Samples

5 Stratified vs. Cluster

6 Assignment

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 3 / 23

Page 4: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Introduction

Suppose we want to measure support for the recent Senatehealth-care bill in Massachusetts.

Suppose further that we know that the Massachusetts populationis 36% Democrat, 12% Republican, and 52% Independent.Suppose even further that we suspect that party affiliation is arelevant variable.We plan to take a sample of 100 individuals.What might go wrong if we take a simple random sample?How can we be assured that each party affiliation will besufficiently represented in our sample?

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 4 / 23

Page 5: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Introduction

Suppose we want to measure support for the recent Senatehealth-care bill in Massachusetts.Suppose further that we know that the Massachusetts populationis 36% Democrat, 12% Republican, and 52% Independent.

Suppose even further that we suspect that party affiliation is arelevant variable.We plan to take a sample of 100 individuals.What might go wrong if we take a simple random sample?How can we be assured that each party affiliation will besufficiently represented in our sample?

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 4 / 23

Page 6: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Introduction

Suppose we want to measure support for the recent Senatehealth-care bill in Massachusetts.Suppose further that we know that the Massachusetts populationis 36% Democrat, 12% Republican, and 52% Independent.Suppose even further that we suspect that party affiliation is arelevant variable.

We plan to take a sample of 100 individuals.What might go wrong if we take a simple random sample?How can we be assured that each party affiliation will besufficiently represented in our sample?

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 4 / 23

Page 7: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Introduction

Suppose we want to measure support for the recent Senatehealth-care bill in Massachusetts.Suppose further that we know that the Massachusetts populationis 36% Democrat, 12% Republican, and 52% Independent.Suppose even further that we suspect that party affiliation is arelevant variable.We plan to take a sample of 100 individuals.

What might go wrong if we take a simple random sample?How can we be assured that each party affiliation will besufficiently represented in our sample?

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 4 / 23

Page 8: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Introduction

Suppose we want to measure support for the recent Senatehealth-care bill in Massachusetts.Suppose further that we know that the Massachusetts populationis 36% Democrat, 12% Republican, and 52% Independent.Suppose even further that we suspect that party affiliation is arelevant variable.We plan to take a sample of 100 individuals.What might go wrong if we take a simple random sample?

How can we be assured that each party affiliation will besufficiently represented in our sample?

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 4 / 23

Page 9: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Introduction

Suppose we want to measure support for the recent Senatehealth-care bill in Massachusetts.Suppose further that we know that the Massachusetts populationis 36% Democrat, 12% Republican, and 52% Independent.Suppose even further that we suspect that party affiliation is arelevant variable.We plan to take a sample of 100 individuals.What might go wrong if we take a simple random sample?How can we be assured that each party affiliation will besufficiently represented in our sample?

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 4 / 23

Page 10: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Introduction

We could choose 36 Democrats, 12 Republicans, and 52Independents.

What if we chose to survey 25 Democrats, 25 Republicans, and50 Independents?Could we accurately estimate the population proportion?

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 5 / 23

Page 11: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Introduction

We could choose 36 Democrats, 12 Republicans, and 52Independents.What if we chose to survey 25 Democrats, 25 Republicans, and50 Independents?

Could we accurately estimate the population proportion?

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 5 / 23

Page 12: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Introduction

We could choose 36 Democrats, 12 Republicans, and 52Independents.What if we chose to survey 25 Democrats, 25 Republicans, and50 Independents?Could we accurately estimate the population proportion?

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 5 / 23

Page 13: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Outline

1 Introduction

2 Stratified Random Samples

3 Estimating Parameters

4 Cluster Samples

5 Stratified vs. Cluster

6 Assignment

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 6 / 23

Page 14: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Stratified Random Sample

Definition (Homogeneous)A group is homogeneous if its member all have similar characteristicswith regard to a variable of interest.

Definition (Stratum)A stratum is a homogeneous subset of the population.

Definition (Stratified random sampling)Stratified random sampling is a sampling method in which thepopulation is first divided into strata. Then a simple random sample istaken from each stratum. The combined results constitute the sample.

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 7 / 23

Page 15: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Examples

Possible strata:I Male and female strata.I Resident and non-resident strata.I White, Black, Hispanic, and Asian strata.I Protestant, Catholic, Jewish, Muslim, etc., strata.

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 8 / 23

Page 16: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Stratified Random Sampling

R

DI

The strata

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 9 / 23

Page 17: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Stratified Random Sampling

R

I DY

N

N

Select 3 Republicans

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 9 / 23

Page 18: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Stratified Random Sampling

R

I DY

N

N

Y

N

Y

Select 3 Democrats

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 9 / 23

Page 19: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Stratified Random Sampling

R

I DY

N

N

Y

N

Y

Y

N

Select 2 Independents

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 9 / 23

Page 20: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Outline

1 Introduction

2 Stratified Random Samples

3 Estimating Parameters

4 Cluster Samples

5 Stratified vs. Cluster

6 Assignment

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 10 / 23

Page 21: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Estimating Parameters

We observed a total of 4 yes’s and 4 no’s in our sample of 8.Our sample proportion is 4

8 = 0.50.Can we do better than that?

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 11 / 23

Page 22: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Estimating Parameters

Base on our sample, support isI Republicans: 1/3 of 12%.I Democrats: 2/3 of 36%.I Independents: 1/2 of 52%.

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 12 / 23

Page 23: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Estimating Parameters

We need to compute a weighted average.

wgt. avg. =13(0.12) +

23(0.36) +

12(0.52)

= 0.04 + 0.24 + 0.26= 0.54.

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 13 / 23

Page 24: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Outline

1 Introduction

2 Stratified Random Samples

3 Estimating Parameters

4 Cluster Samples

5 Stratified vs. Cluster

6 Assignment

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 14 / 23

Page 25: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Cluster Sampling

Definition (Heterogeneous)A group is heterogeneous if is members vary in regard to the variablesof interest in the same way that the population varies.

Definition (Cluster)A cluster is a heterogeneous subset of the population.

Definition (Cluster random sampling)Cluster random sampling is a sampling method in which the populationis first divided into clusters. Then a simple random sample of clustersis taken. All the members of the selected clusters together constitutethe sample.

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 15 / 23

Page 26: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Cluster Sampling

Note that it is the clusters that are selected at random, not theindividuals.It is hoped that each cluster by itself is representative of thepopulation, i.e., each cluster is heterogeneous.

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 16 / 23

Page 27: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Stratified Random Sampling

Row 1

Row 2 Row 3

The clusters

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 17 / 23

Page 28: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Stratified Random Sampling

Row 1

Row 2 Row 3

Y

N

Y

N

Y

Y

Select Row 2

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 17 / 23

Page 29: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Stratified Random Sampling

Row 1

Row 2 Row 3

Y

N

Y

N

Y

Y

Y

Y

Y

N

N

N

Select Row 3

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 17 / 23

Page 30: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Outline

1 Introduction

2 Stratified Random Samples

3 Estimating Parameters

4 Cluster Samples

5 Stratified vs. Cluster

6 Assignment

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 18 / 23

Page 31: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Stratified Sampling vs. Cluster Sampling

In stratified samplingI From all of the strata we take randomly selected individuals.

In cluster samplingI From randomly selected clusters we take all of the individuals.

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 19 / 23

Page 32: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Outline

1 Introduction

2 Stratified Random Samples

3 Estimating Parameters

4 Cluster Samples

5 Stratified vs. Cluster

6 Assignment

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 20 / 23

Page 33: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Assignment

HomeworkRead Sections 2.6, 2.8, pages 108 - 115, 122 - 126.Let’s Do It! 2.6, 2.8.Page 115, exercises 19 - 23, 25.Page 126, exercises 35 - 38.

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 21 / 23

Page 34: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Answers to Even-numbered Problems

Page 115, Problems 20, 222.20 (a) Good. Textbook prices tend to vary dramatically across different

majors. A stratified sample will guarantee that all majors arerepresented.

(b) Not so good. Textbook prices do not vary by the gender of thestudent, except that females are slightly less likely to be takingscience courses, where the textbooks are very expensive. But thattakes us back to part (a).

(c) Not so good. Textbook prices may vary somewhat by class rank,but not much.

2.22 (a) 100200 .

(b) 1001000 .

(c) 0.10. No. Not all samples are equally likely. Only samplescontaining 100 males and 20 females are possible.

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 22 / 23

Page 35: Stratified and Cluster Sampling - people.hsc.edupeople.hsc.edu/faculty-staff/robbk/Math121/Lectures... · 2010-01-26 · Stratified Sampling vs. Cluster Sampling In stratified

Answers to Even-numbered Problems

Page 126, Problems 36, 382.36 (a) Physics and Mathematics.

(b) 60.2.38 (a) Cluster sampling.

(b) No. The more classes a student takes, the more likely he is to be inthe sample.

(c) Poor design together with bad luck. The procedure is not biased,except for the small effect described in part (b). However, theindividual classes, especially the education classes, are notheterogeneous, so cluster sampling should not be used.

Robb T. Koether (Hampden-Sydney College) Stratified and Cluster Sampling Tue, Jan 26, 2010 23 / 23