32
The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Embed Size (px)

Citation preview

Page 1: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

The Practice of Statistics

Chapter 9:9.1 Sampling Distributions

Copyright © 2008 by W. H. Freeman & Company

Daniel S. Yates

Page 2: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Essential Questions for 9.1

• What is a parameter? What is a statistics?

• What is sampling variability?

• What is a sampling distribution?

• How do you describe a sampling distribution?

• What is an unbiased statistic and an unbiased estimator?

Page 3: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Definitions

• parameter: – a number that describes the population– a parameter is a fixed number– in practice, we do not know its value because we

cannot examine the entire population

Page 4: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Definitions

• statistic: – a number that describes a sample– the value of a statistic is known when we have

taken a sample, but it can change from sample to sample

– we often use a statistic to estimate an unknown parameter

Page 5: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates
Page 6: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Compare

• parameter– mean: μ– standard deviation: σ– proportion: p

• Sometimes we call the parameters “true”; true mean, true proportion, etc.

• statistic– mean: x-bar

– standard deviation: s

– proportion: (p-hat)

• Sometimes we call the statistics “sample”; sample mean, sample proportion, etc.

Page 7: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Sampling Variability

• Question: What would happen if we took many sample?• To answer this question, we would do the following:

– Take a large number of samples from the same population.– Calculate the sample means or the sample proportion for each

sample.– Make a histogram.– Examine the distribution for shape, center, spread and outliers.

• In Practice it is too expensive to take many samples from a population. Simulation may be used instead of many samples to approximate the sampling distribution.

• Probability may be used to obtain an exact sampling distribution without simulation.

Page 8: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Sampling Distributions

• The sampling distribution of a statistic is the distribution of values taken by the statistic in all possible samples of the same size from the same population.

Page 9: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

• http://www.ruf.rice.edu/~lane/stat_sim/sampling_dist/

Page 10: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates
Page 11: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

The population used to construct the random number table (Table B) can be described by the probability function below.

Example – Using an Exact Sampling Distribution

Consider taking an SRS of size 2 from this population and computing the means for the sample.

Page 12: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Instead of doing a simulation we can construct the actual sampling distribution displayed below.

Using this table we can construct a sampling distribution.

Page 13: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

The sampling distribution of the means for sample size n = 2. μ = ?. Does this agree with E(x) of the distribution?

Page 14: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Example –Survivor Fan?

• Television executives and companies who advertise on TV are interested in how many viewers watch particular television shows. According to 2001 Nielsen ratings, Survivor II was one of the most watched television shows in the US during every week that is aired.

• Suppose that true proportion of US adults who watched Survivor II is p=.37.

• Suppose we did a survey with n=100.• Suppose we did this survey 1000 times.

Page 15: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Sampling Distribution for SRSs of size n = 100

Page 16: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Sampling distribution for SRSs of size n = 1000.

Page 17: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Sampling distribution for SRSs of size n = 1000 with scale change.

Page 18: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Simulations

Means (n=120)432

Means (n=60)432

Means (n=30)432

To illustrate the general behavior of samples of fixed size n, 10000 samples each of size 30, 60 and 120 were generated from this uniform distribution and the means calculated. Probability histograms were created for each of these (simulated) sampling distributions.

Notice all three of these look to be essentially normally distributed. Further, note that the variability decreases as the sample size increases.

Page 19: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Simulations

Skewed distribution

To further illustrate the general behavior of samples of fixed size n, 10000 samples each of size 4, 16 and 32 were generated from the positively skewed distribution pictured below.

Notice that these sampling distributions all all skewed, but as n increased the sampling distributions became more symmetric and eventually appeared to be almost normally distributed.

Page 20: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Back to Survivor Fan Problem

Notice both distributions are centered at p = .37

Page 21: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Survivor Fan

• Because the sampling distribution is centered at the true value, there is no systematic tendency to overestimate or underestimate the paramater p.

ˆThe sample proportion p from a SRS is

an unbiased estimator of the population

proportion p

Page 22: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates
Page 23: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates
Page 24: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Examples of errors in bias and variability

Page 25: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates
Page 26: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates
Page 27: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates
Page 28: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Large bias and large variability.

Label according to bias and variability.

Page 29: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Small bias and small variability.

Page 30: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Small bias and large variability.

Page 31: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Large bias and small variability.

Page 32: The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates

Essential Questions for 9.1

• What is a parameter? What is a statistics?

• What is sampling variability?

• What is a sampling distribution?

• How do you describe a sampling distribution?

• What is an unbiased statistic and an unbiased estimator?