23
Estimation 1. Appreciate the importance of random sampling 2. Understand the concept of estimation from samples 3. Understand the Central Limit Theorem 4. Be able to determine unbiased estimates of the variance

Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Embed Size (px)

Citation preview

Page 1: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Estimation

1. Appreciate the importance of random sampling

2. Understand the concept of estimation from samples

3. Understand the Central Limit Theorem

4. Be able to determine unbiased estimates of the variance

Page 2: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

What is so important about these words?

• Population– A Population is the set of all elements of interest for a

particular study. Quantities such as the population mean μ are know as population parameters

– You can’t normally get information on every element in a population

• Sample– A subset of the population. The sample mean ,

sample statistics & are estimates of the population parameters.

– Inferences need to be made from a sample – Vital any sample is representative of the population

Page 3: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

If everyone was the same their would be no need for statistics or

statisticians!

Page 4: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

How will ‘first time’ voters vote?If I take a survey how will I know they are representative?

As the sample gets bigger the results should get closer to the population true mean

Page 5: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Getting estimates of the mean

We have come across estimating means before: Example: Mean height f mid=x f x160 ≤ x < 165 1 162.5 162.5165 ≤ x < 170 5 167.5 837.5170 ≤ x < 175 13 172.5 2242.5175 ≤ x < 180 10 177.5 1775180 ≤ x < 185 1 182.5 182.5Total 30 5200

Estimated mean is 5200 ÷ 30 = 173.3

Or when all values in a set of data are known then the mean of a set is got by

1

n

ii

XX

n

Page 6: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

How will they differ from older voters If I take a survey how will I know they are representative?

XIs an UNBIASED ESTIMATOR of the population mean μ

What inferences can be made from samples and can they be trusted – that is the subject of this lesson.

Page 7: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Which issues concern voters most? If I take a survey how will I know they are representative?

In this chapter we learn some statistical magic.

The distribution of all sample means is normal, as long as the sample size are large enough. And it doesn't matter what the distribution of the original population looks like. (CLT)

Page 8: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

What this means is that we can get an unbiased estimate of the

population mean without doing a census

The mean of the distribution of sample means is approximately equal to the population mean.

XIs an UNBIASED ESTIMATOR of the population mean μ

Page 9: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

And the distribution of the means is normal

That means that the normal distribution tables can be used to solve problems.

But first lets

Similarities and differences of between population and sample distribution

Population Number of Samples

Mean μ Mean of means. Used for population mean estimate – unbiased estimator

Distribution

σ distribution is measured in standard deviations from the mean

distribution is measured in standard errors which is thesquare root of the (variance ÷ sample size)

Variance σ² This is the unbiased estimator of population variance from sample

Normal Tables

X ̴ N (μ, σ2) IF X can be modelled by aNormal distribution then :

X

2

n

2

2

1ix x

Sn

2

,X Nn

Page 10: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Sample means have standard errors in the same way as population means have

standard deviations from the mean.

n

2n

=

n n n n n n

Standard Error of sample means

Standard deviation from population mean

Page 11: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Task

EXERCISE B

Page 94

Answers page 171

Page 12: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Just as we standardised for a population when the when the mean

was not zero and the standard deviation was not one.

xz

We standardise samples too but this We standardise samples too but this time we divide by the standard error time we divide by the standard error

2

standard errorn n

Xz

n

Page 13: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

IF random samples of size n are taken from a population which can be modelled by a normal distribution N(μ,σ²) then we can

assume 3 facts about the sampling distribution X

1. The mean of a sample is μ

2. The variance of the sampling distribution of is

3. The sampling distribution of is a normal distribution.

These can be summarised as:

X ̴ N(μ,σ²) then ̴ N(μ, )

X

X

X2

n

X2

n

X

Page 14: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Task

Follow example 2 & 3 page 96. These give you practice working with samples and the

normal distribution and percentage points tables.

Then do Exercise C page 97 Always do a sketch of the problem.

What you don’t finish in class do for homework.

Page 15: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

The Central Limit Theorem (CLT)(Statistical Magic)

The CLT says that if a random sample of size n is taken from any distribution (it does not

need to be normal) with a mean μ and a variance σ² then, provided n is large enough:

2

, approximatelyX Nn

This is important as in real life we don’t always know the distribution of the population.

Page 16: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Example 4 (page 100)

A packaging machine produces packs of butter with a μ weight of 250g and a σ of 5g. Random samples of 10 packs are taken regularly from the production line and the mean weight of the sample is found. Use the CLT to find the approximate proportion of samples that should have a mean weight > 253g.

The distribution of the population is not known but it is implied that n is big enough to use CLT

From CLT so sample means follow a normal distribution where

the mean is 250 and the standard error is so

from the tables Φ(1.90)=0.9713 and since we want the probability of packets being above that we take it away

form 1

25250,

10X N

252.5 1.58

10 253 250

1.90 1.58

z

Page 17: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Task

• Exercise D page 100

• Questions 1 to 3 and do the rest for homework

Page 18: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Estimating the Variance

All the questions that you have done so far the variance or standard deviation has been know. In most real cases however it wont be known and you will have to use the samples to work out an unbiased estimator of the variance.

The unbiased estimator (S2) of the population variance (σ²) from a sample of size n is given by:

2

2 1

( )

1

n

ii

X XS

n

Many people get confused when to

use n and when to use n-1 as the denominator.•If the data entered is for the whole population use n•If the data entered is for a sample use n-1

Page 19: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Task

Exercise E page 104

Page 20: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Warning!!!

IF the population distribution is not known and the sample size is less than 30 the normal distribution tables can’t be used.

The normal distribution tables can be used if the sample size is grater than 30 (from CLT)

Page 21: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Summary

Similarities and differences of between population and sample distribution

Population Number of Samples

Mean μ Mean of means. Used for population mean estimate – unbiased estimator

Distribution

σ distribution is measured in standard deviations from the mean

distribution is measured in standard errors which is thesquare root of the (variance ÷ sample size)

Variance σ² This is the unbiased estimator of population variance from sample

Normal Tables

X ̴ N (μ, σ2) IF X can be modelled by aNormal distribution then :

X

2

n

2

2

1ix x

Sn

2

,X Nn

Page 22: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Estimation

1. Appreciate the importance of random sampling

2. Understand the concept of estimation from samples

3. Understand the Central Limit Theorem

4. Be able to determine unbiased estimates of the variance

Page 23: Estimation 1.Appreciate the importance of random sampling 2.Understand the concept of estimation from samples 3.Understand the Central Limit Theorem 4.Be

Task and homework

• Exercise F page 106

• Homework mixed questions and test yourself pages 107 & 108