26
Economics 105: Statistics Review #1 due next Tuesday in class.

Economics 105: Statistics

  • Upload
    lindley

  • View
    28

  • Download
    1

Embed Size (px)

DESCRIPTION

Economics 105: Statistics. Review # 1 due next Tuesday in class. Mean & Variance. are set of random variables from an i.i.d . sample of size n, what are the mean and variance of sample average? Things we would like to know … What does the p.d.f . of look like? - PowerPoint PPT Presentation

Citation preview

Page 1: Economics 105: Statistics

Economics 105: Statistics• Review #1 due next Tuesday in class.

Page 2: Economics 105: Statistics

Mean & Variance• are set of random variables from an i.i.d. sample of size n, what are the mean and variance of sample average?• Things we would like to know …

1.

2.

3. What does the p.d.f. of look like?

• What does the pdf of look like?

Page 3: Economics 105: Statistics

Central Limit Theorem• Rough statement of CLT:“Sample means are eventually, approximately normally distributed.”

• Formal statement of CLT:

Let X1, X2, X3, . . . ,Xn, where Xi is a random variable denoting the outcome of the ith observation, be an i.i.d. sample from ANY population distribution with mean and variance then as n becomes large

• Graphically (page 236 in BLK, 10th edition, has a nice visual)

Page 4: Economics 105: Statistics

CLT!

Page 5: Economics 105: Statistics

Point and Interval Estimates• A point estimate is a single number, • a confidence interval provides additional

information about variability

Point Estimate

Lower

Confidence

Limit

Upper

Confidence

Limit

Width of confidence interval

Page 6: Economics 105: Statistics

We can estimate a Population Parameter …

Point Estimates

with a SampleStatistic

(a Point Estimate)

Mean

Proportion pπ

Page 7: Economics 105: Statistics

Confidence Level, (1-)• Suppose confidence level = 95% • Also written (1 - ) = 0.95• A relative frequency interpretation:

– In repeated samples, 95% of all the confidence intervals that can be constructed are expected to contain the unknown true parameter

• A specific interval either will contain or will not contain the true parameter– No probability involved in a specific interval

Page 8: Economics 105: Statistics

Confidence Intervals

Population Mean

σ Unknown

ConfidenceIntervals

PopulationProportion

σ Known

Page 9: Economics 105: Statistics

Confidence Intervals for • First, assume 2 is known & X ~ N , so •Things are different when these are not true.• Random sample of n observations • We will use to make inferences about •

Page 10: Economics 105: Statistics

Confidence Interval for μ(σ Known)

• Assumptions– Population standard deviation σ is known– Population is normally distributed

• Confidence interval estimate:

Page 11: Economics 105: Statistics

Finding the Critical Value, Z• Consider a 95% confidence interval:

Z= -1.96 Z= 1.96

Point EstimateLower Confidence Limit

UpperConfidence Limit

Z units:

X units: Point Estimate

0

Page 12: Economics 105: Statistics

Common Levels of Confidence• Commonly used confidence levels are 90%,

95%, and 99%

Confidence Level

Confidence Coefficient,

Z value

1.28

1.645

1.96

2.33

2.58

3.08

3.27

0.80

0.90

0.95

0.98

0.99

0.998

0.999

80%

90%

95%

98%

99%

99.8%

99.9%

Page 13: Economics 105: Statistics

The Most Common Confidence Level is 95%

+/- 1.96

Page 14: Economics 105: Statistics

Intervals and Level of Confidence

Confidence Intervals

Intervals extend from

to

(1-)x100%of intervals constructed contain μ;

()x100% do not.

Sampling Distribution of the Sample Mean

x

x1

x2

Page 15: Economics 105: Statistics

Example• A sample of 11 circuits from a large

normal population has a mean resistance of 2.20 ohms. We know from past testing that the population standard deviation is 0.35 ohms.

• Determine a 95% confidence interval for the true mean resistance of the population.

Page 16: Economics 105: Statistics

Example• A sample of 11 circuits from a large normal

population has a mean resistance of 2.20 ohms. We know from past testing that the population standard deviation is 0.35 ohms.

• Solution:

(continued)

Page 17: Economics 105: Statistics

Interpretations• We are 95% confident that the true mean

resistance is between 1.9932 and 2.4068 ohms

• True mean may or may not be in this interval

• In repeated samples, 95% of confidence intervals formed in this manner are expected to contain the true mean

Page 18: Economics 105: Statistics

Confidence Intervals

Population Mean

σ Unknown

ConfidenceIntervals

PopulationProportion

σ Known

Page 19: Economics 105: Statistics

• If the population standard deviation σ is unknown, we can substitute the sample standard deviation, sx

• This introduces extra uncertainty, since sx varies from sample to sample

• Use t distribution instead of the normal distribution

Confidence Interval for μ(σ Unknown)

Page 20: Economics 105: Statistics

Student’s t distribution• William Sealy Gosset was an Irish statistician who worked for Guinness Brewery in Dublin in the early 1900s. He was interested in the effects of various ingredients and temperature on beer, but only had a few batches of each “formula” to analyze. Thus, he needed a way to correctly treat SMALL SAMPLES in statistical analysis.

• Not supposed to be publishing, so used the pseudonym, “Student”

Page 21: Economics 105: Statistics

Student’s t distribution• The t is a family of distributions

• The shape depends on degrees of freedom (d.f.)– Number of observations that are free to vary after

sample mean has been calculated

d.f. = n - 1

Page 22: Economics 105: Statistics

Student’s t distribution

t0

t (df = 5)

t (df = 13)t-distributions are bell-shaped and symmetric, but have ‘fatter’ tails than the normal

Standard Normal

(t with df = ∞)

Note: t Z as n increases

Page 23: Economics 105: Statistics

• Assumptions– Population standard deviation is unknown– Population is normally distributed

• Use Student’s t distribution• Confidence Interval Estimate:

(where t is the critical value of the t distribution with n -1 degrees of freedom and an area of α/2 in each tail)

Confidence Interval for μ(σ Unknown)

(continued)

Page 24: Economics 105: Statistics

Student’s t Table

Upper Tail Area

df .25 .10 .05

1 1.000 3.078 6.314

2 0.817 1.886 2.920

3 0.765 1.638 2.353

t0 2.920The body of the table contains t values, not probabilities

Let: n = 3 df = n - 1 = 2 = 0.10 /2 = 0.05

/2 = 0.05=TINV(0.1,2) 2.91998558

=TDIST(A3,2,1) 0.05=TDIST(2.92,2,1) 0.049999578

Page 25: Economics 105: Statistics

t distribution valuesWith comparison to the Z value

Confidence t t t Z Level (10 d.f.) (20 d.f.) (30 d.f.) ____

0.80 1.372 1.325 1.310 1.28

0.90 1.812 1.725 1.697 1.645

0.95 2.228 2.086 2.042 1.96

0.99 3.169 2.845 2.750 2.58

Note: t Z as n increases

Page 26: Economics 105: Statistics

Confidence Intervals for • A manufacturer produces bags of flour whose weights are normally distributed. A random sample of 25 bags was taken and their mean weight was 19.8 ounces with a sample standard deviation of 1.2 ounces. • Find and interpret a 99% confidence interval for the true average weight for all bags of flour produced by the company.