92

6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

Embed Size (px)

Citation preview

Page 1: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y
Page 2: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y
Page 3: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y
Page 4: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-1 Introduction To Empirical Models

Page 5: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-1 Introduction To Empirical Models

Page 6: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-1 Introduction To Empirical Models

Page 7: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-1 Introduction To Empirical Models

Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is related to x by the following straight-line relationship:

where the slope and intercept of the line are called regression coefficients.The simple linear regression model is given by

where is the random error term.

Page 8: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

We think of the regression model as an empirical model.

Suppose that the mean and variance of are 0 and 2, respectively, then

The variance of Y given x is

6-1 Introduction To Empirical Models

Page 9: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

• The true regression model is a line of mean values:

where 1 can be interpreted as the change in the mean of Y for a unit change in x.• Also, the variability of Y at a particular value of x is determined by the error variance, 2.• This implies there is a distribution of Y-values at each x and that the variance of this distribution is the same at each x.

6-1 Introduction To Empirical Models

Page 10: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-1 Introduction To Empirical Models

Page 11: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-1 Introduction To Empirical Models

A Multiple Regression Model:

Page 12: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-1 Introduction To Empirical Models

Page 13: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-1 Introduction To Empirical Models

Page 14: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation• The case of simple linear regression considers a single regressor or predictor x and a dependent or response variable Y.

• The expected value of Y at each level of x is a random variable:

• We assume that each observation, Y, can be described by the model

Page 15: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation• Suppose that we have n pairs of observations (x1, y1), (x2, y2), …, (xn, yn).

•The method of least squares is used to estimate the parameters, 0 and 1 by minimizing the sum of the squares of the vertical deviations in Figure 6-6.

Page 16: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation

• Using Equation 6-8, the n observations in the sample can be expressed as

• The sum of the squares of the deviations of the observations from the true regression line is

Page 17: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation

Page 18: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation

Page 19: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation

Page 20: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation

Page 21: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation

Notation

n

x

xxxS

n

iin

ii

n

iixx

2

1

1

2

1

2)(

n

yx

yxyyxxS

n

ii

n

iin

iii

n

iiixy

11

11

))((

Page 22: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression6-2.1 Least Squares Estimation

Page 23: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression6-2.1 Least Squares Estimation

Page 24: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation

Page 25: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation

Page 26: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression6-2.1 Least Squares Estimation

Page 27: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.1 Least Squares Estimation

Page 28: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

Regression Assumptions and Model Properties

Page 29: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

Regression Assumptions and Model Properties

Page 30: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

Regression and Analysis of Variance

Page 31: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

Regression and Analysis of Variance

Page 32: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.2 Testing Hypothesis in Simple Linear Regression

Use of t-TestsSuppose we wish to test

An appropriate test statistic would be

Page 33: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.2 Testing Hypothesis in Simple Linear Regression

Use of t-Tests

We would reject the null hypothesis if

Page 34: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.2 Testing Hypothesis in Simple Linear Regression

Use of t-Tests

Suppose we wish to test

An appropriate test statistic would be

Page 35: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.2 Testing Hypothesis in Simple Linear Regression

Use of t-Tests

We would reject the null hypothesis if

Page 36: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.2 Testing Hypothesis in Simple Linear Regression

Use of t-Tests

An important special case of the hypotheses of Equation 6-23 is

These hypotheses relate to the significance of regression.

Failure to reject H0 is equivalent to concluding that there is no linear relationship between x and Y.

Page 37: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.2 Testing Hypothesis in Simple Linear Regression

Use of t-Tests

Page 38: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.2 Testing Hypothesis in Simple Linear Regression

Use of t-Tests

Page 39: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

The Analysis of Variance Approach

Page 40: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.2 Testing Hypothesis in Simple Linear Regression

The Analysis of Variance Approach

Page 41: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.3 Confidence Intervals in Simple Linear Regression

Page 42: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.3 Confidence Intervals in Simple Linear Regression

Page 43: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

Page 44: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.4 Prediction of New Observations

Page 45: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.4 Prediction of New Observations

Page 46: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.5 Checking Model Adequacy

• Fitting a regression model requires several assumptions.

1. Errors are uncorrelated random variables with mean zero;

2. Errors have constant variance; and,

3. Errors be normally distributed.

• The analyst should always consider the validity of these assumptions to be doubtful and conduct analyses to examine the adequacy of the model

Page 47: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.5 Checking Model Adequacy

• The residuals from a regression model are ei = yi - ŷi , where yi is an actual observation and ŷi is the corresponding fitted value from the regression model.

• Analysis of the residuals is frequently helpful in checking the assumption that the errors are approximately normally distributed with constant variance, and in determining whether additional terms in the model would be useful.

Page 48: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.5 Checking Model Adequacy

Page 49: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.5 Checking Model Adequacy

Page 50: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.5 Checking Model Adequacy

Page 51: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.5 Checking Model Adequacy

Page 52: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.5 Checking Model Adequacy

Page 53: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.6 Correlation and Regression

The sample correlation coefficient between X and Y is

Page 54: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.6 Correlation and Regression

The sample correlation coefficient is also closely related to the slope in a linear regression model

Page 55: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-2 Simple Linear Regression

6-2.6 Correlation and Regression

It is often useful to test the hypotheses

The appropriate test statistic for these hypotheses is

Reject H0 if |t0| > t/2,n-2.

Page 56: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression6-3.1 Estimation of Parameters in Multiple Regression

Page 57: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression6-3.1 Estimation of Parameters in Multiple Regression

• The least squares function is given by

• The least squares estimates must satisfy

Page 58: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression6-3.1 Estimation of Parameters in Multiple Regression• The least squares normal equations are

• The solution to the normal equations are the least squares estimators of the regression coefficients.

Page 59: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

Page 60: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

Page 61: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

Page 62: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

Page 63: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

Page 64: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.1 Estimation of Parameters in Multiple Regression

Page 65: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.2 Inferences in Multiple RegressionTest for Significance of Regression

Page 66: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.2 Inferences in Multiple RegressionInference on Individual Regression Coefficients

•This is called a partial or marginal test

Page 67: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.2 Inferences in Multiple RegressionConfidence Intervals on the Mean Response and Prediction Intervals

Page 68: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.2 Inferences in Multiple RegressionConfidence Intervals on the Mean Response and Prediction Intervals

Page 69: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.2 Inferences in Multiple RegressionConfidence Intervals on the Mean Response and Prediction Intervals

Page 70: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.2 Inferences in Multiple RegressionA Test for the Significance of a Group of Regressors

Page 71: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.3 Checking Model AdequacyResidual Analysis

Page 72: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.3 Checking Model AdequacyResidual Analysis

Page 73: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.3 Checking Model AdequacyResidual Analysis

Page 74: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.3 Checking Model AdequacyResidual Analysis

Page 75: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.3 Checking Model AdequacyResidual Analysis

Page 76: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.3 Checking Model AdequacyInfluential Observations

Page 77: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.3 Checking Model AdequacyInfluential Observations

Page 78: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression

6-3.3 Checking Model Adequacy

Page 79: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-3 Multiple Regression6-3.3 Checking Model AdequacyMulticollinearity

Page 80: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.1 Polynomial Models

Page 81: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.1 Polynomial Models

Page 82: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.1 Polynomial Models

Page 83: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.1 Polynomial Models

Page 84: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.1 Polynomial Models

Page 85: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.2 Categorical Regressors

• Many problems may involve qualitative or categorical variables.

• The usual method for the different levels of a qualitative variable is to use indicator variables.

• For example, to introduce the effect of two different operators into a regression model, we could define an indicator variable as follows:

Page 86: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.2 Categorical Regressors

Page 87: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.2 Categorical Regressors

Page 88: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.3 Variable Selection ProceduresBest Subsets Regressions

Page 89: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.3 Variable Selection ProceduresBackward Elimination

Page 90: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.3 Variable Selection ProceduresForward Selection

Page 91: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y

6-4 Other Aspects of Regression

6-4.3 Variable Selection ProceduresStepwise Regression

Page 92: 6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y