17
Get out p. 193 HW and notes

Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Embed Size (px)

DESCRIPTION

Interpreting Computer Regression Output A number of statistical software packages produce similar regression output. Be sure you can locate the slope b the y intercept a the values of s and r 2

Citation preview

Page 1: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Get out p. 193 HW and notes

Page 2: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

LEAST-SQUARES REGRESSION3.2 Interpreting Computer Regression Output

Page 3: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Interpreting Computer Regression Output

A number of statistical software packages produce similar regression output. Be sure you can locate

• the slope b• the y intercept a• the values of s and r2

Page 4: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Interpreting Computer Regression Output

A number of statistical software packages produce similar regression output. Be sure you can locate

• the slope b• the y intercept a• the values of s and r2

Page 5: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Example, p. 181 & 182A random sample of 15 high school students was selected from the U.S. CensusAtSchool database. The foot length (in cm) and height (in cm) of each student in the sample were recorded.

Page 6: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Example, p. 181 & 182(a) What is the equation of the least-squares regression line that describes the

relationship between foot length and height? Define any variables that you use.

where x = foot length and y = height.

OR

Page 7: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Example, p. 181 & 182(c) Find the correlation.

Take the square root of r2 = .486.

Because the scatterplot showed a positive relationship, r = 0.697.

Page 8: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Regression to the Mean

We have data on an explanatory variable x and a response variable y for n individuals. From the data, calculate the means and the standard deviations of the two variables and their correlation r.

The least-squares regression line is the line ŷ = a + bx with slope

And y intercept

How to Calculate the Least-Squares Regression Line

Page 9: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Example, p. 183Using Feet to Predict Height. The mean and standard deviations of the foot lengths are cm and cm. The mean and standard deviation of the heights are cm and cm. The correlation between foot length and height is .

Problem: Find the equation for the least-squares regression line for predicting height from foot length. Show your work.

Slope:

Y-intercept:

LSRL: , where x = foot length and y = height.

Page 10: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Correlation and Regression Wisdom

1. The distinction between explanatory and response variables is important in regression.

Page 11: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Correlation and Regression Wisdom2. Correlation and regression lines describe only linear relationships.

r = 0.816. r = 0.816.

r = 0.816.r = 0.816.

Page 12: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Correlation and Regression Wisdom3. Correlation and least-squares regression lines are not resistant.

Page 13: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Correlation and Regression Wisdom4. Association does not imply causation.

Page 14: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Outliers and Influential Observations in Regression

An outlier is an observation that lies outside the overall pattern of the other observations. Points that are outliers in the y direction but not the x direction of a scatterplot have large residuals. Other outliers may not have large residuals.

An observation is influential for a statistical calculation if removing it would markedly change the result of the calculation. Points that are outliers in the x direction of a scatterplot are often influential for the least-squares regression line.

Page 15: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Outliers Influential

Page 16: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

Example p. 190• The strong influence of Child 18 makes the original

regression of Gesell score on age at first word misleading. The original data have r2 = 0.41, which means the age a child begins to talk explains 41% of the variation on a later test of mental ability. This relationship is strong enough to be interesting to parents. If we leave out Child 18, r2 drops to only 11%. The apparent strength of the association was largely due to a single influential observation.

Page 17: Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output

HW Due: Friday• P. 196 #59, 61 a, 63, 72, 73