23
Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S.

Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Embed Size (px)

Citation preview

Page 1: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Linear Correlation

To accompany Hawkes lesson 12.1Original content by D.R.S.

Page 2: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Linear Correlation

• Input: A bunch of data points• Take two measurements from each

member in your sample.• Example: Weight and Blood Pressure

• Output: “There is / is not a significant linear relationship between and .

Page 3: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Visual Assessment of Correlation

• A Scatter Plot of the (x,y) ordered pairs in your sample data can give you a notion of what the relationship might be.

• Do the points line up in a straight line?– Or in sort-of a straight-ish line?– Or all over the place with no apparent relationship

between x and y?– Or in a curvy curve pattern?

Page 4: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Types of Relationships:

HAWKES LEARNING SYSTEMS

math courseware specialists

Regression, Inference, and Model Building

12.1 Scatter Plots and Correlation

Strong Linear

Relationship

Non-LinearRelationship

NoRelationship

Weak LinearRelationship

Page 5: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

The Horses Example

• Some horses were measured– Height (in hands?), Girth (inches), Length (inches),

Weight (pounds)– Put these data

values into your TI-84 lists L1, L2, L3, L4.

• Original data source and idea for this problem is “Elementary Statistics” by Johnson & Kuby, 10th Edition, © Brooks-Cole-Thomson, Page 702.

Page 6: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Question: “Is Girth related to Weight?”

• We wonder: is the girth of a horse related to its weight? Significantly so?

• ρ (Greek letter rho) is the population parameter for the Correlation Coefficient

• r (our alphabet’s letter r) is the sample statistic for the Correlation Coefficient

• We use our sample r to estimate the population’s parameter ρ

Page 7: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

The Correlation Coefficient

• If , it means there is absolutely no relationship between Girth () and Height ()

• If , it means there is perfect positive correlation between girth and height.

• If , it means there is perfect negative correlation between girth and height.

• There’s an awful formula to compute .• Remember: sample estimates population .

Page 8: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

• Pearson Correlation Coefficient, – the parameter that measures the strength of a linear relationship for the population.

• Correlation Coefficient, r – measures how strongly one variable is linearly dependent upon the other for a sample.

Correlation coefficient:

HAWKES LEARNING SYSTEMS

math courseware specialists

Regression, Inference, and Model Building

12.1 Scatter Plots and Correlation

When calculating the correlation coefficient, round your answers to three decimal places.

Page 9: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

HAWKES LEARNING SYSTEMS

math courseware specialists

Regression, Inference, and Model Building

12.1 Scatter Plots and Correlation

• –1 ≤ r ≤ 1

• Close to –1 means a strong negative correlation.

• Close to 0 means no correlation.

• Close to 1 means a strong positive correlation.

Page 10: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Hypothesis Test for significant

• Null Hypothesis: “No relationship”• Alternative:

“But there IS a significant relationship!”• There’s some level of significance specified in

advance, like or • It involves calculating a value and finding “what

is the -value of this ?”• And if -value < , reject the null hypothesis

– If so, then we say “Yes, significant relationship!”

Page 11: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Hypothesis Test for significant

• Usually we do this two-tailed test:– Null Hypothesis : “No relationship”– Alternative Hypothesis: , “There is a significant

linear relationship.”• Be aware of a couple one-tailed variations:

– Test for significant POSITIVE correlation only:using and

– Test for significant NEGATIVE correlation only:using and

Page 12: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

“Is a horse’s Girth significantly correlated to its Weight?”

• Here’s how we do the Hypothesis Test for

• Let’s suppose that level of significance , requiring strong evidence.

• STATS, TESTS, F:LinRegTTest– Shortcut instead of scrolling: ALPHA F directly.– But it might be option E on TI-83/Plus.

Page 13: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

LinRegTTest inputs

• Here are the inputs:

• Xlist and Ylist – where you put the data– Shortcut: 2ND 2 puts L2

• Freq: 1 (unless…)

• β & : ≠ 0– This is the Alternative

Hypothesis

• RegEq: VARS, right arrow to Y-VARS, 1, 1– Just put it in for later

• Highlight “Calculate”• Press ENTER

Page 14: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

LinRegTTest Outputs, first screen

• t= the t statistic value for this test (the formula is in the book)

• p = the p-value for this t test statistic

• in this kind of a test• later – for regression

Page 15: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

LinRegTTest Outputs, second screen

• b later, for Regression• s much later, for

advanced Regression

• r2 = how much of the output variable (weight) is explained by the input variable (girth)

• r = the correlation coefficient for the sample– Close to – strong

positive relationship– Or – strong negative

Page 16: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Making the Decision

• We will use the p-value method.• Compare the -value (as calculated by the TI-84)

to the Level Of Significance value for this experiment.

• In this example, (it was chosen during the design of the experiment) and the calculator computed p=5.3448432E-5 ,

• Since , reject the null hypothesis. There IS a significant linear rel. between girth and height.

Page 17: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

How did the calculator get r and r2?

• Here is the awful formula:

Page 18: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

How did the calculator compute t ?

• Here is the awful formula:

Page 19: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Another test: Girth and Length

• Is there a significant relationship between a horse’s girth and length?

• What do you expect?– Think about people: do you expect a significant

relationship between waist size and height?

Page 20: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

TI-84 Inputs and Outputsfor the Girth and Length question

Inputs• (Data already in lists)

OutputsFirst screen

Second screen

Page 21: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Girth and Length conclusions

Conclusions• What does the tell you in

this particular case?

• At the level of significance, is there a significant relationship between a horse’s girth and his length?

OutputsFirst screen

Second screen

Page 22: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

An extra problem type in Hawkes

• They tell you three pieces of information:– The Level of Significance chosen, – The correlation coefficient calculated, – The sample size,

• They ask you “Is this significant?”• Use Table I on Page 777 to determine this.

Lookup in column and row.• If , then Yes, significant.

Page 23: Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S

Determine the significance:

HAWKES LEARNING SYSTEMS

math courseware specialists

Regression, Inference, and Model Building

12.1 Scatter Plots and Correlation

a. r = 0.52, n = 19, a = 0.05

r = 0.456, Yes

b. r = 0.52, n = 19, a = 0.01

r = 0.575, No

c. r = –0.44, n = 35, a = 0.01

r = 0.430, Yes

Determine whether the following values of r are statistically significant.