Linear Regression with One Predictor variable

Linear Regression with One Predictor variable

KNNL – Chapter 1

Model – Error Distribution Unspecified

0 1

0 1

2 2

1,...,

Response on the trial, parameters (intercept and slope of line)

known constant, value of predictor variable on the trial

Random error term 0 ,

i i i

thi

thi

i i i i j

Y X i n

Y i

X i

E

0 1 0 1 0 1 0 1

2 2 2 20 1

0 1 0 1

*0 1 1 0 1

0

(Based on Rules in Appendix A)0

, , , 0

Alternative Form:

i i i i i i i

i i i i

i j i i j j i j

i i i i i

i j

E Y E X X E X X

Y X

Y Y X X i j

Y X X X X X

*0 0 1X

Least Squares Estimation - I

0 1

0 1

0 1

220 1

1 1

0 1 0 1

0 10

1,...,

function of unknown , and observed data

Goal: select values of , that minimize and label them as ,

: 2 1

i i i

i i i

n n

i i ii i

i ii

Y X i n

Y X

Q Y X

Q b bQi Y X

set

0 11 1 1

set2

0 1 0 11 1 1 11

0

: 2 0

n n n

i ii i

n n n n

i i i i i i ii i i i

Y nb b X

Qii Y X X X Y b X b X

Least Squares Estimation - II

1

22

11 1 1 1 1

2

121 11

1 1

1

Solving (by multiplying by and by and taking ) :n

ii

n n n n n

i i i i i ii i i i i

nn n

ii in nii i

i i ii i

i

i X ii n ii i

n X Y X Y b n X X

XX YX Y b X

n n

X Yb

1 1

1 12 2

1 11

2 1

1

0 11 1

1From :

n n

i in ni i

i i i n nii i XYi i inn i iXX XXiiin

ii

i

n nii i i

i iXX

X YX X Y Y X XSSn Y k Y

SS SSX XXX

n

X X Xi b Y b X Y lY

n SS

Fitted Values and Residuals 0 1 0 1

^

0 1

^

0 1

True Regression Function: (Unknown, since , parameters)

Estimated Regression Function (Fitted):

For the observation: 1,...,

Residuals:Differences between ob

thi i

E Y X

Y b b X

i Y b b X i n

^

0 1

1

1

^

0 1 0 11 1 1 1

served and fitted (predicted) values:

1,...,

Properties of Residuals:

0 (From LS eq )

0 (From LS eq )

0

ii i i i

n

ii

n

i ii

n n n n

i i i i i i ii i i i

e Y Y Y b b X i n

e i

X e ii

Y e b b X e b e b X e

Estimating Error Variance 2

2 22 2 2

0 1

^

0 1

2

2^2

2 1 1

0

unobservable since

We use residual to "estimate"

Obtain the "average" squared residual to estimate :

2 2 2

n n

ii ii i

E E E E

Y X

e

e Y Y Y b b X

e Y YSSEs M

n n n

SE

Normal Error Model

20 1

2

0 1

2/2 0 12

0 111

2^ ^ ^

0 1

1,..., ~ 0, (independent)

1 1exp 1,...,22

1, , 2 exp2

Goal: Choose values , , th

i i i i

i ii i

n nn i in

iii

Y X i n N

y Xf y f i n

y XL f

2

0 12

1

2

0 10 1

1

^ ^

0 10 1

2 2

at maximize (or equivalently ln( )) :

1ln 2 ln2 2 2

Note: maximizing wrt , is same as minimizing

,

1 112 2

ni i

i

ni i

i

L l L

y Xn nl

y Xl

b b

yl n

2^ ^22 0 12^

0 1 21 1221

20

n n

i i in seti i i i

i

y X eX n sn n n

Documents

Linear Regression with One Predictor variable