A Brief Introduction to Linear Regression

A Brief introductionto Linear Regression

General Definitionsx : input, features, independent var

y : output, response, labels, dependant var

(xi,yi) : known (input, output)

pairs used for training and evaluation.

unknown

f(x):Χ⟶Υ

The problem:. How do we approximate the unknown f(x)?

. We assume there are some hypothesis h(x) that can approximate it.

. Find the best function h(x)

h(x) ≈ f(x)

. We let the hypothesis h be linear :

hθ(x) = ϴ0 + ϴ1x1 + ϴ2x2 + ϴ3x3 + … + ϴNxN. Find the parameters ϴi ∈ (ϴ0, ϴ1, ϴ2, … ϴN)

that define our hypothesis hθ

hypothesis

hθ(x)

Solution:Linear Regression

hθ(x) = ϴ0 + ϴ1x1 + ϴ2x2 + … + ϴNxN = ∑ ϴixi (from i=0 to N)

Considering x and ϴ to be vectors we get :hθ(x) = ϴ0x0 + ϴ1x1 + ϴ2x2 + … + ϴNxN

row vector : ϴT = [ϴ0,ϴ1,ϴ2,…,ϴN]

hθ(x) = ϴTx (the signal)

Now let’s find ϴ.

Cleaning things up

We define :

J(ϴ) = ½ ∑ ( hϴ(xi)-yi )2

This method is ordinary least squares (OLS)

J(ϴ) outputs the cost/errorof our hypothesis in terms of ϴ.

Since J(ϴ) we chose is quadratic we are guaranteed the existence of a minimum.

How to find the minimum?

The Cost/Loss function:

We start from an initial guess for ϴ and then iterate as follows:

ϴj := ϴj - α ∇ J(ϴj)

ϴj := ϴj - α ∇ [½ ∑ ( hϴ(xi)-yi )2]

after differentiation we get:

ϴj := ϴj + α (yi-hϴ(xi))xj

α is the learning rate.

Gradient Descent:

Two ways to descend:Stochastic descent :

Repeat (for every j) {

for i=1 to n {

ϴj:= ϴj+ α(yi-hϴ(xi))xj}

Batch descent :

Repeat (for every j) {

ϴj:= ϴj+ α∑(yi-hϴ(xi))xj

} until convergence

Batch vs Stochastic:Batch Gradient Descent (BGD) descent has to scan through the entire training set before making progress.

BGD is very costly for large data sets.

Stochastic Gradient Descent (SGD) can start making progress right away and converges faster.

SGD might not converge though!

A Closed form solution:We define :

Then we solve for ∇J(ϴ)=0 :

Visualising things:

J(ϴ) GD:

✘ Change the form of h(x) (Logistic Regression)

✘ Change the Cost/Loss J(ϴ) (e.g. Locally Weighted Regression (non-parametric)).

✘ Considering probability.

✘ Going from regression to classification.

✘ Preprocessing the Data (Dimensionality Reduction).

Where to go from here ?

THANKS!Any questions?You can find me at:contact@nidhalselmi.com

June 2015

A Brief Introduction to Linear Regression

Data & Analytics

Chapter 8 Linear Regression. Objectives & Learning Goals Understand Linear Regression (linear modeling): Create and interpret a linear regression model

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION Determining the Regression ... of linear... · 2012. 6. 21. · REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION. 2 Simple Regression Linear Regression

Linear Regression - SAS · MULTIPLE LINEAR REGRESSION SAS/STAT SYNTAX & EXAMPLE DATA ... SIMPLE LINEAR REGRESSION ... title 'Best Models Using All-Regression Option'; run;

Generalized linear regression - Little Dumb doctor to Multiple Linear Regression 3 tics tutorial at Multiple Linear RegressionMultiple Linear Regression • Regression analysis is

Lecture 11: Regression Methods I (Linear Regression)math.arizona.edu/~hzhang/math574m/2017Lect11_lm.pdf · Lecture 11: Regression Methods I (Linear Regression) 7/40. Linear Model

The Algebra of Linear Regression - statpower.net Notes/RegressionAlgebra.pdf · The Algebra of Linear Regression 1 Introduction 2 Bivariate Linear Regression 3 Multiple Linear Regression

Chapter 11 Multiple Linear Regression Chapter 11 Multiple Linear Regression

Linear Models & Linear Regression

Linear Regression via Normal Equations...Module 1 Objectives/Linear Regression •Linear Algebra Primer -matrix equations, notations -matrix manipulations •Linear Regression -objective,

1 Curve-Fitting Polynomial Interpolation. 2 Curve Fitting Regression Linear Regression Polynomial Regression Multiple Linear Regression Non-linear Regression

Simple Linear Regression AMS 572 11/29/2010. Outline 1.Brief History and Motivation – Zhen Gong 2.Simple Linear Regression Model – Wenxiang Liu 3.Ordinary

Regression analysis Linear regression Logistic regression

Linear and Non-Linear Regression: Powerful and Very ...business.expertjournals.com/ark:/16759/EJBM_322... · Keywords: Linear Regression, Non-Linear Regression, Best-Fitting Model,

Linear regression: Part 1 - NTNU · Linear regression: Part 1. Lecture Outline What are linear models?-EX1: What is the ‘best’ line? What is linear regression? ... Linear regression

Econometrics notes (Introduction, Simple Linear regression, Multiple linear regression)

Introduction to Linear Regression - …classroom.takasila.org/classroom/dataupload/takasila/155/LinearReg.pdf · Background Simple Linear Regression Multi-variable Linear Regression

Two-Variable Analysis: Simple Linear Regression/ Correlationmeonline.engin.umich.edu/courses/blackbelt/6s... · II. Simple Linear Regression • Simple Linear Regression examines

Part II Multiple Linear Regression - Statistics · PDF filePart II Multiple Linear Regression 86. Chapter 7 Multiple Regression A multiple linear regression model is a linear model

REGRESSION 12.1 Simple Linear Regression Model 12.2 ...sman/courses/6739/SimpleLinearRegression.pdf · Goldsman — ISyE 6739 Linear Regression REGRESSION 12.1 Simple Linear Regression