CS 330 - Artificial Intelligentcaora/cs330/Materials/fall2018/Slides/Day13.pdf · CS 330 -...

CS 330 - Artificial Intelligent - Logistic and linear regression

Instructor: Renzhi Cao Computer Science Department

Pacific Lutheran University Fall 2018

Special appreciation to Tom Mitchell, Ian Goodfellow, Joshua Bengio, Aaron Courville, Michael Nielsen, Andrew Ng, Katie Malone, Sebastian Thrun, Ethem Alpaydin, Christopher Bishop,

Announcement

• Homework of decision tree is due on next Tuesday • Lab 3 is due on Sakai • Quiz on next week, study guide will be posted on Sakai • Practical Machine learning next Tuesday, bring your laptop

Gaussian Naive Bayes - Big Picture

Logistic Regression

Idea: • Naive Bayes allows computing P(Y|X) by learning P(Y) and

P(X|Y)

• Why not learn P(Y|X) directly?

• What would be w0 and w1 ?

Lecture Notes for E Alpaydın 2010 Introduction to Machine Learning 2e © The MIT Press (V1.0)

Gradient-Descent

∀∂

∂−=Δ ,η

wt wt+1

E (wt)

E (wt+1)

4n + 1

Regression So far, we’ve been interested in learning P(Y|X) where Y has

discrete values (called ‘classification’) What if Y is continuous? (called ‘regression’) •  predict weight from gender, height, age, …

•  predict Google stock price today from Google, Yahoo, MSFT prices yesterday

•  predict each pixel intensity in robot’s current camera image, from previous image and previous action

Regression Wish to learn f:X!Y, where Y is real, given {<x1,y1>…<xn,yn>} Approach: 1.  choose some parameterized form for P(Y|X; θ)

( θ is the vector of parameters)

2.  derive learning algorithm as MCLE or MAP estimate for θ

1. Choose parameterized form for P(Y|X; θ)

and the expected value of y for any given x is f(x)

XAssume Y is some deterministic f(X), plus randomnoise

Therefore Y is a random variable that follows the distribution

Consider Linear Regression

E.g., assume f(x) is linear function of x Notation: to make our parameters explicit, let’s write

Training Linear Regression

How can we learn W from the training data?

Training Linear Regression

How can we learn W from the training data? Learn Maximum Conditional Likelihood Estimate! where

Training Linear Regression Learn Maximum Conditional Likelihood Estimate

Training Linear RegressionLearn Maximum Conditional LikelihoodEstimate

Can we derive gradient descent rule for training?

Summary

• Learning is optimization problem once we choose our objective function: • maximize data likelihood • maximize posterior prob of W

• We use gradient descent as general learning algorithm to learn the weight.

Discussion about progress of literature review

• Around 20 mins discussions between groups. • One group member presents the current progress, plan and

issues. • (https://www.cs.plu.edu/~caora/cs330/Materials/

fall2018/groups) • (https://www.cs.plu.edu/~caora/cs330/Materials/

fall2018/LiteratureReview_requirement.pdf)

Extra slides (Not required to understand)

EPI 809/Spring 2008 46

YY = mX + b

b = Y-interceptX

Changein Y

Change in X

m = Slope

Linear Equations

Regression – SummaryUnder general assumption

1.  MLE corresponds to minimizing sum of squared prediction errors

2.  MAP estimate minimizes SSE plus sum of squared weights

3.  Again, learning is an optimization problem once we choose our objective function•  maximize data likelihood•  maximize posterior prob of W

4.  Again, we can use gradient descent as a general learning algorithm•  as long as our objective fn is differentiable wrt W•  though we might learn local optima ins

5.  Almost nothing we said here required that f(x) be linear in x

How about MAP instead of MLE estimate?

CS 330 - Artificial Intelligentcaora/cs330/Materials/fall2018/Slides/Day13.pdf · CS 330 -...

Documents

Phone: (330) 650-1776 Fax: (330) 653-9030 E-Mail ...files.constantcontact.com/0c5cae6b401/7ef46f47-95d... · Phone: (330) 650-1776 Fax: (330) 653-9030 E-Mail: johndavid@americanfireworks.com

PAVIMENTOS-330-320-330 (1).pptx

Stainless Steel Vertica Pendant Arms and Enclosures … · 330˚ 330˚ 330˚ 330˚ 13" 13" Vertica™ Pendant Arms THE industry standard for vertically adjustable mountings Swivel

Electronic Supplementary InformationOriented films of layered rare earth hydroxide crystallites self-assembled at the hexane/water interface Linfeng Hu a,b, Renzhi Ma a, Tadashi C

World Bank Documentdocuments.worldbank.org/curated/en/620791468313770824/pdf/E14170v20RUSSI1blic0...3-- 330 330 , 330 , 330 330 330 330 330 220 220 -– – – 330 750 - - -750 -

CS 330 - Artificial Intelligencecaora/cs330/Materials/fall2019/Slides/Day1.pdf• Interesting talk with students • Artificial Intelligence is the broader concept of machines being

330 richards

Pennsylvania Historical and Museum Commission › bah › dam › rg › di › r17-528LastPurchWarrantReg › r17...330 330 330 330 330 -3

Anthropology 330

Ushpizin 330

Parameter selection in prostate IMRT Renzhi Lu, Richard J. Radke 1, Andrew Jackson 2 Rensselaer Polytechnic Institute 1,Memorial Sloan-Kettering Cancer

CS 330 - Artificial Intelligencecaora/cs330/Materials/fall... · CS 330 - Artificial Intelligence - Model Based theory Instructor: Renzhi Cao Computer Science Department Pacific Lutheran

330-1453 Byzantine Empire. 330-565 The Beginning

EVOLUTION OF MAPBASICgeorezo.net/jparis/down/docs/evol_errors.pdf · 330 Cannot load ^0. Version ^1 of MapInfo can load it. 330 330 330 330 330 330 330 330 330 331 Unable to create

ZHEJIANG RENZHI CO., LTD Energy Develop ment. Co., Ltd. Renzhi Technica l Centre sIchuanR enzhi Petroche mical Technolo gy LLC Renzhi Industry Co. Gongden g Technolog y Co. Xinjiang

nibschool. co GUJ.pdf · 101 101 (c) 100 (B) 100 (v) It) 100 110 330 330 110 110 330 13)

Surrogacy Rels 300 / Nurs 330 19 November 2014 300/330 - appleby1

ProLanGO: Protein Function Prediction Using Neural Machine …caora//materials/19_ProLanGO.pdf · Neural Machine Translation Based on a Recurrent Neural Network Renzhi Cao 1,*, Colton

Parameter selection in prostate IMRT Renzhi Lu, Richard J. Radke 1 , Andrew Jackson 2

Artificial Organs BG Index What are artificial organs? Artificial skin Dialysis Artificial pancreas Artificial hearts Artificial kidney Artificial liver