CSC411: Midterm Reviewmren/teach/csc411_19s/tut/... · 2019-08-31 · CSC411: Midterm Review...

CSC411: Midterm Review

Xiaohui Zeng

February 7, 2019

Built from slides by James Lucas, Joe Wu and others1/23

Agenda

1. A brief overview

2. Some sample questions

Basic ML Terminology

I Regression

I Overfitting

I Generalization

I Stochastic Gradient Descent(SGD)

I Classification

I Underfitting

I Regularization

I Bayes Optimal

Basic ML Terminology

I Training Data

I Validation Data

I Test Data

I Optimization

I 0-1 Loss

I Linear classifier

I Features

I Model

Some Questions

Question

1. Bagging improved performance by reducing

2. Bagging improved performance by reducing variance

Some Questions

Question

Some Questions

Question

2. Given discrete random variables X and Y. TheInformation Gain in Y due to X is

IG (Y |X ) = H( )− H( ),

where H is the entropy

Some Questions

Question

2. Given discrete random variables X and Y. TheInformation Gain in Y due to X is

IG (Y |X ) = H(Y )− H(Y |X ),

where H is the entropy

Some Questions

Question 2

Take labelled data (X, y).

1. Why should you use a validation set?

2. How do you know if your model is overfitting?

3. How do you know if your model is underfitting?

Some Questions

Question 2

Some Questions

Question 2

ML Models

1. Nearest Neighbours

2. Decision Trees

3. Ensembles

4. Linear Regression

5. Logistic Regression

6. SVMs

Nearest Neighbours

1. Decision Boundaries2. Choice of ‘k‘ vs. Generalization3. Curse of dimensionality

Decision Trees

1. Entropy: H(X ), H(Y |X )

2. Information Gain

3. Decision Boundaries

Bayes Optimality

Starting with square error: E[(y − t)2|x] = E[y2 − 2yt + t2|x];

1. choose a single value y∗ based on p(t|x):

(y − E[t|x])2 + Var(t|x)

2. treat y as a random variable:I bias termI variance termI Bayes error

Bayes Optimality

(y − E[t|x])2 + Var(t|x)

Bayes Optimality

(y − E[t|x])2 + Var(t|x)

Ensembles

1. Bagging/bootstrapaggregation

2. BoostingI decision stumps:

Linear Regression

1. Loss function

2. Direct solution

3. (Stochastic) GradientDescent

4. RegularizationI L1 vs L2 norm

Logistic Regression

1. Loss functionsI 0-1 loss?I l2 loss?I cross-entropy loss?

2. Binary vs. Multi-class

3. Decision BoundariesI p = 1

1+e−θx

1. Hinge loss

2. Margins

Sample Question 1

First, we use a linear regression method to model this data.To test our linear regressor, we choose at random some datarecords to be a training set, and choose at random some of theremaining records to be a test set.Now let us increase the training set size gradually. As the trainingset size increases, what do you expect will happen with the meantraining and mean testing errors? (No explanation required)

I Mean Training Error: A. Increase; B. Decrease

I Mean Testing Error: A. Increase; B. Decrease

Q1 Solution

I The training error tends to increase. As more examples haveto be fitted, it becomes harder to ’hit’, or even come close, toall of them.

I The test error tends to decrease. As we take into accountmore examples when training, we have more information, andcan come up with a model that better resembles the truebehavior. More training examples lead to bettergeneralization.

Q1 Solution

I The training error tends to increase. As more examples haveto be fitted, it becomes harder to ’hit’, or even come close, toall of them.

I The test error tends to decrease. As we take into accountmore examples when training, we have more information, andcan come up with a model that better resembles the truebehavior. More training examples lead to bettergeneralization.

Sample Question 2

If variables X and Y are independent, is I (X |Y ) = 0? If yes, proveit. If no, give a counter example.

Q2 Solution

Recall that

I two random variables X and Y are independent if for allx ∈ Values(X ) and all y ∈ Values(Y ),P(X = x ,Y = y) = P(X = x)P(Y = y).

I H(X ) = −∑

x P(x)log2P(x)

Q2 Solution

I (X |Y ) = H(X )− H(X |Y ) (1)

= −∑x

P(x)log2P(x)−(−∑y

P(x , y)log2P(x |y))(2)

= −∑x

P(x)log2P(x)−(−∑y

P(y)∑x

P(x)log2P(x))

= −∑x

P(x)log2P(x)−(−∑x

P(x)log2P(x))

= 0 (5)

Sample Question 3

Given input x ∈ RD and target t ∈ R, consider a linear model ofthe form: y(x ,w) =

∑Di=1 wixi .. Now suppose a noisy pertubation

εi is added independently to each of the input variables xi . i.e.,xi = xi + εi , assume

I E[εi ] = 0

I for i 6= j : E[εiεj ] = 0

I E[ε2i ] = λ

We define the following objective that tries to be robust to noise

w∗ = arg minEε[(wT x− t)2]. (6)

Show that it is equivalent to minimizing L2 regularized linearregression, i.e.,:

w∗ = arg min[(wTx− tn)2 + λ||w||2].

Q3 Solution

y =D∑i=1

wi (xi + εi ) = y +D∑i=1

wiεi ,

where y = wTx =∑D

i=1 wixi .

Q3 Solution

Then we start with

w∗ = arg minEε[(wT x− t)2] = arg minEε[(y − t)2],

the inner term

(y − t)2 = y2 − 2y t + t2 (7)

= (y +D∑i=1

wiεi )2 − 2t(y +

D∑i=1

wiεi ) + t2 (8)

= y2 + 2yD∑i=1

wiεi + (D∑i=1

wiεi )2 − 2ty − 2t

D∑i=1

wiεi + t2

then we take the expectation under the distribution of ε

Q3 Solution

That is,

[y2 + 2y

D∑i=1

wiεi + (D∑i=1

wiεi )2 − 2ty − 2t

D∑i=1

wiεi + t2]

we have the second and the fifth term equal to zero since E[εi ] = 0,while the third term become Eε[(

∑Di=1 wiεi )

2] = λ∑D

i=1 w2i .

Finally we see

[y2 + λ

D∑i=1

w2i − 2ty + t2

]= Eε[(y − t)2 + λ

D∑i=1

w2i ].

CSC411: Midterm Reviewmren/teach/csc411_19s/tut/... · 2019-08-31 · CSC411: Midterm Review...

Documents

To our shareholders and investors, The nd Business Report ...1 18th midterm June 2009 19th midterm June 2010 20th midterm June 2011 21st midterm June 2012 22nd midterm June 2013 Net

Midterm Review Question 1 - Arbuiso.comarbuiso.com/thehandouts/midterm walk around review problems and... · Midterm Review Question 1 ... Midterm Review Question 2 ... In BC, use

CSC411/2515 Lecture 2: Nearest NeighborsCSC411/2515 Lecture 2: Nearest Neighbors Roger Grosse, Amir-massoud Farahmand, and Juan Carrasquilla University of Toronto UofT CSC411-Lec2

FIN 683 Financial Institutions Management Midterm … 683 Financial Institutions Management Midterm Review Midterm • Material up to the class preceding the midterm • Format: take

26 Mar 2015-Midterm Reportdanida.vnu.edu.vn/cpis/files/Docu/Annual_Reports/2015-Midterm Rep… · Midterm Report This midterm progress report form must be used by research projects

Bio 102 Midterm Keiras Midterm II Review

CSC411 A Bit of Neuroscienceguerzhoy/411/lec/W06/neuroscience.pdf · A Bit About Neuroscience CSC411: Machine Learning and Data Mining, Winter 2017 Michael Guerzhoy Slides from Geoffrey

CSC411- Machine Learning and Data Mining Tutorial 10– March 23 th, 2007 University of Toronto (Mississauga Campus)

CSC 411: Introduction to Machine Learningmren/teach/csc411_19s/lec/lec05_matt.… · UofT CSC411 2019 Winter Lecture 02 1 / 24. Boosting ... Weak learner is a learning algorithm that

CSC411 Neural Networks Introguerzhoy/411/lec/W03/neuralnetworks.pdfTitle: CSC411 Neural Networks Intro Author: guerzhoy Created Date: 1/23/2017 4:31:10 PM

Cross-Validation and Decision Treesfidler/teaching/2015/slides/CSC411/... · CSC411 Tutorial #3 Cross-Validation and Decision Trees February 3, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu

MULTIPLE CHOICE MIDTERM REVIEWcontent.njctl.org/courses/science/ap-biology/midterm/midterm... · PSI AP Biology Midterm Review MULTIPLE CHOICE MIDTERM REVIEW Units 1-5 1. One of the

MSD 2011 Midterm Karl Lieberherr 3/28/20111MSD midterm

To our shareholders and investors, The Midterm 23 …19th midterm June 2010 20th midterm June 2011 21st midterm June 2012 22nd midterm June 2013 23rd midterm June 2014 Net sales (Million

Midterm Review (overview of fall 2005 midterm)

CSC 411: Introduction to Machine Learning - Lecture 15: K ...mren/teach/csc411_19s/lec/lec15.pdf · UofT CSC411 2019 Winter Lecture 15 16 / 17. Questions about Soft K-means Some remaining

CSC411 Tutorial #6 Clustering: K-Means, GMM, EM

Welcome to CSC411 - Department of Computer Science ...guerzhoy/411/lec/W01/intro.pdf · Welcome to CSC411 CSC411: Machine Learning and Data Mining, Winter 2017 Michael Guerzhoy Y

MIDTERM * Midterm Review:Thursday, March 7 * Midterm Date:Tuesday, March 12

Midterm sample questions - people.cs.umass.edubrenocon/inlp2015/midterm-review-sols.pdf · Midterm sample questions UMass CS 585, Fall 2015 October 18, 2015 1 Midterm policies The