Linear Discriminant Functions Wen-Hung Liao, 11/25/2008

Linear Discriminant Functions

Wen-Hung Liao, 11/25/2008

Introduction: LDF

Assume we know the proper form of the discriminant functions, instead of the underlying probability densities.Use samples to estimate the parameters of the classifier.(statistical or non-statistical)Will be concerned with discriminant functions that are either linear in the components of x, or linear in some given set of functions of x.

Why LDF?

Simplicity vs. accuracyAttractive candidates for initial, trial classifiersRelated to neural networks

Approach

Find the LDF by minimizing a criterion function. Use gradient descent procedure for

minimization Convergence property Computational complexities

Example of criterion function: Sample risk, or training error. (Not appropriate, why?) Because a small training error does not guarantee a small test error.

LDF and Decision Surfaces

A linear discriminant function:

where w : weight vectorw0: bias or threshold

0)( xxg t

Two-Category CaseDecision rule: Decide w1 if g(x) > 0, decide w2 if g(x)<0

In other words, x is assigned to w1 if the inner product wtx exceeds the threshold –w0.

Decision Boundary

A hyperplane H defined by g(x)=0If x1 and x2 are both on the decision surface, then:

w is normal to any vector lying on the hyperplane.

0)( 21

wxwwxwt

Distance Measure

For any x,

where xp is the normal projection of x onto H , and r is the algebraic distance.

|||| w

wrxx p

||||)||||

()( wrw

wrxwxg p

Multi-category Case

General case:

c-1 2-class c(c-1)/2 linear discriminant

Use c linear discriminants,,...1,)( 0 ciwxwxg i

Distance Measure

wi-wj is normal to Hij.Distance for x to Hij is given by:

|||| ji

jiij ww

Quadratic DF

Add terms involving products of pairs of component of x to obtain the quadratic discriminant function:

The separating surface defined by g(x)=0 is a hyperquadric function.

ii xxwxwwxg

Hyperquadric Surfaces

If W=[wij] is not singular, then the linear terms in g(x) can be eliminated by translating the axes.Define a scale matrix:HypersphereHyperellipsoidHyperperboloid

)4( 01 wwWw

Generalized LDF

Polynomial discriminant functionsGeneralized LDF:

iii xyaxg

Augment Vectors

Augment feature vector:

Augment weight vector:

Mapping a d-dimensional x-space to (d+1)-dimensional y-space

ii xwxwwxg

2-Category Separable Case

Look for a weight vector that classifies all of the samples correctly. If such a weight does exist, then the samples are said to be linearly separable.

Gradient Descent Procedure

Define a criterion function J(a) that is minimized if a is a solution vector.Step 1: Randomly pick a(1), and compute the gradient vector:Step 2: a(2) is obtained by moving some distance from a(1) in the direction of the steepest descent.

))1((aJ

))(()()()1( kaJkkaka

Setting the Learning Rate

Second-order expansion of J(a):

Substituting

Minimized whenJHJkJkkaJkaJ t )(

1||||)())(())1(( 22

jiij aa

))(()()()1( kaJkkaka

))(())((2

1))(())(()( kaaHkaakaaJkaJaJ tt

Newton Descent For nonsingular H

Converges faster but more difficult to compute per step.

JHkaka 1)()1(

Perceptron Criterion Function

where Y(a) is the set of samples misclassified by a. Since

Update rule:

tp yaaJ )()(

p yJ )(

ykkaka )()()1(

Convergence Proof

Refer to page 229 to 232 of textbook.

Linear Discriminant Functions Wen-Hung Liao, 11/25/2008

Documents

Semester 1 Module 7 Ethernet Technologies Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology andres@dlit.edu.tw

Digital Systems: Logic Gates and Boolean Algebra Wen-Hung Liao, Ph.D

Digital Systems: Introductory Concepts Wen-Hung Liao, Ph.D

Hsien-wen Liao PhD. Candidate National Chengchi University Yuanchen Chang Department of Finance

Video Special Effects Wen-Hung Liao 10/3/2006. Outline Hardware-based video special effects Software-based video special effects Video content analysis

A Dynamic VPN Architecture for Private Cloud Computing 2011 Fourth IEEE International Conference on Utility and Cloud Computing Wen-Hwa Liao, Shuo-Chun

Semester 1 Module 8 Ethernet Switching Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology andres@dlit.edu.tw

A Collaborative Filtering Recommendation System … Collaborative Filtering Recommendation System Combining Semantics and Bayesian Reasoning Jialing Li Li Li Xiao Wen Jianwei Liao

1. Fisher Linear Discriminant 2. Multiple Discriminant

1 Semester 2 Module 7 Distance Vector Routing Protocols Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology

Discriminant Analysis with Graph Learning for ...crabwq.github.io/pdf/2018 Discriminant Analysis... · Linear Discriminant Analysis Revisited In this section, the Linear Discriminant

Local Area Networks Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology andres@dlit.edu.tw andres

Yu-Chung Tsao , Hui-Ling Fan , Lu-Wen Liao Thuy-Linh Vu

Computer Programming Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology andres@dlit.edu.tw andres

Yi-Wen Liao , Francesco Borrelli and J. Karl Hedrick · Yi-Wen Liao 1, Selina Pan 2, Francesco Borrelli and J. Karl Hedrick 1 Abstract—This paper proposes a new adaptation methodol-ogy

Chap 2 WANs and Routers Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology andres@dlit.edu.tw andres

Chap 5 Startup and Setup Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology andres@dlit.edu.tw andres

Wen- Hsing Kuo , Wanjiun Liao, Tehuang Liu IEEE TRANSACTIONS ON MULTIMEDIA

Interfacing with the Analog World Wen-Hung Liao, Ph.D

Flip-Flops and Related Devices Wen-Hung Liao, Ph.D. 4/10/2002