21
FUZZ-IEEE 2003 1 Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation Yixin Chen and James Z. Wang The Pennsylvania State University http://www.cse.psu.edu/~yixchen

Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

Embed Size (px)

DESCRIPTION

Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation. Yixin Chen and James Z. Wang The Pennsylvania State University http://www.cse.psu.edu/~yixchen. Outline. Introduction VC theory and Support Vector Machines Additive fuzzy systems and kernel machines - PowerPoint PPT Presentation

Citation preview

Page 1: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 1

Kernel Machines and Additive Fuzzy Systems:Classification and Function Approximation

Yixin Chen and James Z. Wang

The Pennsylvania State Universityhttp://www.cse.psu.edu/~yixchen

Page 2: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 2

Outline

IntroductionVC theory and Support Vector MachinesAdditive fuzzy systems and kernel

machinesSupport vector learning for a class of

additive fuzzy systemsExperimental resultsConclusions and future work

Page 3: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 3

Introduction

Building a fuzzy systemStructure identificationParameter estimationModel validation

Do we get a good fuzzy model?How capable can a fuzzy model be?How well can the model generalize?

Page 4: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 4

Introduction (continued)

Several types of fuzzy models are “universal approximators”

Generalization performanceStructural risk minimizationBias variance dilemmaOverfitting phenomena

A “right” tradeoff between training accuracy and model complexity

Page 5: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 5

Introduction (continued)

Two approaches to find a “right” tradeoffCross-validation for model selectionModel reduction to simplify the model

Vapnik-Chervonenkis (VC) theoryA general measure of model set complexityBounds on generalizationSupport Vector Machines (SVM)

Page 6: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 6

Outline

IntroductionVC theory and Support Vector MachinesAdditive fuzzy systems and kernel

machinesSupport vector learning for a class of

additive fuzzy systemsExperimental resultsConclusions and future work

Page 7: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 7

VC Theory and Support Vector Machines

One result from VC TheoryBinary classification: given a set of training samples

drawn independently from some unknown distribution , with probability , the probability of misclassification for any decision function is bounded above by

Page 8: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 8

VC Theory and Support Vector Machines (continued)

Support Vector Machines (SVMs)Optimal separating hyperplane

Page 9: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 9

VC Theory and Support Vector Machines (continued)

Kernel trick

A Mercer kernel is a function,

,

satisfying

where is sometimes referred to as the Mercer features

Page 10: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 10

VC Theory and Support Vector Machines (continued)

Quadratic programming

Decision function

Page 11: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 11

Outline

IntroductionVC theory and Support Vector MachinesAdditive fuzzy systems and kernel

machinesSupport vector learning for a class of

additive fuzzy systemsExperimental resultsConclusions and future work

Page 12: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 12

Additive Fuzzy Systems and Kernel Machines

Kernel Machines

A class of additive fuzzy systems is functionally equivalent to a class of kernel machines

Page 13: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 13

Additive Fuzzy Systems and Kernel Machines (continued)

Additive Fuzzy System (AFS) m fuzzy rules of the form

Product as fuzzy conjunction operatorAddition for fuzzy rule aggregationFirst order moment defuzzificationReference functionKernel is the product of reference functions

Page 14: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 14

Additive Fuzzy Systems and Kernel Machines (continued)

Positive definite fuzzy systems (PDFS)Reference functions are positive definite

functions Mercer kernels

Examples:

Gaussian Symmetric triangle

Cauchy Hyperbolic secant

Laplace Squared sinc

Page 15: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 15

Outline

IntroductionVC theory and Support Vector MachinesAdditive fuzzy systems and kernel

machinesSupport vector learning for a class of

additive fuzzy systemsExperimental resultsConclusions and future work

Page 16: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 16

Support Vector Learning for a Class of Additive Fuzzy Systems

SVM PDFSKernel Reference functions

Support vectors IF-part of fuzzy rules

Lagrange multiplier THEN-part of fuzzy rules

Page 17: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 17

Outline

IntroductionVC theory and Support Vector MachinesAdditive fuzzy systems and kernel

machinesSupport vector learning for a class of

additive fuzzy systemsExperimental resultsConclusions and future work

Page 18: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 18

Experimental Results

USPS data setTraining data

(7291), testing data (2007)

5-fold cross-validation to determine parameters

Page 19: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 19

Experimental Results (continued)

Linear SVM: 91.3%k-nearest neighbor: 94.3%

Page 20: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 20

Outline

IntroductionVC theory and Support Vector MachinesAdditive fuzzy systems and kernel

machinesSupport vector learning for a class of

additive fuzzy systemsExperimental resultsConclusions and future work

Page 21: Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation

FUZZ-IEEE 2003 21

Conclusions and Future Work

KernelMachines

FuzzySystemsPDFS