32
Marco Gori - IEEE Expert Now Course An Introduction to Multilayered Neural Networks Introduction to Multilayer Perceptrons Marco Gori University of Siena

Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

  • Upload
    hathuan

  • View
    226

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An Introduction to Multilayered Neural Networks

Introduction toMultilayer Perceptrons

Marco GoriUniversity of Siena

Page 2: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Outline of the course

• Motivations and biological inspiration• Multilayer perceptrons: architectural

issues• Learning as function optimization• Backpropagation• The applicative perspective

Page 3: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

A difficult problem for knowledge-based solutions

An “A” perceived bya webcam

How can I provide a satisfatorystatement to associate the picturewith an “A”?

Page 4: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Emulation of the brain?

... Or simply inspiration?

I just want to point out that the componentry used in The memory may be entirely different from the one thatUnderlines the basic active organs.John von Neumann, 1958

... Inspiration at the level of neurons

... Hard to go beyond!

Page 5: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Artificial neurons

Sigmoidal units

Radial basis units

Page 6: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Supervised and reinforcement learning

Reinforcement info: reward/punishmentSupervised info: specific target

Page 7: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Directed Acyclic Graph Architecture

Feedforward architecture Multilayer architecture

Partial ordering on the nodes

Page 8: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Forward PropagationLet be any topological sorting of the nodes and let Be the parents of node

Page 9: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Boolean Functions

Page 10: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Boolean Functions by MLP

• Every Boolean function can be expressed in the firstcanonical form

• Every minterm is a linearly-separable function (oneon a hypercube’s vertex)

• OR is linearly-separable• Similar conclusions using the second canonical form.

Every Boolean function can be represented by an MLP with two layers: minterms at the first layer, OR at the second one

Page 11: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Membership functions

A set function is defined by

for all

Convex set by MLP

Page 12: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Membership for comlex domainsnon-connected domains

non-convex domains

Page 13: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Membership functions (Lippman ASSP’87)

• Every hidden unit is associated with a hyperplane• Every convex set is associated with units in the first

hidden layer• Every non-connected or non-convex set can be

represented by a proper combination (at the secondhidden layer) of units representing convex sets in thefirst hidden layer

Basic statement: Two hidden layer to approximate anyset function

Page 14: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Universal ApproximationGiven

and

find

One hidden layer (with “enough” hidden units)!

Page 15: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Supervised LearningConsider the triple

where

Error due to the mismatch between

Page 16: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Gradient Descent

• The optimization may involve a huge number of paramters –even one million (Bourlard 1997)

• The gradient heuristics is the only one which is meaningful insuch huge spaces

• The trajectory ends up in local minima of the error function.• How is the gradient calculated? (very important issue!)

Page 17: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

BackpropagationBryson & Ho (1969), Werbos (1974), le Cun (1995), Rumerlhart-Hinton-Williams (1986)

Error accumulation:

DAG hypothesis:

Page 18: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Backpropagation (con’t)

if thenelse

Page 19: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Backpropagation (con’t)any topologic sorting induced bytopologic sorting induced by the inverse

Page 20: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Batch-learningWe calculate the “true-gradient”

The weight updating takes place according within The classic framework of numerical analysis

Page 21: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

On-line learning

Weight updating after presenting each example …

momentum term to filter out abrupt changes …

Page 22: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Backprop Heuristics

• Weight initialization– Avoid small weights (no Backprop of the delta errors– Avoid large weights (neuron saturation)

• Learning rate– The learning rate can change during the learning (higher

when the gradient is small)– There is no “magic solution”!

Page 23: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Backprop Heuristics (con’t)

• Activation function: symmetric vsasymmetric

• Target values to avoid saturation• Input normalization• The input variables (coordinates) should be

incorrelated• Normalization w.r.t. to the fan-in of the first

hidden layer

Page 24: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

simple example ofsub-optimal learning

local minima as symmetricconfigurations ...

The problem of local minima

Page 25: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Basic results

• No local minima in the case of linearly separablepatterns (Gori & Tesi, IEEE-PAMI92)

• No local m inima if the number of hidden units isequal to the number of examples (Yu et al, IEEE-TNN95)

• A general comment:– The theoretical investigations on this problem have not

relevant results for the design of the networks.

Page 26: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Complexity issues: Backprop is optimal

Classic numerical algorithms requiresfor the computation of a single partial derivative and

for the whole gradient computation

Backprop requires for the whole gradient computation

e.g. in the case of 10,000 parameters, we need 10,000 FPO versus 100 millions FPO!!! This is one of the main reasons of the success of Backpropagation!

Page 27: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

penalties to limit large weights

Courtesy MathWorks

crossvalidation

Page 28: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An Introduction to Multilayered Neural Networks

The applicative perspective:pattern recognition

Page 29: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

MLP as classifiers:a simple pre-processing

input Preprocessedinput

output

Page 30: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

The cascade

Page 31: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Autoassociator-based classifiersMotivation: better behavior for pattern verificationGori & Scarselli, IEEE-PAMI-98

Page 32: Introduction to Multilayer Perceptrons - LIA Neurali Teoria e... · Introduction to Multilayer Perceptrons Marco Gori University of Siena. Marco Gori - IEEE Expert Now Course

Marco Gori - IEEE Expert Now Course

An introduction to Multilayered Neural Networks

Some successful applications

• Airline market assistance, BehavHeuristics Inc• Automated Real Estate Appraisal Systems - HNCSoftware• OCR - Caere, Audre Recognition System• Path planning, NeuroRoute - Protel• Electronic nose - AromaScan Inc• Quality control - Anheuser-Busch, Dunlop• Banknote acceptor BANK, DF Elettronica Florence