Bez tytułu slajdukslot.iis.p.lodz.pl/AI/nn1.pdf · 2018-11-07 · 1 ms 0 V -70 mV Activity...

Artificial Intelligence Krzysztof Ślot, 2008

Artificial Intelligence

Krzysztof Ślot

Institute of Applied Computer Science,

Technical University of Lodz, Poland

Introduction to computing

with neural networks –

feedforward nets

Introduction

To solve “hard” problems – computationally intense, w/unclear rules etc.

To mimic “higher” brain functions - recognition, classification etc.

Sample

recognition

Motivation

Neurons are living cells that consume energy to operate

• Another illusion - same mechanism

– reveals three-channel visual input

Researching neural networks

Synapse

Dendrites Cell

Ranvier gap

Shield

Nucleus

Biological reference

Architecture of a neuron

1 ms 0 V

-70 mV

Activity Refraction

Neuron’s operation

Firing frequency is

proportional to total

excitation

Simple multi-input,

single-output unit

Neural Networks: background

Modeling a neuron

• Physical modeling

– Pulse propagation phenomenon

Hodgkin-Huxley model (Nobel prize)

• Functional modeling

– Of interest to AI: provides means for simulating/emulating neural nets

Activation function

y = f (wTx`)

Mc Culloch-Pitts model

f Linear

Activation functions

1 f(x)

Non-linear,

differentiable s

esthsf

Hyperbolic

tangent

)1...1()( sf

Sigmoid

)1...0()( sf

f Non-linear, non-

differentiable

)1...1()( sf

f(s) = 1(s) f(s) = sgn(s)

)1...0()( sf

)()(01

ii xwfxwfy

Perceptron

Distance

– Assume only two inputs - activation: linear equation 2211 xwxws

5.2,1,1 21 ww

05.2),( 2121 xxxxf

Sample parameters:

A line

0)(,3.0

sfBx y =0 B

0)(,5.1

sfAx y =1

Neuron’s function

• How to interpret neuron’s outcome?

– Outcome: assessment of activation level (dot product)

• Interpretation

– Outcome: a decision (e.g. hunt or not)

– Data classification

Frog’s world

Supervised learning algorithms

Training neural networks

Determine weights which provide desired

network operation

Objective

y i = 1

d i = 0

Training vector set : { x i}

Desired network responses: { d i }

Actual neuron outputs: { y i }

di Expected output for

“i”-th training vector Actual output

ei = c ( d i - y i ) 2 Error k=0

y = f ( xk wk ) n

ei = f(w)

ei = c ( d i - y i ) 2 y = f(s) = f (wTx)

Gradient-descent methods

(differentiable error function)

Supervised learning

Basic idea – adjust weights to minimize an error

k xsfydw )(')(

Non-linear activation functions

Sigmoid

k xydsfw ))(('

y = S wlxl

Delta - rule

Linear activation function

k xydw )(k

)1(β2)(' ffsf

k xffydw )1()(βη2

No differentiation

required !

k xydw )(Step and sign functions

iii yd xw )(

wT(i-1)

Direction of change: vector xi

Delta - rule learning - geometrical interpretation

Note: di - y i is negative

Data classification

Application domain Linearly-separable tasks

Minsky, Pappert (1963): recess in ANN research

Neuron function

Multi-layer ANN

y0 0 1 1 y1

Neuron function - linearly separable

function - logical OR or AND Binary input

Decision

region

Multi-layer ANN

Decision regions for 2D NN are convex

ML ANN decision regions in classification tasks

Source: Lippman “Introduction to neural computeing”

Multi-layer ANN

Data processing in multi-layer ANNs

Input layer

neurons 1,,1

111 , Ni

kkiii xwfsfy

Input vector

N NN M

i fwfwfywfsfy0 00

)(Output layer

neurons

O ....

f f f f

f/l f/l Output

Y 1 Y M .......................

-1 y 1

......

Learning in multi-layer ANNs - notation

MLP Training

• Supervised setup

• Criterion: mean-squared error (MSE)

jiW ,ikw ,

2)( jj

j Yt ij

ji ysYW )(',

Weight update for output neurons: delta rule

For hidden units the error cannot be directly estimated

Solution: basic calculus ?i

)( ,ikwgE

MLP training

Derivative for compound function: chain rule

ik xsfw )(',

)(', SfW

i sfxw

E )( ,

ik xsfSfWYtw )(')(')( ,

Weight update interpretation

Analogous to delta rule

Error: back-projected from the upper layer

)(')( ,

SfWYt i

f’(.) jE

mEmiW ,

f’(.)

Error Back-Propagation algorithm (BP) Krzysztof Ślot: Głębokie sieci

neuronowe

)()( px frfy

Radial functions – distance-dependent output

Typical example: Gaussian

)()( 1mxmx S

Net’s architecture

w1 wN Ouput unit (linear) i

ii yWY

RBF units

x Input vector

Networks with radial-basis units

Regression (function approximation)

Linear

neuron

Feed-forward ANN applications

y Neuron 1

Neuron 2

Neuron 3 :

y Neuron 1

Neuron 2

Neuron 3 :

Sigmoid

Network training

• Parameters to be determined

– Number of hidden neurons – number of approximating functions

– RBF function parameters (means, covariance matrices) if RBF neurons are

used in the hidden layer

– Sigmoid parameters if sigmoid units are used

– Weights of the output neuron

• Learning strategies

– Supervised

– Mixed: unsupervised learning of hidden-layer units’ parameters, supervised

learning of output weights

ki YdE1

jjkk wY

)( μx

N – number of output units

M – number of hidden units

i – training sample index

RBF supervised training

• Training criterion

– To minimize approximation error

• Sample network

– 1D RBF (e.g. Gaussian)

– One output unit

si μx

s eμx

RBF parameters

s YYdeYdw

Output weights

Gradient descent approach

Feedforward NNs: problems

• Overfitting

– For overly complex net and insufficient amount of data, a model learns

training samples, not a rule. A model should generalize well

20 training samples

5 hidden RBF units 50 hidden RBF units

Overfitting

Limitations of ML-FF ANNs

• Local minima

– Only if error function is convex one can expect correct training outcome

(gradient descent gets us to the minimum). Unfortunately, error functions for

multiple-layer fedforward ANNs are rarely convex …

– Possible ways to alleviate the problem:

• Boltzman machines

• Multiple initial points

• Regularization

• Overfitting

– If learning set is not significantly larger than parameter set, network learns

examples not the rule (there are many well-fitting units)

• Capabilities of multiple-layer networks trained using BP algorithm

and its descendendants for solving real-life problems are limited

… Face … Face

Summary of multilayer feed-forward ANNs

• Drawbacks

– Learning is a challenge: local minima of error function result in non-optimal

solutions as gradient-descent methods cannot find global minima of non-

monotonous functions: possible solution – stochastic methods (simulated

annealing – global minima search)

– Convergence speed of BP algorithm: possible solution: consider second

order derivatives in error approximation (Levenbergh-Marquardt)

– Fundamental difficulties with VLSI implementations of nets

– ANNs are hard to analyze (feedback nets)

• Advantages

– Theoretically, capable of solving hard problems

– Extremely fast execution (if implemented in hardware, but also, if simulated)

– Can constantly learn and improve, even after deployment

• Practical applications

– Rare …

– Until recently …

Deep Neural Networks and Deep Learning

• Deep neural networks: breakthrough in performance of intelligent data

processing

– Recognition of contents of Rn data: images (object recognition, scene

analysis, image classification)

– Recognition of contents of Rn data sequences: video (action recognition),

speech (recognition, trascription, translation), NLP (document classification,

analysis)

– Generation of Rn data: image objects, textures

– Generation of Rn data sequences: control, description, speech

Recognition

Humans CNN

Accuracy 96% 99.6 %

Humans CNN

Accuracy 82% 86.1 %

Categories: 40, examples: 30 000 Categories: 100, examples: 400 000

• Classification of image objects

– DNN perform better than humans

Recognition and generation

Application: autonomous vehicles

Scene understanding, vehicle control

Nvidia https://www.youtube.com/watch?v=qhUvQiKec2U 40

Generation

Robot motion control

Boston Dynamics: https://www.youtube.com/watch?v=-e9QzIkP5qI 41

DCGAN creations Style: Van Gogh

Painting style

Learning abstract concepts Input image

http://www.boredpanda.com/computer-

deep-learning-algorithm-painting-masters/

Style: Munch

Convolutional Neural Networks

• Automated image annotation

Deep Learning and Convolutional Neural Networks

• Deep

– Multiple layers (dozens, hundreds, thousands)

– Huge amounts of parameters

– Appropriate measures for training

Conv 1

Filter Si Filter Sj

Pooling

1 - MAX

Conv 2

Pooling

2 - MAX

Conv n

ReLU Data

Fully-connected ANN

• Convolutional neural networks

Bez tytułu slajdukslot.iis.p.lodz.pl/AI/nn1.pdf · 2018-11-07 · 1 ms 0 V -70 mV Activity...

Documents

Neuron’s and The Nervous System

Alanna J. Watt and Niraj S. Desai- Homeostatic plasticity and STDP: keeping a neuron’s cool in a fluctuating world

NN1-Architecture and Performance of Neural Networks for Efficient AC Control in Buildings

Neural Implementation of Hierarchical Bayesian Inference ...papers.nips.cc/paper/3782-neural-implementation-of-hierarchical... · and tuning curves detail how neuron’s responses

CS407 Neural ComputationLR3: Hebbian Learning A purely feed forward, unsupervised learning The learning signal is equal to the neuron’s output The weight initialisation at small

6i n S5water.rid.go.th/hydhome/ma/pictures/58/7285.pdf · fl it::' fi' ..l~ " 1J~nn1~t; l

Nn1 5 page issuu

Sikh Community Centre HAPPY DIWALI! - … Community Centre & Youth Club (SCCYC) Registered Charity No: 1056764 23-25 St Georges Street Northampton NN1

Załącznik Bez Tytułu 00019

Tools of Discovery and Hormonal Systems Older Brain …...Like batteries, neurons generate electricity from chemical events. In the neuron’s chemistry-to-electricity process, ions

Lead Bricks for Radiation Protection › ... › downloads › mth-prospekt_bleibaust… · NN2 NB2 CS1n ES1 NN1 LB1 DN1 DN1n a) b) c) NN2 NN1 Fig. 2 Top view diagram head bricks

Instantaneous Midbrain Control of Saccade Velocity · 2018. 11. 16. · ates instantaneous velocity control of saccades. As indicated in-tuitively, ... the center of the neuron’s

1000 Acacia Ave, New Town, Lalashire, NN1 1EU...Property address: 1000 Acacia Ave, New Town, Lalashire, NN1 1EU A Introduction to the report This HomeBuyer Report is produced by an

Neuron’s and The Nervous System Created by Stella Thalluri 2013

Artificial Neural Networkspeople.sabanciuniv.edu/berrin/cs512/lectures/7-nn1-intro.ppt.pdf · Artificial Neural Networks Part 1/3 Slides modified from Neural Network Design ... they

Learning sculpts the spontaneous activity of the … · Learning sculpts the spontaneous activity of the resting ... of activity evoked by the neuron’s ... connectivity of the cortex

Bez tytułu slajdu - Jagiellonian Universitybiotka.mol.uj.edu.pl/zbm/handouts/2009_02_therapy.pdfElectroporation 1. Strength of electric field - usually in mammalian cells the best

Bez tytułu slajdu - Jagiellonian Universitybiotka.mol.uj.edu.pl/zbm/handouts/2008_ 09_therapy.pdf · Bez tytułu slajdu Author: Dr J. Dulak Created Date: 1/6/2008 10:41:01 AM

Bez tytułu 1 kopia

Form NN1 Application for Registration as …...Form NN1 6 公司秘書 Company Secretar y (如超過一名公司秘書屬自然人或法人團體，請用續頁 B 填報 Use Continuation