nn perceptron 1 -...

Idegrendszeri modellezés

Orbán Gergő

http://golab.wigner.mta.hu

Neurons and neural networks I.

High-level vs. low level modeling

Previously in this course:

* Mechanistic description of neurons* Descriptive models of neuronal operations

Goals of the next classes:

Are these models compatible with the neural architecture?How can neural computations interpreted in this framework?

Outline

• single neuron• neural network

• supervised learning• unsupervised learning

Neural networks, basic concepts

Goal of the neural network can be phrased:learning about its inputforming memories

Address based memory• not associative• not fault tolerant• not distributed

Biological memory• associative• error tolerant

• cue error• hardware fault

• parallel and distributed

Terminology

Architecture: what variables are involved, how they interact(weights, activities)

Activity rule: short time-scale behaviorhow activitiy of neurons depends on inputs

Learning rule: long time-scle behaviorhow weights change as a function of activities ofother neurons, time, or other factors

Terminology

Supervised learning: labelled data is provided, i.e. an input-output relationship is given;the ‘teacher’ tells whether the response ofthe neuron to a given input was correct

Unsupervised learning: unlabelled data is available;learns the statistics of input:generating a table of occurencessome more sophisticated representationis learned

Single neuron as a classifier DefinitionsArchitecture: I inputs xi

weights wi

Activity rule:1. activation is calculated

2. output is calculated• linear• logistic

• tanh• threshold

Single neuron as a classifier DefinitionsArchitecture: I inputs xi

weights wi

Activity rule:1. activation is calculated

2. output is calculated• linear• logistic

• tanh• threshold

Input space

Single neuron as a classifier Concepts

Input space

Weight space

The supervised learning paradigm:given example inputs x and target outputs tlearning the mapping between them

the trained network is supposed to give ‘correct response’ for any given input stimulus

training is equivalent to learning the appropriate weights

in order to achieve this, an objective function (or error function) is defined, which is minimized during training

Binary classification in a neuronError function:

1. information content of outcomes2. relative entropy of distributions (t,

1-t) and (y, 1-y)

Binary classification in a neuronError function:

1. information content of outcomes2. relative entropy of distributions (t,

1-t) and (y, 1-y)

Learning: back-propagation algorithm

Here is the error

Binary classification The algorithm

1. Get the data2. Calculate activations:3. Calculate the output of the neuron:4. Calculate error:5. Update weight(s):

alternatively, batch learning can be done

gradient descent

stochastic gradient descent

Binary classification Demonstration

Evolution of weights

Binary classification Demonstration

Evolution of weights

Evolution of the errorfunction

Binary classification Are we happy?

w’s seem to grow unbounded

Is this bad for us?

Generic pain killer: regularization

(weight decay)

Binary classification Effect of regularization

Capacity of the neuron

How much information can the neuron transmit?

Interpreting learning as inferenceSo far: optimization wrt. an objective function

What’s this quirky regularizer, anyway?

Interpreting learning as inferenceLet’s interpret y(x,w) as a probability:

in a compact form:

the likelihood of the input data can be expressed with the original error function function

in a compact form:

the regularizer has the form of a prior!

in a compact form:

the regularizer has the form of a prior!

what we get in the objective function M(w): the posterior distribution of w:

Interpreting learning as inference Relationship between M(w) and the posterior

interpretation: minimizing M(w) leads to finding the maximum a posteriori estimate wMP

The log probability interpretation of the objective function retains:additivity of errors, whilekeeping the multiplicativity of probabilities

Interpreting learning as inference Properties of the Bayesian estimate

• The probabilistic interpretation makes our assumptions explicit:by the regularizer we imposed a soft constraint on the learned parameters, which expresses our prior expecations.

• An additional plus:beyond getting wMP we get a measure for learned parameter uncertainty

nn perceptron 1 -...

Documents

Perceptron - cool.ntu.edu.tw

Winow vs Perceptron

Supervised Learning: Linear Perceptron NN

Rosenblatt's Perceptron

COMS 4771 Perceptron and Kernelization - cs.columbia.eduverma/classes/ml/lec/lec3_linperceptron_kern.pdf · • The Perceptron algorithm • Mistake bound for the perceptron • Generalizing

The Perceptron - dbs.ifi.lmu.de · A linear classi er can be realized by a Perceptron, a single formalized neuron! 11. The Perceptron: A Learning Machine The Perceptron was the rst

Linear Classification: The Perceptron - Penn Engineeringcis519/fall2017/...Improving the Perceptron • The Perceptron produces many θ‘s during training • The standard Perceptron

Dr. Perceptron - cs.cmu.edu

Learning Linear Separators Perceptron, Marginsninamf/courses/315sp19/lectures/Perceptron-01-25-2019.pdfJan 25, 2019 · Perceptron Extensions • Can use Perceptron in a batch setting

FPGA perceptron

perceptron backpropagations

Perceptron Learning

Linear Classification: The Perceptron - Penn … · Improving the Perceptron • The Perceptron produces many θ‘s during training • The standard Perceptron simply uses the final

Winnow vs perceptron

STRUCTURED PERCEPTRON Alice Lai and Shi Zhi. Presentation Outline Introduction to Structured Perceptron ILP-CRF Model Averaged Perceptron Latent Variable

Neural Networks (Overview) - RITrlaz/prec2010/slides/NeuralNet...This universal approximation property has been proven for the two important NN models: the multi-layered perceptron

Jaringan Perceptron

UNIVERSITI PUTRA MALAYSIA DEVELOPMENT OF AN …psasir.upm.edu.my/6352/1/FSKTM_2004_7a.pdffield. A Multilayer perceptron (MLP) is a popular NN model used in ASR field. In this study,

Convolutional Neural Networks for Biomedical Image ...Deep Learning Intro Perceptron and MLP intro Convolutional NN intro Deep CNN Tools and methods for Deep CNNs. Artificial Neural

Perceptron (neural network)