Neural Networks

Neural NetworksAn Introduction

Warith HARCHAOUI

MAP5, UMR 8145Universite Paris-Descartes

Sorbonne Paris Cite&

Oscaro.comResearch and Development

March 2017

Outline

Supervised Classification and RegressionClassificationRegression

One Neuronfor Regressionfor Classification

Gradient DescentBatch Gradient DescentStochastic Gradient Descent

Several Neurons

Convolutional Neural Networks for Images

Adversarial Networks

Conclusion

Outline

Several Neurons

Conclusion

Supervised ClassificationThe binary case

Given a training set that consists of:

I xi ∈ RD

I yi ∈ {0, 1}for i = 1, . . . , nFind F s.t. F(xi ) ' yiEx:xi is an imageyi = 1 corresponds to “cat”yi = 0 corresponds to “non-cat”

Supervised ClassificationMore than 2 classes

I xi ∈ RD

I yi ∈ {0, 1}K one-hot representation

for i = 1, . . . , nFind F s.t. F(xi ) ' yiEx:xi is an imageyi = [1, 0, 0] corresponds to “cat”yi = [0, 1, 0] corresponds to “dog”yi = [0, 0, 1] corresponds to “elephant”

Regression

I xi ∈ RD

I yi ∈ RK

for i = 1, . . . , nFind F s.t. F(xi ) ' yiEx:xi is a buildingyi is the rent value of the building

Outline

Several Neurons

Conclusion

One NeuronAn Input-Output Machine

Figure: One Neuron

y = a(w1x1 + w2x2 + w3x3 + b)

One Neuron for RegressionLeast Mean Squares

Prediction:F(xi ) = yi = Wxi + b

L(W,b) =1

n∑i=1

‖yi − yi‖22

One Neuron for Binary ClassificationLogistic Function

Prediction:

scorei = w>xi + b

P(yi = 1) = pi = Sigmoid(scorei ) =1

1 + exp(−scorei )

`(w, b) =∏

i :yi=1

i :yi=0

(1− pi ) =n∏

pyii (1− pi )1−yi

L(w, b) =−1

nlog(`(w, b)) =

n∑i=1

yi log(pi )+(1−yi ) log(1−pi )

One neuron for Binary ClassificationLogistic function

Figure: The Sigmoid Function

Sigmoid(a) =1

1 + exp(−a)

One Neuron for Classification of K > 2 classesSoftmax Function

Prediction:

scoreik = wk>xi + bk

pik = SoftMax(scorei ) =exp(scoreik)∑K

k ′=1 exp(scoreik ′)

yi ,k = 1⇔ xi belongs to the kth class

yi ,k = 0⇔ xi does not belong to the kth class

`(W,b) =n∏

K∏k=1

pyi,ki ,k

L(W,b) =−1

nlog(`(W, b)) =

n∑i=1

K∑k=1

yi ,k log(pi ,k)

Outline

Several Neurons

Conclusion

Batch Gradient DescentThe common problem

Loss function:

L(W,b) =1

n∑i=1

Li (W,b)

Problem:

minW,bL(W,b)

Batch Gradient DescentA Universal Learning Procedure

n∑i=1

Li (w)

1. Choose a random w and a constant α > 0

2. Iterate:wnew = wold − α∇L(wold)

∇L(wold) =1

n∑i=1

∇Li (wold)

Stochastic Gradient DescentA Universal Learning Procedure

n∑i=1

Li (w)

1. Choose a random w and a constant α > 0

2. Iterate:

2.1 Choose a random subset J ⊂ (1, n) ⊂ N (sometimes reducedto a singleton)

2.2wnew = wold − α

|J|∑j∈J

∇Lj(wold)

Outline

Several Neurons

Conclusion

Several NeuronsThe Power of Back-Propagation

Hiddenlayer

Inputlayer

Outputlayer

Figure: A Multi-Layer-Perceptron

Several NeuronsThe Power of Back-Propagation

Back-Propagation is just an iterated version of Chain Rule forplenty of functions:

(F ◦ G)′ =(F ′ ◦ G

)× G′

NB: (F ◦ G)(x) = F(G(x))

Three Remarks

1. Non-linearity: Sigmoid, SoftMax, ReLu

ReLu(x) = max(x , 0)

2. Automatic Differentiation thanks to: Theano, Torch, Caffe,Tensorflow, PyTorch

3. GPU Acceleration

Outline

Several Neurons

Conclusion

Convolutional Neural Networks for ImagesConvolutions

x : a PixelI: an Image in gray levelsK: a Kernel = A FilterI ∗ K: Convolution of image I by filter KnonLinearity(I ∗ K): Element-wise non-linearity on the convolutionresult producing a Feature Map

(I ∗ K)(x) =∑

y∈Supp(K)

I(x − y)K(y)

The same neuron of weights K is applied many times (as much asthe number of pixels in I) producing a new image called featuremap.

Convolutional Neural Networks for ImagesConvolutions

Figure: LeNet architecture

Outline

Several Neurons

Conclusion

Adversarial NetworksA Desired Network

yx or z Generator

Figure: Scheme for a Desired Network

Adversarial NetworksBinary Classification Networks

Discriminator p

Figure: Scheme for Binary Classification Networks

Adversarial NetworksThe full system

yx or z Generator

Discriminator p

Figure: Scheme for Adversarial Networks

Adversarial NetworksA New Kind of Loss

G : Generator (e.g. of images) from random noise or a real imageD: Discriminator that distinguished fake examples from realexamples

Figure: Adversarial Networks Example

Outline

Several Neurons

Conclusion

ConclusionA Great Book

Figure: The Deep Learning Book

Neural Networks - An Introduction · 2017-03-16 · Neural Networks - An Introduction Author:...

Documents

Deep Parametric Continuous Convolutional Neural Networks€¦ · Graph Neural Networks: Graph neural networks (GNNs) [25] are generalizations of neural networks to graph structured

Neural Networks with Complex Activations and … · Neural Networks with Complex Activations and Connection Weights ... neural network paradigms, ... Neural Networks with Complex

CS536: Machine Learning Artificial Neural Networks Neural Networks

ARTIFICIAL NEURAL NETWORKS. Introduction to Neural Networks

Artificial Neural Networks Lect8: Neural networks for constrained optimization

Neural Networks Recurrent networks Boltzmann networks …liacs.leidenuniv.nl/~nijssensgr/CI/2011/7 neural networks.pdfNeural Networks • Recurrent networks • Boltzmann networks

Artificial Neural Networks Lect7: Neural networks based on competition

Neural Networks: Backpropagation - sviveksvivek.com/.../fall2018/slides/neural-networks/neural-networks-backpropagation.pdfNeural Networks: Backpropagation 1 Based on slides and material

NEURAL NETWORKS,CELLULAR NEURAL NETWORKS AND ADAPTIVE FUZZY FILTERS

Neural Networks Neural Networks based on Competition CHAPTER 4

Neural networks Computer vision Neural Networks and ... · Neural networks Computer vision Neural Networks and Learning Machines (3rd Edition) Author: Simon's H. Haykin Published:

A STUDY OF NEURAL NETWORKS AND MULTIPLE NEURAL NETWORKS …

Introduction to Neural Networks - Databricks · • Introduction to Neural Networks • Training Neural Networks • Applying your Neural Networks This series will be make use of

Neural Networks Neural Networks

Neural Networks Chapter 8. 8.1 Feed-Forward Neural Networks

Neural Networks · Neural Networks NEURAL NETWORKS by Christos Stergiou and Dimitrios Siganos Abstract This report is an introduction to Artificial Neural Networks. The various types

Lecture 10: Neural Networks and Deep Learningsaravanan-thirumuruganathan.github.io/cse5334... · Neural Networks Deep Learning Convolutional Neural Networks Recurrent Neural Networks

Neural Networks: Old and New · Arti cial neural networks Brain neural networks Credit: Max Pixel Arti cial neural networks Why called arti cial? {(Over-)simpli cation on neural level

Artiﬁcial Neural Networks and Fuzzy Neural Networks for