CS444 Nathan Sprague James Madison UniversityNeurons Neurons communicate using discrete electrical...

Linear Regression, Neural Networks, etc.

CS444Nathan Sprague

James Madison University

Neurons● Neurons communicate using

discrete electrical signals called “spikes” (or action potentials).– Spikes travel along axons.– Reach axon terminals.– Terminals release

neurotransmitters.– Postsynaptic neurons

respond by allowing current to flow in (or out).

– If voltage crosses a threshold a spike is created

Creative Commons byncsa 3.0

Beginning Psychology (v. 1.0).

http://2012books.lardbucket.org/books/beginningpsychology/

Multivariate Linear Regression

● Multi-dimensional input vectors:

● Or:

h(x1 , x2 , ... , x n)=w0+w1 x1+...+wn x n

h (x)=wT x

Linear Regression – The Neural View

● input = x, desired output = y, weight = w.● h(x) = wx

● We are given a set of inputs, and a corresponding set of outputs, and we need to choose w.

● What's going on geometrically?

● h(x) = wx is the equation of a line with a y intercept of 0.

● What is the best value of w? ● How do we find it?

Bias Weights

● We need to use the general equation for a line: h(x) = w

1x + w

● This corresponds to a new neural network with one additional weight, and an input fixed at 1.

Error Metric

● Sum squared error (y is the desired output):

● The goal is to find a w that minimizes E. How?

ErrorE=∑e∈E

12(ye−h(xe))

Gradient Descent

http://en.wikipedia.org/wiki/File:Glacier_park1.jpg

Attribution-Share Alike 3.0 Unported

Gradient Descent

● One possible approach (maximization):1)take the derivative of the function: f'(w)2)guess a value of w : 3)move a little bit according to the derivative:

4)goto 3, repeat.

w← w+η f ' (w)

Partial Derivatives

● Derivative of a function of multiple variables, with all but the variable of interest held constant.

f x , y =x2xy2

fx x , y=2xy2

∂ f x , y

∂ x=2xy2

fy x , y =2xy

∂ f x , y

∂ y=2xy

Gradient

● The gradient is just the generalization of the derivative to multiple dimensions.

● Gradient descent update:

∇ f w=[∂ f w

∂ f w

∂ wn

]w← w−η∇ f (w)

Gradient Descent for MVLR

● Error for the multi-dimensional case:

● The new update rule:

● Vector version:

∂ Error E (w)

∂ wi

=∑e∈E

(ye−wT xe)(−xe , i)

=−∑e∈E

(ye−wT x) xe , i

ErrorE (w)=∑e∈E

12( ye−wT xe)

wi←wi+η∑e∈E

(ye−wT x) xe , i

w←w+η∑e∈E

(ye−wT x) xe

Analytical Solution

● Where X is a matrix with one input per row, y the vector of target values.

w=(X T X )−1 XT y

Notice that we get Polynomial Regression for Free

y=w1 x2+w2 x+w0

CS444 Nathan Sprague James Madison UniversityNeurons Neurons communicate using discrete electrical...

Documents

CS444/CS544 Operating Systems Review: Synchronization 3/9/2007 Prof. Searleman jets@clarkson.edu

Track spikes

CS444/CS544 Operating Systems Threads 1/29/2007 Prof. Searleman jets@clarkson.edu

CS444/544 Operating Systems II

CS444/CS544 Operating Systems Processes 1/24/2007 Prof. Searleman jets@clarkson.edu

CS444/CS544 Operating Systems Synchronization 2/28/2006 Prof. Searleman jets@clarkson.edu

CS444/CS544 Operating Systems Processes 1/24/2006 Prof. Searleman jets@clarkson.edu

Identify these looping axons

CS444/CS544 Operating Systems Deadlock 3/07/2006 Prof. Searleman jets@clarkson.edu

CS444/CS544 Operating Systems Deadlock 3/7/2007 Prof. Searleman jets@clarkson.edu

CS444/CS544 Operating Systems Scheduling 1/31/2007 Prof. Searleman jets@clarkson.edu

Lecture 6: Dendrites and Axons

RELATIONSHIP BETWEEN SPROUTING AXONS - Lawrence Moon

Young Spikes 2011

CS444/CS544 Operating Systems Synchronization 2/21/2006 Prof. Searleman jets@clarkson.edu

CS444/CS544 Operating Systems File Systems 4/16/2007 Prof. Searleman jets@clarkson.edu

CS444/CS544 Operating Systems Synchronization 3/21/2006 Prof. Searleman jets@clarkson.edu

ood Price Spikes?

CS444/CS544 Operating Systems

CS444/CS544 Operating Systems Memory Management 3/28/2007 Prof. Searleman jets@clarkson.edu