* Adapted from slides by Chen Kaeasar, Ben-Gurion University

Optimization methods

Morten NielsenDepartment of Systems biology,

DTUIIB-INTECH, UNSAM, Argentina

*Adapted from slides by Chen Kaeasar, Ben-Gurion University

The path to the closest local minimum = local minimization

Minimization

The path to the closest local minimum = local minimization

Minimization

The path to the global minimum

Minimization

Outline

• Optimization procedures – Gradient descent– Monte Carlo

• Overfitting – cross-validation

• Method evaluation

Linear methods. Error estimate

Linear function

Gradient descent (from wekipedia)

Gradient descent is based on the observation that if the real-valued function F(x) is defined and differentiable in a neighborhood of a point a, then F(x) decreases fastest if one goes from a in the direction of the negative gradient of F at a. It follows that, if

for > 0 a small enough number, then F(b)<F(a)

Gradient descent (example)

Gradient descent

Weights are changed in the opposite direction of the gradient of the error

Gradient descent (Linear function)

Linear function

Gradient descent

Linear function

Gradient descent. Example

Linear function

Gradient descent. Example

Linear function

Gradient descent. Doing it your selfWeights are changed in the opposite direction of the gradient of the error

W1=0.1 W2=0.1

Linear function

What are the weights after 2 forward (calculate predictions) and backward (update weights) iterations with the given input, and has the error decrease (use =0.1, and t=1)?

Fill out the table

itr W1 W2 O

0 0.1 0.1

What are the weights after 2 forward/backward iterations with the given input, and has the error decrease (use =0.1, t=1)?

W1=0.1 W2=0.1

Linear function

Fill out the table

itr W1 W2 O

0 0.1 0.1 0.1

1 0.19 0.1 0.19

2 0.27 0.1 0.27

What are the weights after 2 forward/backward iterations with the given input, and has the error decrease (use =0.1, t=1)?

W1=0.1 W2=0.1

Linear function

Monte Carlo

Because of their reliance on repeated computation of random or pseudo-random numbers, Monte Carlo methods are most suited to calculation by a computer. Monte Carlo methods tend to be used when it is unfeasible or impossible to compute an exact result with a deterministic algorithmOr when you are too stupid to do the math yourself?

Example: Estimating Π by Independent

Monte-Carlo SamplesSuppose we throw darts randomly (and uniformly) at the square:

Algorithm:For i=[1..ntrials] x = (random# in [0..r]) y = (random# in [0..r]) distance = sqrt (x^2 + y^2) if distance ≤ r hits++EndOutput:

Adapted from course slides by Craig Douglas

http://www.chem.unl.edu/zeng/joy/mclab/mcintro.html

Estimating P

Monte Carlo (Minimization)

dE<0dE>0

The Traveling Salesman

Adapted from www.mpp.mpg.de/~caldwell/ss11/ExtraTS.pdf

Gibbs sampler. Monte Carlo simulations RFFGGDRGAPKRGYLDPLIRGLLARPAKLQVKPGQPPRLLIYDASNRATGIPA GSLFVYNITTNKYKAFLDKQ SALLSSDITASVNCAK GFKGEQGPKGEPDVFKELKVHHANENI SRYWAIRTRSGGITYSTNEIDLQLSQEDGQTIE

RFFGGDRGAPKRGYLDPLIRGLLARPAKLQVKPGQPPRLLIYDASNRATGIPAGSLFVYNITTNKYKAFLDKQ SALLSSDITASVNCAK GFKGEQGPKGEPDVFKELKVHHANENI SRYWAIRTRSGGITYSTNEIDLQLSQEDGQTIE

E1 = 5.4 E2 = 5.7

E2 = 5.2

dE>0; Paccept =1

dE<0; 0 < Paccept < 1

Note the sign. Maximization

Monte Carlo Temperature

• What is the Monte Carlo temperature?

• Say dE=-0.2, T=1

• T=0.001

MC minimization

Monte Carlo - Examples

• Why a temperature?

Local minima

Stabilization matrix method

• A prediction method contains a very large set of parameters

– A matrix for predicting binding for 9meric peptides has 9x20=180 weights

• Over fitting is a problem

Data driven method training

yearsTe

rature

Regression methods. The mathematics

y = ax + b2 parameter model

Good description, poor fit

y = ax6+bx5+cx4+dx3+ex2+fx+g

7 parameter modelPoor description, good fit

Model over-fitting

Stabilization matrix method (Ridge regression). The mathematics

y = ax + b2 parameter model

Good description, poor fit

y = ax6+bx5+cx4+dx3+ex2+fx+g

7 parameter modelPoor description, good fit

SMM training

Evaluate on 600 MHC:peptide binding dataL=0: PCC=0.70L=0.1 PCC = 0.78

Stabilization matrix method.The analytic solution

Each peptide is represented as 9*20 number (180)H is a stack of such vectors of 180 valuest is the target value (the measured binding)l is a parameter introduced to suppress the effect of noise in the experimental data and lower the effect of overfitting

SMM - Stabilization matrix method

Linear function

Sum over weights

Sum over data points

Linear function

Per target error:

Global error:

Sum over weights

Sum over data points

SMM - Stabilization matrix methodDo it yourself

Linear function

l per target

Linear function

l per target

Linear function

SMM - Stabilization matrix methodMonte Carlo

Linear function

Global:

• Make random change to weights

• Calculate change in “global” error

• Update weights if MC move is accepted Note difference between MC

and GD in the use of “global” versus “per target” error

Training/evaluation procedure• Define method• Select data• Deal with data redundancy

– In method (sequence weighting)– In data (Hobohm)

• Deal with over-fitting either– in method (SMM regulation term) or– in training (stop fitting on test set

performance)• Evaluate method using cross-validation

A small doit script//home/user1/bin/doit_ex

#! /bin/tcsh foreach a ( `cat allelefile` )mkdir -p $cd $aforeach l ( 0 1 2.5 5 10 20 30 )mkdir -p l.$lcd l.$lforeach n ( 0 1 2 3 4 )smm -nc 500 -l $l train.$n > mat.$npep2score -mat mat.$n eval.$n > eval.$n.predendecho $a $l `cat eval.?.pred | grep -v "#" | gawk '{print $2,$3}' | xycorr`cd ..endcd ..end

* Adapted from slides by Chen Kaeasar, Ben-Gurion University

Documents

Homophily and the Glass Ceiling Effect in Social NetworksHomophily and the Glass Ceiling Effect in Social Networks Chen Avin y Ben Gurion University of the Negev, Israel avin@cse.bgu.ac.il

Ben Gurion University

Ben Gurion University of the Negev

Israel Airport Authority Security Division - Ben-Gurion Intl’Airport

David Ben-Gurion, Oral History Interview – 7/16/1965archive2.jfklibrary.org/JFKOH/Ben-Gurion, David/JFKOH-DB-01/JFKOH... · David Ben-Gurion, Oral History Interview – 7/16/1965

Many Random Walks Are Faster Than One - TAUnogaa/PDFS/aakklt2.pdfMany Random Walks Are Faster Than One Noga Alon y Tel Aviv University Email: nogaa@tau.ac.il Chen Avin Ben-Gurion University

Shakhar Smorodinsky Ben-Gurion University, Be’er-Sheva

Relativistic reconnection Yuri Lyubarsky Ben-Gurion University, Israel

Tiểu Sử David Ben - Gurion - f.libvui.comf.libvui.com/dlsm13/TieuSuDavidBenGurion_91bbb877d8.pdf · Mục Lục 1. TIỂU SỬ DAVID BEN-GURION 2. Foreword By Israel Ambassador

Virtual Point in Time Access Assaf Natanzon EMC, Ben Gurion University Prof. Eitan Bachmat, Ben Gurion University

BEN-GURION UNIVERSITY OF THE NEGEV FACULTY OF …

Prof. Ji Chen Adapted from notes by Prof. Stuart A. Long Notes 5 Poynting Theorem ECE 3317 1 Spring 2014

Ben gurion int airport shmul zackay

Polynomial-time what-if analysis for communication ...Chen Avin (Ben Gurion University) Patent pending, INFOCOM 2018 G.I.F. project. Polynomial-time what-if analysis for communication

Q Designing Effective PowerPoint Presentations Adapted from a presentation by Victor Chen ERAU

Armin Shmilovici Ben-Gurion University, Israel armin@bgu.ac.il

Ben Gurion X - s3-eu-west-2.amazonaws.com

English Project, David Ben Gurion 2015

Open Apartments Program, Ben Gurion University

Designing Effective PowerPoint Presentationsbilalbajwa.pbworks.com/w/file/fetch/47492733/Lab Activity-4.pdf · Designing Effective PowerPoint Presentations Adapted from: Victor Chen