Machine Learning Week 4 Lecture 1. Hand In Data Is coming online later today. I keep test set with...

Machine Learning Week 4 Lecture 1

Hand In Data Is coming online later today. I keep test set with approx. 1000 test images That will be your real test You are most welcome to add regularization as we discussed last week. It is not a requirement. Hand in Version 4 available

Recap What is going on Ways to fix it

Overfitting Data Increases -> Overfitting Decreases Noise Increases -> Overfitting Increases Target Complexity Increase -> Overfitting Increases

Learning Theory Perspective In Sample Error + Model Complexity Instead of picking simpler hypothesis set Prefer Simpler hypotheses h from Define what simple means in complexity measure Minimize

Regularization In Sample Error + Model Complexity Weight Decay Decay Every round we take a step towards the zero vector

Why are small weights better Practical Perspective Because in practice we believe that Noise is Noisy Stochastic Noise High Frequency Deterministic Noise also non-smooth Sometimes weight are weighed differently Bias Term gets free ride

Regularization Summary More Art Than Science Use VC and Bias Variance as guides Weight Decay universal technique practical believe that noise is noisy (non-smooth) Question. Which to use Many other regularizers exist. Extremely Important. Quote Book: Necessary Evil

Validation Regularization Estimates Validation Estimate Remember the test set

Model Selection t Models m 1,,m t Which is better? E val (m 1 ) E val (m 2 ). E val (m t ) Pick the minimum one Compute Train on D train Validate on D val Use to find for my weight decay

Cross Validation Increasing K Dilemma E val estimate tightens E val increases Small K Large K We would like to have both. Cross Validation

K-Fold Cross Validation Split Data in N/K Parts of size K Test Train all but one set. Test on remaining. Pick one who is best on average over N/K partitions Usual K = N/10 (we do not have all day)

Today: Support Vector Machines Margins Intuition Optimization Problem Convex Optimization Lagrange Multipliers Lagrange for SVM WARNING: Linear Algebra and function analysis coming up

Support Vector Machines Today Next Time

Notation Target y is in {-1,+1} We write parameters as w and b The hyperplane we consider is w T x + b = 0 Data D = {x i,y i ) For now assume D is linear separable 0 for some i maximize over i >0 then i g i (x) is unbounded h i (x) 0 for some i maximize ove">

Primal Problem If x is primal infeasible: g i (x) >0 for some i maximize over i >0 then i g i (x) is unbounded h i (x) 0 for some i maximize over then i h i (x) is unbounded x is primal infeasible if g i (x) < 0 for some i or h i (x) 0 for some i Primal Problem

If x is primal feasible: g i (x) 0 for all i maximize over i 0 then optimal is i =0 h i (x) = 0 for all i maximize over then i h i (x) = 0, is irrelevant

Primal Problem Made constraints into value in optimization function Which is what we are looking for!!! is an optimal x

Dual Problem , are dual feasible if i 0 for all i This implies

Weak and Strong Duality Question: When are they equal?

Strong Duality: Slaters Condition If f,g i are convex and h i is affine and the problem is strictly feasible e.g. exist primal feasible x such g i (x) < 0 for all i then d* = p * (strong duality) Assume that is the case

Complementary Slackness Let x* be primal optimal *,* dual optimal (p*=d*) All Non-Negative for all i Complimentary Slackness

Karush-Kuhn-Tucker (KKT) Conditions Let x* be primal optimal *,* dual optimal (p*=d*) g i (x*) 0, for all i i * 0 for all i i * g i (x*) = 0 for all i h i (x*) = 0 for all i Primal Feasibility Dual Feasibility Complementary Slackness Since x* minimizes Stationary KKT Conditions for optimality, necessary and sufficient.

Finally Back To SVM Subject To Minimize Define the Lagrangian (no required)

SVM Dual Form Need to minimize. We take derivatives and solve for 0 Solve for 0 w is a vector that is a specific linear combination of input points

SVM Dual Form Which must be 0. We get constraint

SVM Dual Form Insert Above

SVM Dual Form

SVM Dual Problem Found the minimum over w,b now maximize over Subject To Remember

Intercerpt b* Case: y i = 1 Cases: y i =-1 Constraint

Making Predictions Sign of Support Vectors

w Complementary Slackness Support vectors are the vectors that support the plane

SVM Summary Subject To Support Vectors w

Machine Learning Week 4 Lecture 1. Hand In Data Is coming online later today. I keep test set with...

Documents

Catalogo approx

PowerPoint Presentation...1353 Total Approx. 1,320 SF Ground Floor Approx. 1,050 SF Mezzanine Approx. 270 SF 1355 Total Approx. 1,100 SFRE License #01260345 310.407.6585 Jay.Luchs@ngkf.com

How to Survive the Coming Test Automation Zombie Apocalypse

AL-KO CABLE WINCHES STRONG HELPERS fOR ......approx. 200 hour salt spray test electro galvanized, yellow chromate approx. 200 hour salt spray test electro galvanized, yellow chromate

sman HM604 2 en - Rohde & Schwarz · Ramp output: approx, 5k/ positlve going, Test voltage: max. (open circuit). Test current: max. (shorted). Test frequency: 50 - (line frequency)

Some ideas on how to teach the more tricky …...The Test •In 2014 approx 20% name and identify. By 2016 this rises to approx 45%. •In 2014 approx 80% grammatical accuracy (making

Any Questions!. Test Coming Up! Agenda Printing with Externally Described Printer Files Arrays

1996ASB - imgix · 1175 1925 1725 1650 1650 2275 2400 2400 2350 American Star 5th Wheel Sizes and Capacities Approx Approx. Approx. Approx. American Star 5th Wheel Chassis Information

Guide to exhibit - JIMTOFGuide to exhibit Access Tokyo Big Sight(Venue) Harumi Exit Approx. 14 minutes Approx. 55 minutes Approx. 25 minutes Approx. 40 minutes Approx. 30 minutes Approx

Name: PET Practice Test Class: Date: PET Practice Test · Questions 1–7 . There are seven questions in this part. ... Part 1. PET Practice Test. PAPER 2 • Listening. Approx. 35

Coming to TERMS with Test Automation

Data sheet chainflex CF2 - Igus · 2020. 2. 13. · cranes, refrigerating sector Typical lab test setup for this cable series Test bend radius R approx. 28 - 75 mm Test travel S/S

Cross Approx. State, P&P Sh Section Approx. Sta. Approx

Column 5 BADGES UNLIMITED, LLC. · 2019-07-12 · Approx 2” x 3” Approx 2” x 3” Approx 2” x 3” Approx 2” x 3” Approx 2” x 3” Approx 2” x 3” Approx 2” x 2.2”

Takeshima · 2020. 10. 25. · Japan Sea of Japan Oki Islands Utsuryo Island Approx.217km Approx.211km Approx.67km Approx.158km Approx.88km Republic of Korea Higashijima (Mejima)

Alat Pemadam Kebakaran · POWDER MODEL Capacity Overall Height Body Diameter Working Pressure Test Pressure Approx Discharge Time Approx Discharge Range Propellant Fire Rating AV

Biochemical Test kits approx. 70min 700 tests LabAssay 291 ...291-58601 ALP Mouse, Human Serum 0.06259 0.5mmol/L 20μL approx. 20min 900tests 294-65801 Cholesterol Mouse, Human Serum

Approx Multiplier

Coming soon: Dark Horse Coffee Roasters and · 2020-06-04 · A29 APPROX. 2100sf APPROX. 2300sf C-26a C-26 C-27 C-27a B-21 B-22/23 B-24/25/26 To parking structure Psychotherapy SONUS

Coming for a blood test The Butterfly Room Children's Outpatients o o o