Supervised Learning

SupervisedLearning

RobotImageCredit:Viktoriya Sukhanova©123RF.com

TheseslideswereassembledbyEricEaton,withgratefulacknowledgementofthemanyotherswhomadetheircoursematerialsfreelyavailableonline.Feelfreetoreuseoradapttheseslidesforyourownacademicpurposes,providedthatyouincludeproperattribution.PleasesendcommentsandcorrectionstoEric.

TheBadgesGame

Background:• Pre-registeredattendeesatthe1994MachineLearningConferencereceivedanamebadgelabeledwitha"+"or"-"

• Thelabelisbasedonly uponthename• Thereare294examples(210positiveand84negative)

Whatfunctionwasusedtogeneratethe+/- labeling?

+NaokiAbe - EricBaum

TrainingData

+NaokiAbe- Myriam Abramson+DavidW.Aha+KamalM.Ali- EricAllender+DanaAngluin- Chidanand Apte+MinoruAsada+LarsAsker+Javed Aslam+JoseL.Balcazar- CristinaBaroglio

+PeterBartlett- EricBaum+Welton Becket- Shai Ben-David+GeorgeBerg+NeilBerkman+Malini Bhandaru+Bir Bhanu+Reinhard Blasig- Avrim Blum- AnselmBlumer+JustinBoyan

+CarlaE.Brodley+NaderBshouty- WrayBuntine- Andrey Burago+TomBylander+BillByrne- ClaireCardie+JohnCase+JasonCatlett- PhilipChan- Zhixiang Chen- ChrisDarken

TestData

?Shivani Agarwal?ChrisCallison-Burch?EricEaton?PeterStone?MatthewTaylor

LabeledTestData

- Shivani Agarwal- ChrisCallison-Burch- EricEaton+PeterStone+MatthewTaylor

WhatisLearning?• TheBadgesGameisanexampleofakeylearningprotocol:supervisedlearning

• Firstquestion:Areyousureyougotit?Why?• Issues:–Whichproblemwaseasier: predictionormodeling?– Representation– Problemsetting– BackgroundKnowledge–Whendidlearningtakeplace?

Algorithm:canyouwriteaprogramthattakesthisdataasinputandpredictsthelabelforyourname?

Output

y∈YAnitemy

drawnfromanoutputspaceY

x∈XAnitemx

drawnfromaninputspaceX

Systemy =f(x)

SupervisedLearning

• Weconsidersystemsthatapplyanunknownfunctionf()toinputitemsxandreturnanoutputy =f(x).

Output

y∈YAnitemy

drawnfromanoutputspaceY

x∈XAnitemx

drawnfromaninputspaceX

Systemy =f(x)

SupervisedLearning

• In(supervised)machinelearning,ourgoalistolearnafunctionh()fromexamplesthatapproximatesf()

Output

Anitemydrawnfromalabel

spaceY

AnitemxdrawnfromaninstancespaceX

LearnedModely=h(x)

Supervisedlearning

Targetfunctiony=f(x)

y = h(x)

Supervisedlearning:Training

• GivethelearnerexamplesinD train

• Thelearnerreturnsamodelh(x)11

LabeledTrainingDataD train

(x1,y1)(x2,y2)…

(xN,yN)

Learnedmodelh(x)

LearningAlgorithm

Canyousuggestotherlearningprotocols?

h(x)isthemodelwe’lluseinourapplication

FunctionApproximationProblemSetting• Setofpossibleinstances• Setofpossiblelabels• Unknowntargetfunction• Setoffunctionhypotheses

Input:Trainingexamplesofunknowntargetfunctionf

Output:Hypothesisthatbestapproximatesf

f : X ! YH = {h | h : X ! Y}

BasedonslidebyTomMitchell

{hxi, yii}ni=1 = {hx1, y1i , . . . , hxn, yni}

SampleDataset• ColumnsdenotefeaturesXi

• Rowsdenotelabeledinstances• Classlabeldenoteswhetheratennisgamewasplayed

hxi, yii

Supervisedlearning:Testing

• Reservesomelabeleddatafortesting14

LabeledTestData

D test

(x’1,y’1)(x’2,y’2)

…(x’M,y’M)

Supervisedlearning:Testing

LabeledTestData

D test

(x’1,y’1)(x’2,y’2)

…(x’M,y’M)

TestLabelsY test

y’1y’2...y’M

RawTestDataX test

x’1x’2….x’M

TestLabelsY test

y’1y’2...y’M

RawTestDataX test

x’1x’2….x’M

Supervisedlearning:Testing• Applythemodeltotherawtestdata• Evaluatebycomparingpredictedlabelsagainstthetestlabels

Learnedmodelh(x)

PredictedLabelsh(X test)h(x’1)h(x’2)….

h(x’M)

Canyouuse thetestdataotherwise?

SupervisedLearning:Examples

§ Diseasediagnosis§ x:Propertiesofpatient(symptoms,labtests)§ f:Disease(ormaybe:recommendedtherapy)

§ Part-of-Speechtagging§ x:AnEnglishsentence(e.g.,Thecanwillrust)§ f:Thepartofspeechofawordinthesentence

§ Facerecognition§ x:Bitmappictureofperson’sface§ f:Nametheperson(ormaybe:apropertyof)

§ AutomaticSteering§ x:Bitmappictureofroadsurfaceinfrontofcar§ f:Degreestoturnthesteeringwheel

Manyproblemsthatdonotseemlikeclassificationproblemscanbedecomposedintoclassificationproblems.

KeyIssuesinMachineLearning• Modeling

– Howtoformulateapplicationproblemsasmachinelearningproblems?– Howtorepresentthedata?– LearningProtocols(whereisthedata&labelscomingfrom?)

• Representation– Whatfunctions shouldwelearn(hypothesisspaces)?– Howtomaprawinput toaninstancespace?– Anyrigorouswaytofindthese?Anygeneralapproach?

• Algorithms– Whataregoodalgorithms?– Howdowedefinesuccess?– Generalizationvs.overfitting– Thecomputationalproblem

Usingsupervisedlearning

§ Whatisourinstancespace?§ Whatkindoffeaturesareweusing?

§ Whatisourlabelspace?§ Whatkindoflearningtaskarewedealingwith?

§ Whatisourhypothesisspace?§ Whatkindoffunctions(models)arewelearning?

§ Whatlearningalgorithmdoweuse?§ Howdowelearnthemodelfromthelabeleddata?

§ Whatisourlossfunction/evaluationmetric?§ Howdowemeasuresuccess?Whatdriveslearning?

Output

y∈YAnitemy

drawnfromalabelspaceY

x∈XAnitemx

drawnfromaninstancespaceX

LearnedModelh(x)

1.TheinstancespaceX

• DesigninganappropriateinstancespaceX iscrucialforhowwellwecanpredicty.

1.TheinstancespaceX§ Whenweapplymachinelearningtoatask,wefirst

needtodefinetheinstancespaceX.§ Instancesx∈ X aredefinedbyfeatures:

§ Booleanfeatures:§ Isthereafoldernamedafterthesender?§ Doesthisemailcontainstheword‘class’?§ Doesthisemailcontainstheword‘waiting’?§ Doesthisemailcontainstheword‘class’andtheword‘waiting’?

§ Numericalfeatures:§ Howoftendoes‘learning’occurinthisemail?§ Whatlongisemail?§ HowmanyemailshaveIseenfromthissenderoverthelastday/week/month?

§ Bagoftokens§ Justlistallthetokens intheinput 21

Doesitaddanything?

What’sX fortheBadgesgame?

§ Possiblefeatures:§ Gender§ Name’scountry-of-origin§ Lengthoftheirfirstorlastname§ Doesthenamecontainletter‘x’?§ Howmanyvowelsdoestheirnamecontain?§ Isthen-th letteravowel?§ Doesthenamehavethesamenumberofvowelsandconsonants?

X asavectorspace

§ X isanN-dimensionalvectorspace(e.g.<N)§ Eachdimension=onefeature.

§ Eachx isafeaturevector(hencetheboldfacex).§ Thinkofx =[x1 …xN]asapointinX :

Goodfeaturesareessential§ Thechoiceoffeaturesiscrucial forhowwellataskcanbelearned

§ Inmanyapplicationareas(language,vision,etc.),alotofworkgoesintodesigningsuitablefeatures

§ Thisrequiresdomainexpertise

§ Thinkaboutthebadgesgame– whatifyouwerefocusingonvisualfeatures?

§ Wecan’tteachyouwhatspecificfeaturestouseforyourtask§ Butwewilltouchonsomegeneralprinciples

Output

y∈YAnitemy

x∈XAnitemx

LearnedModelh(x)

2.ThelabelspaceY

• ThelabelspaceY determineswhatkind ofsupervisedlearningtask wearedealingwith

SupervisedlearningtasksI

§ Outputlabelsy∈Y arecategorical:§ Binaryclassification:Twopossiblelabels§ Multi-classclassification:kpossiblelabels

§ Outputlabelsy∈Y arestructuredobjects (sequencesoflabels,parsetrees,etc.)

§ Structurelearning

SupervisedlearningtasksII

§ Outputlabelsy∈Y arenumerical:§ Regression(linear/polynomial):

§ Labelsarecontinuous-valued§ Learnalinear/polynomialfunctionf(x)

§ Ranking:§ Labelsareordinal§ Learnanorderingf(x1)>f(x2)overinput

Output

y∈YAnitemy

x∈XAnitemx

LearnedModelh(x)

3.Themodelh(x)

• Weneedtochoosewhatkind ofmodelwewanttolearn

ALearningProblem

y = f (x1, x2, x3, x4)Unknownfunction

x1x2x3x4

Example x1 x2 x3 x4 y1 0 0 1 0 0

3 0 0 1 1 14 1 0 0 1 15 0 1 1 0 06 1 1 0 0 07 0 1 0 1 0

2 0 1 0 0 0Canyoulearnthis

function?Whatisit?

HypothesisSpaceCompleteIgnorance:Thereare216 =65536possiblefunctionsoverfourinputfeatures.

Wecan’tfigureoutwhichoneiscorrectuntilwe’veseeneverypossibleinput-outputpair.

Afterobservingsevenexampleswestillhave29 possibilitiesfor f

IsLearningPossible?

Example x1 x2 x3 x4 y

16 1 1 1 1 ?

1 0 0 0 0 ?

1 0 0 0 ?

1 0 1 1 ?1 1 0 0 01 1 0 1 ?

1 0 1 0 ?1 0 0 1 1

0 1 0 0 00 1 0 1 00 1 1 0 00 1 1 1 ?

0 0 1 1 10 0 1 0 0

2 0 0 0 1 ?

1 1 1 0 ?

q Thereare|Y||X| possiblefunctionsf(x)fromtheinstancespaceX tothelabelspaceY.

q Learnerstypicallyconsideronlyasubset ofthefunctionsfromX toY,calledthehypothesisspaceH .H⊆|Y||X|

GeneralstrategiesforMachineLearning

§ Developflexiblehypothesisspaces:§ Decisiontrees,neuralnetworks,nestedcollections.§ Constrainingthehypothesisspaceisdonealgorithmically

§ Developrepresentationlanguagesforrestrictedclassesoffunctions:§ Servetolimittheexpressivityofthetargetmodels§ E.g.,Functionalrepresentation(n-of-m);Grammars;linearfunctions;stochasticmodels;

§ Getflexibilitybyaugmentingthefeaturespace§ Ineithercase:

§ Developalgorithmsforfindingahypothesisinourhypothesisspace,thatfitsthedata

§ Andhopethattheywillgeneralizewell

KeyIssuesinMachineLearning• Modeling

– Howtoformulateapplicationproblemsasmachinelearningproblems?– Howtorepresentthedata?– LearningProtocols(whereisthedata&labelscomingfrom?)

• Representation– Whatfunctions shouldwelearn(hypothesisspaces)?– Howtomaprawinput toaninstancespace?– Anyrigorouswaytofindthese?Anygeneralapproach?

• Algorithms– Whataregoodalgorithms?– Howdowedefinesuccess?– Generalizationvs.overfitting– Thecomputationalproblem

Supervised Learning - Penn Engineering · 2019. 1. 22. · Supervised Learning : Examples §...

Documents

Self-supervised Learning

Supervised learning network

Iterative Attention Mining for Weakly Supervised Thoracic Disease ...lelu/publication/MICCAI2018_ChestXRay_IAM.pdf · Iterative Attention Mining for Weakly Supervised Thoracic Disease

Supervised Experiential Learning

Weakly Supervised Deep Learning for Brain Disease

Alzheimer's Disease Early Diagnosis Using …sharif.edu/~hoda/papers/Alzheimer.pdfAlzheimer’s Disease Early Diagnosis Using Manifold-BasedSemi-Supervised Learning Moein Khajehnejad,ForoughHabibollahiSaatlouand

Supervised Learning - wnzhang.netwnzhang.net/teaching/ee448/slides/5-supervised-learning-2.pdf · Content of Supervised Learning •Introduction to Machine Learning •Linear Models

Nonparametric Supervised Learning - cda.psych.uiuc.educda.psych.uiuc.edu/multivariate_fall_2013/matlab_help/...learning.pdf · Supervised Learning (Machine Learning) Workflow and

Federated Semi-Supervised Learning with Inter-Client ...2.1. Preliminaries Semi-Supervised Learning Semi-Supervised Learning (SSL) refers to the problem of learning with partially

CS583 Supervised Learning

Semi-Supervised Learning for Optical Flow with Generative … › paper › 6639-semi-supervised-learning-for-opti… · Semi-Supervised Learning for Optical Flow with Generative

Lecture 1: Supervised Learning - ISyE Home | ISyE ...tzhao80/Lectures/Lecture_1.pdf · Lecture 1: Supervised Learning ... CS229 Lecture notes Andrew Ng Supervised learning LetÕs

Graph-BasedSemi-Supervised Learningsemi-supervised learning, graph-based semi-supervised learning, manifold learn- ing, graph-based learning, transductive learning, inductive learning,

Supervised learning, classification

OA Text - A supervised machine learning algorithm SKVMs ...Belgacem R (2018) A supervised machine learning algorithm SKVMs used for both classification and screening of glaucoma disease

SUPERVISED LEARNING IN R REGRESSION - Amazon S3 · DataCamp Supervised Learning in R: Regression Logistic regression to predict probabilities SUPERVISED LEARNING IN R: REGRESSION

PrediksiCuacadiKotaPalembangBerbasis Supervised Learning

Semi-Supervised Learning

Semi-Supervised Learningzhuxj/tmp/book.pdf1.1.2 Semi-Supervised Learning Semi-supervised learning (SSL) is half way between supervised and unsupervised learning. In addition to unlabeled