Development in Object Detectionweb.engr.oregonstate.edu/~sinisa/courses/OSU/CS559/... · Ramanan. A...

Development in Object Detection

Junyuan Lin May 4th

Line of Research

[1] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection, CVPR 2005.

[2] P. Felzenszwalb, D. McAllester, and D. Ramanan. A discriminatively trained, multiscale, deformable part model, CVPR 2008.

[3] C. Desai, D. Ramanan, C. Fowlkes. Discriminative Models for Multi-Class Object Layout, ICCV 2009.

HOG Feature template

Deformable Part Model

Multi-class Object Relationship

HOG Feature template

Multi-class Object Relationship

Linear SVM

Latent SVM

Structural SVM

Y structured output

Max-margin Formulation

•Concatenate HOG Features in a search window into feature vector•Formulate the object detection as a binary linear classifier problem

•Sliding window scanning over the HOG feature pyramid •Output location with highest score f(x)•Possible post process (Non-maxima suppression)

Dalal & Triggs Detector

Linear SVM classifier

Learned weighted Filter

Training sample Positive weights Negative weights

Multiscale model captures features at two-resolution

relative to p0

Slide by P. Felzenszwalb

Score of a hypothesis

Latent variable

relative to anchor position

Latent SVM Training

Hinge loss

Not convex in general !

Semi-convexity:

is concave for positive examples

is convex for negative examples

For positive examples make convex by fixing the latent variable for each positive training example.

Latent SVM Training cont’d

Coordinate descent optimization approach:Initialize and iterate:

•Fix pick best for each positive example.

•Fix optimize over by quadratic programming or gradient descent

Model training procedure:

-Root Filter : train an initial root filter F0 using linear SVM without latent variable.-Part Filter : Initialize six parts from root filter F0 with the same shape,

place at anchor positions with most positive energy in F0 . The size of each part filter is determined to take 80% of the area of F0.

•Initialization:

•Train the latent SVM model

Example Results

Quantitative Result

Multi-class Object LayoutPascal Dataset: Multiple objects of multi-class in a single image. Problems with traditional binary classification + sliding window approach:• Intra-class: Require ad-hoc non-maxima suppression as post processing.• Inter-class: multiple class models are searched independently over images.

Heuristically forcing mutual exclusion.

Need a principal way to model spatial statistics between objects

FormulationFormulate as a structured prediction problem.

X: a collection of overlapping windows at various scales covering the entire imageY: the entire label vector for the set of all windows in an image.

spatial relationship between prediction

Spatial context descriptor

Output score of local detectors

The parameter learning and inference can be formulated as a structural SVM framework.

Structural SVM

Joint feature map

•Exponential Parameters•Exponential Constraints

represent Y using features extract from X,Y

Learn weights so that is max for correct y

Multi-class SVM

Structural SVM cont’dn-Slack Formulation: (margin rescaling)

1-Slack Formulation:

Slide by T. Joachims

Cutting-Plane Algorithm• Input: •• REPEAT

– FOR– Compute

– ENDFOR– IF

– optimize StructSVM over– ENDIF

• UNTIL has not changed during iteration

[Jo06] [JoFinYu08]

Add constraint to working set

Find most violated

constraint

Violated by more than ε ?

Theoretical BoundTheorem: Given any , the number of constraints added to working set

S is bounded by

so that is a feasible solution.

• Number of constraints is independent of training examples.• Linear time training algorithm.

Summary of Structural SVM• Training: (cutting plane algorithm)

• Prediction:

• Application Specific Design of Model– Loss function:– Representation: joint feature map– Algorithm to compute:

Component Formulation

0-1 loss functionjoint feature map

Greedy forward search algorithm

Slide by D. Ramanan

Development in Object Detectionweb.engr.oregonstate.edu/~sinisa/courses/OSU/CS559/... · Ramanan. A...

Documents

cs559-23 (1)

Discriminatively Trained Dense SurfaceNormal Estimation · 2014-07-05 · Discriminatively Trained Dense SurfaceNormal Estimation L’ubor Ladicky´, Bernhard Zeisl, and Marc Pollefeys

ECE 468: Digital Image Processing Lecture 3web.engr.oregonstate.edu/~sinisa/courses/OSU/ECE... · ECE 468: Digital Image Processing Lecture 3 Prof. Sinisa Todorovic sinisa@eecs.oregonstate.edu

CS559: Computer Graphics

Object Detection with Discriminatively Trained Part Based

COMP 790-096: Computational Photographyweb.engr.oregonstate.edu/~sinisa/courses/OSU/CS559/lectures/CS559... · What is Computational Photography? • Definition 1: the use of photographic

CS559: Computer Graphics - University of Wisconsin–Madisonpages.cs.wisc.edu/~lizhang/courses/cs559-2008s/... · CS559: Computer Graphics Lecture 27: Texture Mapping Li Zhang. Spring

Discrete markov image modeling and inference on the ...web.engr.oregonstate.edu/~sinisa/courses/OSU/CS559/...Discrete Markov Image Modeling and Inference on the Quadtree Jean-Marc

Sinisa Edinburgh

Discriminatively Structured Graphical Models for Speech ...ssli.ee.washington.edu/ssli/people/karim/papers/GMRO-final-rpt.pdf · Discriminatively Structured Graphical Models for Speech

CS559: Computer Graphicspages.cs.wisc.edu/~lizhang/courses/cs559-2010s/syllabus/05-03-ray... · CS559: Computer Graphics Lecture 28: Ray Tracing Li Zhang. Spring 2010. Slides are

CS559: Computer Graphicspages.cs.wisc.edu/~lizhang/courses/cs559-2008s/syllabus/...CS559: Computer Graphics Lecture 17: Shading in OpenGL, FLTK Li Zhang Spring 2008 Today • shading

Object Detection with Discriminatively Trained Part …people.cs.uchicago.edu/~pff/papers/lsvm-pami.pdf · Object Detection with Discriminatively Trained Part Based Models ... R.B

Sinisa Malesevic - Sociologija Etniciteta

CS559: Computer Graphicspages.cs.wisc.edu/~lizhang/courses/cs559-2010s/syllabus/02-24... · CS559: Computer Graphics Lecture 11: Antialiasing& Visibility, Intro to OpenGL Li Zhang

CS 559: Computer Vision Lecture 1 - Home | College of …web.engr.oregonstate.edu/~sinisa/courses/OSU/CS559/... · · 2010-03-30CS 559: Computer Vision Lecture 1 Prof. Sinisa Todorovic

CS559: Computer Graphicspages.cs.wisc.edu/~lizhang/courses/cs559-2010s/... · CS559: Computer Graphics Lecture 15: Hierarchical Modeling and Shading Li Zhang Spring 2008. Today •Finish

A Discriminatively Trained, Multiscale, Deformable …people.cs.uchicago.edu/~pff/papers/latent.pdfA Discriminatively Trained, Multiscale, Deformable Part Model Pedro Felzenszwalb

Sinisa Zaric - Advertising

Dissertation Sinisa Sladic