15
Project 3 Results

Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Embed Size (px)

Citation preview

Page 1: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Project 3 Results

Page 2: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

base pipeline

Spatial pyramids and additional

features

base pipeline + careful

parameter choices

Page 3: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Example Results

• Tala Huhe• Betsy Hilliard• Hang Su• Yun Zhang• Seth Goldenberg• Paul Sastrasinh• Andy Loomis• Emanuel Zgraggen

Page 4: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Object Category Detection: Sliding Windows (continued)

Computer VisionCS 143, Brown

James Hays

10/3111

Many Slides from Kristen Grauman

Page 5: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Previously

• Category recognition (proj3)– Bag of words over not-so-invariant local

features.

• Instance recognition– Local invariant features: interest point

detection and feature description– Local feature matching, spatial verification– Scalable indexing

Page 6: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Today (continued from Wed.)

• Window-based generic object detection– basic pipeline– boosting classifiers– face detection as case study

Page 7: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

What other categories are amenable to window-based representation?

Page 8: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

gV

isu

al O

bje

ct

Recog

nit

ion

Tu

tori

al

Vis

ual O

bje

ct

Recog

nit

ion

Tu

tori

al

Pedestrian detection• Detecting upright, walking humans also possible

using sliding window’s appearance/texture; e.g.,

SVM with Haar wavelets [Papageorgiou & Poggio, IJCV 2000]

Space-time rectangle features [Viola, Jones & Snow, ICCV 2003]

SVM with HoGs [Dalal & Triggs, CVPR 2005]

Kristen Grauman

Page 9: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

gV

isu

al O

bje

ct

Recog

nit

ion

Tu

tori

al

Vis

ual O

bje

ct

Recog

nit

ion

Tu

tori

al

Window-based detection: strengths

• Sliding window detection and global appearance descriptors:

Simple detection protocol to implement Good feature choices critical Past successes for certain classes

Kristen Grauman

Page 10: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

gV

isu

al O

bje

ct

Recog

nit

ion

Tu

tori

al

Vis

ual O

bje

ct

Recog

nit

ion

Tu

tori

al

Window-based detection: Limitations

• High computational complexity For example: 250,000 locations x 30 orientations x 4

scales = 30,000,000 evaluations! If training binary detectors independently, means

cost increases linearly with number of classes

• With so many windows, false positive rate better be low

Kristen Grauman

Page 11: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

gV

isu

al O

bje

ct

Recog

nit

ion

Tu

tori

al

Vis

ual O

bje

ct

Recog

nit

ion

Tu

tori

al

Limitations (continued)

• Not all objects are “box” shaped

Kristen Grauman

Page 12: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

gV

isu

al O

bje

ct

Recog

nit

ion

Tu

tori

al

Vis

ual O

bje

ct

Recog

nit

ion

Tu

tori

al

Limitations (continued)

• Non-rigid, deformable objects not captured well with representations assuming a fixed 2d structure; or must assume fixed viewpoint

• Objects with less-regular textures not captured well with holistic appearance-based descriptions

Kristen Grauman

Page 13: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

gV

isu

al O

bje

ct

Recog

nit

ion

Tu

tori

al

Vis

ual O

bje

ct

Recog

nit

ion

Tu

tori

al

Limitations (continued)

• If considering windows in isolation, context is lost

Figure credit: Derek Hoiem

Sliding window Detector’s view

Kristen Grauman

Page 14: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

gV

isu

al O

bje

ct

Recog

nit

ion

Tu

tori

al

Vis

ual O

bje

ct

Recog

nit

ion

Tu

tori

al

Limitations (continued)

• In practice, often entails large, cropped training set (expensive)

• Requiring good match to a global appearance description can lead to sensitivity to partial occlusions

Image credit: Adam, Rivlin, & Shimshoni Kristen Grauman

Page 15: Project 3 Results. base pipeline Spatial pyramids and additional features base pipeline + careful parameter choices

Summary

• Basic pipeline for window-based detection

– Model/representation/classifier choice

– Sliding window and classifier scoring

• Viola-Jones face detector

– Exemplar of basic paradigm

– Plus key ideas: rectangular features, Adaboost for feature selection, cascade, hard negatives.

• Pros and cons of window-based detection