44
7/31/2019 Beckman Background Knowledge http://slidepdf.com/reader/full/beckman-background-knowledge 1/44  Using Background Knowledge to Improve Visual Learning Derek Hoiem Beckman Director’s Seminar March 11, 2009 Work with: Ali Farhadi, Ian Endres, Gang Wang, Santosh Divvala, James Hays David Forsyth, Alexei Efros, Martial Hebert

Beckman Background Knowledge

  • Upload
    zukun

  • View
    218

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 1/44

 

Using Background Knowledge

to Improve Visual Learning

Derek Hoiem

Beckman Director’s Seminar 

March 11, 2009

Work with: Ali Farhadi, Ian Endres, Gang Wang, Santosh Divvala, James Hays

David Forsyth, Alexei Efros, Martial Hebert

Page 2: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 2/44

What I’d like to make possible with computer vision 

Household Robot Intelligent Vehicle

Security Photo Organization

Page 3: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 3/44

What we can do (with the right dataset)

• Recognize faces

• Categorize scenes

• Detect, segment, and track objects

• 3D from multiple images or stereo

• Classify actions

Page 4: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 4/44

But we’re a long way from “Rosie”

Computer vision has been divided into many

task- and dataset-specific problems

 – Difficult to coordinate pieces – Poor generalization to unfamiliar environments

 – Massive engineering and data collection effort

required for every task/dataset

Page 5: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 5/44

Goal

Use background knowledge: generalize known

solutions to new problems or dataset

Page 6: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 6/44

The Challenge

How can we use what we know to make learning

new things easier and more robust?

Page 7: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 7/44

This Talk

• Three uses of background knowledge

 – Contextual knowledge

 – Compositional knowledge

 – Organizational knowledge

Page 8: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 8/44

I. Contextual Knowledge

Goal: Use knowledge of objects and spatial

layout to better detect a new object.

Work with Santosh Divvala, James Hays, Alexei Efros, Martial Hebert

Page 9: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 9/44

Object Detection without Context

Search over many positions and scales

Page 10: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 10/44

Object Detection without Context

Cat?

Cat?

Cat?

In each window: is this a cat?

Page 11: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 11/44

Training a Detector

Color

Texture

Edges

xx

x x

x

x

x

xx

oo

oo

o

Classifier FeaturesExamples

Page 12: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 12/44

Object Detection without Context

In each window: is this a cat?

,

Page 13: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 13/44

Object Detection without Context

• Top five cat detections in a challenging dataset

Detector: Felzenszwalb et al. CVPR 2008 Dataset: PASCAL VOC 2008

Page 14: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 14/44

What do we know that can help us?

Page 15: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 15/44

What do we know that can help us?

Knowledge of Other Objects and Scenes

Similar ImagesLarge Set of 

Loosely

AnnotatedImages

 Associated Keywords

BabyPuppy

Sand

House KittenHelps tell us howlikely the object is to

appear in this image.

Page 16: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 16/44

.

.

Occlusion Boundaries Depth EstimatesSurface Layout

Helps tell us where and how

big the object is likely to be.

Knowledge of Spatial Layout

What do we know that can help us?

Hoiem et al. 2005,2007

Page 17: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 17/44

Context: Likelihood of Presence

1. Object presence

Contains Cat No Cat

Page 18: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 18/44

Context: Likelihood of Presence

Image

Gist

Surface Layout

 Associated Keywords

BabyPuppy

Sand

House Kitten

gist: Torralba Oliva 2003

Likely to contain a cat?

Page 19: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 19/44

Page 20: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 20/44

Context: Likelihood of Size

• Predict height of object based on depth, surface orientations,gist, and image position

Predicted Height

Candidate

Bounding Box

Size from Gist: Torralba Oliva 2003

Page 21: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 21/44

Rescoring Candidate Objects

Presence Scores

Position Scores

Size Scores

Appearance Score(from detector)

Independently Trained

Classifiers

Bounding BoxScore

Linear Weights L1-Regularized Logistic

Regression

Page 22: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 22/44

Context improves detection

Top 5: Before Context

Top 5: After Context

Page 23: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 23/44

Context improves detection accuracy

 Average Precision (Higher is Better)

Page 24: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 24/44

Context changes the error patterns

•More confusion

 – Cats and Dogs

 – Dogs and Sheep

 – Motorbike and Bicycle

• Less confusion

 – Objects and background

Page 25: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 25/44

II. Compositional Knowledge

Goal: Describe new objects using attributes

learned from other objects.

Work with Ali Farhadi, Ian Endres, David Forsyth

Page 26: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 26/44

A name doesn’t tell us much 

Name: Cat

Name: Dog

Name: Horse

Known Objects New Object

Name: Unknown

Page 27: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 27/44

But what if we learn attributes?

Name: Cat

Properties: four legs, tail,

eyes, ears, furry, has

stripes, gray

Name: Dog

Name: Horse

Known Objects New Object

Properties: four legs,

eyes, ears, snout, tan,

muscular 

Properties: four legs, tail,

mane, eyes, ears, snout,

tan

Name: Unknown

Page 28: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 28/44

We can infer what object is like

Name: Cat

Properties: four legs, tail,

eyes, ears, furry, hasstripes, gray

Name: Dog

Name: Horse

Known Objects

Name: Unknown

Properties: four legs, eyes,

snout, tan, muscular 

Properties: four legs, tail,

mane, eyes, ears, snout,

tan

Properties: four legs,eyes, ears, snout,

stripes, mane

New Object

Page 29: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 29/44

Learning Attributes

• Learn to distinguish between things that have

an attribute and things that don’t 

• Train one classifier per attribute

Page 30: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 30/44

Learning Correlated Attributes

Problem• Many attributes are strongly correlated through the

object category

Most cars are “made of metal” and have “wheels” 

When we try to learn “has

wheels”, we may accidentally

learn “made of metal” 

Has Wheels, Made of Metal?

Page 31: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 31/44

Decorrelating Attributes

Solution• Select features that can distinguish between two

classes

 – Things that have wheels

 – Things that do not, but have other attributes in common

“Has Wheels”  “No Wheels” 

Vs.

Page 32: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 32/44

Learning to Describe Objects

Page 33: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 33/44

Describing New Objects

Page 34: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 34/44

Identifying Unusual Attributes

Absence of Typical Attributes

Presence of Atypical Attributes

752 reports

68% are correct

951 reports

47% are correct

Page 35: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 35/44

Recognition from Description

• Learn new classes by describing them to the

algorithm

• Goat = “Is furry, four legged, has snout, has horn” 

• 12-Class Classification Accuracy = 32.5%

• Chance = 8%

• As good as having 8 visual examples with original imagefeatures

Page 36: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 36/44

III. Organizational Knowledge

Goal: Help a person organize his photos usingimage similarity learned from Flickr groups.

Work with Gang Wang, David Forsyth

Page 37: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 37/44

Taming the Digital Explosion

Photos are easy to take and store.

But it’s still difficult to organize them. 

Page 38: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 38/44

Solution: Learn from photo sharing sites

• Billions of images in Flickr

• Hundreds of thousands of categories

Page 39: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 39/44

Learn similarity

• Downloads hundreds of groups, each

containing thousands of photos

• Train classifier to predict whether a photo islikely to belong in each group

 – Gang Wang created super-fast online training

method for kernelized SVMs

• Images are similar if they are likely to belong

to the same group

Page 40: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 40/44

We can find similar images

Query Image

Retrieved Images Using

Feature Similarity

Retrieved Images Using

Similarity Learned from Flickr 

Page 41: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 41/44

We can say how two images are similar

Fireworks (15.6)Christmas (7.6)

Rain (4.0)

Water drops (2.5)

Candles (2.0)

Sports (2.6)

Dances (2.0)

Weddings (1.0)

Toys (0.5)

Horses (0.5)

Painting (2.4)

 Art (1.2)

Macro-flowers (0.9)

Hands (0.9)

Skateboarding (0.6)

Page 42: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 42/44

Conclusions

• Background knowledge is a key missing component in today’s

computer vision algorithms

• Existing knowledge can make learning easier

 – Provides new abilities (say two things are similar or different)

 – More complete visual models (better accuracy, more reasonable

mistakes)

 – Better able to handle new objects and situations

• We need to start designing systems that accumulate

visual knowledge

Page 43: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 43/44

Thank you

Page 44: Beckman Background Knowledge

7/31/2019 Beckman Background Knowledge

http://slidepdf.com/reader/full/beckman-background-knowledge 44/44