38
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna University of Technology, Austria Carsten Rother Microsoft Research Cambridge, UK Pushmeet Kohli Microsoft Research Cambridge, UK Daniel Scharstein Middlebury College, USA Sudipta Sinha Microsoft Research Redmond, USA 1

Object Stereo- Joint Stereo Matching and Object Segmentation

  • Upload
    kevork

  • View
    71

  • Download
    0

Embed Size (px)

DESCRIPTION

Object Stereo- Joint Stereo Matching and Object Segmentation. Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna University of Technology, Austria Carsten Rother Microsoft Research Cambridge, UK - PowerPoint PPT Presentation

Citation preview

Page 1: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Object Stereo- Joint Stereo Matching and Object Segmentation

Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on

Michael Bleyer Vienna University of Technology, AustriaCarsten Rother Microsoft Research Cambridge, UKPushmeet Kohli Microsoft Research Cambridge, UKDaniel Scharstein Middlebury College, USASudipta Sinha Microsoft Research Redmond, USA 1

Page 2: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Outline• Introduction• Proposed Model• Energy Minimization• Result• Conclusion

2

Page 3: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Introduction• A 3D scene is represented as a collection of

visually distinct and spatially coherent objects.

• Each object is characterized by three different aspects: • color model• 3D plane• 3D connectivity

3

Page 4: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Introduction• The proposed method employs object-level color

models as a soft constraint to aid depth estimation.

• The proposed method can recover the depth of regions that are fully occluded in one input view.

4

Page 5: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Introduction• The proposed method models a 3D scene as a

collection of 3D objects, assume that1. each object is compact.2. each object is connected.3. all visible parts of an object share a similar

appearance.4. scene interpretations with a few large objects.

5

Page 6: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Introduction• Compactness• objects are coherent.• depth variations within an object are smooth.• objects have a bias towards being planar in 3D.

6

Page 7: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Introduction• 3D Connectivity• disconnected 2D regions and separated by smaller

depth.

7

Page 8: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Introduction• Similar Appearance• use color as the only appearance cue.• each object in a scene has a compact distribution

of colors.

• Scene Interpretation• with few objects.• prevent single pixels from being explained as

individual objects.8

Page 9: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Introduction• Color models introduce a color segmentation into

the stereo matching process.• assign untextured regions to the same object.• extend disparities into untextured regions.• capture disparity discontinuities more precisely.

• Assign disparities to small disconnected background regions in complex occlusions.

9

Page 10: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Outline• Introduction• Proposed Model• Energy Minimization• Result• Conclusion

10

Page 11: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Proposed Model• Scene Representation, assume that• disparity map is a collection of 3D planes (depth

planes).• estimate object’s depth by a 3D plane (object

plane).• compute a parallax value obtained by subtracting

p’s disparity at each pixel p within an object op.

11

Page 12: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Parallax Model• Enforce parallax values have a compact

distribution within object op.

• The parallax model provides the probability of the occurrence of a specific parallax in object op.

• The proposed model avoid parallaxes that have low probabilities.

12

Page 13: Object  Stereo-  Joint Stereo Matching and Object Segmentation

• An object o ∈ O contains the following parameters: 1. a color model2. a parallax model3. an object plane

• F : I → F that assigns each pixel to a depth plane.• .

• O : I → O that assigns each pixel to an object.

Energy Function

13

Page 14: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Energy Function• Energy function evaluates the quality of F and O.

• Minimize the energy to obtain a “good” approximation to the Maximum a Posteriori (MAP) solution of the model.

• .

14

Page 15: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Photo Consistency Term Epc

• .

• Measures the pixel dissimilarity of corresponding points and accounts for occlusion handling.

• Ensures that corresponding pixels are assigned to the same depth plane and object.

15

Page 16: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Photo Consistency Term Epc

• .

16

Page 17: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Object-Coherency Term Eoc

• .

• Encourages neighboring pixels in the image to take the same object label.

• . [19]

17

[19] C. Rother, V. Kolmogorov, and A. Blake. Grabcut: Interactive foreground extraction using iterated graph cuts. ACM Trans. Graph., 23:309–314, 2004.

Page 18: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Depth Plane-Coherency Term Edc

• .

• Depth plane assignments within an object shall be spatially coherent.

• .

18

Page 19: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Object-Color Term Ecol

• .

• Each object contains a color model implemented as a Gaussian Mixture Model (GMM).

• The GMM gives the probability that a pixel lies inside the object according to its color value.

19

Page 20: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Object-Color Term Ecol

• . [19]

20

[19] C. Rother, V. Kolmogorov, and A. Blake. Grabcut: Interactive foreground extraction using iterated graph cuts. ACM Trans. Graph., 23:309–314, 2004.

Page 21: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Object-Parallax Term Epar

• .

• The disparity at pixel p according to op’s object plane by .

• The parallax is then computed as .

21

Page 22: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Object-Parallax Term Epar

• Distribution of the parallax within same object is likely to be compact.

• .

22

Page 23: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Object-MDL Term Emdl

• .

• The term puts a penalty on the occurrence of an object [4].

• .

23

[4] M. Bleyer, C. Rother, and P. Kohli. Surface stereo with soft segmentation. In CVPR, 2010.

Page 24: Object  Stereo-  Joint Stereo Matching and Object Segmentation

3D Connectivity Econ

• .

• An object is considered connected • a path connects all pixels with the same object label.

• The path are either 1. pixels belong to the same object.2. pixels belong to different objects.

24

Page 25: Object  Stereo-  Joint Stereo Matching and Object Segmentation

3D Connectivity Econ

• .

• .

25

Page 26: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Outline• Introduction• Proposed Model• Energy Minimization• Proposal Generator

• Result• Conclusion

26

Page 27: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Energy Minimization• Proposed model is formulated as an energy

function that is optimized via fusion moves [16].

• In the fusion move, a new solution generated by “selecting” • depth planes and objects from S• others from S’

27

[16] V. Lempitsky, C. Rother, and A. Blake. Logcut - efficient graph cut optimization for Markov Random Fields. In ICCV, 2007.

Page 28: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Energy Minimization• Start with an initial solution S that consists of a

disparity map F and an object map O.

• Obtain a proposal S’ from a proposal generator.

• S and S’ are fused to produce a new solution S*.• S := S*

28

Page 29: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Proposal Generator S’

• Initial Proposals : • initialize the disparity map.• color segmentation by mean-shift.• derive F, O.• estimate parameters.• derive a large variety of initial proposals

(approximately 30 ).

29

Page 30: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Proposal Generator S’

• Refit Proposals :• compute a new color model, object plane, parallax

model.

• 〈 F’, O’〉 is derived by refitting the object parameters of the current solution〈 F, O〉 .

30

Page 31: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Proposal Generator S’

• Expansion Proposals : • select one depth plane f present in F and one object

o present in O.

•〈 F’, O’〉 is derived by setting all pixels of F’ to f and all pixels of O’ to o.

31

Page 32: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Optimal Fusion• Use quadratic pseudo-boolean optimization

function (QPBO-F) [11] to the fusion move problem.

• Reduces the problem with multi-valued variables to a sequence of minimization sub-problems with binary variables.

32

[11] V. Kolmogorov and C. Rother. Minimizing non-submodular functions with graph cuts - a review. PAMI, 29(7):1274–1279, 2007.

Page 33: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Outline• Introduction• Proposed Model• Energy Minimization• Result• Conclusion

33

Page 34: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Result

34

Page 35: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Result

35

Page 36: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Result

36

Page 37: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Outline• Introduction• Proposed Model• Energy Minimization• Result• Conclusion

37

Page 38: Object  Stereo-  Joint Stereo Matching and Object Segmentation

Conclusion• The object level enables our algorithm to utilize

color segmentation as a soft constraint and to handle difficult occlusion cases.

• A 3D connectivity constraint that enforces consistency of object assignments with stereo geometry.

• Currently, our algorithm is slow, i.e., it takes approximately 20 minutes to obtain results on images.

38