9
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050 051 052 053 054 055 056 057 058 059 060 061 062 063 064 065 066 067 068 069 070 071 072 073 074 075 076 077 078 079 080 081 082 083 084 085 086 087 088 089 090 091 092 093 094 095 096 097 098 099 100 101 102 103 104 105 106 107 CVPR #1625 CVPR #1625 CVPR 2015 Submission #1625. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE. Predicting Complete 3D Models of Indoor Scenes supplementary material Anonymous CVPR submission Paper ID 1625 1. Sample Results For more illustrations of the performance of our method, below we show 15 randomly selected examples of 3D models estimated automatically from the input RGBD images. Those examples are all from the 654 testing images in NYU v2 dataset. In each figure: (1) Input represents the original input image; (2) Raw Depth means the original raw depth provide by kinect; (3) Smoothed Depth represents inpainted depth map based on raw depth; (4) Object Probability shows the object probability map automatically computed based on the method in [1]. Lighter part in the figure represents a higher probability of being an object. This is used in the scene composition step to help encouraging rendered object and layout pixels to match the object probability; (5) 3D Annotation is the manual annotated 3D model for the given input image; (6) Rendered Segmentation illustrates the 2D segmen- tation result based on the rendered object computed by our method; (7) Automatic 3D Model provides results of automati- cally estimated 3D model based on our method. Two views are shown. 2. Video For a more intuitive view of our proposed method and correlated results, please refer to the attached movie in the uploaded supplementary material package. References [1] N. Silberman, D. Hoiem, P. Kohli, and R. Fergus. In- door segmentation and support inference from rgbd images. In ECCV, 2012. 1 1

Predicting Complete 3D Models of Indoor Scenes ...dhoiem.cs.illinois.edu/publications/guo_3dscene_arxiv2015_supp.pdf · 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Predicting Complete 3D Models of Indoor Scenes ...dhoiem.cs.illinois.edu/publications/guo_3dscene_arxiv2015_supp.pdf · 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

000001002003004005006007008009010011012013014015016017018019020021022023024025026027028029030031032033034035036037038039040041042043044045046047048049050051052053

054055056057058059060061062063064065066067068069070071072073074075076077078079080081082083084085086087088089090091092093094095096097098099100101102103104105106107

CVPR#1625

CVPR#1625

CVPR 2015 Submission #1625. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE.

Predicting Complete 3D Models of Indoor Scenessupplementary material

Anonymous CVPR submission

Paper ID 1625

1. Sample Results

For more illustrations of the performance of our method,below we show 15 randomly selected examples of 3Dmodels estimated automatically from the input RGBDimages. Those examples are all from the 654 testingimages in NYU v2 dataset. In each figure:

(1) Input represents the original input image;(2) Raw Depth means the original raw depth provide by

kinect;(3) Smoothed Depth represents inpainted depth map

based on raw depth;(4) Object Probability shows the object probability

map automatically computed based on the method in [1].Lighter part in the figure represents a higher probability ofbeing an object. This is used in the scene composition stepto help encouraging rendered object and layout pixels tomatch the object probability;

(5) 3D Annotation is the manual annotated 3D modelfor the given input image;

(6) Rendered Segmentation illustrates the 2D segmen-tation result based on the rendered object computed by ourmethod;

(7) Automatic 3D Model provides results of automati-cally estimated 3D model based on our method. Two viewsare shown.

2. VideoFor a more intuitive view of our proposed method and

correlated results, please refer to the attached movie in theuploaded supplementary material package.

References[1] N. Silberman, D. Hoiem, P. Kohli, and R. Fergus. In-

door segmentation and support inference from rgbdimages. In ECCV, 2012. 1

1

Page 2: Predicting Complete 3D Models of Indoor Scenes ...dhoiem.cs.illinois.edu/publications/guo_3dscene_arxiv2015_supp.pdf · 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161

162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215

CVPR#1625

CVPR#1625

CVPR 2015 Submission #1625. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Automatic 3D Model (two views) Rendered Segmentation

Figure 1: Occupancy Precision: 0.2669, Occupancy Recall: 0.5412, Freespace Precision: 0.9866, Freespace Recall: 0.9486,Depth Error Overall: 0.0227. The table is broken into two pieces, but otherwise this is a fairly accurate representation of thescene.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 2: Occupancy Precision: 0.5221, Occupancy Recall: 0.2418, Freespace Precision: 0.8853, Freespace Recall: 0.9381,Depth Error Overall: 0.2569. Large pieces of furniture are well-modeled, though some small objects on them are missed.

2

Page 3: Predicting Complete 3D Models of Indoor Scenes ...dhoiem.cs.illinois.edu/publications/guo_3dscene_arxiv2015_supp.pdf · 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269

270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323

CVPR#1625

CVPR#1625

CVPR 2015 Submission #1625. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 3: Occupancy Precision: 0.8943, Occupancy Recall: 0.6758, Freespace Precision: 0.9528, Freespace Recall: 0.9851,Depth Error Overall: 0.1444. Decent model of a cluttered scene.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 4: Occupancy Precision: 0.8206, Occupancy Recall: 0.7717, Freespace Precision: 0.9860, Freespace Recall: 0.9852,Depth Error Overall: 0.0193. Solid furniture is well-modeled, but glass tables are missed due to missing depth information(and transparency).

3

Page 4: Predicting Complete 3D Models of Indoor Scenes ...dhoiem.cs.illinois.edu/publications/guo_3dscene_arxiv2015_supp.pdf · 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377

378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431

CVPR#1625

CVPR#1625

CVPR 2015 Submission #1625. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 5: Occupancy Precision: 0.7337, Occupancy Recall: 0.7628, Freespace Precision: 0.9790, Freespace Recall: 0.9713,Depth Error Overall: 0.0160. Some mistakes are due to missing depth information. We use the smoothed depth image (fromSilberman et al.), but depth is sometimes incorrectly completed.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Automatic 3D Model (two views) Rendered Segmentation

Figure 6: Occupancy Precision: 0.4954, Occupancy Recall: 0.2583, Freespace Precision: 0.9632, Freespace Recall: 0.9727,Depth Error Overall: 0.0.0558. Use of a prior that sets of tables and chairs are likely to have the same models and to bearranged in an orderly way could substantially improve results in some cases.

4

Page 5: Predicting Complete 3D Models of Indoor Scenes ...dhoiem.cs.illinois.edu/publications/guo_3dscene_arxiv2015_supp.pdf · 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485

486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539

CVPR#1625

CVPR#1625

CVPR 2015 Submission #1625. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Automatic 3D Model (two views) Rendered Segmentation

Figure 7: Occupancy Precision: 0.4662, Occupancy Recall: 0.1150, Freespace Precision: 0.9393, Freespace Recall: 0.9688,Depth Error Overall: 0.1015. Incorporation of support relationships would help ground objects and possibly improve seg-mentation and modeling.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Automatic 3D Model (two views) Rendered Segmentation

Figure 8: Occupancy Precision: 0.6472, Occupancy Recall: 0.6330, Freespace Precision: 0.9645, Freespace Recall: 0.8361,Depth Error Overall: 0.2578.

5

Page 6: Predicting Complete 3D Models of Indoor Scenes ...dhoiem.cs.illinois.edu/publications/guo_3dscene_arxiv2015_supp.pdf · 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593

594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647

CVPR#1625

CVPR#1625

CVPR 2015 Submission #1625. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 9: Occupancy Precision: 0.9180, Occupancy Recall: 0.4806, Freespace Precision: 0.7749, Freespace Recall: 0.9758,Depth Error Overall: 0.0158.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 10: Occupancy Precision: 0.2985, Occupancy Recall: 0.1327, Freespace Precision: 0.9065, Freespace Recall: 0.9217,Depth Error Overall: 0.0853.

6

Page 7: Predicting Complete 3D Models of Indoor Scenes ...dhoiem.cs.illinois.edu/publications/guo_3dscene_arxiv2015_supp.pdf · 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

648649650651652653654655656657658659660661662663664665666667668669670671672673674675676677678679680681682683684685686687688689690691692693694695696697698699700701

702703704705706707708709710711712713714715716717718719720721722723724725726727728729730731732733734735736737738739740741742743744745746747748749750751752753754755

CVPR#1625

CVPR#1625

CVPR 2015 Submission #1625. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 11: Occupancy Precision: 0.9714, Occupancy Recall: 0.6227, Freespace Precision: 0.9489, Freespace Recall: 0.9667,Depth Error Overall: 0.1394.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 12: Occupancy Precision: 0.7183, Occupancy Recall: 0.0989, Freespace Precision: 0.9035, Freespace Recall: 0.9696,Depth Error Overall: 0.1513.

7

Page 8: Predicting Complete 3D Models of Indoor Scenes ...dhoiem.cs.illinois.edu/publications/guo_3dscene_arxiv2015_supp.pdf · 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

756757758759760761762763764765766767768769770771772773774775776777778779780781782783784785786787788789790791792793794795796797798799800801802803804805806807808809

810811812813814815816817818819820821822823824825826827828829830831832833834835836837838839840841842843844845846847848849850851852853854855856857858859860861862863

CVPR#1625

CVPR#1625

CVPR 2015 Submission #1625. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 13: Occupancy Precision: 0.7150, Occupancy Recall: 0.2085, Freespace Precision: 0.9776, Freespace Recall: 0.8347,Depth Error Overall: 0.2530.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 14: Occupancy Precision: 0.6706, Occupancy Recall: 0.4353, Freespace Precision: 0.9979, Freespace Recall: 0.9973,Depth Error Overall: 0.0448.

8

Page 9: Predicting Complete 3D Models of Indoor Scenes ...dhoiem.cs.illinois.edu/publications/guo_3dscene_arxiv2015_supp.pdf · 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

864865866867868869870871872873874875876877878879880881882883884885886887888889890891892893894895896897898899900901902903904905906907908909910911912913914915916917

918919920921922923924925926927928929930931932933934935936937938939940941942943944945946947948949950951952953954955956957958959960961962963964965966967968969970971

CVPR#1625

CVPR#1625

CVPR 2015 Submission #1625. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE.

Input Raw Depth Smoothed Depth Object Probability

3D Annotation Rendered Segmentation Automatic 3D Model (two views)

Figure 15: Occupancy Precision: 0.4330, Occupancy Recall: 0.1800, Freespace Precision: 0.9099, Freespace Recall: 0.8911,Depth Error Overall: 0.4816.

9