1
Set of training patches 3D offset from patch centre central pixel raw depth map 3 Euler angles with : aligned reference patches for modality m : discrete set of pairs made up of 2D offsets and modalities Latent-Class Hough Forests Main Idea & Goals Latent-Class Hough Forests for 3D Object Detection and Pose Estimation Alykhan Tejani, Danhang Tang, Rigas Kouskouridas, and Tae-Kyun Kim Imperial College London Motivation Holistic Template Matching (Hinterstoisser et al. ACCV 2012) (+) Invariant to background clutter (-) Suffer from partial occlusions (-) Not-tested yet for multi-instance objects Hough Forests (Gall et al. PAMI 2010) (+) State-of-the-art patch-based detector (-) Explicitly exploit classification labels during training Integrate Holistic Template Matching into Random Forests framework Hough Forests as a patch-based detector (+) LINEMOD as a 3D descriptor for the patches Goals: (1) Make LINEMOD scale-invariant (Depth check) (2) Guarantee efficient data split at node levels (Novel split function) (3) Class distributions are treated as latent variables (One-class training) (1) + (2) + (3) = Latent-Class Hough Forests ( , ) i i x y { ( , , , )} i i i i i P cTD ( , , , , , ) x y z ya pi ro ({ } , ) m i mM i O m i O i ( ( )) (, ) ( , ) max ( , ,) ( ( ( )), ( )) ( , ,) ( ( ( )) ( ) ( ( )) ( ) ) ( ,) ( ) i j j m m i j i j m i i i j t R c r rm i j i i i i i j j j j j PP PPr f c r t PPr D c r Dc D c r D c z r c r c D c Blue patch: true positive match Red patch: false positive match A random patch T (with red frame) is compared with all other patches to produce the correct splits Inference Split Function Training l fg p foreground probability at leaf node l () E random event of object’s existence under 6 DoF For a single tree ( ( )| ; ) ( ( ), 1| ) ( ( ), 1| ) ( 1| ) l l l fg fg fg pE PT pE p P pE p P pp P ( ( ), 1| ) l fg pE p P pass patch to forest & accumulate votes at the leaf ( 1| ) l fg pp P train with positive patches 1 l fg p all leaf nodes repeat Randomly partition the forest two partitions (classifiers) First partition: obtain consensus patch set Further reduced to consensus pixel set Π (object diameter) All pixels in Π are labelled as foreground and the rest background (two labelled datasets) Second partition: accumulate patches and update leaf probability distribution Image segmentation mask until Maximum iteration Key Contributions Latent-Class Hough Forests – Novel patch-based approach to 3D object detection and pose estimation Joint 3D pose estimation and pixel wise visibility map New dataset Multi-instance objects, near and far range 2D and 3D clutter, foreground occlusions (available at http://www.iis.ee.ic.ac.uk/icvl/) Experimental Results F1-Scores for the 13 objects in the dataset of Hinterstoisser et al. Matching score ˆ ˆ ( )-( ) m avg R T R T 2 1 1 2 ˆ ˆ min ( )-( ) m avg R T R T Non-symmetric objects Symmetric objects 3D object model M ˆ T R T ground truth rotation ground truth translation ˆ R estimated rotation estimated translation correct estimation: m m kd m k chosen coefficient d diameter of object Average Precision-Recall curves over all objects in the dataset of LINEMOD (left) and our dataset (right) Augmented 3D axis Vote map Segmentation mask This project was supported by the Omron Corporation

Alykhan Tejani, Danhang Tang, Rigas Kouskouridas, and Tae-Kyun …rkouskou.gitlab.io/publications/docs/ECCV_2014_poster.pdf · 2018-02-09 · Alykhan Tejani, Danhang Tang, Rigas Kouskouridas,

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Alykhan Tejani, Danhang Tang, Rigas Kouskouridas, and Tae-Kyun …rkouskou.gitlab.io/publications/docs/ECCV_2014_poster.pdf · 2018-02-09 · Alykhan Tejani, Danhang Tang, Rigas Kouskouridas,

Set of training patches

3D offset from patch centre

central pixel raw depth map 3 Euler angles

with : aligned reference patches for modality m

: discrete set of pairs made up of 2D offsets and modalities

Latent-Class Hough Forests

Main Idea & Goals

Latent-Class Hough Forests for 3D Object Detection and Pose Estimation

Alykhan Tejani, Danhang Tang, Rigas Kouskouridas, and Tae-Kyun Kim

Imperial College London

Motivation Holistic Template Matching (Hinterstoisser et al. ACCV 2012) (+) Invariant to background clutter (-) Suffer from partial occlusions (-) Not-tested yet for multi-instance objects Hough Forests (Gall et al. PAMI 2010) (+) State-of-the-art patch-based detector (-) Explicitly exploit classification labels during training

Integrate Holistic Template Matching into Random Forests framework Hough Forests as a patch-based detector (+) LINEMOD as a 3D descriptor for the patches Goals: (1) Make LINEMOD scale-invariant (Depth check) (2) Guarantee efficient data split at node levels (Novel split function) (3) Class distributions are treated as latent variables (One-class training) (1) + (2) + (3) = Latent-Class Hough Forests

( , )i ix y

{ ( , , , )}i i i i iP c T D ( , , , , , )x y z ya pi ro

({ } , )m

i m M iO m

iO

i

( ( ))( , )

( , ) max ( , , ) ( ( ( )), ( ))

( , , ) ( ( ( )) ( ) ( ( )) ( ) )

( , )( )

i

j j

m m

i j i j m i i i jt R c r

r m

i j i i i i i j j j j j

P P P P r f c r t

P P r D c r D c D c r D c z

rc r c

D c

Blue patch: true positive match Red patch: false positive match

A random patch T (with red frame) is compared with all other patches to produce the correct splits

Inference

Split Function

Training

l

fgp foreground probability at leaf node l

( )E random event of object’s existence under 6 DoF

For a single tree

( ( ) | ; ) ( ( ), 1| ) ( ( ), 1| ) ( 1| )l l l

fg fg fgp E P T p E p P p E p P p p P

( ( ), 1| )l

fgp E p P pass patch to forest & accumulate votes at the leaf

( 1| )l

fgp p P train with positive patches 1l

fgp all leaf nodes

repeat Randomly partition the forest two partitions (classifiers) First partition: obtain consensus patch set

Further reduced to consensus pixel set Π (object diameter) All pixels in Π are labelled as foreground and the rest background

(two labelled datasets)

Second partition: accumulate patches and update leaf probability distribution

Image segmentation mask

until Maximum iteration

Key Contributions Latent-Class Hough Forests – Novel patch-based approach to 3D object detection

and pose estimation Joint 3D pose estimation and pixel wise visibility map New dataset – Multi-instance objects, near and far range 2D and 3D clutter,

foreground occlusions (available at http://www.iis.ee.ic.ac.uk/icvl/)

Experimental Results

F1-Scores for the 13 objects in the dataset of Hinterstoisser et al.

Matching score ˆ ˆ( ) - ( )m avg R T R T

21

1 2ˆ ˆmin ( ) - ( )m avg R T R T

Non-symmetric objects

Symmetric objects

3D object model M

R

T

ground truth rotation ground truth translation

R̂ estimated rotation estimated translation

correct estimation: mm k dmk chosen coefficient

d diameter of object

Average Precision-Recall curves over all objects in the dataset of LINEMOD (left) and our dataset (right)

Augmented 3D axis

Vote map

Segmentation mask

This project was supported by the Omron Corporation