105
Deep Learning for Image Instance Segmentation ----Mask R-CNN Jianping Fan Dept of Computer Science UNC-Charlotte Course Website: http://webpages.uncc.edu/jfan/itcs5152.html

Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

  • Upload
    others

  • View
    9

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Deep Learning for Image Instance Segmentation----Mask R-CNN

Jianping Fan Dept of Computer Science

UNC-Charlotte

Course Website: http://webpages.uncc.edu/jfan/itcs5152.html

Page 2: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Definition of Image Instance Segmentation

Instance segmentation = object detection + semantic segmentation?

Page 3: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 4: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 5: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Background Review

Page 6: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

R-CNN

• R-CNN [4]: The Region-based CNN (R-CNN)• Replace sliding windows with “selective search” region proposals(Uijilings et

al. IJCV 2013)

• Extract rectangles around regions and resize to 227x227 pixels

• Extract features with fine-tuned CNN (that was initialized with network trained on ImageNet before training)

• Classify last layer of network features with SVM, refine bounding box localization (bbox regression) simultaneously

6Slide credit to Ke-Shuan Cheng

Page 7: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

R-CNN

7

• R-CNN: The Region-based CNN (R-CNN)

Slide credit to Ke-Shuan Cheng

Region warping is performed for fixed size features

Page 8: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 9: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

R-CNN Architecture

Page 10: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Regional Proposal Network (RPN)

● Foreground vs Background ● Bounding Box regression● Feed bounding boxes into Fast RCNN

Page 11: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 12: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 13: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 14: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

(0,0)

(0,0)(1,3)

(16,48)

Mapping the center of the receptive fields

Page 15: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

● k anchor boxes ○ 3 scales (8,16, 32)○ 3 aspect ratios (.5, 1, 2)○ Stride 16

● WHk anchors

Anchor Boxes

W

H Center(x,y)

Feature Map

No

thin

g to

See here.

Page 16: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Anchor Boxes

Page 17: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Anchor Boxes

Page 18: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

RPN(Region Proposal Network) Object vs Not an Object

Anchor

Object = 1 to:a) Anchors with the highest

Intersection-over-Union(IoU)b) IoU > 0.7 with any ground truth

box.Not object = -1

a) If IoU <0.3

Page 19: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

RPN

512-d

(x,y)

(Sx,Sy)

Mapping the center of the receptive fields

Page 20: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

RPN

SigmoidCrossEntropyLoss SmoothL1Loss

(512 × (2 + 4) × 9) parameters for VGG-16)

512

Page 21: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 22: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Multi-task loss:

RPN

Only if p*= 1Hyper parameter =10

Mini batch size =256

Number of Anchor locations

Page 23: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 24: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 25: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Fast R-CNN Architecture

Page 26: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Fast R-CNN Architecture

Page 27: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Fast R-CNN

• Fast R-CNN [5]• Improvement: It only feed the whole image into CNN only once! Then crop

features instead of image itself.

27Slide credit to Ke-Shuan Cheng

Page 28: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Fast R-CNN

• RoI Pooling• The RoI pooling layer uses max pooling to convert the features inside any valid

region of interest into a small feature map with a fixed spatial extent of H × W. (e.g., 2*2 )

28

0.85

0.97

0.84

0.96

Slide credit to Ke-Shuan Cheng

Page 29: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

RoI Pooling in Fast R-CNN

RoI Pooling

Slide credit to Silvio Galesso

Page 30: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

For each proposalNMS RoI Pooling

Fully connected layers

softmax

Bbox regression

Fast R-CNN

label

bbox

Page 31: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Region of Interest (RoI):

Fast R-CNN

Page 32: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Region of Interest (RoI):

Fast R-CNN

.74 | .39 | .34

.2 | .16 | .73

.83 | .97 | .88

3X3 RoI pooling

Page 33: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Region of Interest (RoI):

Fast R-CNN

.74 | .39 | .34

.2 | .16 | .73

.83 | .97 | .88

7X7 RoI poolingPer proposal

Only a problem for segmentation

Page 34: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 35: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Faster R-CNN

• Faster R-CNN [6]• Improvement: Generate RoI by Region Proposal Network.

35Slide credit to Ke-Shuan Cheng

Page 36: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

● Two main parts○ Region Proposal

Network○ Fast R-CNN○ (also this) Pre-

trained network

Page 37: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Comparison

• Compare with 3 model

37Slide credit to Ke-Shuan Cheng

Page 38: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 39: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Mask-RCNN

Page 40: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Mask R-CNN• Mask RCNN is a simple, flexible, and general framework for

object instance segmentation.

• Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks on each Region of Interest (RoI), in parallel with the existing branch for classification and bounding box regression.

• Mask R-CNN is simple to trained and adds only a small overhead to Faster R-CNN.

• The mask branch is a small Fully Convolutional Network (FCN) applied to each RoI, predicting a segmentation mask in a pixel-to-pixel manner.

40Slide credit to Ke-Shuan Cheng

Page 41: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 42: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 43: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Mask R-CNN Architecture

Page 44: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 45: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 46: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

Page 47: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Mask R-CNN

• Mask R-CNN [3] is conceptually simple: Faster R-CNN has two outputs for each candidate object, a class label and a bounding-box offset; to this R-CNN added a third branch that outputs the object mask.

47

Faster R-CNN

3rd Branch Object Mask

Slide credit to Ke-Shuan Cheng

Page 48: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 49: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 50: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to Ross Girshick

Page 51: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 52: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 53: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

Page 54: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 55: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 56: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 57: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 58: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 59: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 60: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 61: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 62: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 63: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 64: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 65: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

Page 66: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

Page 67: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

Page 68: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to Silvio Galesso

RoI Align in Mask R-CNN

Page 69: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to Silvio Galesso

RoI Align in Mask R-CNN

RoI Align

Page 70: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to Silvio Galesso

RoI Align in Mask R-CNN

RoI Align

Page 71: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 72: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 73: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 74: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 75: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 76: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 77: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 78: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 79: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 80: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 81: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to T. Kim

Page 82: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 83: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 84: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 85: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 86: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

Page 87: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to Ross Girshick

Page 88: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to Ross Girshick

Page 89: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Slide credit to Ross Girshick

Page 90: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Mask R-CNN

• RoI Align is Improves miss-align problems of RoI pooling• RoI Align use bilinear interpolation to generate new feature map.

• Do RoI Pooling with aligned feature map

90

RoI Pooling

Bilinear interpolation

Slide credit to Ke-Shuan Cheng

Page 91: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

Page 92: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

Page 93: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

Page 94: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

FPN (Feature Pyramid Network)

Page 95: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 96: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 97: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Kaiming He, ICCV 2017 Tutorial

Page 98: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Mask R-CNN

• They proposed 2 architecture and compared they for Object

Mask branch

• ResNet

• Branch from last Convolutional layer

• Feature Pyramid Network(FPN)

• Branch from RoI

98Slide credit to Ke-Shuan Cheng

Page 99: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Mask R-CNN

99Slide credit to Ke-Shuan Cheng

Page 100: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Mask R-CNN

• Loss function is defined as below:𝐿 = 𝐿𝑐𝑙𝑠 + 𝐿𝑏𝑜𝑥 + 𝐿𝑚𝑎𝑠𝑘

• 𝐿𝑐𝑙𝑠: Cross-Entropy

• 𝐿𝑏𝑜𝑥: IoU𝐴𝑟𝑒𝑎 𝑜𝑓 𝑂𝑣𝑒𝑟𝐿𝑎𝑝

𝐴𝑟𝑒𝑎 𝑜𝑓 𝑈𝑛𝑖𝑜𝑛• 𝐿𝑚𝑎𝑠𝑘: Cross-Entropy between pixel-to-pixel

100Slide credit to Ke-Shuan Cheng

Page 101: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 102: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

Mask R-CNN• Result

102Slide credit to Ke-Shuan Cheng

Page 103: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 104: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks
Page 105: Deep Learning for Image Instance Segmentation ----Mask R-CNN · object instance segmentation. •Mask R-CNN, extends Faster R-CNN by adding a branch for predicting segmentation masks

References https://arxiv.org/pdf/1506.01497.pdf (Faster R-CNN)https://arxiv.org/pdf/1504.08083.pdf (Fast R-CNN)https://arxiv.org/pdf/1506.06981.pdf (R-CNN minus R)https://koen.me/research/pub/uijlings-ijcv2013-draft.pdf (Selective Search for Object Detection)https://arxiv.org/pdf/1703.06870.pdf (Mask R-CNN)http://host.robots.ox.ac.uk/pascal/VOC/https://www.dropbox.com/s/xtr4yd4i5e0vw8g/iccv15_tutorial_training_rbg.pdf?dl=0http://kaiminghe.com/iccv15tutorial/iccv2015_tutorial_convolutional_feature_maps_kaiminghe.pdfhttps://lovesnowbest.site/2018/02/27/Intro-to-Object-Detection/https://blog.deepsense.ai/region-of-interest-pooling-explained/https://tryolabs.com/blog/2018/01/18/faster-r-cnn-down-the-rabbit-hole-of-modern-object-detection/