Recurrent Scene Parsing with Perspective Understanding In the...

Shu KongCS, ICS, UCI

Recurrent Scene Parsing with Perspective Understanding In the Loop

1. Background

2. Attention to Perspective: Depth-aware Gating

3. Recurrent Refining

4. Attentional Mechanism

5. Conclusion and Future Work

Outline

1. Background

Outline

Background

Semantic Segmentation with Deep Convolutional Neural Networks

Keywords: skip connection, multi-scale, upsampling

Background

DeepLab is a strong baseline (based on ResNet architecture), yet simple and straightforward.

It sums up feature maps at different scales using atrous convolution, i.e. convolution with various dilate rates.

[1] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

1. a trous (French) -- holes (English)

2. Atrous convolution (skipping/inserting zero)

Background

fusing responses with multiple atrous kernels of different rates.

Background

That's all about the baseline.

Large Perspective Image

The fusion of multi-scale feature maps exhibits some degree of scale invariance;

but it's not obvious this invariance covers the range scale variantion existing in perspective images.

Large Perspective Image

large range scale variantion in perspective images.

white/black board

charis

1. Background

Outline

disparity, or depth, conveys the scale information.

Depth-aware pooling module

select the right scale with depth

quantize the disparity into five scales with dilate rates {1, 2, 4, 8, 16}

Alternatively, learning depth estimator, and testing without depth

reliable monocular depth estimation

more configurations to compare --

1. sharing the parameters in this pooling module (multiPool)

2. averaging the feature vs. depth-aware gating

3. MultiPool vs. MultiScale (input)

Qualitative Results -- street images

Qualitative Results -- panorama images

1. Background

Outline

Recurrently refining the results by adapting the predicted depth

Recurrent Refinement Module

unrolling the recurrent module during training

adding a loss to each unrolled loop

embedding the depth-aware gating module in the loops

Recurrently refining the results by adapting the predicted depth

Qualitative Results -- NYU-depth-v2 indoor dataset

Qualitative Results -- Cityscapes

yellow --> closer --> larger pooling size

Qualitative Results -- Stanford-2D-3D (panoramas)

blue --> closer --> larger pooling size

1. Background

Outline

Some slides from this point are removed due to research conflicts.

They will be disclosed in the future.

Attention to Scale Again

1. Background

5. Conclusion and Future Work

Outline

1. Attentional module is powerful.

Conclusion and Future Work

1. Attentional module is powerful.

2. Such attentional module should be also useful in various pixel-level tasks, e.g. pixel embedding for instance grouping, depth estimation, surface normal estimation, etc.

Conclusion and Future Work

Thanks

Recurrent Scene Parsing with Perspective Understanding In the...

Documents

Dependency Parsing - Oregon State Universityclasses.engr.oregonstate.edu/.../notes/dependency-parsing-lecture.pdf · Outline Introduction Dependency Parsing Formal definition Parsing

Chart Parsing and Probabilistic Parsing

Basic Parsing Algorithms: Earley Parser and Left Corner Parsing

Parsing: Top-Down vs. Bottom-Up Parsing Algorithms Partial Parsing

Top-Down Parsing - recursive descent - predictive parsing

essential parsing

Scene Graph Parsing as Dependency Parsing · 2018-05-15 · Scene Graph Parsing as Dependency Parsing Yu-Siang Wang National Taiwan University b03202047@ntu.edu.tw Chenxi Liu( ) Johns

Syntactic Analysis Operator-Precedence Parsing Recursive-Descent Parsing

Anatomy of recurrent laryngeal nerveAnatomy of recurrent laryngeal nerveAnatomy of recurrent laryngeal

1 Compiler Construction Parsing Part I 제 4 주 Parsing

October 2008csa3180: Setence Parsing Algorithms 1 1 CSA350: NLP Algorithms Sentence Parsing I The Parsing Problem Parsing as Search Top Down/Bottom Up

Incremental Parsing by Modular Recurrent Connectionist ... · 364 Jain and Waibel Incremental Parsing by Modular Recurrent Connectionist Networks Ajay N. Jain Alex H. Waibel School

Kuligowski Parsing

Recurrent Pregnancy Loss Recurrent Pregnancy Loss

Statistical Dependency Parsing - Uppsala Universitynivre/docs/StatState.pdf · I Methods for statistical dependency parsing I Chart parsing techniques I Parsing as constraint satisfaction

Multigrid Predictive Filter Flow for Unsupervised Learning ...skong2/slides/mgpff_public_version.pdf · con: expensive to annotate training set, domain/data bias Fully-Supervised

u.cs.biu.ac.ilu.cs.biu.ac.il/~89-680/parsing-algorithms.pdf · Title: parsing-algorithms

Chart Parsing and Probabilistic Parsing - SourceForgenltk.sourceforge.net/doc/en/advanced-parsing.pdfChart Parsing and Probabilistic Parsing 9.1 Introduction ... Furthermore, it is

Syntax and Parsing of Semitic Languages - Tsarfatytsarfaty.com/pdfs/semitic.pdfSyntax and Parsing of Semitic Languages 3 1.1 Parsing Systems Syntactic Analysis A parsing system is

Learning for semantic parsing using statistical syntactic parsing techniques