22

Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew Shepherd Mnih, V., Heess, N., Graves, A., & kavukcuoglu, K. (2014). Recurrent Models of Visual Attention. Advances in Neural Information …, 2204–2212.

Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Download PDF Report

Upload
others
View
5
Download
0

Embed Size (px)

Citation preview

Page 1: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Visual Attention With Neural Networks

Main Paper: Recurrent Models of Visual Attention

Presentation by Matthew Shepherd

Mnih, V., Heess, N., Graves, A., & kavukcuoglu, K. (2014). Recurrent Models of Visual Attention. Advances in Neural Information …, 2204–2212.

Page 2: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Full image processing is computationally expensive

Regions are can be selected intelligently but time still scales with the size of the image

Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. … And Pattern Recognition …, 580–587. http://doi.org/10.1109/CVPR.2014.81

Page 3: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Humans focus on specific regions in their FOV

https://www.youtube.com/watch?v=vJG698U2Mvo

Page 4: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

“Vision as a sequential decision task”

The model sequentially chooses small windows of information on the data

Integrates information from all past windows to make its next decision.

Page 5: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

The sensor only provides limited information about the scene, Xt, focused at

a location, lt-1

Referred to as a “Glimpse”

retina-like representation

Page 6: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

The “glimpse network” incorporates sensor and location information into a

single vector

Page 7: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

The core network fh(𝛳h) incorporates the sensor information as well as past information

Page 8: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

The output of the core network is then used to deploy the sensor and make a

classification

Page 9: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

The network is recurrent

Page 10: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Partially Observed Markov Decision Problem (POMDP)

- Glimpse can be seen as a partial view of the state - Network must learn a policy

π((lt, at)|s1:t; θ)

- The policy is determined by the NN

- State history is encapsulated by the hidden state of the network

Page 11: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

So how do we train it?

The REINFORCE rule

Page 12: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Additional training details

Useful for determining fl but fa can be determined more directly by minimizing

cross-entropy loss.

A baseline value, bt, is added to the gradient approximation to reduce variance

Page 13: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Experiments

Page 14: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

RAM performs well on translated MNIST

Page 15: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

RAM performs very well on cluttered translated images

Page 16: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Meaningful policies are learned

Page 17: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

RAM performs in a dynamic environment

Page 18: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Other models of attention

Graves, A. (2014). Generating Sequences with Recurrent Neural Networks, 1–43.

Page 19: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

DRAW: A Recurrent Neural Network For Image Generation

• Combines an attention mechanism with a sequential variational auto-encoder

• Reading and writing are now both sequential tasks

Gregor, K., Danihelka, I., Graves, A., Rezende, D. J., & Wierstra, D. (2015). DRAW: A Recurrent Neural Network For Image Generation , 1–10.

Page 20: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Differentiable RAM

Page 21: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

Differentiable RAM performance

Page 22: Visual Attention With Neural Networksfidler/teaching/2015/slides/... · Visual Attention With Neural Networks Main Paper: Recurrent Models of Visual Attention Presentation by Matthew

DRAW-ing with attention

https://www.youtube. com/watch?v=Zt-7MI9eKEo

Learning Multi-Attention Convolutional Neural Network for ...openaccess.thecvf.com/content_ICCV_2017/papers/Zheng_Learning_Multi... · Learning Multi-Attention Convolutional Neural

Learning Multi-Attention Convolutional Neural Network for ...openaccess.thecvf.com/content_ICCV_2017/papers/Zheng_Learning_Multi... · Learning Multi-Attention Convolutional Neural

Documents

Visual Question Answering and Visual Reasoning · Pixel-BERT [1] Show, Attend and Tell: Neural Image Caption Generation with Visual Attention, ICML 2015 [2] Stacked Attention Networks

Visual Question Answering and Visual Reasoning · Pixel-BERT [1] Show, Attend and Tell: Neural Image Caption Generation with Visual Attention, ICML 2015 [2] Stacked Attention Networks

Documents

Understanding Biological Visual Attention Using ...€¦ · Understanding Biological Visual Attention Using Convolutional Neural Networks Grace W. Lindsay a,b, Kenneth D. Miller a

Understanding Biological Visual Attention Using ...€¦ · Understanding Biological Visual Attention Using Convolutional Neural Networks Grace W. Lindsay a,b, Kenneth D. Miller a

Documents

Show, Attend and Tell: Neural Image …Show, Attend and Tell: Neural Image Caption Generation with Visual Attention Kelvin Xu KELVIN.XU@UMONTREAL.CA Jimmy Lei Ba JIMMY@PSI.UTORONTO.CA

Show, Attend and Tell: Neural Image …Show, Attend and Tell: Neural Image Caption Generation with Visual Attention Kelvin Xu [email protected] Jimmy Lei Ba [email protected]

Documents

Attention II Selective Attention & Visual Search

Attention II Selective Attention & Visual Search

Documents

visual attention reading - MITweb.mit.edu/zoya/www/visual_attention_reading.pdf · Visual attention with deep neural nets Paper discussions: “SALICON: Reducing the Semantic Gap

visual attention reading - MITweb.mit.edu/zoya/www/visual_attention_reading.pdf · Visual attention with deep neural nets Paper discussions: “SALICON: Reducing the Semantic Gap

Documents

Top-down Neural Attention

Top-down Neural Attention

Documents

Neural Synchrony in Attention and Consciousness

Neural Synchrony in Attention and Consciousness

Documents

CHAPTER 3. VISUAL ATTENTION AND VISUAL AWARENESS

CHAPTER 3. VISUAL ATTENTION AND VISUAL AWARENESS

Documents

Show, Attend and Tell: Neural Image CaptionGeneration with … · 2016-04-20 · Neural Image Caption Generation with Visual Attention with images,Donahue et al.(2014) also apply

Show, Attend and Tell: Neural Image CaptionGeneration with … · 2016-04-20 · Neural Image Caption Generation with Visual Attention with images,Donahue et al.(2014) also apply

Documents

A simple circuit model of visual cortex explains neural and ......Selective visual attention modulates neural activity in the visual system and leads to enhanced performance on di

A simple circuit model of visual cortex explains neural and ......Selective visual attention modulates neural activity in the visual system and leads to enhanced performance on di

Documents

Visual Attention › ~se4d03 › lectures.new › 4d3lec30.pdf · SE4D03 – Computer User Interfaces • The Science of Visualization – continued – Visual Attention • Attention

Visual Attention › ~se4d03 › lectures.new › 4d3lec30.pdf · SE4D03 – Computer User Interfaces • The Science of Visualization – continued – Visual Attention • Attention

Documents

Show, Attend and Tell: Neural Image Caption Generation with …zemel/documents/captionAttnIcml2015.pdf · 2015-07-08 · Neural Image Caption Generation with Visual Attention tive

Show, Attend and Tell: Neural Image Caption Generation with …zemel/documents/captionAttnIcml2015.pdf · 2015-07-08 · Neural Image Caption Generation with Visual Attention tive

Documents

Structured Attention Guided Convolutional Neural …openaccess.thecvf.com/content_cvpr_2018/papers/Xu...Structured Attention Guided Convolutional Neural Fields for Monocular Depth

Structured Attention Guided Convolutional Neural …openaccess.thecvf.com/content_cvpr_2018/papers/Xu...Structured Attention Guided Convolutional Neural Fields for Monocular Depth

Documents

Deep Attention Neural Tensor Network for Visual Question ...€¦ · Deep Attention Neural Tensor Network for Visual QuestionAnswering Yalong Bai1,2, Jianlong Fu3, Tiejun Zhao1, and

Deep Attention Neural Tensor Network for Visual Question ...€¦ · Deep Attention Neural Tensor Network for Visual QuestionAnswering Yalong Bai1,2, Jianlong Fu3, Tiejun Zhao1, and

Documents

The Neural Basis for Visual Selective Attention in Young ...research.clps.brown.edu/.../08/Schlesinger_Amso_Johnson_2007_22.pdf · 135 The Neural Basis for Visual Selective Attention

The Neural Basis for Visual Selective Attention in Young ...research.clps.brown.edu/.../08/Schlesinger_Amso_Johnson_2007_22.pdf · 135 The Neural Basis for Visual Selective Attention

Documents

Parental neural responsivity to infants’ visual attention

Parental neural responsivity to infants’ visual attention

Documents

Show, Attend, and Tell Neural Image Caption Generation ... · Show, Attend, and Tell Neural Image Caption Generation with Visual Attention Kelvin Xu, Jimmy Lei Ba, Ryan Kiros, Kyunghyun

Show, Attend, and Tell Neural Image Caption Generation ... · Show, Attend, and Tell Neural Image Caption Generation with Visual Attention Kelvin Xu, Jimmy Lei Ba, Ryan Kiros, Kyunghyun

Documents

Selective Visual Attention Visual Search What is …psiexp.ss.uci.edu/research/teachingP140C/Lectures/2_2.pdfSelective Visual Attention & Visual Search ... larger distance) • compatible

Selective Visual Attention Visual Search What is …psiexp.ss.uci.edu/research/teachingP140C/Lectures/2_2.pdfSelective Visual Attention & Visual Search ... larger distance) • compatible

Documents

Recursive Visual Attention in Visual Dialogopenaccess.thecvf.com/content_CVPR_2019/papers/Niu... · Recursive Visual Attention in Visual Dialog ... communication, such as vision-and-language

Recursive Visual Attention in Visual Dialogopenaccess.thecvf.com/content_CVPR_2019/papers/Niu... · Recursive Visual Attention in Visual Dialog ... communication, such as vision-and-language

Documents

Electrophysiology of Visual Attention. Does Visual Attention Modulate Visual Evoked Potentials? The theory is that Visual Attention modulates visual information

Electrophysiology of Visual Attention. Does Visual Attention Modulate Visual Evoked Potentials? The theory is that Visual Attention modulates visual information

Documents

Deep Attention Neural Tensor Network for Visual Question Answering · 2018-10-15 · Deep Attention Neural Tensor Network for Visual Question Answering Yalong Bai 1; 2, Jianlong Fu3,

Deep Attention Neural Tensor Network for Visual Question Answering · 2018-10-15 · Deep Attention Neural Tensor Network for Visual Question Answering Yalong Bai 1; 2, Jianlong Fu3,

Documents

Deep Attention Neural Tensor Network for Visual … › openaccess › content_ECCV_2018 › papers › ...Deep Attention Neural Tensor Network for Visual QuestionAnswering Yalong

Deep Attention Neural Tensor Network for Visual … › openaccess › content_ECCV_2018 › papers › ...Deep Attention Neural Tensor Network for Visual QuestionAnswering Yalong

Documents

Oddball distractors demand attention: neural and

Oddball distractors demand attention: neural and

Documents

Common neural substrates for visual working memory and ...dept.wofford.edu/neuroscience/NeuroSeminar/pdfsFall2007/MayerN… · The conceptual link between visual WM and attention

Common neural substrates for visual working memory and ...dept.wofford.edu/neuroscience/NeuroSeminar/pdfsFall2007/MayerN… · The conceptual link between visual WM and attention

Documents

Dissociating the Neural Mechanisms of Visual Attention …cognitrn.psych.indiana.edu/busey/forJP/locID2004/fMRIChange... · occurswhenlow-levelmotiontransientsareavailable, suchaswhentheintervalbetweenstimuliislessthan

Dissociating the Neural Mechanisms of Visual Attention …cognitrn.psych.indiana.edu/busey/forJP/locID2004/fMRIChange... · occurswhenlow-levelmotiontransientsareavailable, suchaswhentheintervalbetweenstimuliislessthan

Documents

Deep Attention Neural Tensor Network for Visual Question ...openaccess.thecvf.com/.../Yalong_Bai_Deep_Attention...Deep Attention Neural Tensor Network for Visual QuestionAnswering

Deep Attention Neural Tensor Network for Visual Question ...openaccess.thecvf.com/.../Yalong_Bai_Deep_Attention...Deep Attention Neural Tensor Network for Visual QuestionAnswering

Documents

Attention II Theories of Attention Visual Search

Attention II Theories of Attention Visual Search

Documents

The Neural Basis for Visual Selective Attention in Young Infants: A … · 2016. 9. 23. · Schlesinger et al. Visual Selective Attention 137 or competing stimuli—is not only a

The Neural Basis for Visual Selective Attention in Young Infants: A … · 2016. 9. 23. · Schlesinger et al. Visual Selective Attention 137 or competing stimuli—is not only a

Documents

Topic: Attention & Visual Search Selective Attention: Visual sampling Visual Target Search Structured Search VS Experiment Divided and Focused Attention

Topic: Attention & Visual Search Selective Attention: Visual sampling Visual Target Search Structured Search VS Experiment Divided and Focused Attention

Documents