Tutorial Overview - MIT Saliency...

New directions in saliency research: Developments in architectures, datasets, and evaluation

Tutorial OverviewZoya Bylinskii & Tilke Judd, ECCV Oct. 8, 2016

Ali BorjiCenter for Research in

Computer Vision,University of Central Florida

Zoya BylinskiiComputer Science and

Artificial Intelligence Lab,MIT

Tilke JuddProduct Manager,

GooglePreviously: MIT

MIT Saliency Benchmark Team

Tutorial organized by:

Judd et al. “A Benchmark of Computational Models of Saliency to Predict Human Fixations” [MIT Tech Report 2012]

• 10 saliency models

• 3 baslines

• 3 metrics (AUC, SIM, EMD)

• 1 dataset

• 66 saliency models

• 5 baslines

• 8 metrics

• 2 datasets

2011 2016

large boosts in performance

in recent yearsdriven by neural

network architectures

tracking progress with the MIT Saliency Benchmark

• 66 models evaluated on MIT benchmark (as of Sept 30, 2016)

• top 8/10 are neural networks

• neural networks make up 25% of all submissions (17 models since 2014)

What are the common architectures used for neural network models of saliency?

How are these models trained?

PDP - Jetley et al. [CVPR 2016]

DeepGaze - Kümmerer et al. [ICLR 2015 workshop]

ML-Net - Cornia et al. [ICPR 2016]

SALICON - Huang et al. [ICCV 2015]

DeepFix - Kruthiventi et al. [ArXiv 2015]

eDN - Vig et al. [CVPR 2014]

loss functions & evaluation

datasets

architectures

applications

semanticsegmentation

objectdetection

objectproposals

image clustering & retrieval

cognitivesaliency

Tutorial overview“Deep networks for

saliency map prediction” Naila Murray

• 66 models evaluated on MIT benchmark

• using 8 evaluation metrics

• on 2 datasets (MIT300, CAT2000)

0 0.2 0.4 0.6 0.8 1159

1317212529333741454953576165

AUC-Judd

0 0.2 0.4 0.6 0.8159

1317212529333741454953576165

0 0.5 1 1.5 2 2.5159

1317212529333741454953576165

0 0.2 0.4 0.6 0.8 11591317212529333741454953576165

0 0.2 0.4 0.6 0.8159

1317212529333741454953576165

gradual improvementover time

but saturating

tracking progress with the MIT Saliency Benchmarkm

0 0.2 0.4 0.6 0.8 1159

1317212529333741454953576165

AUC-Judd

0 0.2 0.4 0.6 0.8159

1317212529333741454953576165

0 0.5 1 1.5 2 2.5159

1317212529333741454953576165

0 0.2 0.4 0.6 0.8 11591317212529333741454953576165

0 0.2 0.4 0.6 0.8159

1317212529333741454953576165

0 0.2 0.4 0.6 0.8 1159

1317212529333741454953576165

AUC-Judd

0 0.2 0.4 0.6 0.8159

1317212529333741454953576165

0 0.5 1 1.5 2 2.5159

1317212529333741454953576165

0 0.2 0.4 0.6 0.8 11591317212529333741454953576165

0 0.2 0.4 0.6 0.8159

1317212529333741454953576165

large boosts in performance

in recent yearsdriven by neural

network architectures

As model performances continue to approach ground truth, how can we have

more meaningful evaluation?

datasets

architectures

applications

objectdetection

objectproposals

cognitivesaliency

Tutorial overview“Evaluating saliency models in a

probabilistic framework”Matthias Kümmerer

New data collection methodologies

How can we collect enough training data for neural network models of saliency?

Krafka et al. [CVPR 2016]

Papoutsaki et al. [IJCAI 2016]

Xu et al. [ArXiv 2015]

Webcam-based

Krafka et al. [CVPR 2016]

Papoutsaki et al. [IJCAI 2016]

Xu et al. [ArXiv 2015]

Kim et al. [CHI EA 2015]

Jiang et al. [CVPR 2015]

iSUN SALICON

Click-basedWebcam-based

What kind of new applications are possiblewith state-of-the-art saliency architectures

and big data?

datasets

architectures

applications

objectdetection

objectproposals

cognitivesaliency

Tutorial overview“Saliency for image

understanding and manipulation”Ming-Ming Cheng

Is saliency solved?

Is saliency solved?NO!

“Towards cognitive saliency”Zoya Bylinskii

Tutorial overview

Gaze Text Face Action Main Focus

“Towards cognitive saliency”Zoya Bylinskii

Tutorial overview

Tutorial Overview - MIT Saliency...

Documents

Analysis of Scores, Datasets, and Models in Visual …ilab.usc.edu/borji/papers/SaliencyModeling2013ICCV.pdfAnalysis of scores, datasets, and models in visual saliency prediction Ali

David Holmes, John Williams Civil and Environmental Engineering Massachusetts Institute of Technology Peter Tilke Mathematics and Modeling Department Schlumberger-Doll

What Makes a Patch Distinct?ayellet/Ps/13-MTZ.pdfWe test our method on the recently-published benchmark of Borji et al. [4]. This benchmark consists of ﬁve well-known datasets of

Paying Attention to Descriptions Generated by Image ......Paying Attention to Descriptions Generated by Image Captioning Models Hamed R. Tavakoli† Rakshith Shetty⋆ Ali Borji‡

Towards Attentive Robotsilab.usc.edu/borji/cvpr2013/AttentionApplicationsInRobotics.pdf · KTH, Stockholm: a KUKA arm is controlled by bottom-up and top-down attention for detecting,

Download Tilke Portfolio

Human vs. computer in scene and object recognitionilab.usc.edu/borji/papers/borjiIttiCVPR2014.pdfcomputer vision has long been inspired by human vision, we believe systematic efforts,

Robert Scrimger kevin wolford Anthony tilke MCSEindex-of.es/Tutorials/files/005/MCSE/MCSE Training Guide TCP-IP.pdfTable of Contents iii P1/Vet MCSE Tr Gde: TCP/IP 747-2 Lori 12.01.97

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE …ilab.usc.edu/publications/doc/Borji_Itti13pami.pdf · 2013-10-09 · State-of-the-Art in Visual Attention Modeling Ali Borji,

Computer Vision: Summary and Discussion Ali Borji UWM

Tilke Presentation 18.9.12

Google I/O 2015 - Massachusetts Institute of Technologyweb.mit.edu/zoya/www/googleIO2015_small.pdf · Google I/O 2015 Zoya Bylinskii ... need a personal search engine to navigate

Learning to predict where people look · Tilke Judd, Antonio Torralba, Frédo Durand Learning to predict where people look How do we do this? Image database We collected a large database

Curriculum Vitaeisp.tums.ac.ir/content/board_members/46/Dr H Borji CV 2019.pdf · 2 Research and Publications (Full paper) 1- Shayan P, Borji H., Eslami A.Zakeri S. (2007) Isolation

Formula One Azerbaijani Grand Prix - BAM Motorsports · 2019-02-22 · European Grand Prix in 2016. A fascinating street circuit devised by German architect Hermann Tilke carves its

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. XXX, NO. …ilab.usc.edu/borji/papers/TiP-Borji-2Col.pdf · U.S. Government work not protected by U.S. copyright. This article has been

Human vs. Computer in Scene and Object Recognitionilab.usc.edu/borji/papers/cvpr2014Poster_lowRes.pdfLearned Lessons Summary Supported by the National Science Foundation, the General

Predicting observers' task from their scanpaths on natural scenesilab.usc.edu/borji/papers/borjiVSS2014lowres.pdf · 2014. 5. 14. · 9 - 0.2828 [0.2825] 10 - 0.4162 [0.3905] 11 -

New IPA Design For 500 MeV TRIUMF Cyclotron A. Mitra Yuri Bylinskii, Z. Bjelic, V. Zvyagintsev, TRIUMF, Vancouver, M. Battig, University of Waterloo, S

Robust Handwritten Character Recognition with Features ...ilab.usc.edu/borji/papers/NPL.pdf · written digit recognition is proposed. A benchmark analysis for state-of-the-art handwritten