A critical analysis of self-supervision, or what we can ... · Unsupervised representation learning...

A critical analysis of self-supervision, or what we can learn from a single image

Yuki M. Asano

CDT Annual Meeting Oct 2019

work with Christian Rupprecht and Andrea Vedaldi at VGG

Outline

• Self-supervised learning saga

•Or is it?

Self-supervised learning like we do

1. Unlabelled, large collection of images

2. Train your network without labels

3. Use the image representations (vectors) for new tasks

Self-supervised learning like we do?

Unlabelled data

+ transformations

task/Loss

e.g. DeepCluster

• Run k-means on features

• Train classifier on k classes

• Repeat for 200 epochs

Deepclustering for unsupervised learning of visual featuresM. Caron, P. Bojanowski, A. Joulin, and M. Douze

ECCV, 2018

e.g. RotNet

• Create 4 classes based on rotations

• Exploits photographer bias

• Simple but works

Unsupervised representation learning by predicting image rotations S. Gidaris, P. Singh, and N. Komodakis

ICLR, 2018

Or colorizing images

Zhang, Isola, Efros.Colorful Image Colorization.

In ECCV, 2016

Hypothesis

What/how humans learn“Priors”

Transformations ?

Getting there, but not quite yet

Where are we?

What/how humans learn“Priors”

Transformations ?

“Learn” from one image… using multiple transformations

A critical analysis of self-supervision, or what we can learn from a single imageYM Asano, C Rupprecht, A Vedaldi

arXiv 1904.13132

Learned first convolutional layer – from one image

Performance

17.1 16.9 16.314.1

31.5 32.5

39.237.2

48.350.5

Conv1 Conv2 Conv3 Conv4 Conv5

Comparison of random, DeepCluster (1 & 1M images) and supervised

Random 1-image 1M images Supervised

Conclusion

1. Early layers of deep networks contain limited information about natural images

2. These can be learned through self-supervision or supervised learning

3. Notably, only one image + transformations are necessary for this

4. Much space to go the right direction in self-supervised learning

Style transfer with a 1-image trained CNN

yuki@robots.ox.ac.uk@y_m_asa

Appendix

A critical analysis of self-supervision, or what we can ... · Unsupervised representation learning...

Documents

ICLR Friday Forum: Reducing flood risk in Toronto (February 2016)

ICLR 2017 hurricane briefing (Bob Robichaud, Canadian Hurricane Centre)

Lexis Library Complete Collection - LexisNexis UK · PDF fileLexis ®Library | Complete Collection ... • ICLR: Admiralty & Ecclesiastical • ICLR: Appeal Cases ... Mental Capacity

Published as a conference paper at ICLR 2021

Workshop track - ICLR 2018

ICLR 2020 Decoupling representation and classifier

Under review as a conference paper at ICLR 2018

FastPD: MRF inference via the primal-dual schema Nikos Komodakis Tutorial at ICCV (Barcelona, Spain, November 2011)

Nikos Komodakis

ICLR - Basement Flooding Brochure - French Final

MRF optimization based on Linear Programming relaxations Nikos Komodakis University of Crete IPAM, February 2008

Published as a conference paper at ICLR 2017 - …mschaeke/publications/jaini_transfer_2017.pdfPublished as a conference paper at ICLR 2017 recognition, sleep classiﬁcation and the

ICLR Forecast Webinar: 2014 Canadian wildfire season (June 6, 2014)

ICLR interactive catalogue: Basement flood risk reduction info

Published as a conference paper at ICLR 2019

ICLR Forecast Webinar: 2014 Canadian hurricane season (June 20, 2014)

Attend Reﬁne Repeat: Active Box Proposal Generation via In ...{Sermanet, Eigen, Zhang, Mathieu, Fergus, and LeCun} 2013 {Bell, Zitnick, Bala, and Girshick} 2015 {Gidaris and Komodakis}

ICLR: How flood affects insurance

Workshop track - ICLR 2017

Indiana Iclr Ppt 3.12