Deep Generative Image Models using a Laplacian Pyramid...

Applications of GANs

● Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

● Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks

● Generative Adversarial Text to Image Synthesis

Using GANs for Single Image Super-Resolution

Christian Ledig, Lucas Theis, Ferenc Huszar, Jose Caballero, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi

Problem

How do we get a high resolution (HR) image from just one (LR) lower resolution image?

Answer: We use super-resolution (SR) techniques.

http://www.extremetech.com/wp-content/uploads/2012/07/super-resolution-freckles.jpg 3

Previous Attempts

SRGAN - Generator

● G: generator that takes a low-res image ILR and outputs its high-res counterpart ISR

● θG: parameters of G, {W1:L, b1:L}● lSR: loss function measures the difference between the 2 high-res images

SRGAN - Discriminator

● D: discriminator that classifies whether a high-res image is IHR or ISR

● θD: parameters of D

SRGAN - Perceptual Loss Function

Loss is calculated as weighted combination of:

➔ Content loss

➔ Adversarial loss

➔ Regularization loss

SRGAN - Content Loss

Instead of MSE, use loss function based on ReLU layers of pre-trained VGG network. Ensures similarity of content.

● i,j : feature map of jth convolution before ith maxpooling● Wi,j and Hi,j: dimensions of feature maps in the VGG

SRGAN - Adversarial Loss

Encourages network to favour images that reside in manifold of natural images.

SRGAN - Regularization Loss

Encourages spatially coherent solutions based on total variations.

SRGAN - Examples

Conditional Generative Adversarial Nets (CGAN)

Mirza and Osindero (2014)

CGAN 16

Generative Adversarial Text to Image Synthesis

Author’s code available at: https://github.com/reedscot/icml2016

Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, Honglak Lee

Motivation

Current deep learning models enable us to...

➢ Learn feature representations of images & text➢ Generate realistic images & text

pull out images based on captions

generate descriptions based on images

answer questions about image content

Problem - Multimodal distribution

• Many plausible image can be associated with one single text description

• Previous attempt uses Variational Recurrent Autoencoders to generate image from text caption but the images were not realistic enough. (Mansimov et al. 2016)

What GANs can do

• CGAN: Use side information (eg. classes) to guide the learning process

• Minimax game: Adaptive loss function

➢ Multi-modality is a very well suited property for GANs to learn.

The Model - Basic CGAN

Pre-trained char-CNN-RNNLearns a compatibility function of images and text -> joint embedding

The Model - Variations

GAN-CLS

In order to distinguish different error sources:

Present to the discriminator network 3 different types of input. (instead of 2)

Algorithm

The Model - Variations cont.

GAN-INT

In order to generalize the output of G:

Interpolate between training set embeddings to generate new text and hence fill the gaps on the image data manifold.

Updated Equation

GAN-INT-CLS: Combination of both previous variations

{fake image, fake text}

Disentangling ❖ Style is background, position & orientation of the object, etc.

❖Content is shape, size & colour of the object, etc.

● Introduce S(x), a style encoder with a squared loss function:

● Useful in generalization: encoding style and content separately allows for different new combinations

Training - Data (separated into class-disjoint train and test sets)

Caltech-UCSD Birds

MS COCO

Oxford Flowers

Training – Results: Flower & Bird

Training – Results: MS COCO Mansimov et al.

Training – Results Style disentangling

Thoughts on the paper

• Image quality

• Generalization

• Future work

Deep Generative Image Models using a Laplacian Pyramid...

Documents

On the Laplacian Coefficients and Laplacian-Like Energy of ...downloads.hindawi.com/journals/jam/2012/404067.pdf · On the Laplacian Coefﬁcients and Laplacian-Like Energy of Unicyclic

Hypothetical Exploration on Generative Adversarial Networks · Convolutional Neural Networks, Generative Adversarial Net- works, Laplacian Pyramid of Adversarial Networks. I. INTRODUCTION

Generative Process, Generative Outcome: The Transformational

Laplacian Framework for Interactive Mesh Editingylipman/ijsm_editing.pdf · plying Laplacian smoothing which requires only the solution of a sparse linear system. The Laplacian-based

The (signless) Laplacian spectral radii of c-cyclic graphs ...hlai2/Pdf/Hong-Jian_Lai_Pdf/... · the unique extremal graph with largest signless Laplacian spectral radius and Laplacian

Discrete Laplacian

Using the Normalised Laplacian Pyramid Distance. In ...€¦ · Generative Adversarial Networks (GANs) aim to generate data indistinguishable from the training data [5]. The generator

Laplacian on Riemannian manifolds

Laplacian Matrices of Graph

Local Laplacian Filters: Edge-aware Image Processing with a … · 2011-06-02 · Local Laplacian Filters: Edge-aware Image Processing with a Laplacian Pyramid Sylvain Paris Adobe

Fast Local Laplacian Filters: Theory and Applications · Fast Local Laplacian Filters: Theory and Applications • 3 Local Laplacian ﬁltering. Paris et al. [2011] introduced local

Towards a Theoretical Foundation for Laplacian-Based ...web.cse.ohio-state.edu/~belkin.8/papers/TT_JCSS_08.pdf · Laplacian to the manifold Laplacian in the context of machine learning

Local Laplacian Filters: Edge-aware Image …people.csail.mit.edu/.../ParisEtAl11-lapfilters-lowres.pdfLocal Laplacian Filters: Edge-aware Image Processing with a Laplacian Pyramid

Generative Graph Models based on Laplacian Spectranetwork-games-muri.engin.umich.edu/wp-content/uploads/sites/439/… · Generative Graph Models based on Laplacian Spectra∗ Alana

Laplacian Patch-Based Image Synthesis - KAISTvclab.kaist.ac.kr/cvpr2016p2/CVPR2016_LaplacianInpainting_supp.pdf · Laplacian Patch-Based Image Synthesis ... with Other Laplacian-Based

Generative Adversarial Network (+Laplacian Pyramid GAN)

Laplacian Operator and Smoothingpanozzo/ustc/05 - Laplacian Operator.pdfFlorida State University Laplacian Operator and Smoothing Xifeng Gao Acknowledgements for the slides: Olga Sorkine-Hornung,

Deep Generative Image Models using a Laplacian Pyramid ...duvenaud/courses/csc2541/slides/gan... · Motivation Current deep ... Learn feature representations of images & text

Scalable Laplacian K-modespapers.nips.cc/paper/8208-scalable-laplacian-k-modes.pdfScalable Laplacian K-modes Imtiaz Masud Ziko ÉTS Montreal Eric Granger ÉTS Montreal Ismail Ben Ayed

Guia Thesis Complex Laplacian