Perceptual Losses for Real-Time Style Transfer and Super ...3 Super-Resolution Examples We show...

Perceptual Losses for Real-Time Style Transferand Super-Resolution: Supplementary Material

Justin Johnson, Alexandre Alahi, Li Fei-Fei{jcjohns, alahi, feifeili}@cs.stanford.edu

Department of Computer Science, Stanford University

1 Network Architectures

Our style transfer networks use the architecture shown in Table 1 and our super-resolution networks use the architecture shown in Table 2. In these tables “C ×H ×W conv” denotes a convolutional layer with C filters size H ×W which isimmediately followed by spatial batch normalization [1] and a ReLU nonlinearity.

Our residual blocks each contain two 3×3 convolutional layers with the samenumber of filters on both layer. We use the residual block design of Gross andWilber [2] (shown in Figure 1), which differs from that of He et al [3] in that theReLU nonlinearity following the addition is removed; this modified design wasfound in [2] to perform slightly better for image classification.

Layer Activation size

Input 3× 256× 25632× 9× 9 conv, stride 1 32× 256× 25664× 3× 3 conv, stride 2 64× 128× 128128× 3× 3 conv, stride 2 128× 64× 64Residual block, 128 filters 128× 64× 64Residual block, 128 filters 128× 64× 64Residual block, 128 filters 128× 64× 64Residual block, 128 filters 128× 64× 64Residual block, 128 filters 128× 64× 6464× 3× 3 conv, stride 1/2 64× 128× 12832× 3× 3 conv, stride 1/2 32× 256× 256

3× 9× 9 conv, stride 1 3× 256× 256

Table 1. Network architecture used for style transfer networks.

2 Residual vs non-Residual Connections

We performed preliminary experiments comparing residual networks for styletransfer with non-residual networks. We trained a style transfer network usingThe Great Wave Off Kanagawa as a style image, replacing each residual block

2 Johnson et al

×4 ×8Layer Activation size Layer Activation size

Input 3× 72× 72 Input 3× 36× 3664× 9× 9 conv, stride 1 64× 72× 72 64× 9× 9 conv, stride 1 64× 36× 36Residual block, 64 filters 64× 72× 72 Residual block, 64 filters 64× 36× 36Residual block, 64 filters 64× 72× 72 Residual block, 64 filters 64× 36× 36Residual block, 64 filters 64× 72× 72 Residual block, 64 filters 64× 36× 36Residual block, 64 filters 64× 72× 72 Residual block, 64 filters 64× 36× 36

64× 3× 3 conv, stride 1/2 64× 144× 144 64× 3× 3 conv, stride 1/2 64× 72× 7264× 3× 3 conv, stride 1/2 64× 288× 288 64× 3× 3 conv, stride 1/2 64× 144× 144

3× 9× 9 conv, stride 1 3× 288× 288 64× 3× 3 conv, stride 1/2 64× 288× 288- - 3× 9× 9 conv, stride 1 3× 288× 288

Table 2. Network architectures used for ×4 and ×8 super-resolution.

3 x 3 Conv

Batch Norm

3 x 3 Conv

Batch Norm

3 x 3 Conv

Batch Norm

3 x 3 Conv

Batch Norm

Fig. 1. Left: Residual block design used in our networks. Right: An equivalent convo-lutional block.

Style Content Residual Non-residual

Fig. 2. A comparison of residual vs non-residual networks for style transfer.

Perceptual Losses for Real-Time Style Transfer and Super-Resolution 3

in Table 1 with an equivalent non-residual block consisting of a pair of 3 × 3convolutional layers with the same number of filters as shown in Figure 1.

Figure 2 shows the training losses for a residual and non-residual network,both trained using Adam [4] for 40,000 iterations with a learning rate of 1×10−3.We see that the residual network trains faster, but that both networks eventuallyachieve similar training losses. Figure 2 also shows a style transfer examplefrom the trained residual and non-residual networks; both learn similar to applysimilar transformations to input images.

Our style transfer networks are only 16 layers deep, which is relatively shallowcompared to the networks in [3]. We hypothesize that residual connections maybe more crucial for training deeper networks.

3 Super-Resolution Examples

We show additional examples of ×4 single-image super-resolution in Figure 4and additional examples of ×8 single-image super-resolution in Figure 3.

Ground TruthPSNR / SSIM

Bicubic24.92 / 0.6694

Ours (`pixel)25.48 / 0.6810

Ours (`feat)24.70 / 0.6757

Bicubic24.37 / 0.5718

Ours (`pixel)24.97 / 0.5889

Ours (`feat)23.34 / 0.5879

Fig. 3. Additional examples of ×8 single-image super-resolution on the BSD100dataset.

4 Johnson et al

Bicubic30.18 / 0.8737

Ours (`pixel)29.96 / 0.8760

SRCNN [5]32.00 / 0.9026

Ours (`feat)27.80 / 0.8053

Bicubic 29.84/ 0.8144

Ours (`pixel)29.69 / 0.8113

SRCNN [5]31.20 / 0.8394

Ours (`feat)28.18 / 0.7757

Bicubic32.48 / 0.8575

Ours (`pixel)32.30 / 0.8568

SRCNN [5]33.49 / 0.8741

Ours (`feat)30.85 / 0.8125

Fig. 4. Additional examples of ×4 single-image super-resolution on examples from theSet5 (top), Set14 (middle) and BSD100 (bottom) datasets.

Perceptual Losses for Real-Time Style Transfer and Super-Resolution 5

References

1. Ioffe, S., Szegedy, C.: Batch normalization: Accelerating deep network trainingby reducing internal covariate shift. In: Proceedings of The 32nd InternationalConference on Machine Learning. (2015) 448–456

2. Gross, S., Wilber, M.: Training and investigating residual nets.http://torch.ch/blog/2016/02/04/resnets.html (2016)

3. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition.arXiv preprint arXiv:1512.03385 (2015)

4. Kingma, D., Ba, J.: Adam: A method for stochastic optimization. arXiv preprintarXiv:1412.6980 (2014)

5. Dong, C., Loy, C.C., He, K., Tang, X.: Learning a deep convolutional network forimage super-resolution. In: Computer Vision–ECCV 2014. Springer (2014) 184–199

Perceptual Losses for Real-Time Style Transfer and Super ...3 Super-Resolution Examples We show...

Documents

Single Image Super-resolution using Deformable Patchesayuille/Pubs10_12/Single Image Super-resolution u… · Single Image Super-resolution using Deformable Patches Yu Zhu1, Yanning

Super-Resolution Fluorescence Microscopyckocabas/Super-Resolution microscopy.pdf · Super-Resolution Fluorescence ... Factors Affecting the Effective Spatial Resolution ... The intensity

Super-resolution - New York UniversitySuper-resolution 1 Super-resolution of point sources The word super-resolution has di erent meanings in di erent disciplines. In optics it is

Pixel Recursive Super Resolution - arXiv · PDF filePixel Recursive Super Resolution Ryan Dahl Mohammad Norouzi Jonathon Shlens Google Brain frld,mnorouzi,shlensg@ Abstract Super resolution

SUPER-RESOLUTION 2021

Super-resolution microscopy reveals structural diversity ...Super-resolution microscopy reveals structural diversity ... Super-resolution techniques are powerful tools to reveal the

Super Resolution with Generative Adversarial Networkspages.cs.wisc.edu/~blv/super-resolution-generative.pdf · the Generative Adversarial networks can be applied on Image Super-Resolution

Improving Multiple-Image Super-Resolution for Mobile ... · Super-resolution, Mobile Devices, Mean Fusion, Image Alignment 1 INTRODUCTION Multiple-image super-resolution (MISR) attempts

Apparent Resolution Enhancement for Motion Videosvis.berkeley.edu/papers/vidEnh/vidEnh.pdf · Multi-frame Super-Resolution. Super-resolution techniques aim to reconstruct a high-resolution

1 - Super-Accuracy and Super-Resolution: Getting Around ...people.physics.illinois.edu/Selvin/PRS/PSCV/PTP/2010/Super-accura… · super-resolution methods, the optical resolution

ImagePairs: Realistic Super Resolution Dataset via Beam ... · real-world image super resolution. 1. Introduction Super Resolution (SR) is the problem of recovering high-resolution

Light Field Super-Resolution Via Graph-Based Regularization · PDF fileLight Field Super-Resolution Via Graph-Based Regularization ... similarly to multi-frame super-resolution, they

Super-resolution Microscopy

Super-Resolution Radar

Advances in super-resolution imaging: applications in ...€¦ · 3. Super-resolution techniques The current super-resolution techniques can be divided into two groups. Techniques

SUPER HUMAN Conflict-Resolution Conflict Resolution

Single-image Super-resolution

Super Resolution Inpainting

Super-Resolution Imaging of MammogramsBased on the Super ... · hancement, such as denoising [22], deblurring [23], and super-resolution. The super-resolution convolutional neural

Exampled-based Super resolution