55
Recent Progress in Generative Modeling Ilya Sutskever

Recent Progress in Generative Modeling - · PDF file• Deep supervised learning ... What is a generative model?

  • Upload
    letuyen

  • View
    217

  • Download
    3

Embed Size (px)

Citation preview

Page 1: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Recent Progress in Generative Modeling

Ilya Sutskever

Page 2: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Goal of OpenAI

• Make sure that AI is actually good for humanity

Page 3: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Goal of OpenAI

• Prevent concentration of AI power

• Build AI to benefit as many people as possible

• Build AI that will do what we want it to do

Page 4: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

ML: what works?

• Deep supervised learning

• Vision, speech, translation, language, ads, robotics

Page 5: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

ML: what works?• Deep supervised learning:

• Get lots of input-output examples

• Train a very large deep neural network

• Convolutional or seq2seq with attention

• Great results are likely

Page 6: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

What’s next?

• Agents that achieve goals

• Systems that build a holistic understanding of the world

• Creative problem solving

• etc

Page 7: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Generative models

• Critical for many of the upcoming problems

Page 8: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

What is a generative model?

• Learn your data distribution

• Assign high probability to it

• Learn to generate plausible structure

• Discover the “true” structure of the data

Page 9: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Generative models

• What are they good for?

• What can we do with them?

Page 10: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Conventional applications• Good generative models will definitely enable the

following:

• Structured prediction (e.g., output text)

• Much more robust prediction

• Anomaly detection

• Model-based RL

Page 11: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Speculative applications• Really good feature learning

• Exploration in RL

• Inverse RL

• Good dialog that actually works

• “Understanding the world”

• Transfer learning

Page 12: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Generative models

• Three broad categories of generative models:

• Variational Autoencoders

• Generative adversarial networks

• Autoregressive models

Page 13: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Improved techniques for training GANs

• Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, Xi Chen

Page 14: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Generative adversarial networks

• A generator G(z) and a discriminator D(x)

• Discriminator aims to separate real data from generator samples

• Generator tries to fool the discriminator

• GANs often produce best samples so far

Page 15: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Generative adversarial networks

• Yann LeCun: The most important [recent development], in my opinion, is adversarial training (also called GAN for Generative Adversarial Networks) — from Quora Q&A session

Page 16: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Promising early results• Best high-resolution image samples of any model

so far:

• Deep generative image models using a Laplacian pyramid of adversarial networks. — Denton et al.

• DCGAN — Radford et al.

Page 17: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Hard to train

• The model is defined in terms of a minimax problem

• No cost function

• Hard to tell if progress is being made

Page 18: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Simple ideas for improving GAN training

• GANs fail to learn due to the collapse problem:

• The generator becomes degenerate and the learning gets stuck

• Solution: the discriminator should see the entire mini batch

• If all the cases are identical, it will be easier to discern

Page 19: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Results

Page 20: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 21: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

–Johnny Appleseed

“Type a quote here.”

Page 22: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 23: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 24: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 25: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Semi supervised learning with GANs

• Semi supervised learning is the problem of getting better classification using unlabelled data

• A good generic semi supervised learning algorithm will improve all ML applications

Page 26: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Semi supervised learning with GANs

• Discriminator should both tell the class of the training samples, and tell real samples from fake samples apart

• The specific way in which it is done is important, but it is technical, and I will not explain it

• The GAN training algorithm is also different here. Details are available offline.

Page 27: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Results

• MNIST: 50 supervised training cases + ensemble of 10 models = 1.4% test error

• CIFAR 10: 4000 supervised training cases = 18.5% test error

• Both results are new state of the art

Page 28: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Conclusions

• We have better methods for training GANs

• New simple way of using GANs to improve discriminative models

• New level of sample quality and semi-supervised learning accuracy

Page 29: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

InfoGAN

• Xi Chen, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel

Page 30: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Disentangled representations

• Holy grail of representation learning

Page 31: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

InfoGAN

• Train a GAN

• such that: a small subset of its variables is accurately predictable from the generated sample

• Straightforward to add this constraint

Page 32: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Actually works!

Page 33: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 34: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 35: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 36: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 37: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 38: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 39: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 40: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Exploration with generative models

• Rein Houthooft, Xi Chen, John Schulman, Filip De Turck, Pieter Abbeel

Page 41: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

The problem

• In reinforcement learning, we take random actions

• Sometimes the actions do us good

• Then we do more of these actions in the future

Page 42: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Exploration

• Are random actions the best we can do?

• Surely not

Page 43: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Curiosity

• Key idea: take actions to maximize “information gain”

Page 44: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Formally

• Learn a Bayesian generative model of the environment

• For the action taken, calculate the amount of information gained about the environment by the generative model

• Add the amount of information to the reward

Page 45: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Actually works

• Extremely well on low-D environments

• Many unsolvable problems become solvable

• Current work: scaling up to high-D environments

Page 46: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 47: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 48: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Improving Variational Autoencoders with Inverse Autoregressive Flow

• Durk Kingma, Tim Salimans, Max Welling

Page 49: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

The Helmholtz Machine

• Latent variable model

• Use an approximate posterior

• Maximize a lower bound to the likelihood

• Has been impossible to train

Page 50: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Reparameterization Trick

• The Helmholtz machine has been forever impossible to train

• The reparameterization trick of Kingma and Welling fixes this problem, whenever the latent variables are continuous

Page 51: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

High-quality posterior

• Approximate posteriors matter

• Typical approximate posteriors are very simple

• Normal way of doing powerful posteriors is very expensive

• IAF = a new cheap way of getting extremely powerful posteriors

Page 52: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Results

• Best non-pixel-CNN log probabilities on CIFAR-10

• Excellent samples

• Currently training huge ImageNet models

Page 53: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 54: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?
Page 55: Recent Progress in Generative Modeling -   · PDF file• Deep supervised learning ... What is a generative model?

Questions?