9
Deep learning, 2017 Lyle Ungar University of Pennsylvania

Lyle Ungar University of Pennsylvania - Penn …cis520/lectures/deep...Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play,

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Lyle Ungar University of Pennsylvania - Penn …cis520/lectures/deep...Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play,

Deep learning, 2017 Lyle Ungar

University of Pennsylvania

Page 2: Lyle Ungar University of Pennsylvania - Penn …cis520/lectures/deep...Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play,

What’s hot in Deep Learning u Solving problems

l  Breast cancer diagnosis; chatbots; optimizing data centers… l  Replacing more and more hand-crafted features

u GANS (Generative Adversarial Networks) l  Learn generative models

u Reinforcement Learning u DeepMind/google

Page 3: Lyle Ungar University of Pennsylvania - Penn …cis520/lectures/deep...Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play,

Power Utilization u 15% reduction

https://www.theverge.com/2016/7/21/12246258/google-deepmind-ai-data-center-cooling

Page 4: Lyle Ungar University of Pennsylvania - Penn …cis520/lectures/deep...Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play,

A simple neural network module for relational reasoning

https://deepmind.com/blog/deepmind-papers-nips-2017/

There is a tiny rubber thing that is the same colour as the large cylinder; what shape is it?

Page 5: Lyle Ungar University of Pennsylvania - Penn …cis520/lectures/deep...Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play,

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play, and given no domain knowledge except the game rules, AlphaZero achieved within 24 hours a superhuman level of play in the games of chess and shogi (Japanese chess) as well as Go, and convincingly defeated a world-champion program in each case.

https://arxiv.org/pdf/1712.01815.pdf

Page 6: Lyle Ungar University of Pennsylvania - Penn …cis520/lectures/deep...Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play,

Lots of fancy network structures

googlenet

Convolutional (different sizes) Or fully connected Maxpool Concatination Softmax

Some layers use dropout

input

outputs

Page 7: Lyle Ungar University of Pennsylvania - Penn …cis520/lectures/deep...Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play,

Design a really smart computer

https://research.googleblog.com/2017/05/using-machine-learning-to-explore.html

Using Machine Learning to Explore Neural Network Architecture

Use RL to search for the ‘best’ neural net architecture

Page 8: Lyle Ungar University of Pennsylvania - Penn …cis520/lectures/deep...Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play,

AutoML learns network structure

Human built Learned by RL https://research.googleblog.com/2017/05/using-machine-learning-to-explore.html

Page 9: Lyle Ungar University of Pennsylvania - Penn …cis520/lectures/deep...Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Starting from random play,

Big open directions (an opinion) u Multitask learning / domain transfer u One shot learning u  Integrating deep learning with external data

(e.g. results of search)