48
Bayesian Inference and Latent Variable Models in Machine Learning Dmitry P. Vetrov Head of Bayesian methods research group http://bayesgroup.ru, Faculty of Computer Science, HSE Skoltech

Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Embed Size (px)

Citation preview

Page 1: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Bayesian Inference and Latent Variable Models in

Machine LearningDmitry P. Vetrov

Head of Bayesian methods research group

http://bayesgroup.ru,

Faculty of Computer Science, HSE

Skoltech

Page 2: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Outline

Today

• Probabilistic modeling in Machine Learning

• Exponential class of distributions

• Learning with latent variables

• EM-algorithm

Next time

• Examples of models with discrete and continuous latent variables

• Extensions of EM-algorithm

• Stochastic optimization in EM framework

Page 3: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

What is machine learning?

Page 4: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Simple example

Page 5: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Conditional and marginal distributions

Page 6: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Bayesian Framework

Page 7: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Frequentist vs. Bayesian frameworks

Page 8: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Bayesian Learning and Inference

Page 9: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Combining models

Page 10: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Maximal a posteriori (MAP) learning

Page 11: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Exponential class of distributions

Page 12: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Log-concavity of exponential class

Page 13: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Log-concavity of exponential class

Page 14: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Example: Gaussian distribution

Page 15: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Incomplete likelihood

Page 16: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Variational lower bound

Page 17: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

EM-algorithm

Page 18: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

EM-algorithm

Page 19: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

EM-algorithm

Page 20: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

EM-algorithm

Page 21: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

EM-algorithm

Page 22: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

EM-algorithm

Page 23: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

EM-algorithm

Page 24: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Discrete T

Page 25: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Mixture of gaussians

Page 26: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Mixture of gaussians

Page 27: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Mixture of gaussians

Page 28: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Mixture of gaussians

Page 29: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Mixture of gaussians

Page 30: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Mixture of gaussians

Page 31: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Mixture of gaussians: formal description

Page 32: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

EM-algorithm for mixture of gaussians

Page 33: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Continuous T

Page 34: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Example: PCA model

Page 35: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Advantages of EM PCA

Page 36: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Mixture of PCA

Page 37: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Example: Latent Dirichlet Allocation

Page 38: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

LDA: formal description

Page 39: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

General nature of EM-framework

Page 40: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Extending E-step

Page 41: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Examples of conjugate distributions

Page 42: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Crisp E-step

Page 43: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Variational E-step

Page 44: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Stochastic optimization

Page 45: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Stochastic EM

Page 46: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Summary: extensions of basic EM

Page 47: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Conclusion

Page 48: Bayesian inference and latent variable models in machine learning (by Dmitry Vetrov)

Challenge

For those who’s interested

• Help Nick Carter to find the criminal who kidnapped lady Thun’s dog http://cmp.felk.cvut.cz/cmp/courses/recognition/Labs/em/index_en.html