Approximate Inference in Bayes Nets › courses › csep573 › 11wi › ...Approximate Inference in...

Approximate Inference in Bayes NetsSampling based methods

Mausam

(Based on slides by Jack Breese and Daphne Koller)

Bayes Nets is a generative model• We can easily generate samples from the

distribution represented by the Bayes net

– Generate one variable at a time in topological order

P(B|C)

Rejection Sampling

• Sample from the prior

– reject if do not match the evidence

• Returns consistent posterior estimates

• Hopelessly expensive if P(e) is small

– P(e) drops off exponentially if no. of evidence vars

Likelihood Weighting

• Idea:

– fix evidence variables

– sample only non-evidence variables

– weight each sample by the likelihood of evidence

P(B|C)

Likelihood Weighting

• Sampling probability: S(z,e) =

– Neither prior nor posterior

• Wt for a sample <z,e>:

• Weighted Sampling probability S(z,e)w(z,e)

= P(z,e)

• returns consistent estimates

• performance degrades w/ many evidence vars

– but a few samples have nearly all the total weight

– late occuring evidence vars do not guide sample generation

))Parents(Z|P(z ii

ii )Parents(E|P(e e) w(z,

ii )Parents(E|P(ei

))Parents(Z|P(z ii

MCMC with Gibbs Sampling• Fix the values of observed variables

• Set the values of all non-observed variables randomly

• Perform a random walk through the space of complete variable assignments. On each move:

1. Pick a variable X

2. Calculate Pr(X=true | all other variables)

3. Set X to true with that probability

• Repeat many times. Frequency with which any variable X is true is it’s posterior probability.

• Converges to true posterior when frequencies stop changing significantly

– stable distribution, mixing

Markov Blanket Sampling

• How to calculate Pr(X=true | all other variables) ?

• Recall: a variable is independent of all others given it’s Markov Blanket

– parents

– children

– other parents of children

• So problem becomes calculating Pr(X=true | MB(X))

– We solve this sub-problem exactly

– Fortunately, it is easy to solve

( ) ( | ( )) ( | ( ))Y Children X

P X P X Parents X P Y Parents Y

Example

( ) ( | ( )) ( | ( ))Y Children X

P X P X Parents X P Y Parents Y

( , , , )( | , , )

( , , )

( ) ( )( | ) ( | , )

( , , )

( | ) ( ) ( | , )

( | , )

P X A B CP X A B C

P A B C

P A P X A P C P B

P A P CP X A P B X C

P A B C

A P B X C

Example

Smoking

Heartdisease

Lungdisease

Shortnessof breath

~s 0.1

H G P(b)

h g 0.9

h ~g 0.8

~h g 0.7

~h ~g 0.1

Example

• Evidence: s, b

Smoking

Heartdisease

Lungdisease

Shortnessof breath

~s 0.1

H G P(b)

h g 0.9

h ~g 0.8

~h g 0.7

~h ~g 0.1

Example

• Evidence: s, b

• Randomly set: h, b

Smoking

Heartdisease

Lungdisease

Shortnessof breath

~s 0.1

H G P(b)

h g 0.9

h ~g 0.8

~h g 0.7

~h ~g 0.1

Example

• Evidence: s, b

• Randomly set: h, g

• Sample H using P(H|s,g,b)

Smoking

Heartdisease

Lungdisease

Shortnessof breath

~s 0.1

H G P(b)

h g 0.9

h ~g 0.8

~h g 0.7

~h ~g 0.1

Example

• Evidence: s, b

• Randomly set: ~h, g

• Suppose result is ~h

Smoking

Heartdisease

Lungdisease

Shortnessof breath

~s 0.1

H G P(b)

h g 0.9

h ~g 0.8

~h g 0.7

~h ~g 0.1

Example

• Evidence: s, b

Suppose result is ~h

Sample G using P(G|s,~h,b)

Smoking

Heartdisease

Lungdisease

Shortnessof breath

~s 0.1

H G P(b)

h g 0.9

h ~g 0.8

~h g 0.7

~h ~g 0.1

Example

• Evidence: s, b

Suppose result is g

Smoking

Heartdisease

Lungdisease

Shortnessof breath

~s 0.1

H G P(b)

h g 0.9

h ~g 0.8

~h g 0.7

~h ~g 0.1

Example

• Evidence: s, b

Suppose result is g

Sample G using P(G|s,~h,b)42

Smoking

Heartdisease

Lungdisease

Shortnessof breath

~s 0.1

H G P(b)

h g 0.9

h ~g 0.8

~h g 0.7

~h ~g 0.1

Example

• Evidence: s, b

Suppose result is g

Suppose result is ~g 43

Smoking

Heartdisease

Lungdisease

Shortnessof breath

~s 0.1

H G P(b)

h g 0.9

h ~g 0.8

~h g 0.7

~h ~g 0.1

Gibbs MCMC Summary

• Advantages:

– No samples are discarded

– No problem with samples of low weight

– Can be implemented very efficiently• 10K samples @ second

• Disadvantages:

– Can get stuck if relationship between two variables is deterministic

– Many variations have been devised to make MCMC more robust

P(X|E) =number of samples with X=x

total number of samples

Other inference methods

• Exact inference

– Junction tree

• Approximate inference

– Belief Propagation

– Variational Methods

Approximate Inference in Bayes Nets › courses › csep573 › 11wi › ...Approximate Inference in...

Documents

Bookkeeping Bayes Nets - Inspiring Innovation...1 © Cynthia Matuszek – UMBC CMSC 671 Bayes Nets AI Class 10 (Ch. 14.1–14.4.2; skim 14.3) Based on slides by Dr. Marie desJardin

CSE 473: Artificial Intelligence Bayes’ Nets: Inference

Bayes’ Nets: Big Picturepstone/Courses/343spring12/resources/... · 2012-02-21 · Bayes’ Nets: Big Picture Two problems with using full joint distribution tables as our probabilistic

Bayes’ Nets: Samplingcs188/fa19/assets/slides/lec17.pdf · CS 188: Artificial Intelligence Bayes’ Nets: Sampling Instructor: Professor Dragan --- University of California, Berkeley

Bayes Nets II: Independence Day - University Of Marylandusers.umiacs.umd.edu/.../cs421-day16-bayesnets2.pdf · Bayes nets compactly encode joint distributions Guaranteed independencies

Compression in Bayes nets - MIT OpenCourseWare · PDF fileCompression in Bayes nets • A Bayes net compresses the joint probability distribution over a set of variables in two ways:

CS 188: Artificial Intelligence Bayes’ Nets: Samplingcs188/fa18/assets/... · CS 188: Artificial Intelligence Bayes’ Nets: Sampling Instructors: Dan Klein and Pieter Abbeel ---

Solving Probability Puzzles with Bayes’ Nets · This document explains in detail how to give complete solutions to probability puzzles using the technique known as Bayes’ Nets

Bayes’ Nets - University of Washington · 2019-05-20 · Bayes’ Nets Luke Zettlemoyer ---University of Washington ... A Bayes net = Topology (graph) + Local Conditional Probabilities

Lecture 15: Probability review Bayes Nets intro

CS 188: Artificial Intelligence Bayes’ Net Representation Bayes’ …cs188/fa18/assets/slides/... · 2018. 10. 17. · Bayes’ nets implicitly encode joint distributions As a

[Clark Glymour] the Mind's Arrows Bayes Nets and (BookZZ.org)

The IMAP Hybrid Method for Learning Gaussian Bayes Nets

Full Joint Probability Distribution Bayesian Networkspages.cs.wisc.edu/~bgibson/cs540/handouts/bayes_nets_addl.pdf · Bayesian Networks (aka Bayes Nets, Belief Nets) (one type of

Hal Daumé III (hal@cs.utah.edu) Bayes Nets II ... · Slide 1 CS 5300: Bayes Nets II Hal Daumé III (hal@cs.utah.edu) Bayes Nets II: Independence Day CS 5300 / CS 6300 Artificial

Cps270 Bayes Nets

CSE 415 -- (c) S. Tanimoto, 2008 Bayes Nets 1 Probabilistic Reasoning With Bayes’ Rule Outline: Motivation Generalizing Modus Ponens Bayes’ Rule Applying

Bayes’ Nets II Independence - University of Victoriamaryam/AISpring94/Slides/10_BayesNet_inependence.pdf · • Bayes nets compactly encode joint distributions • Guaranteed independencies

Inference Algorithms for Bayes Networks. Outline Bayes Nets are popular representations in AI, and researchers have developed many inference techniques

The bycatch of Bayes Nets Kerrie Mengersen QUT Australia