Pattern Recognition and Machine Learning : Graphical...

Machine Learning

Sourangshu Bhattacharya

Bayesian Networks

Directed Acyclic Graph (DAG)

Bayesian Networks

General Factorization

Curve Fitting Re-visited

Maximum Likelihood

Determine by minimizing sum-of-squares error, .

Curve Fitting

Polynomial

Curve Fitting

Predictive Distribution

MAP: A Step towards Bayes

Determine by minimizing regularized sum-of-squares error, .

Bayesian Curve Fitting (3)

Input variables and explicit hyperparameters

Bayesian Curve Fitting—Learning

Condition on data

Generative Models

Causal process for generating images

Discrete Variables (1)

General joint distribution: K 2 { 1 parameters

Independent joint distribution: 2(K { 1) parameters

Discrete Variables (2)

General joint distribution over M variables: KM { 1 parameters

M -node Markov chain: K { 1 + (M { 1)K(K { 1)

parameters

Conditional Independence

a is independent of b given c

Equivalently

Notation

Conditional Independence: Example 1

Note: this is the opposite of Example 1, with c unobserved.

Note: this is the opposite of Example 1, with c observed.

“Am I out of fuel?”

B = Battery (0=flat, 1=fully charged)F = Fuel Tank (0=empty, 1=full)G = Fuel Gauge Reading

(0=empty, 1=full)

and hence

Probability of an empty tank increased by observing G = 0.

Probability of an empty tank reduced by observing B = 0. This referred to as “explaining away”.

D-separation• A, B, and C are non-intersecting subsets of nodes in a

directed graph.• A path from A to B is blocked if it contains a node such that

eithera) the arrows on the path meet either head-to-tail or tail-

to-tail at the node, and the node is in the set C, orb) the arrows meet head-to-head at the node, and

neither the node, nor any of its descendants, are in the set C.

• If all paths from A to B are blocked, A is said to be d-separated from B by C.

• If A is d-separated from B by C, the joint distribution over all variables in the graph satisfies .

D-separation: Example

D-separation: I.I.D. Data

Markov Random Fields

Markov Blanket

Cliques and Maximal Cliques

Clique

Maximal Clique

Joint Distribution

where is the potential over clique C and

is the normalization coefficient; note: MK-state variables KM terms in Z.

Energies and the Boltzmann distribution

Illustration: Image De-Noising (1)

Original Image Noisy Image

Noisy Image Restored Image (ICM)

Converting Directed to Undirected Graphs (1)

Converting Directed to Undirected Graphs (2)

Additional links

Moralization

Moral Graph

Directed vs. Undirected Graphs (1)

Directed vs. Undirected Graphs (2)

Inference in Graphical Models

• Posterior Marginalization: P xn y1, … , yn

𝑥1,…,𝑥𝑛−1,𝑥𝑛+1,…,𝑥𝑁

𝑃(𝑥1, … , 𝑥𝑁|𝑦1, … , 𝑦𝑀)

• M A P:𝑥1∗, … , 𝑥𝑁

∗ = max𝑥1,…,𝑥𝑁𝑃(𝑥1, … , 𝑥𝑁|𝑦1, … , 𝑦𝑀)

Conditional Probability:

𝑃 𝑋 𝑌 =1

𝑍exp(𝑤𝑇𝑓 𝑋, 𝑌 )

𝑓 𝑋, 𝑌 = [𝜓12 𝑥1, 𝑥2, 𝑌 , … , 𝜓𝑛−1,𝑛 𝑥𝑛−1, 𝑥𝑛, 𝑌 ]

Conditional Random Fields

𝑋1 𝑋2 𝑋𝑛

𝑌1 𝑌2 𝑌𝑛

• Observation sequence: 𝑌1, … , 𝑌𝑛; State sequence: 𝑋1, … , 𝑋𝑛• Transition Prob.: 𝑃 𝑋𝑖+1 𝑋𝑖 ; Emission prob.: 𝑃(𝑌𝑖|𝑋𝑖).

𝑃 𝑋, 𝑌 = 𝑃(𝑋1)

𝑖=1

𝑛−1

𝑃(𝑋𝑖+1|𝑋𝑖)

𝑗=1

𝑃(𝑌𝑗|𝑋𝑗)

Hidden Markov Model

𝑋1 𝑋2 𝑋𝑛

𝑌1 𝑌2 𝑌𝑛

Inference on a Chain

To compute local marginals:

•Compute and store all forward messages, .

•Compute and store all backward messages, .

•Compute Z at any node xm

•Compute

for all variables required.

Undirected Tree Directed Tree Polytree

Factor Graphs

Factor Graphs from Directed Graphs

Factor Graphs from Undirected Graphs

The Sum-Product Algorithm (1)

Objective:

i. to obtain an efficient, exact inference algorithm for finding marginals;

ii. in situations where several marginals are required, to allow computations to be shared efficiently.

Key idea: Distributive Law

Initialization

To compute local marginals:• Pick an arbitrary node as root

• Compute and propagate messages from the leaf nodes to the root, storing received messages at every node.

• Compute and propagate messages from the root to the leaf nodes, storing received messages at every node.

• Compute the product of received messages at each node for which the marginal is required, and normalize if necessary.

Sum-Product: Example (1)

The Max-Sum Algorithm (1)

Objective: an efficient algorithm for finding

i. the value xmax that maximises p(x);

ii. the value of p(xmax).

In general, maximum marginals joint maximum.

Maximizing over a chain (max-product)

Generalizes to tree-structured factor graph

maximizing as close to the leaf nodes as possible

Max-Product Max-Sum

For numerical reasons, use

Again, use distributive law

Initialization (leaf nodes)

Recursion

Termination (root node)

Back-track, for all nodes i with l factor nodes to the root (l=0)

Pattern Recognition and Machine Learning : Graphical...

Documents

This file has been cleaned of potential threats. If you ...cse.iitkgp.ac.in/~sourangshu/coursefiles/SDM19A/03-hadoop.pdf · Hadoop: The definitive Guide.OreillyPress. Title: 03-hadoop

Archive of SID - nm.bpums.ac.irnm.bpums.ac.ir/UploadedFiles/CourseFiles/2016_8_9/e0df6a13bd__a67e8a... · Archive of SID . Archive of SID . Archive of SID

CS60021: Scalable Data Mining Sparkcse.iitkgp.ac.in/~sourangshu/coursefiles/SDM17A/mapreduce2.pdf · Scala • Scala is both func?onal and object-oriented – every value is an object

Introduction to CS60092: Information Retrievalcse.iitkgp.ac.in/~sourangshu/coursefiles/IR18A/chap12.pdf · Introduction to Information Retrieval 6 Using language models in IR §Each

Integrative neurophysiologymed.bpums.ac.ir/UploadedFiles/CourseFiles/... · Consciousness requires proper functioning of the reticular activating system and both cerebral hemispheres

Introduction to CS60092: Information Retrievalcse.iitkgp.ac.in/~sourangshu/coursefiles/IR18A/chap13.pdfIntroduction to Information Retrieval 21 The Naive Bayes classifier 21 The Naive

med.bpums.ac.irmed.bpums.ac.ir/UploadedFiles/CourseFiles... · 6e - . CLINICAL MEDICINE PSYCHOLOGY THERAPEUTICS VETERINA RMACY BIOTECHNOLOGY —MEDICINE YHA PATHOLOGY HE-MISTRY GENETICS

Learning and Forecasting Opinion Dynamics in …...Learning and Forecasting Opinion Dynamics in Social Networks Abir De Isabel Valeray Niloy Ganguly Sourangshu Bhattacharya Manuel

Blood and tissue protozoa - nm.bpums.ac.irnm.bpums.ac.ir/UploadedFiles/CourseFiles/Blood_and_tissue_protozoa... · Plasmodium vivax 14 Gametocytes Erythrocytic schizont. Plasmodium

PowerPoint Presentationmed.bpums.ac.ir/UploadedFiles/CourseFiles/داروهای_آنتی_میکروبیال... · Sulfonamides DNA THFA Ribosomes mR CELL WALL Inhibitors of cell membrane

This file has been cleaned of potential threats. If you confirm that …cse.iitkgp.ac.in/~sourangshu/coursefiles/IR18A/chap16... · 2018-11-12 · CoNLL-2002 and CoNLL-2003 (British

CS60092:(Informaon(Retrieval(cse.iitkgp.ac.in/~sourangshu/coursefiles/IR16A/week1.pdf · 2016. 8. 3. · I did enact Julius Caesar I was killed i’ the Capitol; Brutus killed me

Academic Profile of Prof. Sourangshu Mukhopadhyay

New CS60021: Scalable Data Mining Similarity Search and Hashingcse.iitkgp.ac.in/~sourangshu/coursefiles/SDM17A/lsh1.pdf · 2017. 9. 4. · Documents as High-Dim Data ... Mining of

•Bones •Musclesedu.bpums.ac.ir/UploadedFiles/CourseFiles/Bones__ea163d75.pdf · •Bones •Muscles. Joints & Ligaments. Bones of the Skeleton • Axial Skeleton • AppendicularSkeleton

Blood and tissue protozoa - bpumsparamed.bpums.ac.ir/UploadedFiles/CourseFiles/Blood_and_tissue_pr… · Blood and tissue protozoa. ... Plasmodium vivax ... Maturation Of macrogametocyte

PowerPoint Presentation - med.bpums.ac.irmed.bpums.ac.ir/UploadedFiles/CourseFiles/فارماکولوژی_تنفس__574e7ce9.pdfAsthma is an inflammatory disease associated with bronchial

Introduc)onto CS60092:(Informa0on(Retrieval(cse.iitkgp.ac.in/~sourangshu/coursefiles/IR16A/week9.pdf · 2016. 11. 7. · Es)matehowtermscontributetorelevance! ... log (1 )

MEDICAL PARASITOLOGYmed.bpums.ac.ir/UploadedFiles/CourseFiles/2__کلیات_انگل_شناسی_پزشکی... · PARASITOLOGY The study of the relationship between a parasite and

MEDICAL PARASITOLOGYmed.bpums.ac.ir/UploadedFiles/CourseFiles/1_کلیات_انگل_شناسی_پزشکی... · Medical parasitology is the science that deals with organisms living