Hidden Markov Modelssrihari/CSE574/Chap13/13.2-HMMs.pdf · Hidden Markov Models Sargur Srihari...

Hidden Markov ModelsSargur Srihari

srihari@cedar.buffalo.edu

HMM OverviewMachine Learning Srihari

Role of HMMs in ML• Ubiquitous tool for modeling time series data

• Used in most all speech recognition systems• Computational molecular biology• Group amino acid sequences into proteins

• It is a BN for representing probability distributions over sequences of observations

• HMM has two defining properties:• Observation xt at time t was generated by some process

whose state zt is hidden from the observer• Assumes that state at zt is dependent only on state zt-1 and

independent of all prior states (First order) • Example: • z are phoneme sequences, x are acoustic observations

Machine Learning Srihari

Graphical Model of an HMM• Has the state space model shown below and

latent variables are discrete

• Joint distribution has the form

n) = p(z

1) p(z

n−1)n=2

∏⎡⎣⎢

⎦⎥ p(

∏ xn| z

Mixture Viewed as HMM

• A single time slice corresponds to a mixture distribution with component densities p(x|z)

• An extension of mixture model• Choice of mixture component depends on choice of

mixture component for previous distribution• Latent variables are multinomial variables zn

• That describe component responsible for generating xn• Can use one-of-K coding scheme

Transitional Probability Matrix• Joint distribution:

• We allow zn to depend on zn-1 via p(zn|zn-1)• Latent variables are 1-of-K, thus this conditional

distribution is specified by a transition probability matrix A with Ajk =p(znk=1|zn-1,j=1)

• Conditional distribution is stated as

• Exponent zn-1,jzn,k , which is a product, is 0 or 1• Hence product evaluates to a single Ajk for each

setting of values of zn , zn-1

• Thus p(zn=3|zn-1=2)=A23

n−1,A) = Ajk

zn−1,jzn ,k

∏k=1

• Not a graphical model since nodes are not separate variables but states of a single variable• Here K=3

Transition diagramwhere latent variables have three possible states

State of zn State of zn-1 A matrixMatrixA has K(K-1) independent parameters

n) = p(z

1) p(z

n−1)n=2

∏⎡⎣⎢

⎦⎥ p(

∏ xn| z

Initial Variable Probabilities

• Initial latent node z1 does not have a parent node • Its marginal distribution p(z1) is represented by a vector of

probabilities p with elements p k=p(z1k=1) so that

• Note that p is an HMM parameter • representing probabilities of each state for the first variable

1|π) = π

∏ where Σkπ

Unfolding State Transition over timeLattice or trellis representation of the latent states

State Transition diagram Each column corresponds to one of the zn

Emission Probabilities p(xn|zn)• Specification of probabilistic model is completed by

defining conditional distributions of observed variablesp(xn|zn,ϕ) where ϕ are parameters• These are known as emission probabilities which can be

either continuous (by Gaussians) or discrete (by tables)• Because xn is observed, distribution p(xn|zn,ϕ)

consists of a table of K numbers corresponding to Kstates of binary vector zn• Emission probabilities can be represented as

!(x# |%#,&) = ∏*+,- (! .# &* )/01

Homogeneous Models• All conditional distributions governing the latent

variables share the same parameters A• All the emission distributions share the same

parameters ϕ

Joint Distribution over Latent and Observed variables

},,{ where

),|(),|()|()|,(

A}, θ,..z{z}, Z,..x{xX

zxpAzzpzpZXp

ùêë

é= ÕÕ

• Joint can be expressed in terms of parameters:

• Most discussion of HMM is independent of emission probabilities• Tractable for discrete tables, Gaussian, GMMs

HMM from a generative viewpoint• Can get a better understanding of HMMs by

considering it from a generative viewpoint• First choose latent variable z1 with probabilities

determined by probabilities πk and then sample the corresponding observation x1

• Now we choose the state of the variable z2according to the transition probabilities p(z2|z1) according to the already instantiated value of z1

• This is an example of ancestral sampling for adirected PGM

Example of sampling from a HMM• Generative viewpoint• Three states of a latent variable• Gaussian emission model p(x|z)• Two-dimensional x• 50 data points generated• Transition probabilities

• 5% probability of making transition• 90% of remaining in same

A variant of HMM• Imposing restrictions on transition matrix A

• Left-to right HMM• Setting the elements of Ajk=0 if k < j

• Once a state has been vacated, it cannot be reentered• Corresponding lattice

Left-to-Right HMM Applied to Digits• Examples of on-line handwritten digits

•K=16 states: corresponding to a line segment of fixed length in one of 16 angles

• Emission probabilities: 16 x 16 table, associated with allowed angle values for each state

• Transition probabilities set to zero except for those that keep state index k the same or increment by one

• Parameters optimized by 25 iterations of EM

Top row: Training SetOf 45 examples of 2

Bottom row: Generated Set

HMM invariance to time warping• Time warping (compression and stretching) • On-line handwriting recognition

• A typical digit consists of two strokes• Arc starts at top left down to cusp/loop at bottom left• A straight sweep ending at bottom right• Relative sizes of the two sections vary and hence the

location of the cusp/loop• HMM accommodates this by no of transitions to

same state vs transitions to successive state• Speech recognition

• Warping of time axis is speed of speech• HMM accommodates such distortion

Hidden Markov Modelssrihari/CSE574/Chap13/13.2-HMMs.pdf · Hidden Markov Models Sargur Srihari...

Documents

Hidden Markov Model Nov 11, 2008 Sung-Bae Cho. Hidden Markov Model Inference of Hidden Markov Model Path Tracking of HMM Learning of Hidden Markov Model

Hidden Markov Models - uml.edugrinstei/91.510/HMM/Class Sildes - Hidden Markov... · Hidden Markov Models. ... particular hidden state. • Thus a hidden Markov model is a standard

Hidden Markov Models - Home | Princeton Universityrvan/orf557/hmm080728.pdf · 1 Hidden Markov Models..... 1 1.1 Markov Processes ..... 1 1.2 Hidden Markov Models..... 4 ... able

Hidden Markov Model

Sequence Classification - with emphasis on Hidden Markov ...cs.unc.edu/~lazebnik/fall09/sequence_classification.pdf · Hidden Markov Models ... with emphasis on Hidden Markov Models

Hidden Markov

9 Markov chains and Hidden Markov Models - Freie … · 9 Markov chains and Hidden Markov Models We will discuss: Markov chains Hidden Markov Models (HMMs) Algorithms: Viterbi, forward,

Hidden Markov Models./awm/tutorials/hmm14.pdf · Hidden Markov Models ... 14)

L13: hidden Markov models - Texas A&M Universityresearch.cs.tamu.edu/prism/lectures/sp/l13.pdf · L13: hidden Markov models • Discrete Markov processes • Hidden Markov models

Markov Chains and Hidden Markov Models - Rice University · are “hidden”; hence, we have a hidden Markov model, or HMM. ... In Markov chains and hidden Markov models, the probability

EE365: Hidden Markov Models - Stanford Universityee266.stanford.edu/lectures/hmm.pdf · EE365: Hidden Markov Models Hidden Markov Models The Viterbi Algorithm 1. Hidden Markov Models

Hidden Markov Models - AUusers-cs.au.dk/cstorm/courses/PRiB_f12/slides/hidden-markov-model… · Hidden Markov Models Markov Model Hidden Markov Model If the latent variables are

Bioinformatics Introduction to Hidden Markov Models - … · Bioinformatics Introduction to Hidden Markov Models Hidden Markov Models and Multiple Sequence Alignment Slides borrowed

Hidden Markov Models in Bioinformaticscsatol/mach_learn/bemutato/Mate_Korosi_HMMpr… · Outline ˜ Markov Chain ˜ HMM (Hidden Markov Model) ˜ Hidden Markov Models in Bioinformatics

Hidden Markov Models

Hidden Markov Models · Hidden Markov Models 1 10-601 Introduction to Machine Learning Matt Gormley Lecture 20 Nov. 7, 2018 ... Hidden Markov Model 28 A Hidden Markov Model (HMM)

Hidden Markov Models and Gaussian Mixture Models · Hidden Markov Models and Gaussian Mixture Models ... Hidden Markov Model ... ASR Lectures 4&5 Hidden Markov Models and Gaussian

L13: hidden Markov modelscourses.cs.tamu.edu/rgutier/csce630_f14/l13.pdf · L13: hidden Markov models • Discrete Markov processes • Hidden Markov models • Forward and Backward

Inference in Mixed Hidden Markov Models and Applications ... · 2. From hidden Markov models to mixed hidden Markov models Many authors suggested applying hidden Markov models to

Interacting Hidden Markov Models for Video Understandingdraper/papers/narayana_ijprai18.pdfKeywords: Hidden Markov models; video analysis; Segre variety. 1. Introduction Hidden Markov