Hidden Markov Model in Automatic Speech Recognition Z. Fodroczi Pazmany Peter Catholic. Univ

Hidden Markov Model in Automatic Speech Recognition

Z. Fodroczi

Pazmany Peter Catholic. Univ.

Outline

Discrete Markov processHidden Markov Model

Vitterbi algorithm Forward algorithm Parameter estimation Type of them

HMMs in ASR SystemsPopular modelsState of the art and limitation

Discrete Markov Processes

a11 a22

a23a32

States={S1,S2,S3} P(qt=Si|qt-1=Sj)=aij Sum(j=1..N)

(aij)=1

Consider the following model of weather:•S1: rain or snow•S2: cloudy•S3: sunny

8.01.01.0

2.06.02.0

3.03.04.0

What is the probability that the weather for the next 3 days will be “sun,sun,rain”.O={S3,S3,S1} – observation sequence

P(O|Model)=P(S3,S3,S1|Model)=P(S3)*P(S3|S3)*P(S3|S1)=1*0.8*0.3=0.24

Hidden Markov Model

The observation is a probabilistic function of the state.

Teacher-mood-modelSituation: Your school teacher gave three dierent types of daily homework assignments:

A: took about 5 minutes to complete B: took about 1 hour to complete C: took about 3 hours to complete

Your teacher did not reveal openly his mood to you daily, but you knew that your teacher was either in a bad, neutral, or a good mood for a whole day.Mood changes occurred only overnight.

Question: How were his moods related to the homework type assigned that day?

Hidden Markov Model

Model parameters:

S - states {good, neutral, bad}akl,- probability that state l is

followed by kΣ – alphabet {A,B,C} ek(x) –probability that state k

emit symbol x€Σ

Hidden Markov Model

One week, your teacher gave the following homework assignments: Monday: A Tuesday: C Wednesday: B Thursday: A Friday: C

Questions: What did his mood curve look like most likely that week?

(Searching for the most probable path – Viterbi algorithm)

What is the probability that he would assign this order of homework assignments? (Probability of a sequence - Forward algorithm)

How do we adjust the model parameters λ(S,aij,ei(x)) to maximize P(O| λ)(create a HMM for a given sequence set)

HMM – Viterbi algorithm

Given: Hidden Markov model:

S, akl ,Σ ,ek(x)

Observed symbol sequence E = x1x2, … xn.

Most probable path of states that resulted in symbol sequence E.

Let vk(i) be the probability of the most probable path of the symbol sequence x1, x2, …. xi ending in state k. Then:

Matrix vk(i), where k€S and 1 <= I <= n.

Initialization: vk(1) = ek(x1)/#states for all states k€S .

vl(i) = el(xi)maxk(vk(i - 1)akl) for all states k€S , i >=2.

Algorithm Iteratively build up matrix vk(i). Store pointers to chosen path. Probability of most probable path in maximum entry in

last column. Reconstruct path along pointers.

Empty table

Question: What did his mood curve look like most likely that week?

Answer: Most probable mood curve:

Day: Mon Tue Wed Thu FriAssignment: A C B A CMood: good bad neutral good bad

HMM – Forward algorithm

Used to test the probability of a sequence.

Given: Hidden Markov model: S, akl, , el(x). Observed symbol sequence E = x1; : : : ; xn.

What is the probability of E.

Let fk(i) be the probability of the symbol sequence x1, x2, …. xi ending in state k. Then:

Matrix fk(i), where k€S and 1 <= i <= n.

Initialization: fk(1) = ek(x1)=#states for all states k€S . fl(i) = el(xi)Pk(fk(i - 1)akl) for all states k€S , i <=2.

Algorithm Iteratively build up matrix fk(i).

Probability of symbol sequence is sum of entries in last column.

HMM – Parameter estimation

Type of HMMs

Till now I was speaking about HMMs with discrete output (finite |Σ|).

Extensions: continuous observation probability density function mixture of Gaussian pdfs

HMMs in ASR

Separation Feature extraction

HMMdecoder

UnderstandingApplication

HMMs in ASR

How HMM can used to classify feature sequences to known classes.

Make a HMM to each class.

By determineing the probability of a sequence to the HMMs, we can decide which HMM could most probable generate the sequence.

There are several idea what to model: Isolated word recognition ( HMM for each known word)

Usable just on small dictionaries. (digit recognition etc.)Number of states usually >=4. Left-to-rigth HMM

Monophone acoustic model ( HMM for each phone)~50 HMM

Triphone acoustic model (HMM for each three phone sequence)50^3 = 125000 triphoneseach triphone has 3 state

HMMs in ASR

Hierarchical system of HMMs

HMM of a triphone HMM of a triphone HMM of a triphone

Higher level HMM of a word

Language model

ASR state of the art

Typical state-of-the-art large-vocabulary ASR system:- speaker independent

- 64k word vocabulary

- trigram (2-word context) language model

- multiple pronunciations for each word

- triphone or quinphone HMM-based acoustic model

- 100-300X real-time recognition

- WER 10%-50%

HMM Limitations

Data intensive Computationally intensive

50 phones = 125000 possible triphones 3 states per triphone 3 Gaussian mixture for each state

64k word vocabulary 262 trillion trigrams 2-20 phonemes per word in 64k vocabulary

39 dimensional feature vector sampled every 10ms 100 frame per second

Hidden Markov Model in Automatic Speech Recognition Z. Fodroczi Pazmany Peter Catholic. Univ

Documents

Markov Processes and Controlled Markov Chains

Hidden Markov Models in Bioinformaticscsatol/mach_learn/bemutato/Mate_Korosi_HMMpr… · Outline ˜ Markov Chain ˜ HMM (Hidden Markov Model) ˜ Hidden Markov Models in Bioinformatics

L13: hidden Markov modelscourses.cs.tamu.edu/rgutier/csce630_f14/l13.pdf · L13: hidden Markov models • Discrete Markov processes • Hidden Markov models • Forward and Backward

A Markov Chain Financial Market - Department of Statisticsaldous/206-RWG/RWG... · A Markov Chain Financial Market Ragnar Norberg Univ. Copenhagen/London School of Economics/Univ

9 Markov Chains Regular Markov Chains Absorbing Markov Chains Game Theory and Strictly Determined Games Games with Mixed Strategies Markov Chains

9 Markov chains and Hidden Markov Models - Freie … · 9 Markov chains and Hidden Markov Models We will discuss: Markov chains Hidden Markov Models (HMMs) Algorithms: Viterbi, forward,

Markov Chains on Countable State Space 1 Markov Chains ...stevel/565/lectures/5c Markov chain.pdf · Markov Chains on Countable State Space 1 Markov Chains Introduction 1. Consider

Markov Systems and Markov Decision Processes...States, Actions, Observations Passive Controlled Fully Observable Markov Model Markov Decision Process (MDP) Hidden State Hidden Markov

Hidden Markov Models€¦ · •Hidden Markov models 1 The “Markov”swe have learned so far. Markov Models 2 •A Markov model is a chain-structured BN –Conditional probabilities

AUTHORS FR: MARKOV, G.L. TO: MARKOV, V.A. · Title: AUTHORS FR: MARKOV, G.L. TO: MARKOV, V.A. Subject: AUTHORS FR: MARKOV, G.L. TO: MARKOV, V.A. Keywords: Tho IL-28 kircrtSt (Technical

Markov Chains. Summary Markov Chains Discrete Time Markov Chains Homogeneous and non-homogeneous Markov chains Transient and steady state Markov chains

1 Algebraic Structure in Almost-Symmetries Igor Markov, Univ. of Michigan Presented by Ian Gent, St. Andrews

Nov. 2005 M. Huang Northwestern Univ. 1 Markov Chain Population Models in Medical Decision Making Gordon Hazen Min Huang Northwestern University

Markov Chains and Hidden Markov Models - Cornell …physiology.med.cornell.edu/people/banfelder/qbio/resources_2012... · Conclusion: Introduction to Markov Chains and Hidden Markov

Markov Chains - 1 Markov Chains Chapter 16. Markov Chains - 2 Overview Stochastic Process Markov Chains Chapman-Kolmogorov Equations State classification

Part1 Markov Models for Pattern Recognition – Introduction CSE717, SPRING 2008 CUBS, Univ at Buffalo

Stochastic processes and Markov chains (part I)Markov ... · Stochastic processes and Markov chains (part I)Markov chains (part I) Wessel van Wieringen ... a Markov chain is an acceptable

2 1 Discrete Markov Processes (Markov Chains) 3 1 First-Order Markov Models

Markov Chains - UTKweb.eecs.utk.edu/.../cs594_spring2017/presentations/markov-chains.pdf · Overview of Markov Chains •What is a Markov Chain? •A discrete-time stochastic process

Pazmany Aircraft Corporationpazmany.com/wp/pdf/PL-9_profile.pdfPazmany Aircraft Corporation P.O. Box 60577 San Diego, California 92166 Fax (619) 224-7358 The Pazmany PL-9 Stork can