Natural Language Processing

Spring 2007

V. “Juggy” Jagannathan

Foundations of Statistical Natural Language Processing

Christopher Manning & Hinrich Schutze

Course Book

Chapter 9

Markov Models

March 5, 2007

Markov models

• Markov assumption– Suppose X = (X1, …, XT) is a sequence of

random variables taking values in some finite set S = {s1,…,sN}, Markov properties are:

• Limited Horizon– P(Xt+1 = sk|X1,…,Xt) = P(Xt+1 = sk|Xt)

– i.e. the t+1 value only depends on t value

• Time invariant (stationary)• Stochastic Transition matrix A:

– aij = P(Xt+1 = sj|Xt=si) where

j ijij iajia1

,1&,,0

Markov model example

123121

112131211

)|()...|()|()(

),...,|()...,|()|()(),...,(

XXPXXPXXPXP

XXXPXXXPXXPXPXXP

6.03.00.1

)|()|()(),,( 23121

iXpXPtXiXPtXPpitP

Probability: {lem,ice-t} giventhe machine starts in CP?

0.3x0.7x0.1+0.3x0.3x0.7=0.021+0.063 = 0.084

Hidden Markov Model Example

Why use HMMs?

• Underlying events generating surface observable events

• Eg. Predicting weather based on dampness of seaweeds• http://www.comp.leeds.ac.uk/roger/HiddenMarkovModels/

html_dev/main.html

• Linear Interpolation in n-gram models:

),|( 12 nnnli wwwP

),|()|()( 123312211 nnnnnn wwwPwwPwP

Look at Notes from David Meir Blei [UC Berkley]

http://www-nlp.stanford.edu/fsnlp/hmm-chap/blei-hmm-ch9.pptSlides 1-13

(Observed states)

Forward Procedure

)|,...()( 11 iXooPt tti

Niii 1,)1(

iijoijij NjTtbattt

1,1,)()1(

ii TOP

)1()|(

Initialization:

Induction:

Total computation:

Forward Procedure

)|,...()( iXooPt tTti

NiTi 1,1)1(

jjijoiji NiTttbat

1,1),1()(

)1()|(

Initialization:

Induction:

Total computation:

Backward Procedure

)||...()|,...(

)|,...()|,(

iXooPiXooP

iXooPiXOP

iii TtttOP

11),()()|(

Combining both – forward and backward

Finding the best state sequence

11),(maxarg

To determine the state sequence that best explains observationsLet:

Individually the most likely state is:

This approach, however, does not correctly estimate the most likely state sequence.

Finding the best state sequenceViterbi algorithm

)|(maxarg OXPX

)|,...,...(max)( 1111... 11

jXooXXPt tttxx

Njjj 1,)1(

Store the most probable path that leads to a given node

Initialization

Induction

Njbatttijoiji

1,)(max)1(

Store Backtrace

Njbatttijoiji

1,)(maxarg)1(

)1(maxarg1

TX iNi

)1(max)(1

TXP iNi

Parameter Estimation

jijoiji

OjXiXP

OjXiXPjip

),|,(),(

Probability of traversing an arc at time t given observation sequence O:

Oinjtoistatefromstransitionofnumberectedjip

Oinistatefromstransitionofnumberectedt

__________exp),(

________exp)(

Parameter Estimation

Ttkot t

jtoistatefromstransitionofnumberected

observedkwithjtoistatefromstransitionofnumberectedb

istatefromstransitionofnumberected

jtoistatefromstransitionofnumberecteda

________exp

___________exp

______exp

________exp

Natural Language Processing

Documents

Natural Language Processing (NLP): Overview & Toolscl.indiana.edu/~md7/14/715/slides/03-nlp/03-nlp.pdf · Language modeling Natural Language Processing Natural Language Processing

NATURAL LANGUAGE PROCESSING - GitHub Pages · NATURAL LANGUAGE PROCESSING (based heavily on Dr. Pham QuangNhatMinh’s 2016 lecture, “Introduction to Natural Language Processing”)

Natural Language Processing - University of Cambridge · Natural Language Processing Natural Language Processing ... I machine aided translation ... AN AMERICAN WEREWOLF IN PARIS

Natural Language Processing: Part II Overview of Natural Language Processing … · 2019-12-03 · Natural Language Processing: Part II Overview of Natural Language Processing (L90):

Natural Language Processing - uni- · PDF fileNatural Language Processing Wolfgang Menzel Department für Informatik Universität Hamburg Natural Language Processing: 1 Natural Language

Fuzzy Logic in Natural Language Processing Fuzzy Logic in Natural Language Processing

Natural Language Processing - Statistiek Vlaanderen · Natural Language Processing −Natural language processing (NLP) a subfield of linguistics, computer science, information engineering,

Natural Language Processing - Stony Brook Universitycse352/nlanguageprocessing.pdf · Natural Language Processing Current Research and Progress in NLP Natural Language Processing

Natural language-processing

NATURAL LANGUAGE PROCESSING SYSTEMS EVALUATION …Natural Language Processing Systems Evaluation Workshop INTRODUCTON The evaluation of natural language processing (NLP) systems is

LIN3022 Natural Language Processing Lecture 4 Albert Gatt LIN3022 -- Natural Language Processing

Natural language processing

LIN3022 Natural Language Processing Lecture 5 Albert Gatt LIN3022 -- Natural Language Processing

Natural Language Processing: Part II Overview of Natural Language Processing (L90 ... · 2018. 10. 29. · Natural Language Processing: Part II Overview of Natural Language Processing

LIN3022 Natural Language Processing Lecture 3 Albert Gatt 1LIN3022 Natural Language Processing