70
Finite State Morphology Assignment

Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Embed Size (px)

Citation preview

Page 1: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Finite State Morphology Assignment

Page 2: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Finite State MachineConsume input; move to next state

StartGot

selectiondispense

Got 10 cents

Got 5 cents

nicklenickle

dime

Push button

Improve the machine?• Eliminate one state (more elegant)• Add error message if you push the

button too soon• What to do if someone keeps

putting money in after the final state.

Page 3: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Finite State TransducerConsume input; produce output: move to next state

Start EndGot 10 cents

Got 5 cents

nickle: cha-ching

dime:cha-ching

Push-button:dispense

Improve the machine?• Eliminate one state (more elegant)• Add error message if you push the

button too soon• What to do if someone keeps

putting money in after the final state.

nickle: cha-ching

Page 4: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

An English Word• From David Crystal (Cambridge Encyclopedia of English)

• Grace (noun): graces– Graceful

• Ungraceful– Ungracefully– Ungracefulness

• Gracefully• Gracefulness

– Grace (verb): graces, graced, gracing– Disgrace (noun): disgraces

• Disgraceful– Disgracefully– Disgracefulness

• Disgrace (verb): disgraces, disgraced, disgracing

– Graceless• Gracelessly• Gracelessness

– Gracious• Graciously• Graciousness• Ungracious

– Ungraciously– Ungraciousness

Page 5: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Start

ADVADJ

V N

VgVdVs

Ns

dis

NN/N

grace

grace

un

ious

less

ful

ly

ness

eses

Ø

eding

es

1. Trace all the words based on “grace” to make sure that they all get through the machine.

2. What is wrong with the machine? (Overgenerate? Undergenerate? Redundant? Doesn’t capture structure and ambiguity?)

Adj/Asuff/N Adj/Asuffgrace

ful ious

Page 6: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Start

ADVADJ

V N

VgVdVs

Ns

dis:dis

NN/N

nstem:grace

un:un

ious:ious

less:less

ful:ful

ly:ly

ness: ness

pl:espl:es

Ø:Ø

pst:edprpart:ing3sgpres:es

1. input: dis nstem Ø pst2. output: disgraceed (we will fix it later)

Adj/Asuff/N Adj/Asuffnstem:grace

ful:ful ious:ious

nstem:grace

dis nstem Ø pst underlying form or upper formdisgraceed surface form or lower form

Page 7: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Underlying forms and surface forms

• Phonology:

– Underlying: INkᴧmplit (incomplete)

– Surface: Iŋkᴧmplit

• Orthography

– Underlying: happy+ness

– Surface: happiness

Page 8: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

A terrible twist of terminology in XFST

• The underlying form is called the upper form.

• The surface form is called the lower form.

• Think of it this way:

– in phonology class, if you are going to derive a word using a sequence of rules, you write the underlying form at the top of the page/board and write the derivation under it.

Page 9: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Underlying and surface forms

• Underlying forms can be glosses of morphemes.

• Underlying: happy[AdjCmpr]

• Surface: happier

• Underlying: happy[NDer][Npl]

• Surface: happinesses

Page 10: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Where do those things in square brackets come from?

• You make them up.

• I decided to call –er [AdjCmpr] for “adjective, comparative”

• You could call it something else.

Page 11: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

You have to make up labels for your morphemes

Page 12: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

It will look like code instead of circles and arrows

Multichar_Symbols [NSg] [NPl] [Adj] [AdjCmpr] [AdjSupr] [NDer]

LEXICON Root0:0 NRoot ;0:0 AdjRoot ;

LEXICON AdjRoothappy[Adj]:happy # ;happy:happi AdjSuffs ;

LEXICON AdjSuffs[AdjCmpr]:er # ;[AdjSupr]:est # ;[NDer]:ness NSuffs-es ;

LEXICON NRoot

book:book NSuffs-s ;

pencil:pencil NSuffs-s ;

child[NSg]:child #;

child[NPl]:children #;

rash:rash NSuffs-es;

LEXICON NSuffs-s

[NSg]:0 # ;

[NPl]:s # ;

LEXICON NSuffs-es

[NSg]:0 # ;

[NPl]:es # ;

Page 13: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Another way

LEXICON NRoot

book:book NSuffs;

pencil:pencil NSuffs;

child[NSg]:child #;

child[NPl]:children #;

rash:rash NSuffs;

LEXICON NSuffs

[NSg]:0 # ;

[NPl]:s # ;

s -> e s || s h _ .#. ;

Page 14: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

What will you do for this assignment?

• You will use a program called XFST (Xerox Finte State Transducer).

• If you give it an underlying form, it will give you the surface form.

• If you give it a surface form, it will give you the underlying form.

• XFST software: http://www.stanford.edu/~laurik/.book2software/

• Tutorial slides by Ken Beesley and Lauri Karttunen: http://www.stanford.edu/~laurik/fsmbook/lecture-notes/Beesley2004/index.html

Page 15: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the
Page 16: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

linux7:~/xfst/linux32> ~/xfst/linux32/xfst -l three-eng-plural-script.xfstOpening input file 'three-eng-plural-script.xfst'March 17, 2015 04:23:34 GMTOpening file eng-sibilant-plural.regex...Last line is not terminated.2.4 Kb. 6 states, 18 arcs, Circular.Closing file eng-sibilant-plural.regex...Defined 'R': 2.4 Kb. 6 states, 18 arcs, Circular.Opening input file 'three-eng-plurals-lexc.txt'March 03, 2015 04:31:01 GMTReading UTF-8 text from 'three-eng-plurals-lexc.txt'NRoot...5, NSuffs...2Building lexicon...Minimizing...Done!2.8 Kb. 21 states, 25 arcs, 8 paths.Closing 'three-eng-plurals-lexc.txt'Defined 'L': 2.8 Kb. 21 states, 25 arcs, 8 paths.2.9 Kb. 23 states, 28 arcs, 8 paths.Closing file three-eng-plural-script.xfst...Copyright © Palo Alto Research Center 2001-2015Xerox Finite-State Tool, version 2.10.23 (cfsm 2.12.18) (svn 25211)

Type "help" to list all commands available or "help help" for further help.

xfst[1]: up rashesrash[NPl]xfst[1]: down rash[NPl]rashesxfst[1]: up childrenchild[NPl]xfst[1]: up pencilpencil[NSg]xfst[1]: down child[NSg]childxfst[1]: down child[NPl]childrenxfst[1]:

Page 17: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Break the job down into small pieces

• Start with your hundred sentences and hundred noun phrases• Make a regression test suite (upper and lower forms)

– Include ill-formed words to be sure they fail

• Give a square bracket name to each morpheme• Install XFST and learn to run it.• Write a file called a lexc file.• Optional: write a rule file. • Test:

– If you input a word that is well-formed in your language, the output is the underlying form.

– If the word is not ambiguous, you should get one underlying form.– If the word is ambiguous (e.g., [unlock]able, un[lockable]), you should

get one underlying form for each meaning. – If you input an ill-formed word (e.g., happyness), there should be no

underlying form and the FST will output three question marks (???).

Page 18: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Input/Output behavior

• Input: well-formed upper• Output: corresponding lower form

• Input: lower (orthographic) form• Output: upper (lemma plus names of morphological

categories that are present)

• Input: illformed or not covered yet– *disful– *ungrace– cat

• Output: nothing

Page 19: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

The regression test suite

• Order of morphemes (morphotactics)– Include infixing

• Morpho-phonological alternations– Insertions, deletions, assimilations

• The test suite shows the language your machine accepts (in computer science terms)– E.g., your machine accepts (ab)n

– Test suite includes ab, abab, ababab, etc.

– Test suite includes aa, aba, etc.

Page 20: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Review of Karttunen and Beesleytutorial on lexc and fst

Page 21: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Look for more examples at

• http://web.stanford.edu/~laurik/fsmbook/LSA-207/

Page 22: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

The tutorial by Beesley and Karttunenis divided

• Monday morning

• Monday afternoon

• Tuesday morning

• etc.

• I referenced them as Xday { morning | afternoon }, slide N

Page 23: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Computational morphology

Analysis

leaves

leaf N Pl leave N Pl leave V Sg3

Generation

hang V Past

hanged hung

Karttunen, LSA 2005, day 1, slide 10

Page 24: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Four notations

• lexc: lexicon compiler– declarative

• FST: finite state transducer– declarative

• XFST command line– Xerox finite state transducer– procedural: something happens every time you type a

command

• Linux command line– procedural

Page 25: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Declarative vs procedural

• Your LEXC file is not a program.

• It is data.

• But you have to understand what the program will do with it.

• Monday morning, slide 13

Page 26: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon: Lexc (lexicon compiler)

• Make a lexc file

• at the xfst prompt (xfst command line)

• read lexc < filename

• apply up word

Page 27: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Format of the LEXC file

Page 28: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 6

Page 29: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 8

Page 30: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 9

Page 31: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 10

Page 32: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 11

Page 33: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Slide 14, Tuesday afternoon

Page 34: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 17

Page 35: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 15

Design and software engineering issues: keep it clean and modular

Page 36: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Some common morphotacticoperations

• Skip a morpheme

• Loop

Page 37: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 18

Page 38: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 19

Page 39: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 20

Page 40: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 21

Page 41: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 22

Page 42: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Monday afternoon, 39-42 replace rules

Page 43: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Tuesday afternoon, slide 24

Adding morpho-phonological rules

Page 44: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Monday afternoon, slide 39

Page 45: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the
Page 46: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Wednesday morning, slide 27

Page 47: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Wednesday morning, slide 29

Page 48: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the
Page 49: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Wednesday morning, slide 30

Page 50: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Wednesday morning, slide 31

The string “ab” is replaced by the string “x” when it is preceded by “ab” and followed by “a”.

Page 51: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the
Page 52: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Wednesday morning, slide 32

Page 53: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the
Page 54: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Mon. pm, slide 30

Ordering of multiple morpho-phonological rules

Page 55: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Sequential application in detail

N:m

N

?

? 0

2

1

p

N:mm

pN

m

p:m

?? 0 1

mp

m

k a N p a n

k a m p a n

k a m m a n

0 0 0 2 0 0 0

0 0 0 1 0 0 0

Karttunen, LSA 2005, day 1, slide 24

N becomes m before p

p becomes m after m

Page 56: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Composition

N:m

N

?? 0

3

1

N:m

m

p

N

?

m2

p:m

p:m

N m

N:mk a N p a n

k a m m a n

0 0 0 3 0 0 0

Karttunen, LSA 2005, day 1, slide 25

Page 57: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the
Page 58: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Mon. pm.Slide 41

Page 59: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Mon. pm.Slide 42

Page 60: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Palatalization and Vowel Raising

• Palatalization– tim --> cim

• Vowel Raising– memi --> mimi

• Interaction– temi --> cimi

– tememi --> cimimi

Karttunen, LSA 2005, day 2, slide 22

Page 61: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Vowel Raising & Palatalizationdefine C [ b | c | d | f | g | h | j | k | l |

m | n | p | q | r | s | t | v |x | y | z ];

define Raising e -> i \\ _ C* i ;define Palatalization t -> c || _ i;

regex Raising .o. Palatalization;

down memimimidown timcimdown temicimidown tememicimimi

t e m e m i

t i m i m i

c i m i m i

Karttunen, LSA 2005, day 2, slide 23

Page 62: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the
Page 63: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Shortening script

define V [ a | e | i | o | u | a ];define C [ b | c | d | f | g | h | j | k | l |

m | n | p | q | r | s | t | v |x | y | z ];

define SlovakShortening %: -> 0 || V %: C* V _ ;

define GidabalShortening %: -> 0 // V %: C* V _ ;

push SlovakShorteningdown vola:va:me:vola:vame

push GidabalShorteningdown gunu:mba:da:ngbe:gunu:mbada:ngbe

Karttunen, LSA 2005, day 2, slide 21

Page 64: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Stop

• Extra slides follow

Page 65: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the
Page 66: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the
Page 67: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Making a lexical transducer

LexiconFST

RuleFSTs

CompilerLexical Transducer

(a single FST)composition

LexiconRegular Expression

RulesRegular Expressions

Morphotactics

Alternations

Karttunen, LSA 2005, day 2, slide 24

Page 68: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

The language that the machine accepts

• Our pop machine accepts a language consisting of the strings:– nickle nickle button-press– dime button-press

• Our “grace” machine accepts the language consisting of the strings:– grace, graces, graced, gracing, graceful, gracefully, gracefulness,

etc.

• Monday morning, slide 23

Page 69: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Monday morning, slide 24

Page 70: Finite State Morphology Assignmenttts.speech.cs.cmu.edu/11-823/slides/xfst.pdf · –Grace (verb): graces, graced, gracing ... 2.9 Kb. 23 states, ... the underlying form. –If the

Monday morning, slide 25