24
NAACL HLT – Rochester, NY – April 2007 Chunk-level Reordering of Source Language Sentences with Automatically Learned Rules for Statistical Machine Translation Yuqi Zhang, Richard Zens and Hermann Ney Human Language Technology and Pattern Recognition Lehrstuhl für Informatik 6 Computer Science Department RWTH Aachen University, Germany Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 1

NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

NAACL HLT – Rochester, NY – April 2007

Chunk-level Reordering of Source Language Sentenceswith Automatically Learned Rules for

Statistical Machine Translation

Yuqi Zhang, Richard Zens and Hermann Ney

Human Language Technology and Pattern RecognitionLehrstuhl für Informatik 6

Computer Science DepartmentRWTH Aachen University, Germany

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 20071

Page 2: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Overview

• Introduction

• Baseline system

• Chunk parsing

• Rules extraction

• Reordering lattice generation

• Results

• Conclusions and outlook

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 20072

Page 3: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Introduction

goal:improve MT utilizing syntactic knowledge

idea:reordering at the chunk level

approach:

1. chunk source sentence

2. reorder chunks

3. represent alternative reorderings in a lattice

4. translate lattice

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 20073

Page 4: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Phrase-based SMT

log-linear combination of several model:

Pr(eI1|fJ

1 ) =exp

(∑Mm=1 λmhm(eI

1, fJ1 )

)∑

I′,e′I′1

exp(∑M

m=1 λmhm(e′I′1 , fJ

1 ))

models:• phrase translation model

• phrase count features

• word-based translation model

• word and phrase penalty

• target language model (6-gram)

• distortion penalty modelZhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 20074

Page 5: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

System Architecture

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 20075

Page 6: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Example

source ke yi dan shi wo men chu zu che bu duoPOS v c r v n d mchunks v c r NP VPEnglish gloss yes but we taxi not many

reordering rulesNP VP → VP NPr NP VP → r VP NPr NP VP → VP r NP

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 20076

Page 7: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Example (cont’d)

• reordering lattice:

0 1ke yi

2dan shi

3wo men

11

bu

4chu zu

8bu

5che

9duo

10chu zu

7che

6bu

duo

12duo

wo men

• translation result:reference yes, but there are not many rental cars herebaseline yes , but we do rent car is notchunk-reordering yes , but we do not have much rental car

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 20077

Page 8: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Chunk Parsing

• POS tagging + word segmentation with ICTCLAS toolInstitute of Computing Technology, Chinese Academy ofSciences

• training data for chunker: Chinese Treebank (LDC2005T01)

• 24 chunk types

• MaxEnt tagger

– input features: word + POS tag– output: chunk types + chunk boundary

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 20078

Page 9: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Reordering Rules Extraction

• convert word-to-word alignment to chunk-to-word alignment

• run standard phrase extraction on chunk-to-word alignment

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 20079

Page 10: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Reordering Rules Extraction (cont’d)

(a) monotone phrase, (b) reordering phrase, (c) cross phrase

• extract rules from monotone phrases and reordering phrases

– e.g. NP0NP1# NP0 NP1 NP0NP1# NP1 NP0

– within a subsentence, not across punctuationsZhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200710

Page 11: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Reordering Lattice Generation I

• apply reordering rules to chunked source sentence

• represent alternative reorderings as a lattice

• example:

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200711

Page 12: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Reordering Lattice Generation II

• chunk-level lattice:

0

4

NP1

1NP0

6

v

5v

2

NP0

3

NP0

NP1

7v

v

NP0

NP1

• word-level lattice:

0

1

f2

8f0

14

f6

2f3 3f4 4

f5

5

f6

20f0

6

f0

7

f1

9f1 10f2

16

f6

11f3

12f4

13f5f6

15f0 f1

17f2

18f3 19f4

f5

f1

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200712

Page 13: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Reordering Model

• use language model to weigh lattice

• training:

– chunk source training data– generate chunk-to-word alignment– reorder source chunks to monotonize alignment– train LM on reordered source training data

• word-level LM

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200713

Page 14: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Chunking Result

• corpus statistics (Chinese Treebank LDC2005T01):

train testsentences 17 785 1 000words 486 468 21 851chunks 105 773 4 680

• tagging results:

word-level chunk-levelaccuracy [%] precision [%] recall [%] F-measure [%]

74.51 65.2 61.5 63.3

• number of chunk types: 24

• both chunk type and boundary have to be correct

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200714

Page 15: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Corpus Statistics

Chinese EnglishTrain Sentences 40k

Words 308k 377kDev(dev4) Sentences 489

Words 5 478 6 008Test Sentences 500IWSLT04 Words 3 866 3 581Test Sentences 506IWSLT05 Words 3 652 3 579Test Sentences 500IWSLT06 Words 5 846 –

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200715

Page 16: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Statistics of Reordering Rules

0

10

20

30

40

50

0 2 4 6 8 10 12 14

num

ber

of r

ules

(k)

rule length

totalsingletons

reorder rules

total: 184k, singletons: 88%, reorder rules: 34%Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200716

Page 17: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Translation Results

WER [%] PER [%] NIST BLEU [%]IWSLT04 baseline 47.3 38.2 7.78 39.1

chunk reordering 46.3 37.2 7.70 40.9IWSLT05 baseline 45.0 37.3 7.40 41.8

chunk reordering 44.6 36.8 7.51 42.3IWSLT06 baseline 67.4 50.0 6.65 22.4

chunk reordering 65.6 50.4 6.46 23.3

• evaluation without punctuation marks and in lower case

• baseline: RWTH IWSLT 2006 system without rescoringZhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200717

Page 18: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Chunk-level vs. POS-level

Translation performance (IWSLT 2004):WER [%] PER [%] NIST BLEU [%]

Baseline 47.3 38.2 7.78 39.1POS 46.9 37.5 7.38 39.7

Chunk 46.3 37.2 7.70 40.9

Lattice statistics:avg. density used translation

per sent rules time [min:sec]Baseline - - 1:22

POS 15.7 6 868 7:08Chunk 8.2 3 685 3:47

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200718

Page 19: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Translation Examples (IWSLT04)

reference about twenty-five secondsbaseline seconds about twenty-five

chunk reorder about twenty five secondsreference could n’t you make it a little cheaperbaseline could not you some better

chunk reorder can’t you make it a little cheaper onesreference how much is admissionbaseline admission fees how much is it

chunk reorder how much is the admissionreference may i have that gift wrapped pleasebaseline wrap can i have a gift

chunk reorder can i have a gift wrapped please

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200719

Page 20: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Summary

• idea:

1. chunk input sentence2. reorder chunks3. represent alternative reorderings as lattice4. translate lattice

• nice improvements on IWSLT task

• chunk-level reordering better than POS-level reordering

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200720

Page 21: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Outlook

• large data task (e.g. NIST)

• other language pairs

• improve chunk parsing

• better reordering model

• analyze what kind of rules work well

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200721

Page 22: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

THANK YOU FOR YOUR ATTENTION!

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200722

Page 23: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

ICTCLAS POS Tag Set

n noun r pronnr person name rg pron morphemens location name m numberng noun morpheme q quantityt time d adverbs location p prepf position word c conjunctionv verb u auxiliary

vd verb adv e interjectionvn noun verb y modal particlevg verb morpheme o onomatopoeiaa adj h prefix

ad adv adj k suffixan adj noun w punctuationag adj morpheme b determiner

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200723

Page 24: NAACL HLT – Rochester, NY – April 2007dekai/ssst1/slides/ZhangZensNey_SSST2007.pdff5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1

Syntactic Tag Set of Chunks

ADJP adjective phraseADVP adverbial phrase headed by AD (adverb)CLP classifier phraseCP clause headed by C (complementizer)

DNP phrase formed by ′XP + DEG′

DP determiner phraseDVP phrase formed by ′XP + DEV ′

FRAG fragmentIP simple clause headed by I (INFL)

LCP phrase formed by ′XP + LC′

LST list markerNP noun phrasePP preposition phrase

PRN parentheticalQP quantifier phrase

UCP unidentical coordination phraseVP verb phrase

Details are in "The bracketing gudelines for penn chinese treebank 3.0", Technical Re-port 00-08, University of Pennsylvania(2000) IRCS Report.

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 200724