28
Hybrid Example-Based – Rule-Based MT: Feeding Apertium with Bilingual Chunks Felipe S´ anchez-Mart´ ınez Dept. Llenguatges i Sistemes Inform ` atics Universitat d’Alacant E-03071 Alacant, Spain [email protected] Work done in collaboration with Andy Way (DCU) and Mikel L. Forcada (UA) at the Centre for Next Generation Localisation – DCU 8th July 2009 Felipe S´ anchez-Mart´ ınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 1 / 27

Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Hybrid Example-Based – Rule-Based MT:Feeding Apertium with Bilingual Chunks

Felipe Sanchez-Martınez

Dept. Llenguatges i Sistemes InformaticsUniversitat d’Alacant

E-03071 Alacant, [email protected]

Work done in collaboration withAndy Way (DCU) and Mikel L. Forcada (UA)

at the Centre for Next Generation Localisation – DCU

8th July 2009

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 1 / 27

Page 2: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Outline

1 Motivation & goal

2 The Apertium free/open-source MT platformApertium rule-based MT engineApertium: example of translation

3 Integration of bilingual chunks into ApertiumConsiderationsTranslation approachComputation of the best coverage

4 ExperimentsExperimental setupResults: marker-based chunksResults: tree-based chunks

5 Discussion

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 2 / 27

Page 3: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Motivation & goal

Motivation & goal

Motivation:

Usually rule-based machine translation (RBMT) systems do notbenefit from the post-edition effort of professional translators

Some RBMT may benefit from the translation units found intranslation memories (usually whole sentences)

Goal:

To integrate sub-sentential translation units into the Apertiumfree/open-source MT platform

Test the approach with bilingual chunks automatically obtainedusing the example-based methods implemented in Matrex

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 3 / 27

Page 4: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

The Apertium free/open-source MT platform Apertium rule-based MT engine

Apertium rule-based MT engine

source text→ de-formatter↓

morph. analyser↓

PoS tagger↓

structural transfer ↔ lexical transfer↓

morph. generator↓

Post-generator↓

Re-formatter → target text

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 4 / 27

Page 5: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

The Apertium free/open-source MT platform Apertium: example of translation

Apertium: Example of execution /1

Source text:Francis’ <strong>car</strong> is broken

De-formatter:Francis’[ <strong>]car[</strong> ]is broken

Morphological analyser:ˆFrancis’/Francis<np><ant><m><sg>+’s<gen>$[ <strong>]ˆcar/car<n><sg>$[</strong> ]ˆis/be<vbser><pri><p3><sg>$ˆbroken/break<vblex><pp>$

Part-of-speech tagger:ˆFrancis<np><ant><m><sg>$ ˆ’s<gen>$[ <strong>]ˆcar<n><sg>$[</strong> ]ˆbe<vbser><pri><p3><sg>$ˆbreak<vblex><pp>$

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 5 / 27

Page 6: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

The Apertium free/open-source MT platform Apertium: example of translation

Apertium: Example of execution /2

Structural transfer (prechunk) + Lexical transfer:ˆ nom <SN><UNDET><m><sg> {ˆFrancis<np><ant><3><4>$}$ˆ pr <GEN> {}$[ <strong>]ˆ nom <SN><UNDET><m><sg> {ˆcoche<n><3><4>$}$ [</strong> ]ˆ be pp <Vcop><vblex><pri><p3><sg><GD> {ˆestar<vblex><3><4><5>$ ˆromper<vblex><pp><6><5>$}$

Structural transfer (interchunk):[<strong>]ˆ nom <SN><PDET><m><sg> {ˆcoche<n><3><4>$}$[</strong> ]ˆ pr <PREP> { ˆde<pr>$}$ˆ nom <SN><PDET><m><sg> { ˆFrancis<np><ant><3><4>$}$ˆ be pp <Vcop><vblex><pri><p3><sg><m> {ˆestar<vblex><3><4><5>$ ˆromper<vblex><pp><6><5>$}$

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 6 / 27

Page 7: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

The Apertium free/open-source MT platform Apertium: example of translation

Apertium: Example of execution /3

Structural transfer (postchunk):[<strong>]ˆel<det><def><m><sg>$ ˆcoche<n><m><sg>$[</strong>] ˆde<pr>$ ˆFrancis<np><ant><m><sg>$ˆestar<vblex><pri><p3><sg>$ ˆromper<vblex><pp><m><sg>$

Morphological generator and post-generator:[<strong>]el coche[</strong> ]de Francis esta roto

De-formatter:<strong>el coche</strong> de Francis esta roto

Target text:<strong>el coche</strong> de Francis esta roto

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 7 / 27

Page 8: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Integration of bilingual chunks into Apertium Considerations

Considerations

To take into account:Not break the application of structural transfer rulesUse the longest possible chunks

How can the application of rules be preserved?Introducing chunks delimiters as format information. . . is [BCH 12 0]the chunk detected[ECH 12 0] by . . .

Chunks can be then recognised after the translation. . . es [BCH 12 0]el segmento detectado[ECH 12 0]por . . .

Known problem:As a result of the structural transfer rules, format information maybe moved aroundSome rules also delete format information (known bug)

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 8 / 27

Page 9: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Integration of bilingual chunks into Apertium Translation approach

Translation approach

1 apply a dynamic-programming algorithm to compute the bestcoverage of the input sentence

2 translate the input sentence as usual by Apertium

3 use a language model to choose one of the possible translationsfor each of the bilingual chunks detected

One source-language chunk may have different target-languagetranslationsAlso consider Apertium translation

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 9 / 27

Page 10: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Integration of bilingual chunks into Apertium Computation of the best coverage

Computation of the best coverage: data structure

Store source-language chunks in a trie of strings

the

sessionadjourned

interest

shown

...

...with

interestthe

...

...

...

1 2

3 4

5

It allows to compute the best coverage in O(l) time, where l is thelength of the input sentence

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 10 / 27

Page 11: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Integration of bilingual chunks into Apertium Computation of the best coverage

Computation of the best coverage: algorithm

... the session adjourned with the interest of ...inlike

A set of alive states in the trie is maintained to compute all thepossible ways to cover the input sentence

A new search is started at every word

At each position the best coverage until that position is stored

Is applied to text segments shorter than sentencesThe best coverage can be retrieved when there are no more alivestates

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 11 / 27

Page 12: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Integration of bilingual chunks into Apertium Computation of the best coverage

Computation of the best coverage

The best coverage:

is the one that uses the least possible number of chunkslongest possible chunks

each not covered word counts like one chunk

if two coverages use the same number of chunks, the one thatuses the most frequent chunks is used

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 12 / 27

Page 13: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Experimental setup

Experimental setup /1

Data used:

Corpora distributed for the WMT 09 Workshop for MTLanguage pairs: Spanish–English (es-en),English–Spanish (en-es)Linguistic data: apertium-en-es; SVN revision 9284

Software used:

ApertiumGiza++ and Moses to calculate word alignments and lexicalprobabilitiesSRILM to train 5-gram language modelsMatrex to segment training corpora and to align chunks

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 13 / 27

Page 14: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Experimental setup

Experimental setup /2

Training corpus:

Max. sentence length: 45 wordsMax. word ratio: 1.5 words (mean ration + std. dev.)

# sent: 1,187,905; # en words: 26,983,025; # es words: 27,951,388

Development corpus:# sent: 2,050; # en words: 49,884; # es words: 52,719

Test corpus:# sent: 3,027; # en words: 77,438; # es words: 80,580

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 14 / 27

Page 15: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Experimental setup

Experimental setup /3

Methods used to extract bilingual chunks:

Marker-based bilingual chunks (using Matrex)

Parse-tree based bilingual chunks (thanks to John Tinsley)Preliminary results using previously compute chunks using an oldversion of the Europarl parallel corpus

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 15 / 27

Page 16: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Results: marker-based chunks

Results: marker-based chunks

Bilingual chunks filtering:

There must be at least one word aligned in each side

Chunks not seen at least N times are discardedTested values for N: 5 . . . 80

Chunks containing punctuation marks and numbers are discarded

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 16 / 27

Page 17: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Results: marker-based chunks

Results: marker-based chunks — Spanish→English /1

Development corpus (BLEU):

17.7

17.75

17.8

17.85

17.9

10 20 30 40 50 60 70 80

Freq. cut-off

ApertiumApertium+chunks

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 17 / 27

Page 18: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Results: marker-based chunks

Results: marker-based chunks — Spanish→English /2

Test corpus (BLEU):

Apertium+chunks: 19.14Apertium: 18.81

# of chunks: 6,600 (Freq. cut-off: 28)# of applications: 6,321

Words covered by chunks: ≈ 17%# of no Apertium: 2,559 (40%)

Words really covered by chunks: ≈ 6%

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 18 / 27

Page 19: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Results: marker-based chunks

Results: marker-based chunks — English→Spanish /1

Development corpus (BLEU):

17.1

17.15

17.2

17.25

17.3

17.35

17.4

10 20 30 40 50 60 70 80

Freq. cut-off

ApertiumApertium+chunks

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 19 / 27

Page 20: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Results: marker-based chunks

Results: marker-based chunks — English→Spanish /2

Test corpus (BLEU):

Apertium+chunks: 18.94Apertium: 18.51

# of chunks: 16,395 (Freq. cut-off: 11)# of applications: 6,812

Words covered by chunks: ≈ 18%# of no Apertium: 2,884 (42%)

Words really covered by chunks: ≈ 8 %

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 20 / 27

Page 21: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Results: marker-based chunks

Some Spanish→English examples /1

S: desde hace muchos anos un fenomeno misterioso ...R: for years , a mysterious phenomenon ...A: from does a lot of years a mysterious phenomenon ...

A+C: for many years a mysterious phenomenon ...S: olmert devolverıa casi todas las zonas ocupadas a cambio de la pazR: olmert would return ... territories in exchange for peaceA: olmert it would give back ... zones to change of the peace

A+C: olmert it would give back ... zones in exchange for peaceS: pero hay una cosa que nos une :R: but there is one thing that connects us :A: but there is a thing that joins us :

A+C: but there is one thing that joins us :

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 21 / 27

Page 22: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Results: tree-based chunks

Results: tree-based chunks

Bilingual chunks filtering:

There must be at least one target word aligned with a source wordwith p(target|source) > 0.01

Chunks not seen at least N times are discardedTested values for N: 5 . . . 120

Chunks containing punctuation marks and numbers are discarded

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 22 / 27

Page 23: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Results: tree-based chunks

Results: tree-based chunks — Spanish→English /1

Development corpus (BLEU):

17.7

17.75

17.8

17.85

17.9

20 40 60 80 100 120

Freq. cut-off

ApertiumApertium+chunks

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 23 / 27

Page 24: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Experiments Results: tree-based chunks

Results: tree-based chunks — Spanish→English /2

Test corpus (BLEU):

Apertium+chunks: 0.1911Apertium: 0.1881

# of chunks: 7,466 (Freq. cut-off: 74)# of applications: 4,650

Words covered by chunks: ≈ 12%# of no Apertium: 3,075 (66%)

Words really covered by chunks: ≈ 4%

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 24 / 27

Page 25: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Discussion

Discussion /1

Small improvement in both the development set and the test setBetter improvement in the test set

marker-based chunks set improvement covered words

es-endev + 0.20 18% (7%)test + 0.33 17% (6%)

en-esdev + 0.31 17% (7%)test + 0.43 18% (8%)

Noise introduced due to how Apertium manages formatinformation

Some chunks are not applied because chunk delimiters are lostSome chunk delimiters are moved and the detected sequence ofwords after translation is not correct

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 25 / 27

Page 26: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Discussion

Discussion /2

Possible way of improvement when computing the best coverage andtwo coverages uses the same number of chunks:

Use the bilingual chunk that would produce the most-likely TLtranslation instead of the most frequent oneHow? Using a language model with gaps

© in the session © © with the interest .

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 26 / 27

Page 27: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Discussion

Discussion /3

Future work:

Try tree-based chunks obtained from the WMT 09 corpus usingFreeLing to parse Spanish (in both directions)Perform a manual evaluationCombine both Spanish→English and English→Spanish chunks

IntersectionUnion

Try to know how Matrex is helping Apertium

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 27 / 27

Page 28: Hybrid Example-Based -- Rule-Based MT: Feeding Apertium ...nclt.computing.dcu.ie/slides/felipeS.pdf3 use a language model to choose one of the possible translations for each of the

Discussion

Hybrid Example-Based – Rule-Based MT:Feeding Apertium with Bilingual Chunks

Felipe Sanchez-Martınez

Dept. Llenguatges i Sistemes InformaticsUniversitat d’Alacant

E-03071 Alacant, [email protected]

Work done in collaboration withAndy Way (DCU) and Mikel L. Forcada (UA)

at the Centre for Next Generation Localisation – DCU

8th July 2009

Felipe Sanchez-Martınez (Univ. d’Alacant) Hybrid MT: Feeding Apertium with chunks 8th July 2009 27 / 27