Presentation

Preview:

DESCRIPTION

 

Citation preview

Translation Memory Retrieval Methods[Bloodgood and Strauss, 2014] in Proc of 14th EACL

Koichi Akabe and Philip Arthur

NAIST MT Study

2014-07-03

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 1 / 27

Introduction

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 2 / 27

Translation Memory (TM)

▶ Most widely used computer-assisted translation (CAT) tool

▶ Suggest translations using other translations

En The dog opened the door.

Ja 犬がドアを開けた。

En I saw a girl with a telescope.

Ja 僕は望遠鏡で少女を見た。

En John opened the door.

Ja

1. Find the nearest source sentence

2. Suggest a translation

3. Post-editing

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27

Translation Memory (TM)

▶ Most widely used computer-assisted translation (CAT) tool

▶ Suggest translations using other translations

En The dog opened the door.

Ja 犬がドアを開けた。

En I saw a girl with a telescope.

Ja 僕は望遠鏡で少女を見た。

En John opened the door.

Ja

1. Find the nearest source sentence

2. Suggest a translation

3. Post-editing

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27

Translation Memory (TM)

▶ Most widely used computer-assisted translation (CAT) tool

▶ Suggest translations using other translations

En The dog opened the door.

Ja 犬がドアを開けた。

En I saw a girl with a telescope.

Ja 僕は望遠鏡で少女を見た。

En John opened the door.

Ja

1. Find the nearest source sentence

2. Suggest a translation

3. Post-editing

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27

Translation Memory (TM)

▶ Most widely used computer-assisted translation (CAT) tool

▶ Suggest translations using other translations

En The dog opened the door.

Ja 犬がドアを開けた。

En I saw a girl with a telescope.

Ja 僕は望遠鏡で少女を見た。

En John opened the door.

Ja

1. Find the nearest source sentence

2. Suggest a translation

3. Post-editing

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27

Translation Memory (TM)

▶ Most widely used computer-assisted translation (CAT) tool

▶ Suggest translations using other translations

En The dog opened the door.

Ja 犬がドアを開けた。

En I saw a girl with a telescope.

Ja 僕は望遠鏡で少女を見た。

En John opened the door.

Ja 犬がドアを開けた。 (fuzzy)

1. Find the nearest source sentence

2. Suggest a translation

3. Post-editing

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27

Translation Memory (TM)

▶ Most widely used computer-assisted translation (CAT) tool

▶ Suggest translations using other translations

En The dog opened the door.

Ja 犬がドアを開けた。

En I saw a girl with a telescope.

Ja 僕は望遠鏡で少女を見た。

En John opened the door.

Ja 犬がドアを開けた。 (fuzzy)

1. Find the nearest source sentence

2. Suggest a translation

3. Post-editing

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27

Translation Memory (TM)

▶ Most widely used computer-assisted translation (CAT) tool

▶ Suggest translations using other translations

En The dog opened the door.

Ja 犬がドアを開けた。

En I saw a girl with a telescope.

Ja 僕は望遠鏡で少女を見た。

En John opened the door.

Ja ジョンがドアを開けた。

1. Find the nearest source sentence

2. Suggest a translation

3. Post-editing

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 3 / 27

How to find the nearest source sentence?

TM finds the nearest source sentence using similarity metrics

▶ Edit distance (Leven-shtein distance)−→ Widely used metric

▶ MT evaluation metrics [Simard and Fujita, 2012]−→ WER, BLEU, NIST, VMeteor, Meteor as TM metrics

▶ This paper

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27

How to find the nearest source sentence?

TM finds the nearest source sentence using similarity metrics

▶ Edit distance (Leven-shtein distance)−→ Widely used metric

▶ MT evaluation metrics [Simard and Fujita, 2012]−→ WER, BLEU, NIST, VMeteor, Meteor as TM metrics

▶ This paper

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27

How to find the nearest source sentence?

TM finds the nearest source sentence using similarity metrics

▶ Edit distance (Leven-shtein distance)−→ Widely used metric

▶ MT evaluation metrics [Simard and Fujita, 2012]−→ WER, BLEU, NIST, VMeteor, Meteor as TM metrics

▶ This paper

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27

How to find the nearest source sentence?

TM finds the nearest source sentence using similarity metrics

▶ Edit distance (Leven-shtein distance)−→ Widely used metric

▶ MT evaluation metrics [Simard and Fujita, 2012]−→ WER, BLEU, NIST, VMeteor, Meteor as TM metrics

▶ This paper

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 4 / 27

Threshold of helpfulness

Matching algorithm always returns the nearest sentenceHowever, low score suggestions should not be shown

TM softwares set the threshold at 70% in practice

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 5 / 27

Threshold of helpfulness

Matching algorithm always returns the nearest sentenceHowever, low score suggestions should not be shown

TM softwares set the threshold at 70% in practice

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 5 / 27

Threshold of helpfulness

Matching algorithm always returns the nearest sentenceHowever, low score suggestions should not be shown

TM softwares set the threshold at 70% in practice −→ Why?

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 5 / 27

Translation Memory Similarity Metrics

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 6 / 27

Definitions

TM Similarity Metrics compare M and C.M : workload sentenceC: source language side of a candidate pre-existing translation

En The dog opened the door .

Ja 犬がドアを開けた。

En I saw a girl with a telescope .

Ja 僕は望遠鏡で少女を見た。

En John opened the door .

Ja 犬がドアを開けた。 (fuzzy)

M =John opened the door .C1 =The dog opened the door .C2 =I saw a girl with a telescope ....

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 7 / 27

Definitions

TM Similarity Metrics compare M and C.M : workload sentenceC: source language side of a candidate pre-existing translation

En The dog opened the door .

Ja 犬がドアを開けた。

En I saw a girl with a telescope .

Ja 僕は望遠鏡で少女を見た。

En John opened the door .

Ja 犬がドアを開けた。 (fuzzy)

M =John opened the door .C1 =The dog opened the door .C2 =I saw a girl with a telescope ....

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 7 / 27

Translation Memory Similarity Metrics

Compare the following metrics:

▶ Percent Match

▶ Weighted Percent Match

▶ Edit Distance

▶ N-gram Precision

▶ Weighted N-gram Precision

▶ Modified Weighted N-gram Precision

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 8 / 27

Percent Match (PM)

The simplest metric

PM(M,C) =|Munigrams ∩ Cunigrams|

|Munigrams|

e.g.

M =John opened the door .C =The dog opened the door .

PM(M,C) =4

5= 0.80

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27

Percent Match (PM)

The simplest metric

PM(M,C) =|Munigrams ∩ Cunigrams|

|Munigrams|

e.g.

M =John opened the door .C =The dog opened the door .

PM(M,C) =4

5= 0.80

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27

Percent Match (PM)

The simplest metric

PM(M,C) =|Munigrams ∩ Cunigrams|

|Munigrams|

e.g.

M =John opened the door .C =The dog opened the door .

PM(M,C) =4

5= 0.80

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27

Percent Match (PM)

The simplest metric

PM(M,C) =|Munigrams ∩ Cunigrams|

|Munigrams|

e.g.

M =John opened the door .C =The dog opened the door .

PM(M,C) =4

5= 0.80

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 9 / 27

Weighted Percent Match (WPM)

We want to know translation of rare words

PM with IDF weighting

WPM(M,C) =

∑u∈{Munigrams∩Cunigrams}

idf(u,D)

∑u∈Munigrams

idf(u,D)

where D is a set of all source sentences in the parallel corpus

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 10 / 27

Weighted Percent Match (WPM)

We want to know translation of rare words

PM with IDF weighting

WPM(M,C) =

∑u∈{Munigrams∩Cunigrams}

idf(u,D)

∑u∈Munigrams

idf(u,D)

where D is a set of all source sentences in the parallel corpus

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 10 / 27

Weighted Percent Match (WPM)

We want to know translation of rare words

PM with IDF weighting

WPM(M,C) =

∑u∈{Munigrams∩Cunigrams}

idf(u,D)

∑u∈Munigrams

idf(u,D)

where D is a set of all source sentences in the parallel corpus

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 10 / 27

Problem of PM and WPM

PM and WPM only consider coverage of words

−→ They cannnot see any context

We show methods that consider contexts in next slides

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 11 / 27

Problem of PM and WPM

PM and WPM only consider coverage of words−→ They cannnot see any context

We show methods that consider contexts in next slides

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 11 / 27

Problem of PM and WPM

PM and WPM only consider coverage of words−→ They cannnot see any context

We show methods that consider contexts in next slides

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 11 / 27

Edit Distance (ED)

Widely used metric

ED = max

(1− edit-dist(M,C)

|Munigrams|, 0

)where edit-dist(M,C) is the number of word insertions, deletions,and substitutions required to transform M into C

e.g.

M =John opened the door .C =The dog opened the door .substitution: 1insertion: 1

ED(M,C) = 1− 2

5= 0.60

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27

Edit Distance (ED)

Widely used metric

ED = max

(1− edit-dist(M,C)

|Munigrams|, 0

)where edit-dist(M,C) is the number of word insertions, deletions,and substitutions required to transform M into C

e.g.

M =John opened the door .C =The dog opened the door .substitution: 1insertion: 1

ED(M,C) = 1− 2

5= 0.60

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27

Edit Distance (ED)

Widely used metric

ED = max

(1− edit-dist(M,C)

|Munigrams|, 0

)where edit-dist(M,C) is the number of word insertions, deletions,and substitutions required to transform M into C

e.g.

M =John opened the door .C =The dog opened the door .

substitution: 1insertion: 1

ED(M,C) = 1− 2

5= 0.60

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27

Edit Distance (ED)

Widely used metric

ED = max

(1− edit-dist(M,C)

|Munigrams|, 0

)where edit-dist(M,C) is the number of word insertions, deletions,and substitutions required to transform M into C

e.g.

M =John opened the door .C =The dog opened the door .substitution: 1

insertion: 1

ED(M,C) = 1− 2

5= 0.60

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27

Edit Distance (ED)

Widely used metric

ED = max

(1− edit-dist(M,C)

|Munigrams|, 0

)where edit-dist(M,C) is the number of word insertions, deletions,and substitutions required to transform M into C

e.g.

M =John opened the door .C =The dog opened the door .substitution: 1insertion: 1

ED(M,C) = 1− 2

5= 0.60

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27

Edit Distance (ED)

Widely used metric

ED = max

(1− edit-dist(M,C)

|Munigrams|, 0

)where edit-dist(M,C) is the number of word insertions, deletions,and substitutions required to transform M into C

e.g.

M =John opened the door .C =The dog opened the door .substitution: 1insertion: 1

ED(M,C) = 1− 2

5= 0.60

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 12 / 27

N-gram Precision (NGP)

Mean of N-gram precision (like the BLEU metric)However, BLEU → 0 when the precision of longer N-grams is 0

This work uses arithmetic mean instead of geometric mean

NGP =1

N

N∑n=1

pn

pn =|Mn-grams ∩ Cn-grams|

Z ∗ |Mn-grams|+ (1− Z) ∗ |Cn-grams|

where Z is a parameter to control normalization,and N is the maximum length of N-gramN = 4 and Z = 0.75 in main experiments (discuss later)

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27

N-gram Precision (NGP)

Mean of N-gram precision (like the BLEU metric)

However, BLEU → 0 when the precision of longer N-grams is 0

This work uses arithmetic mean instead of geometric mean

NGP =1

N

N∑n=1

pn

pn =|Mn-grams ∩ Cn-grams|

Z ∗ |Mn-grams|+ (1− Z) ∗ |Cn-grams|

where Z is a parameter to control normalization,and N is the maximum length of N-gramN = 4 and Z = 0.75 in main experiments (discuss later)

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27

N-gram Precision (NGP)

Mean of N-gram precision (like the BLEU metric)However, BLEU → 0 when the precision of longer N-grams is 0

This work uses arithmetic mean instead of geometric mean

NGP =1

N

N∑n=1

pn

pn =|Mn-grams ∩ Cn-grams|

Z ∗ |Mn-grams|+ (1− Z) ∗ |Cn-grams|

where Z is a parameter to control normalization,and N is the maximum length of N-gramN = 4 and Z = 0.75 in main experiments (discuss later)

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27

N-gram Precision (NGP)

Mean of N-gram precision (like the BLEU metric)However, BLEU → 0 when the precision of longer N-grams is 0

This work uses arithmetic mean instead of geometric mean

NGP =1

N

N∑n=1

pn

pn =|Mn-grams ∩ Cn-grams|

Z ∗ |Mn-grams|+ (1− Z) ∗ |Cn-grams|

where Z is a parameter to control normalization,and N is the maximum length of N-gramN = 4 and Z = 0.75 in main experiments (discuss later)

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 13 / 27

Weighted N-gram Precision (WNGP)

NGP with IDF weighting

WNGP =

N∑n=1

1

Nwpn

wpn =

∑i∈{Mn-grams∩Cn-grams}

w(i)

Z ∗

∑i∈Mn-grams

w(i)

+ (1− Z) ∗

∑i∈Cn-grams

w(i)

w(i) =∑

1-gram∈iidf(1-gram,D)

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 14 / 27

Weighted N-gram Precision (WNGP)

NGP with IDF weighting

WNGP =

N∑n=1

1

Nwpn

wpn =

∑i∈{Mn-grams∩Cn-grams}

w(i)

Z ∗

∑i∈Mn-grams

w(i)

+ (1− Z) ∗

∑i∈Cn-grams

w(i)

w(i) =∑

1-gram∈iidf(1-gram,D)

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 14 / 27

Modified Weighted N-gram Precision (MWNGP)

Shorter N-grams may help translators more than longer N-grams

WNGP =

N∑n=1

1

Nwpn

MWNGP =2N

2N − 1

N∑n=1

1

2nwpn

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 15 / 27

Modified Weighted N-gram Precision (MWNGP)

Shorter N-grams may help translators more than longer N-grams

WNGP =

N∑n=1

1

Nwpn

MWNGP =2N

2N − 1

N∑n=1

1

2nwpn

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 15 / 27

Experiment

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 16 / 27

Experiment

Two different technicals domains with Two different language pairs(Fr-En, Zn-En).

▶ Zn-En: OpenOffice3

▶ Fr-En: EMEA

Preprocessing is performed on both source sides to produce validsegment.

Some sentences are randomly sampled from corpus as M and C.

▶ Zn-En: 400 M and 10.000 C.

▶ Fr-En: 300 M and 10.000 C.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27

Experiment

Two different technicals domains with Two different language pairs(Fr-En, Zn-En).

▶ Zn-En: OpenOffice3

▶ Fr-En: EMEA

Preprocessing is performed on both source sides to produce validsegment.

Some sentences are randomly sampled from corpus as M and C.

▶ Zn-En: 400 M and 10.000 C.

▶ Fr-En: 300 M and 10.000 C.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27

Experiment

Two different technicals domains with Two different language pairs(Fr-En, Zn-En).

▶ Zn-En: OpenOffice3

▶ Fr-En: EMEA

Preprocessing is performed on both source sides to produce validsegment.

Some sentences are randomly sampled from corpus as M and C.

▶ Zn-En: 400 M and 10.000 C.

▶ Fr-En: 300 M and 10.000 C.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27

Experiment

Two different technicals domains with Two different language pairs(Fr-En, Zn-En).

▶ Zn-En: OpenOffice3

▶ Fr-En: EMEA

Preprocessing is performed on both source sides to produce validsegment.

Some sentences are randomly sampled from corpus as M and C.

▶ Zn-En: 400 M and 10.000 C.

▶ Fr-En: 300 M and 10.000 C.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27

Experiment

Two different technicals domains with Two different language pairs(Fr-En, Zn-En).

▶ Zn-En: OpenOffice3

▶ Fr-En: EMEA

Preprocessing is performed on both source sides to produce validsegment.

Some sentences are randomly sampled from corpus as M and C.

▶ Zn-En: 400 M and 10.000 C.

▶ Fr-En: 300 M and 10.000 C.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 17 / 27

Evaluation

Evaluation is performed with Human Evaluation using AmazonMechanical Turk.

The Score is ranging from 1 to 5 (Not Helpful until ExtremelyHelpful).

Each segment M is rated by 5 Turkers and we keep track whichmetric performs best (ties is allowed).

The scores of each M are averaged as Mean Opinion Score(MOS).

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27

Evaluation

Evaluation is performed with Human Evaluation using AmazonMechanical Turk.

The Score is ranging from 1 to 5 (Not Helpful until ExtremelyHelpful).

Each segment M is rated by 5 Turkers and we keep track whichmetric performs best (ties is allowed).

The scores of each M are averaged as Mean Opinion Score(MOS).

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27

Evaluation

Evaluation is performed with Human Evaluation using AmazonMechanical Turk.

The Score is ranging from 1 to 5 (Not Helpful until ExtremelyHelpful).

Each segment M is rated by 5 Turkers and we keep track whichmetric performs best (ties is allowed).

The scores of each M are averaged as Mean Opinion Score(MOS).

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27

Evaluation

Evaluation is performed with Human Evaluation using AmazonMechanical Turk.

The Score is ranging from 1 to 5 (Not Helpful until ExtremelyHelpful).

Each segment M is rated by 5 Turkers and we keep track whichmetric performs best (ties is allowed).

The scores of each M are averaged as Mean Opinion Score(MOS).

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27

Evaluation

Evaluation is performed with Human Evaluation using AmazonMechanical Turk.

The Score is ranging from 1 to 5 (Not Helpful until ExtremelyHelpful).

Each segment M is rated by 5 Turkers and we keep track whichmetric performs best (ties is allowed).

The scores of each M are averaged as Mean Opinion Score(MOS).

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 18 / 27

Result and Analysis

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 19 / 27

Result: Which metric performs best?

Table OO3 Zn-En

Metric Found Best Total C

PM 178 400WPM 200 400

ED 193 400NGP 251 400

WNGP 271 400MWNGP 282 400

Table EMEA Fr-En

Metric Found Best Total C

PM 166 300WPM 184 300

ED 148 300NGP 188 300

WNGP 198 300MWNGP 201 300

Modified Weighted N-Gram Precision (MWNGP) achieved thebest result compared to any other metrics.

There are slight different between WNGP and Modified-WNGP.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27

Result: Which metric performs best?

Table OO3 Zn-En

Metric Found Best Total C

PM 178 400WPM 200 400

ED 193 400NGP 251 400

WNGP 271 400MWNGP 282 400

Table EMEA Fr-En

Metric Found Best Total C

PM 166 300WPM 184 300

ED 148 300NGP 188 300

WNGP 198 300MWNGP 201 300

Modified Weighted N-Gram Precision (MWNGP) achieved thebest result compared to any other metrics.

There are slight different between WNGP and Modified-WNGP.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27

Result: Which metric performs best?

Table OO3 Zn-En

Metric Found Best Total C

PM 178 400WPM 200 400

ED 193 400NGP 251 400

WNGP 271 400MWNGP 282 400

Table EMEA Fr-En

Metric Found Best Total C

PM 166 300WPM 184 300

ED 148 300NGP 188 300

WNGP 198 300MWNGP 201 300

Modified Weighted N-Gram Precision (MWNGP) achieved thebest result compared to any other metrics.

There are slight different between WNGP and Modified-WNGP.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27

Result: Which metric performs best?

Table OO3 Zn-En

Metric Found Best Total C

PM 178 400WPM 200 400

ED 193 400NGP 251 400

WNGP 271 400MWNGP 282 400

Table EMEA Fr-En

Metric Found Best Total C

PM 166 300WPM 184 300

ED 148 300NGP 188 300

WNGP 198 300MWNGP 201 300

Modified Weighted N-Gram Precision (MWNGP) achieved thebest result compared to any other metrics.

There are slight different between WNGP and Modified-WNGP.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 20 / 27

Scatterplot: OO3 Percent Match

1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0

MOS

0.0

0.2

0.4

0.6

0.8

1.0

Me

tric

Va

lue

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 21 / 27

Scatterplot: OO3 Edit Distance

1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0

MOS

0.0

0.2

0.4

0.6

0.8

1.0

Me

tric

Va

lue

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 22 / 27

Scatterplot: OO3 Modified N-Gram Precision

1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0

MOS

0.0

0.2

0.4

0.6

0.8

1.0

Me

tric

Va

lue

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 23 / 27

The effect of Z: Adjusting for length preferences

Many of the metrics are using Z as parameters.

Z parameter can be used to control for length preferences.

Table EMEA Fr-En

Z Value Avg Length0.00 9.92980.25 13.2040.50 16.01340.75 19.63551.00 27.8829

Table OO3 Zn-En

Z Value Avg Length0.00 7.24750.25 9.56000.50 11.12500.75 14.18251.00 25.0875

Smaller Z prefered shorter match that are more precise andincreased precision.

Larger Z prefers longer match that contains many correcttranslations and increased recall.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27

The effect of Z: Adjusting for length preferences

Many of the metrics are using Z as parameters.

Z parameter can be used to control for length preferences.

Table EMEA Fr-En

Z Value Avg Length0.00 9.92980.25 13.2040.50 16.01340.75 19.63551.00 27.8829

Table OO3 Zn-En

Z Value Avg Length0.00 7.24750.25 9.56000.50 11.12500.75 14.18251.00 25.0875

Smaller Z prefered shorter match that are more precise andincreased precision.

Larger Z prefers longer match that contains many correcttranslations and increased recall.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27

The effect of Z: Adjusting for length preferences

Many of the metrics are using Z as parameters.

Z parameter can be used to control for length preferences.

Table EMEA Fr-En

Z Value Avg Length0.00 9.92980.25 13.2040.50 16.01340.75 19.63551.00 27.8829

Table OO3 Zn-En

Z Value Avg Length0.00 7.24750.25 9.56000.50 11.12500.75 14.18251.00 25.0875

Smaller Z prefered shorter match that are more precise andincreased precision.

Larger Z prefers longer match that contains many correcttranslations and increased recall.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27

The effect of Z: Adjusting for length preferences

Many of the metrics are using Z as parameters.

Z parameter can be used to control for length preferences.

Table EMEA Fr-En

Z Value Avg Length0.00 9.92980.25 13.2040.50 16.01340.75 19.63551.00 27.8829

Table OO3 Zn-En

Z Value Avg Length0.00 7.24750.25 9.56000.50 11.12500.75 14.18251.00 25.0875

Smaller Z prefered shorter match that are more precise andincreased precision.

Larger Z prefers longer match that contains many correcttranslations and increased recall.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27

The effect of Z: Adjusting for length preferences

Many of the metrics are using Z as parameters.

Z parameter can be used to control for length preferences.

Table EMEA Fr-En

Z Value Avg Length0.00 9.92980.25 13.2040.50 16.01340.75 19.63551.00 27.8829

Table OO3 Zn-En

Z Value Avg Length0.00 7.24750.25 9.56000.50 11.12500.75 14.18251.00 25.0875

Smaller Z prefered shorter match that are more precise andincreased precision.

Larger Z prefers longer match that contains many correcttranslations and increased recall.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 24 / 27

Conclusion

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 25 / 27

Conclusion

▶ This paper compares TM similarity metrics.

▶ The best method is Modified Weighted N-Gram Precision.

▶ All the discussed metrics only consider source sides in thecalculation.

▶ Z parameter is used to adjust the length preferences of theretrieved TM.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27

Conclusion

▶ This paper compares TM similarity metrics.

▶ The best method is Modified Weighted N-Gram Precision.

▶ All the discussed metrics only consider source sides in thecalculation.

▶ Z parameter is used to adjust the length preferences of theretrieved TM.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27

Conclusion

▶ This paper compares TM similarity metrics.

▶ The best method is Modified Weighted N-Gram Precision.

▶ All the discussed metrics only consider source sides in thecalculation.

▶ Z parameter is used to adjust the length preferences of theretrieved TM.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27

Conclusion

▶ This paper compares TM similarity metrics.

▶ The best method is Modified Weighted N-Gram Precision.

▶ All the discussed metrics only consider source sides in thecalculation.

▶ Z parameter is used to adjust the length preferences of theretrieved TM.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27

Conclusion

▶ This paper compares TM similarity metrics.

▶ The best method is Modified Weighted N-Gram Precision.

▶ All the discussed metrics only consider source sides in thecalculation.

▶ Z parameter is used to adjust the length preferences of theretrieved TM.

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 26 / 27

Thank you for your attention!

2014-07-03 Koichi Akabe and Philip Arthur (MT Study) 27 / 27