16
Automating Bilingual Terminology Creation TAUS Annual Conference 2015 – San Jose

Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Embed Size (px)

Citation preview

Page 1: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Automating Bilingual Terminology Creation TAUS Annual Conference 2015 – San Jose

Page 2: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Democratizing Quality

Love your language

“Translation buyers shouldn’t have to buy yet more overpriced technology from vendors who fail to take ultimate responsibility for quality”

Christian Arno, CEO Lingo24

Page 3: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Love your language

Terminology and Its Impact

Page 4: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Terminology and Its Impact

Love your language

Upstream effects on content creation

Higher quality documentation as authors create less variation based on validated terms

Less time as authors require less research of what term is appropriate

Cost of translation

Consistently created source texts are easier to translate

More classic TM matches Shorten translation review time as QA checks capture

errors at source Dramatic increase in consistency = high quality MT

training material

Overall quality Building TermBank becomes easier once there is a

qualified base to build from Linguist feedback is more relevant as they help

validate the terms in context

Terminology is an invaluable asset in effective translation and localization…

Page 5: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Terminology and Its Impact

Love your language

High entry barriers

Need for enterprise terminology strategy How to build a TermBank? Who approves terms? How to maintain a TermBank?

Need to implement terminology tools

Central storage/repository Application of terms to translations Quality assurance to ensure consistency Cost of tool

We have found terminology is not always used to its potential given…

Page 6: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Love your language

Lingo24’s TermFinder

Page 7: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

How It Works? An abridged version…

Love your language

Lingo24 uses a refreshed statistical approach by combining techniques that look at the data differently than traditional

frequency based methods

Sentence Aligned Content

(TMX, XLIFF, XLSX)

Bilingual TermBank(TMX, XLIFF,

XLSX)

IdentifyApplication of log likelihood

comparison method to extract monolingual terminology from

source and target.

RankUse phrase-based statistical Machine Translation to align,

rank and trim bilingual terminology.

< Log likelihood Comparison + MT Pipeline + Custom Features >

Page 8: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Results That Speak For Themselves

Love your language

Page 9: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

How It Works? In-depth…

Love your language

https://www.aclweb.org/anthology/W/W14/W14-4806.pdf

Page 10: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Love your language

The Power of Terms in Your Hands

Page 11: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Love your language

Powers Our Approach

Log likelihood of TM vs generic terms Statistical machine translation alignment Custom feature ranking

Evaluation of automated extraction results (source and target candidates)

No/Yes interface for human review of unqualified term candidates Research and add quality meta-data Approval of results for inclusion in TermBank

Access to TM and concordances Access to comments and meta information Scalable authoring and approval structure All work performed based on relevant and validated data

Client head terminologist Client language terminologists Qualified Lingo24 linguists Casual terminologists / SMEs

Automated Terms

Right People

TailoredTasks

One Location

Page 12: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Love your language

Terminology in Action

Page 13: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Love your language

The Benefits

Page 14: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Love your language

The Benefits

Automatic creation of high-quality, domain-specific bilingual glossaries User-friendly reviewing and editing of automatically created glossary Monolingual glossary creation from source documents Reduced cost and time of terminology creation compared to industry norms Increased consistency and accuracy across projects involving multiple

linguists Ability to create terminology-aware Machine Translation output High quality glossaries immediately in the hands of linguists and reviewers

Page 15: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

Love your language

Thanks / Mulțumesc / Gracias / Merçi

Page 16: Automating Bilingual Terminology Creation. David Meikle (Lingo24)

www.lingo24.com