Automating Bilingual Terminology Creation TAUS Annual Conference 2015 – San Jose
Democratizing Quality
Love your language
“Translation buyers shouldn’t have to buy yet more overpriced technology from vendors who fail to take ultimate responsibility for quality”
Christian Arno, CEO Lingo24
Love your language
Terminology and Its Impact
Terminology and Its Impact
Love your language
Upstream effects on content creation
Higher quality documentation as authors create less variation based on validated terms
Less time as authors require less research of what term is appropriate
Cost of translation
Consistently created source texts are easier to translate
More classic TM matches Shorten translation review time as QA checks capture
errors at source Dramatic increase in consistency = high quality MT
training material
Overall quality Building TermBank becomes easier once there is a
qualified base to build from Linguist feedback is more relevant as they help
validate the terms in context
Terminology is an invaluable asset in effective translation and localization…
Terminology and Its Impact
Love your language
High entry barriers
Need for enterprise terminology strategy How to build a TermBank? Who approves terms? How to maintain a TermBank?
Need to implement terminology tools
Central storage/repository Application of terms to translations Quality assurance to ensure consistency Cost of tool
We have found terminology is not always used to its potential given…
Love your language
Lingo24’s TermFinder
How It Works? An abridged version…
Love your language
Lingo24 uses a refreshed statistical approach by combining techniques that look at the data differently than traditional
frequency based methods
Sentence Aligned Content
(TMX, XLIFF, XLSX)
Bilingual TermBank(TMX, XLIFF,
XLSX)
IdentifyApplication of log likelihood
comparison method to extract monolingual terminology from
source and target.
RankUse phrase-based statistical Machine Translation to align,
rank and trim bilingual terminology.
< Log likelihood Comparison + MT Pipeline + Custom Features >
Results That Speak For Themselves
Love your language
How It Works? In-depth…
Love your language
https://www.aclweb.org/anthology/W/W14/W14-4806.pdf
Love your language
The Power of Terms in Your Hands
Love your language
Powers Our Approach
Log likelihood of TM vs generic terms Statistical machine translation alignment Custom feature ranking
Evaluation of automated extraction results (source and target candidates)
No/Yes interface for human review of unqualified term candidates Research and add quality meta-data Approval of results for inclusion in TermBank
Access to TM and concordances Access to comments and meta information Scalable authoring and approval structure All work performed based on relevant and validated data
Client head terminologist Client language terminologists Qualified Lingo24 linguists Casual terminologists / SMEs
Automated Terms
Right People
TailoredTasks
One Location
Love your language
Terminology in Action
Love your language
The Benefits
Love your language
The Benefits
Automatic creation of high-quality, domain-specific bilingual glossaries User-friendly reviewing and editing of automatically created glossary Monolingual glossary creation from source documents Reduced cost and time of terminology creation compared to industry norms Increased consistency and accuracy across projects involving multiple
linguists Ability to create terminology-aware Machine Translation output High quality glossaries immediately in the hands of linguists and reviewers
Love your language
Thanks / Mulțumesc / Gracias / Merçi
www.lingo24.com