Upload
peregrine-terry
View
217
Download
0
Embed Size (px)
DESCRIPTION
Authoring Tools Software: Software: Integrated Development Environments (IDEs), such as MS Visual Studio Integrated Development Environments (IDEs), such as MS Visual Studio Documents: Documents: Traditional: MS Word, Adobe Framemaker, etc. Traditional: MS Word, Adobe Framemaker, etc. XML-based: Arbortext, XMetal, AuthorIT, Madcap Flare, etc. XML-based: Arbortext, XMetal, AuthorIT, Madcap Flare, etc. Publishing: PDF (Adobe Acrobat or proprietary system) Publishing: PDF (Adobe Acrobat or proprietary system) Help (User Assistance): Help (User Assistance): Traditional: MS HTML Help Workshop Traditional: MS HTML Help Workshop Webhelp: Adobe RoboHelp; Webhelp: Adobe RoboHelp; XML/single-souring tools are increasingly used for both docs and Help (i.e., Madcap Flare) XML/single-souring tools are increasingly used for both docs and Help (i.e., Madcap Flare)
Citation preview
Software Software LocalizationLocalization
Dr. Keiran J. DunneDr. Keiran J. DunneInstitute for Applied LinguisticsInstitute for Applied Linguistics
Lecture 5
Tools for I18n and L10nTools for I18n and L10nInternationalization and localization Internationalization and localization can be facilitated by a number of can be facilitated by a number of specialized software tools:specialized software tools:Authoring toolsControlled language toolsCounting /costing toolsDocument / version management toolsDTP toolsEngineering tools (software) Resource extract / find / replace toolsGrammar and spelling toolsLocalization tools
Machine translation systemsQA / testing / validation toolsProject management toolsTerm extraction toolsTerminology management systemsText alignment /comparison toolsText analysis tools Text editors Translation memory systemsWorkflow management tools
Authoring ToolsAuthoring Tools Software:Software:
Integrated Development Environments (IDEs), Integrated Development Environments (IDEs), such as MS Visual Studiosuch as MS Visual Studio
Documents:Documents: Traditional: MS Word, Adobe Framemaker, etc.Traditional: MS Word, Adobe Framemaker, etc. XML-based: Arbortext, XMetal, AuthorIT, Madcap XML-based: Arbortext, XMetal, AuthorIT, Madcap
Flare, etc.Flare, etc. Publishing: PDF (Adobe Acrobat or proprietary Publishing: PDF (Adobe Acrobat or proprietary
system)system) Help (User Assistance):Help (User Assistance):
Traditional: MS HTML Help WorkshopTraditional: MS HTML Help Workshop Webhelp: Adobe RoboHelp; Webhelp: Adobe RoboHelp; XML/single-souring tools are increasingly used for XML/single-souring tools are increasingly used for
both docs and Help (i.e., Madcap Flare)both docs and Help (i.e., Madcap Flare)
Controlled Languages & Controlled Languages & Computer-Assisted Computer-Assisted
Authoring (CAA) ToolsAuthoring (CAA) Tools Controlled language: a form of usage Controlled language: a form of usage
restricted by grammar and vocabulary rulesrestricted by grammar and vocabulary rules Controlled Languages + CAA systems = a Controlled Languages + CAA systems = a
standardized approach to document standardized approach to document authoring (facilitates i18n)authoring (facilitates i18n)
Specialized style checkers, monolingual Specialized style checkers, monolingual glossaries and source text memories help glossaries and source text memories help internationalize software documents (help, internationalize software documents (help, manuals, etc.)manuals, etc.)
Machine TranslationMachine Translation MT is only as good as the MT is only as good as the
globalization/i18n strategies of the globalization/i18n strategies of the software publisher:software publisher:“Machine translation has not been used widely in the localization industry, and this is largely due to the way in which software publishers develop their documentation” (Esselink, 2000 p. 360).
I18N
controlled languagesauthoring systems
software documents
help text
technical writers
L10Nlocalization
vendor
machine translation
dependent
Software Engineering Software Engineering ToolsTools
Used by i18n engineering as part of efforts Used by i18n engineering as part of efforts to internationalize the GUIto internationalize the GUI
Include IDE and specialized tools for Include IDE and specialized tools for finding, extracting, and sometimes finding, extracting, and sometimes replacing localizable resources (bitmaps, replacing localizable resources (bitmaps, strings, dialogs, menus)strings, dialogs, menus)
Note: it is a good idea to familiarize Note: it is a good idea to familiarize yourself with a few such tools! (Resource yourself with a few such tools! (Resource Hacker, Resource Tuner, PE Explorer, Hacker, Resource Tuner, PE Explorer, etc.)etc.)
Example: Extraction ToolExample: Extraction ToolNon-internationalized C code:Non-internationalized C code:#include <stdio.h>
main() {
int n; char y[5]; printf("This program converts decimal numbers to hexadecimal\n\
n"); while(1) { printf("\nEnter decimal number: ");
scanf("%d",&n); printf("\nNumber entered is <%d> decimal and <%x> hexa",n,n); printf("\nDo you want to continue? "); scanf("%s",y); if(strcmp(y,"yes")) { printf("\n exiting ..\n"); exit(); } } }
Extract Strings to Extract Strings to Message FileMessage File
#include <stdio.h> extern unsigned char *intl_m_msg(),
*intl_f_msg(); main() {
int n; char y[5]; printf(intl_m_msg("","mypg",1)); while(1) { printf(intl_m_msg("","mypg",2));
scanf("%d",&n); printf(intl_m_msg("","mypg",3),n,n); printf(intl_m_msg("","mypg",4)); scanf("%s",y); if(strcmp(y, (intl_m_msg("","mypg",6))) { printf(intl_m_msg("","mypg",5)); exit(); } } source.c}
mypg.en• "This program converts
decimal numbers to hexadecimal\n\n"
• "\nEnter decimal number:" • "\nNumber entered is <
%d> decimal and <%x> hexadécimal "
• "\nDo you want to continue?"
• "\nexiting ..\n" • "yes"
EXTRACTION TOOL
Translate Extracted Translate Extracted Message FileMessage File
#include <stdio.h> extern unsigned char *intl_m_msg(),
*intl_f_msg(); main() {
int n; char y[5]; printf(intl_m_msg("","mypg",1)); while(1) { printf(intl_m_msg("","mypg",2));
scanf("%d",&n); printf(intl_m_msg("","mypg",3),n,n); printf(intl_m_msg("","mypg",4)); scanf("%s",y); if(strcmp(y, (intl_m_msg("","mypg",6))) { printf(intl_m_msg("","mypg",5)); exit(); } } source.c}
mypg.fr• "Ce logiciel convertit les
nombres décimaux en hexadécimal\n\n"
• "\nSaisir le nombre décimal :"
• "\nLe nombre saisi est <%d> en décimal et <%x> en hexadécimal"
• "\nVoulez-vous continuer ?" • "\nSortie...\n" • "oui"
Extract & Translate
I18n
controlled languagesauthoring systems
software internationalizationtools
software documents
help text
resources
technical writers
L10n
localizablesoftware
distribution
localizationvendor
software engineers
Localization Project Localization Project ManagementManagement
In addition to translators’ tools, l10n vendors In addition to translators’ tools, l10n vendors use:use:
Project Management softwareProject Management software Enterprise editions of localization tools Enterprise editions of localization tools
(Catalyst)(Catalyst) Document/version management software: to Document/version management software: to
manage and track versions, updates, changesmanage and track versions, updates, changes Workflow management: to identify where in the Workflow management: to identify where in the
system a given piece of work is at a given timesystem a given piece of work is at a given time
L10n localizationproject
localizablesoftware
distribution
projectmanagement
tool
localizationtool
(enterprise)
workflowmanagement
document / versioncontrol
translators / localizers
project managerlocalization engineer
QA / testing /validation tools
Localization WorkflowLocalization Workflowlocalizablesoftware
distributiontranslate/localize
deliverlocalizedsoftware
extract
integrate
proofread
test
proofread
edit
remediate files/update TM
remediate files/update TM
Localizer/Translator Localizer/Translator ToolsTools
The PM creates localizable distribution for The PM creates localizable distribution for localizers (Catalyst toolkit, Passolo bundle, Word localizers (Catalyst toolkit, Passolo bundle, Word files)files)
Localizers use enterprise tools available in Localizers use enterprise tools available in translator versions (Alchemy Catalyst Lite translator versions (Alchemy Catalyst Lite Edition, Passolo Translator Edition, SDLX Lite Edition, Passolo Translator Edition, SDLX Lite Edition)Edition)
Localizers also use Localizers also use Computer-Assisted Computer-Assisted Translation (CAT)Translation (CAT) tools, including translation tools, including translation memory (TM) and terminology management toolsmemory (TM) and terminology management tools
localizationproject
localizablesoftware
distribution
localizationtool
(enterprise)
translators / localizers
project managerlocalization engineer
localizationtoolkit
(distribution)
localizationtool
(translator)
translationmemory
terminologymanager
Localization Tools: Localization Tools: AdvantagesAdvantages
L10n tools offer a number of advantages:L10n tools offer a number of advantages: Edit resources without using an IDEEdit resources without using an IDE Integrate TM and terminology managementIntegrate TM and terminology management Protect software code that shouldn’t be Protect software code that shouldn’t be changedchanged Support workflow functions that integrate with Support workflow functions that integrate with L10n L10n PM (review, sign-off) PM (review, sign-off)
The focus is on supporting resource editing of *.rc, *.res, and binary (*.dll, *.exe) resource files. Some tools (Catalyst) will support web page and WinHelp file localization as well.
Widely Used Localization Widely Used Localization ToolsTools
Alchemy Catalyst (www.alchemysoftware.ie)Alchemy Catalyst (www.alchemysoftware.ie) SDL Passolo [formerly PASS Engineering SDL Passolo [formerly PASS Engineering
Passolo PASS SOftware LOcalizer] Passolo PASS SOftware LOcalizer] (www.lspzone.com/(www.lspzone.com/en/products/software-localization/sdl-en/products/software-localization/sdl-passolo.asp)passolo.asp)
Multilizer (www.multilizer.com)Multilizer (www.multilizer.com) RC WinTrans (www.schaudin.com)RC WinTrans (www.schaudin.com) Visual Localize (www.visloc.com)Visual Localize (www.visloc.com)
Translation MemoryTranslation Memory A TM is a database containing A TM is a database containing
aligned source- and target-language aligned source- and target-language segments segments
A TM allows you to A TM allows you to leverageleverage previous work and ensure optimum previous work and ensure optimum consistencyconsistencyTRANSLATION
MEMORY
Segment 1Segment 2Segment 3, etc.
Localization & TM toolsLocalization & TM tools TM tools compare source/target TM tools compare source/target
segments and pull segments that meet segments and pull segments that meet minimum match criteriaminimum match criteria
L10n ToolString 1String 2String 3
TM tool compares source segments to contents of TM
TMTM pulls & inserts
matches into L10n tool
Translation Memory & Translation Memory & LocalizationLocalization
Localization is usually performed using a translation memory.
The decision to use a translation memory has to be made on the basis of volume and repeatability: do you have an expectation of continued commissions from the same client, and of stable domain and text type?
A translation memory may be made available to you by the localization vendor. (Relative quality of previous work TM =two-edged sword.)
The Power of LeveragingThe Power of Leveraging
48% of this file’s segments can be leveraged in whole or in part.
Translation Memory and Translation Memory and L10nL10n
Translation memories are useful in localization because of the short life span of versions / web sites and significant amounts of reusable material.
Version 1
Version 2 Version 3
translationmemory
new version uses 70% same text
initial translation with TM tool
30%change
latest version uses 80% same text as
previous
20%change
REUSABILITY / LEVERAGING
TM ToolsTM ToolsWidely-used TM tools include:Widely-used TM tools include: SDL Trados SDL Trados
(www.translationzone.com)(www.translationzone.com) STAR Transit (www.star-STAR Transit (www.star-
solutions.net)solutions.net) Atril Déjà Vu (www.atril.com)Atril Déjà Vu (www.atril.com) Wordfast (www.wordfast.net)Wordfast (www.wordfast.net)
TM & Terminology Tools TM & Terminology Tools (cont.) (cont.)
Term extraction Term extraction build termbase build termbase Alignment tool Alignment tool create a TM from legacy create a TM from legacy
translations not performed using TM toolstranslations not performed using TM tools
TM and termbase may require TM and termbase may require processing:processing:
translators / localizers
localizationtoolkit
(distribution)
localizationtool
(translator)
translationmemory
terminologymanager
term extractiontool
text alignmenttool
AlignmentAlignment Alignment : establishing equivalents between Alignment : establishing equivalents between
a source text and the corresponding target a source text and the corresponding target text, segment by segment. Resulting DB = TMtext, segment by segment. Resulting DB = TM
Alignment allows creation of TM from legacy Alignment allows creation of TM from legacy translations carried out without translation translations carried out without translation support software. support software.
Alignment requires a considerable amount of Alignment requires a considerable amount of manual intervention. manual intervention.
The decision to align, like the decision to use The decision to align, like the decision to use TM, depends on the parameters of a TM, depends on the parameters of a particular project.particular project.
AlignmentAlignmentLegacy source-
language documentLegacy target –
language document
(establish segmentation rules)
ALIGNMENT
EXPORT: TMX file containing
aligned segments (TUs)
L10n & Terminology L10n & Terminology Mgmt.Mgmt.
Localization is a specialized, Localization is a specialized, “terminologically dense” form of “terminologically dense” form of translation. translation.
Terminology managers are typically used Terminology managers are typically used along with TMalong with TM
SDL MultiTermSDL TermbaseSTAR TermStar
SDL Translator’s Workbench
SDLXSTAR Transit
The Localization The Localization DistributionDistribution
Program files (EXE and DLL) GUI resources (RC and RES) WinHelp (RTF), HTML Help or
WebHelp files Embedded graphics (screen captures,
splash screens, buttons, etc.) Printed documentation (TXT, DOC,
PDF) DTP