213
490 St-Joseph Blvd., Suite 102 Gatineau, Quebec, Canada J8Y 3Y7 [email protected] www.multicorpora.ca

MultiCorpora MultiTrans Pro User Manual v37 English

  • Upload
    leon

  • View
    9

  • Download
    0

Embed Size (px)

DESCRIPTION

MultiCorpora MultiTrans User Manual v37 English

Citation preview

  • 490 St-Joseph Blvd., Suite 102 Gatineau, Quebec, Canada J8Y 3Y7

    [email protected] www.multicorpora.ca

  • i

    Table Of Contents INTRODUCTION ................................................................................................................... 1

    Welcome to MultiTrans ......................................................................................................... 1 System Requirements .......................................................................................................... 2

    GETTING STARTED............................................................................................................. 3 MultiTrans Modules .............................................................................................................. 3 MultiTrans Primer ................................................................................................................. 4 The Initial Screen.................................................................................................................. 5 Getting Help.......................................................................................................................... 7 Contacting Us ....................................................................................................................... 8

    TRANSCORPORA MODULE ................................................................................................ 9 Introducing the TransCorpora Module - Full-text multilingual repository.............................. 9 Working with the TransCorpora Module ............................................................................. 10 Creating a New Corpus ...................................................................................................... 11

    Using TransCorpora Builder ............................................................................................ 11 TransCorpora Builder Statistics....................................................................................... 19 Conversion Errors............................................................................................................ 22 Alignment Considerations................................................................................................ 23 Terminology Extraction.................................................................................................... 27 Concordance List............................................................................................................. 28 Input Files ........................................................................................................................ 29 Abbreviation Files ............................................................................................................ 29

    Searching in TransCorpora................................................................................................. 32 Using TransCorpora Search............................................................................................ 32 TransCorpora Search Screen Display............................................................................. 33 Searching in TransCorpora Files..................................................................................... 36 Searching for an expression in a source or target file ..................................................... 39 Printing Search Results ................................................................................................... 39

    Importing and Exporting...................................................................................................... 40 Exporting Corpus Alignments to a TMX File.................................................................... 40 Exporting Corpus Alignments to a TermBase ................................................................. 41 Exporting Corpus Alignments to a Text File .................................................................... 41 Importing a TMX File into a Corpus................................................................................. 42

    Other TransCorpora Functions........................................................................................... 43 Aligning sentences in a TransCorpora file....................................................................... 43 Adding Expressions to a TermBase from a TransCorpora file ........................................ 44 Switching Between TransCorpora and TermBase Windows........................................... 46 Saving and Closing TransCorpora files........................................................................... 46 Reversing Language Direction ........................................................................................ 47

  • ii

    Appending Document Sets to a Corpus .......................................................................... 48 Removing Document Sets from a Corpus ....................................................................... 49

    Client-Server Considerations.............................................................................................. 51 Corpora Server Console.................................................................................................. 51 Opening TransCorpora files from the Server................................................................... 52 Creating a new TransCorpora file on the Server ............................................................. 52 Specifying an Ignored Words List to Improve Performance and Scalability.................... 53

    Accessing Corpora from the Web....................................................................................... 54 Welcome to TransCorpora Web ...................................................................................... 54 Accessing TransCorpora Web......................................................................................... 55 Selecting a Corpus .......................................................................................................... 56 Performing a Search........................................................................................................ 57

    TERMBASE MODULE......................................................................................................... 63 Introducing the TermBase Module - Terminology management ........................................ 63 TermBase Editor Screen Display........................................................................................ 64 Basic TermBase Operations............................................................................................... 67

    Opening a TermBase ...................................................................................................... 67 To Open Multiple TermBases.......................................................................................... 68 Closing a TermBase ........................................................................................................ 68 Creating a TermBase ...................................................................................................... 69 Deleting a TermBase....................................................................................................... 70 Removing a TermBase.................................................................................................... 70

    Updating TermBase Data ................................................................................................... 71 Adding an Expression to a TermBase ............................................................................. 71 Adding a Translation to an Expression............................................................................ 72 Adding a Synonym to a TermBase.................................................................................. 74 Duplicating Records ........................................................................................................ 74 Deleting an Expression or a Translation.......................................................................... 75 Modifying a Translation ................................................................................................... 76 Modifying/Renaming an Expression................................................................................ 77

    Searching in a TermBase ................................................................................................... 78 Expression Filter Manager............................................................................................... 79

    Importing and Exporting...................................................................................................... 81 Importing from Tagged Lists............................................................................................ 81 Importing Other Text File Formats................................................................................... 85 Importing TMX (Translation Memory Exchange) Data .................................................... 91 Exporting Expressions From a TermBase....................................................................... 93

    Other TermBase Functions................................................................................................. 96 Adding New Fields to a TermBase .................................................................................. 96 Saving Expression Lists ................................................................................................ 100 Switching Between TermBases..................................................................................... 101 Switching Languages in a TermBase............................................................................ 102

  • MultiTrans User Manual - Table Of Contents

    iii

    Viewing TermBase Properties ....................................................................................... 103 Read-Only TermBases .................................................................................................. 103 Copying an Expression From One TermBase to Another ............................................. 104

    Client-Server Considerations............................................................................................ 105 Opening TermBases from the Server............................................................................ 105 Creating new TermBases on the Server ....................................................................... 106

    Accessing TermBases from the Web ............................................................................... 107 Welcome to TermBase Web.......................................................................................... 107 Accessing TermBase Web ............................................................................................ 108 Selecting a TermBase ................................................................................................... 109 Performing a Search...................................................................................................... 110

    TRANSTERM MODULE.................................................................................................... 111 The TransTerm Module - Integrated translation workbench............................................. 111 TransTerm Functions........................................................................................................ 112 Connecting/Disconnecting to/from MultiTrans.................................................................. 114 Using the TermBase Process to Automatically Replace Expressions from a TermBase. 115 Looking for Translations in Reference Documents........................................................... 116 Interactive Translation Process ........................................................................................ 117 Finding Translations in TermBases and/or Reference Documents.................................. 120 Extracting Expressions from Reference Documents ........................................................ 122 Viewing the Contents of a TermBase ............................................................................... 123 Setting TransTerm Options............................................................................................... 124 Resetting Document Colors.............................................................................................. 129 Counting Translated Words in a Document...................................................................... 130

    LISTBUILDER MODULE ................................................................................................... 131 Using ListBuilder............................................................................................................... 131

    DOUBLEVUE .................................................................................................................... 137 The DoubleVue Menu....................................................................................................... 137 Creating a DoubleVue Session......................................................................................... 138 Opening an Existing DoubleVue Session......................................................................... 139 Closing a DoubleVue Session .......................................................................................... 140 Following a Sentence in the Source File .......................................................................... 141 Replacing with an Original Sentence................................................................................ 141 Exact Search and Replace ............................................................................................... 141 Exporting a Session to a Folder........................................................................................ 142 Exporting a Session to an E-mail...................................................................................... 144 Importing a Session from a Zip file ................................................................................... 145

    WORKING WITH PROJECTS........................................................................................... 147 Working with Projects ....................................................................................................... 147 Project Manager - Information about the Project.............................................................. 148

  • iv

    Viewing Project Properties................................................................................................ 149 Creating a Project .............................................................................................................149 Opening a Project ............................................................................................................. 150

    ANALYSIS......................................................................................................................... 151 Document Analysis ........................................................................................................... 151 Analysis Process Options ................................................................................................. 159 Analysis Features ............................................................................................................. 161

    Statistics ........................................................................................................................ 162 Terminology Extraction.................................................................................................. 163 TermBase Extraction ..................................................................................................... 165 Corpus Extraction .......................................................................................................... 166 Making a Package ......................................................................................................... 167

    PREFERENCES................................................................................................................ 169 Setting Your Preferences.................................................................................................. 169 File Locations.................................................................................................................... 170 General ............................................................................................................................. 172 Logon................................................................................................................................ 173 Languages ........................................................................................................................ 174 Network............................................................................................................................. 176 TermBase ......................................................................................................................... 177 TransCorpora.................................................................................................................... 178 TransFinder ...................................................................................................................... 179 TransTerm ........................................................................................................................ 180 Windows NT Network - Defined Access Rights................................................................ 181 MultiTrans CS ................................................................................................................... 182

    FILE TYPES ...................................................................................................................... 183 MultiTrans File Types ....................................................................................................... 183

    MENUS ............................................................................................................................. 185 MultiTrans Menus ............................................................................................................. 185 File .................................................................................................................................... 185 Edit.................................................................................................................................... 186 Process............................................................................................................................. 186 TermBase ......................................................................................................................... 187

    SHORTCUT KEYS ............................................................................................................ 189 Shortcut Keys ................................................................................................................... 189

  • MultiTrans User Manual - Table Of Contents

    v

    FREQUENTLY ASKED QUESTIONS (FAQ)..................................................................... 193 Why are my TransTerm menu functions greyed out?....................................................... 193 How do I assign Word shortcuts for TransTerm Processes? ........................................... 194 How do I keep Word macros when refreshing normal.dot?.............................................. 195 How do I add a synonym to an expression in my TermBase?.......................................... 195 Why can't TransCorpora Search find an expression? ...................................................... 196 How do I Append to TransCorpora files on the server?.................................................... 196

    TROUBLESHOOTING....................................................................................................... 197 TransTerm won't respond................................................................................................. 197 TermBases appear as read-only ...................................................................................... 197 Error message when trying to open a TransTerm session............................................... 198 Cannot open .tcs file ......................................................................................................... 198 Import not found................................................................................................................ 199 Cannot uninstall MultiTrans .............................................................................................. 199 Highlights not revealed after using TransCorpora process .............................................. 199 The .tcs file disappears on Windows XP, SP1.................................................................. 200 TransTerm menu doesn't appear in Word after installation.............................................. 201 Manually Installing the TransTerm Module....................................................................... 202

    INDEX ............................................................................................................................... 203

  • 1

    Introduction Welcome to MultiTrans

    MultiTrans is a translation support and language management solution that is based on a multilingual full-text repository of previously translated content. It has helped global organizations and language industry professionals to improve translation productivity and quality for all types of content. Unlike traditional translation memory tools which are based on a database of isolated whole sentences, MultiTrans makes vast collections of legacy full-text translations searchable for text strings of any length in their full usage context. Its interactive search capabilities provide users with the most relevant information, maximizing language resource reuse. The complete MultiTrans solution consists of:

    1. An indexed full-text multilingual reference corpus and tools to build and search the corpus.

    2. Terminology management tools, including a multilingual terminology repository and terminology extraction tools.

    3. An integrated translation workbench that integrates with popular word processing environments to provide a variety of manual and automated capabilities to search multiple corpora and terminology repositories for examples of previous translations that are candidates for reuse.

    4. A collaborative multi-user infrastructure for sharing language resources in real-time, including web browser access for the entire enterprise.

    Since MultiTrans is Unicode compliant and does not rely on linguistic knowledge, it can manage content in any written language, including bidirectional languages such as Arabic as well as ideographic languages such as Chinese.

    IMPORTANT NOTE:

    Before you start using MultiTrans, please take a moment to read The Initial Startup Screen, for a brief explanation of the application's main visual components.

  • 2

    System Requirements

    Minimum Requirements

    Windows 98, ME, NT (service pack 4 or higher), 2000 or XP

    Internet Explorer 5.0 or higher

    Pentium II 300 MHz

    At least 64 MB of free memory (more may be required depending on data volumes)

    800x600 minimum resolution

    20 MB of disk space for local installation

    CD-ROM drive

    VGA monitor (or higher-resolution) capable of displaying at least 256 colors

    mouse

    Additional items or services required using certain features:

    28,800 bps modem (or faster) and Internet access for online updates

    Microsoft Word 97 or higher

    Recommended Requirements

    To use MultiTrans for maximum performance, we recommend the following configuration: PC with a Pentium III 450 MHz processor or higher

    For Windows 98: 128 MB RAM

    For Windows XP: 256 MB RAM

    SVGA monitor (or higher resolution) capable of displaying at least 256 colors in 1024 x 768 resolution. (We also recommend a 17-inch monitor or higher.)

  • 3

    Getting Started MultiTrans Modules

    MultiTrans contains five modules. They are:

    TransCorpora: Use this module to build and search through corpus files, then use these corpora to create terminology databases and increase your translating speed.

    TermBase: Use this module to build terminology databases (also called "TermBases"); these may then be used when translating documents using TransTerm.

    TransTerm: Use this add-in within your editor (e.g. Microsoft Word) to automate some of your translation tasks, using your previously created corpus files and TermBases.

    Analysis: Use this module to assess the effort of translating a new set of documents and determine the best translation approach. It provides statistics such as word counts, amount of repetition, expressions found in terminology databases, and sentence coverage from existing corpora.

    ListBuilder: Use this module to automatically pair up your previously translated documents in preparation for building a corpus.

  • 4

    MultiTrans Primer

    You can use MultiTrans in a wide variety of ways. It is the perfect translation support and language management solution for all your translation and terminology research needs. The following is a list of tasks you can do with MultiTrans:

    Set up MultiTrans

    Index reference documents into a TransCorpora file. To do this use the TransCorpora Search module.

    Use ListBuilder to add reference documents to your TransCorpora files.

    Store and manage monolingual and multilingual terminology in flexible databases using the TermBase module.

    Translate a document

    Translate a document and build a TermBase all in one. To do this, use the TransTerm module.

    Automatically compare a document to be translated against reference documents. To do this, use the TransTerm module.

    Automatically replace all known terminology in a document using the terminology banks you created. To do this, use the TransTerm module.

    Work with legacy documentation

    Align documents for later use with MultiTrans. To do this, use the TransCorpora Search module.

    Search for expressions in previously translated documents. To do this, use the TransCorpora Search module.

    Work with terminology repositories

    Add, modify, and/or delete terms/expressions to an existing or new terminology bank. To do this, use The TermBase Module.

  • MultiTrans User Manual - Getting Started

    5

    The Initial Screen

    The first time you start MultiTrans, you will be asked to login with a user name. Enter a user name into the dialog window that appears upon launching the application:

    Click Start to launch the main window of MultiTrans. The main screen consists of a Menu, a Toolbar, a Process bar and the Main Working Area.

  • 6

    Menus

    See Menus and Shortcut Keys for more information on the MultiTrans menus.

    Toolbar

    The MultiTrans toolbar provides buttons which allow you to access the application's most frequently used menu-option commands. To activate a command from the toolbar, simply click on the appropriate button. You can hide or show the toolbars from the View menu.

    Process bar

    The Process bar is located on the left side of your screen. This bar allows you to quickly access any MultiTrans module except TransTerm, as well the main options of the program. You can hide/show the process bar from the View menu.

    Main Working Area

  • MultiTrans User Manual - Getting Started

    7

    This is where you will be doing most of your work with MultiTrans. The main working area initially displays a Quick Help sheet which gives you general information on how to use MultiTrans.

    Getting Help

    There are various ways you can get help on MultiTrans:

    Access the Help file by pressing F1 at any moment in the software.

    Go to the Help menu and choose Contents or Index to find your way through the Help files.

    Refer to your User Manual.

    E-mail us your questions at [email protected].

    Fax us a description of your problem at 819-778-0801.

    Call our Customer Support Service at 819-778-7070.

  • 8

    Contacting Us

    If you have any questions and/or concerns about this product or any other MultiCorpora product, please do not hesitate to contact us:

    Web:

    Our URL is: http://www.multicorpora.com

    Email:

    For program help: [email protected]

    To offer suggestions: [email protected]

    For general information: [email protected]

    Regular Mail

    MultiCorpora R&D Inc. 490 St-Joseph Blvd., Suite 102 Gatineau, Quebec, Canada J8Y 3Y7

    Telephone

    (819) 778-7070 (9 A.M. to 5 P.M. EST)

    Fax

    (819) 778-0801

  • 9

    TransCorpora Module Introducing the TransCorpora Module - Full-text multilingual repository

    MultiTrans enables users to easily build and maintain indexed repositories (corpora) of cross-referenced monolingual or multilingual content from legacy documents. The corpus-builder automatically aligns legacy translated documents based on user-defined file nomenclature. It then automatically extracts, indexes and aligns the complete text. The full-text indexing enables text strings of any length (words, sub-sentence expression, whole sentences or paragraphs) to be subsequently searched and retrieved. Statistical alignment algorithms establish links between the equivalent sentences in multiple language versions of the same content. The alignment process allows you to see the whole translation in full context view so that misalignments are easily spotted and modified. Each expression is highlighted and presented in parallel windows that you can scroll through. Because of this context and ability to correct misalignments as they work, translators can begin using a corpus within minutes of its automatic creation. The quality of the corpus actually improves over time with each usage. Using the TransCorpora module in MultiTrans, a translator can build large corpora of previous translations rapidly and efficiently. Besides previous in-house translation projects, additional sources of translated text can also be easily added to a corpus, including published web content. The benefits of such capabilities are enormous; if content is found on the Internet that is relevant to a project, it can be downloaded and used in a translation process immediately.

  • 10

    Working with the TransCorpora Module

    The TransCorpora module allows you to quickly search through your legacy (i.e. previously translated) documents for translations and/or equivalents. It allows you to build your own terminology banks (known as TermBases see The TermBases Module) by searching for corresponding terms in your reference documents. TransCorpora includes three sub-modules (also referred to as processes):

    The TransCorpora Builder process Use this tool to import your reference documents into MultiTrans by creating TransCorpora files. These files may contain one or more multilingual document sets, consisting of a source document with translations in many languages. Once you have created a TransCorpora file, you are then ready to use the TransCorpora Search process.

    The TransCorpora Search process Use this tool to search for a word or an expression in your source documents at the sentence level. Once you find a word/expression, select its equivalent from the suggested translated sentence in the corresponding target document, and add both to your active TermBase for later use. A variety of search forms is available, including exact and fuzzy matching.

    The TransCorpora Imports process Use this tool to import from TMX files.

    NOTE:

    In general, you should build TransCorpora files around a specific subject or field (for example, engineering, marketing, technology, law, etc.).

  • MultiTrans User Manual TransCorpora Module

    11

    Creating a New Corpus Using TransCorpora Builder

    Adding Reference Documents to a Corpus

    Adding reference documents to a corpus is a simple and straightforward process, thanks to our easy-to-use TransCorpora Builder Wizard.

    NOTE:

    Before using the TransCorpora Builder, close any MS Word/WordPerfect programs you may have opened.

    To add reference documents to a corpus

    1. Select TransCorpora Builder, either from the Process menu, the Quick Help Page, or the Process bar. The first dialog box to be displayed is Step 1 - Language Selection dialog box:

  • 12

    2. Select the languages to be used in the corpus. A corpus may contain more than two languages, but only two languages may be viewed at one time in TransCorpora Search. You can also change the languages listed by selecting Modify the language list.

    3. When you have selected the languages to be used, click Next. This displays the Step 2 - Encoding Selection dialog box:

    4. Select the encoding for your source and target documents. Your choices are ISO Latin and Unicode (UTF-16). Click Next. This displays the Step 3 - Input File(s) Selection dialog box:

  • MultiTrans User Manual TransCorpora Module

    13

    5. Select the documents you wish to include in your corpus. The simplest way to do this is to click Open List and select a list of document names. If no such list exists, you may enter the names manually as follows:

    o Click the Add button to insert a row in the displayed list o Click each language column in the row one at a time, and for each

    language select the file from a folder/directory on your computer (or a Zip drive, CD-ROM, etc.)

    o Repeat this process for each set of documents you wish to add. o When you have finished adding documents, click Next. This displays

    the Step 4 - Output Method Selection dialog box:

  • 14

    NOTE:

    MultiTrans can index the following file formats: text, rtf, doc, ppt, xls, xml, html, pdf, wpd, along with any file that MS Word and WordPerfect can open and save as text. Unicode and ISO-Latin are recognized in all these formats. MultiTrans will recognize normal ASCII files (usually *.txt) as well.

    6. You can now either create a new corpus or append to an existing corpus.

    o To create a new corpus: Select the Create new TransCorpora file radio button. Input your new corpus name.

    o To append to an existing corpus: Select the Append to existing TransCorpora file radio button.

  • MultiTrans User Manual TransCorpora Module

    15

    Add the appropriate corpus by clicking the icon (on the right of the blank field). Select the appropriate name in the TransCorpora Search Manager dialog box, then click Open.

    7. Click Next. This displays the Step 5 - Sentence Delimitation dialog box:

    8. To specify one or more abbreviation files, click the Configure Abbreviations button.

    9. If your corpus includes PDF files, specify the appropriate PDF carriage return type setting.

    10. Click Next. This displays the Step 6 - Auto-Alignment dialog box.

  • 16

    11. By default TransCorpora Builder will automatically align your indexed documents. If you do not wish it to do so, simply uncheck the Use Auto-Alignment Process box. See also Customizing the Auto-Alignment Process.

    12. If the corpus contains more than two languages, you may choose to align only selected language pairs. For example, a corpus with 15 languages contains 105 pairs, and it is unlikely that all pairs would be used in translation.

    13. Click Next. This displays the Step 7 - Terminology Extraction dialog box:

  • MultiTrans User Manual TransCorpora Module

    17

    14. Check the Create Terminology Extraction box if you want TransCorpora Builder to extract expressions containing two words (or more) with a frequency of more than 1 in the reference documents. To customize the settings for this option, click the Customize button. This will display the Terminology Extraction Options dialog box. See Terminology Extraction Options for more information on this feature.

    15. Click Next. The Step 8 - Build Process dialog box is displayed.

  • 18

    The window will contain one tab per language, and an Auto-Alignment tab.

    16. Click Build.

    17. Click Open to start a TransCorpora Search session with the newly processed TransCorpora file. This will start the TransCorpora Search wizard (see Using TransCorpora Search).

    NOTES:

    Your documents do not need to be perfectly aligned before use, since MultiTrans looks through the full text. It can also assist you later on in aligning your documents (see Aligning Sentences in a TransCorpora file).

    Once you have built your corpus, you may then use it with TransCorpora Search to search for expressions, or to extract translations and build a TermBase.

  • MultiTrans User Manual TransCorpora Module

    19

    TransCorpora Builder Statistics The TransCorpora Builder Statistics window provides detailed information about TransCorpora Builders processing of input documents. After you have built a corpus, a window such as the following is displayed:

    The window consists of one tab per corpus language, one for Terminology Extraction and one for Auto-Alignment. Each language tab has three separate areas - Statistics, Progress status, and Progress indicator.

    In the Statistics area you can view information such as the number of files, sentences, words and unique words.

    In the Progress status area you can view details of the build progress.

  • 20

    The Progress indicator area shows you in seconds (or milliseconds) the time it took to build your file.

    The Terminology Extraction tab looks like this:

    In this tab are three separate areas - Terminology extraction, Progress status, and Progress indicator.

    In the Terminology extraction area you can view the file characteristics of the source and target files.

    In the Progress status area you can view details of the build progress.

    The Progress indicator area shows you in seconds (or milliseconds) the time it took to build your file.

  • MultiTrans User Manual TransCorpora Module

    21

    Last is the Auto-Alignment tab:

    In this tab are three separate areas - Auto-Alignment, Progress status, and Progress indicator.

    In the Auto-Aligning area you can view the results of the auto-alignment process.

    In the Progress status area you can view details of the build progress.

    The Progress indicator area shows you in seconds (or milliseconds) the time it took to build your file.

  • 22

    Conversion Errors MultiTrans will try to automatically resolve any errors that may occur in the conversion of MS Word, MS PowerPoint, MS Excel, Corel WordPerfect, WordPro, HTML, XML and PDF files. When it is unable to resolve conversion problems, due to a problem such as a corrupted file or password protected file, MultiTrans will skip the document set containing the problem. At the end of the conversion process, before indexing starts, an error report dialog will list any files that contained errors, and briefly explain the problem.

    To Continue with the indexing of your documents

    Click on Continue.

    To Save this report for future reference

    Click on Save.

    To Cancel the corpus building process

    Click on Cancel.

    To Print this report for future references

    Click on Print.

  • MultiTrans User Manual TransCorpora Module

    23

    Alignment Considerations

    Auto-Alignment Process

    The Auto-Alignment Process is used to re-align documents in the current corpus. This may be useful if you did not select auto-alignment when creating the corpus, or you have updated the matching word list since creating the corpus, or you wish to select different auto-alignment options.

    Customizing the Auto-Alignment Process

    The very powerful Auto-Alignment Process feature in MultiTrans allows you to automatically align your source and target documents at a very high accuracy rate. You can easily customize this feature when running the TransCorpora Builder Wizard.

    To customize the auto-alignment process

    1. Click the Customize button in the Step 6 - Auto-Alignment dialog box. This displays the Auto-Alignment Options dialog box.

  • 24

    o Accuracy Tab: Check the appropriate button. Choices are: Align sentences with at least one candidate, Align sentences containing multiple candidates

    NOTE:

    The Auto-Alignment process searches the source document for sentences containing numerical expressions or words from the Matching Word list. These are called "candidates". Sentences containing one or more candidates will produce more accurate alignments than sentences which contain only one candidate. However, if you align sentences containing only one candidate, you may not obtain a satisfactory amount of alignments.

    o Matching Word List Tab: You can instruct the Auto-Alignment process to create a list of matching words, to increase the effectiveness of the process. In this context, matching words are words which have identical spellings in both source and target languages. This includes numerical expressions which your documents may contain. The Use Matching Word List box is checked by default. Clear this box if you do not wish to use the list of matching words.

    2. Click OK in the Auto-Alignment Options dialog box to return to the TransCorpora Builder Wizard.

  • MultiTrans User Manual TransCorpora Module

    25

    Alignment Analysis The alignment analysis function allows you to analyze the quality of the alignment bookmarks in your corpus. The alignment analysis will find all areas in your corpus where there are more than a certain number of consecutive sentences without any alignment bookmarks. These areas where no alignment is present usually represent a problematic area in your text in which the potential of misalignment is greater. As an example, misalignment may be caused by a paragraph missing from the target text. The alignment analysis function is available from the TransCorpora Search menu.

    To use the alignment analysis function

    1. From the TransCorpora Search menu, select Alignment Analysis. A dialog box will open, containing the value of the gap size to look for. We suggest starting with a high value such as 100.

    2. Click OK.

    3. The alignment analysis report will then appear, listing the alignment bookmarks that follow a gap of 100 sentences without bookmarks. By identifying alignment gaps, this report can help you identify potential alignment problems.

    4. Save the report, by clicking Save Report.

    To go to an alignment that follows a large gap

    Click Go to Gap, or double click on any item listed.

  • 26

    Aligned Word List While building a TransCorpora file, or afterwards, the Auto-Alignment Process can be selected, allowing MultiTrans to automatically align your documents. However, MultiTrans also allows you to customize how it aligns your documents, by letting you specify expressions and translations that match each other.

    To customize the alignment process

    1. Create a file called AlignWordList...lst. For example, if the languages in your TransCorpora file are Russian to French, create a file called AlignWordList.Russian.French.lst.

    2. Using a text editor such as Notepad, add words or expressions that MultiTrans will be able to match during the alignment process. Add one pair of words or expressions per line, separated by a comma. For example:

    map, grille

    hard work, travail assidu

  • MultiTrans User Manual TransCorpora Module

    27

    Terminology Extraction TransCorpora Builder's robust Terminology Extraction feature generates a list of expressions of two (or more)consecutive words that occur more than once in your corpus. You can view this "Concordance List" when using the TransCorpora Search process (Terminology tab, Concordance List sub-tab). See Concordance List for more information. Quicker Access to Recurring Expressions With the Terminology Extraction feature, you can get a quicker view of the recurring expressions and terminology contained in your previous translations, or even in documents that have yet to be translated. The Concordance List generated by this feature also displays the frequency of each expression in the document(s). This list can be very useful when building a terminology database from previously translated documents, since it allows you to concentrate on the most frequent expressions, the ones that can best increase your performance when pre-translating documents with TransTerm's TermBase Process. Expressions in TermBase Also included in the Concordance List is information about the Termbase. A paper clip icon ( ) in the first column means that an expression is already included in the active Termbase. Therefore, when adding expressions to a TermBase, you can easily hide all translations with a paper clip, to concentrate only on your new expressions. Using Terminology Extraction to Speed Up Your Translation Tasks Terminology Extraction is also very useful for your documents that have yet to be translated. If, for instance, you want to get an idea of the recurring expressions contained in a document to be translated, simply create a monolingual corpus. This will give you a list of expressions contained in the document, and you will also know which expressions/words are already contained in your active TermBases. You may then take a few minutes to include frequent expressions to ensure a better yield in the TermBase process, since adding expressions once in the TermBase at that stage will allow translate automatically all the occurrences with TransTerm's TermBase Process.

  • 28

    Concordance List The Concordance List sub-tab displays expressions of two words or more (with a frequency of at least 2) that have been identified by the Terminology Extraction feature (see Using TransCorpora Builder). You can view where a specific concordance occurs in documents by double-clicking it in the list.

    Frequency

    The Frequency column (at the right of the Expression column) shows you how many times the expression appears in the document(s) included in the active corpus.

    Expressions Already in TermBase

    Expressions of two words or more from the Concordance List that are identified with the paper clip icon in the first column are already included in the active TermBase.

    If you wish to view only those expressions in the Concordance List that are not included in the active TermBase, simply right-click anywhere in the list and select Current View > Hide expressions found in TermBase.

    Saving the Concordance List to File

    You may wish to save a concordance list to a text file for later use.

    To save the concordance list to file

    1. Right-click anywhere in the Concordance List area.

    2. In the following context menu, select Save list as text.

    3. Type in a filename in the Save as dialog box, and click Save.

    NOTES:

    You can view this list by clicking the Concordance List sub-tab from the TransCorpora tab.

  • MultiTrans User Manual TransCorpora Module

    29

    Input Files Input files are grouped in sets that normally consist of a source document with one or more translations. For instance, if you want to include several documents translated from English to French, you need to match the original (source) English document with each translated (target) French document. You may also create a monolingual corpus. For example, this could be used to identify which expressions in a document are contained in a selected TermBase. Reference documents can be indexed from the following file formats: text, rtf, doc, ppt, xls, xml, html, pdf, wpd, lwp (WordPro) as well as anything Word and WordPerfect can open and save as text. MultiTrans will recognize normal ASCII files (usually *.txt) as well. They can be retrieved from any medium (network drive, CD-ROM, disk, etc), and their equivalent input files are copied in the TransCorpora folder.

    Abbreviation Files Abbreviation files helps TransCorpora Builder separate text into sentences for alignment purposes. TransCorpora Builder contains predefined punctuation rules which define sentence delimiters. However, sentence delimiters such as periods may also appear in abbreviations. To help distinguish between a period at the end of a sentence and a period in an abbreviation, MultiTrans refers to an abbreviation file (one for each language). When TransCorpora Builder finds an expression that is in the abbreviation file, it recognizes that the delimiter does not mark the end of a sentence. For instance: Without the abbreviation file: Mr. John Doe is interpreted as two separate sentences. With the abbreviation file (containing Mr.): Mr. John Doe is interpreted as only one sentence.

    NOTE:

    Some abbreviations will automatically be recognized by the system. For example, R.C.M.P. will be a recognized abbreviation, since the first letter in the expression is directly followed by a period.

  • 30

    Modifying and Creating an Abbreviation File A TransCorpora Builder abbreviation file is an ASCII document containing abbreviations related to a certain language. You need one abbreviation file for each language you will be working with.

    To modify an existing abbreviation file

    1. From the File menu, select Preferences.

    2. Click the TransCorpora tab.

    3. Click the Abbreviation Editor button. This will display the Abbreviations dialog box.

    4. Click the Open file button. In the Open dialog box, select the abbreviation file you wish to modify, and then click Open.

    NOTE:

    Abbreviation files have a *.abv extension. At this point, you may either add new abbreviations to the opened file or modify existing ones.

    To create a new abbreviation file

    1. From the File menu, select Preferences.

    2. Click the TransCorpora tab.

    3. Click the Abbreviation Editor button. This will display the Abbreviations dialog box.

    4. Type in a new abbreviation in the Abbreviation field, then enter a Description, and click the Add button. Add as many abbreviations as you wish.

    5. When you have finished adding abbreviations, click the Save file button.

    6. In the Save as dialog box, enter a file name, then click Save.

    To add a new abbreviation:

    o Type the new abbreviation in the Abbreviation field, then enter a description, and click the Add button. You may add as many abbreviations as you wish.

  • MultiTrans User Manual TransCorpora Module

    31

    To modify an existing abbreviation:

    1. In the list of existing abbreviations, click on the entry you wish to modify. This will display the desired abbreviation and its description in the Abbreviation and Description fields.

    2. Make the necessary changes, and then click Add.

    To remove an abbreviation:

    1. In the list of existing abbreviations, click on the entry you wish to remove, then click the Remove button.

    2. Click the Save file button to save the file as modified.

  • 32

    Searching in TransCorpora Using TransCorpora Search Before you can search for source/target expressions in one or more TransCorpora files, you first need to open a TransCorpora session. Opening a TransCorpora session involves selecting one or more TransCorpora files, as well as one or more TermBase files to link your corpus. Consequently, when searching through TransCorpora files, you can select a source expression and its target equivalent, and add them to a TermBase. Your TermBase files are continually evolving and growing.

    To Open a TransCorpora Session

    Select TransCorpora Search from the Process menu, the Quick Help Page, or the Process Bar. This displays the Step 1 - Select TransCorpora File dialog box.

    Step 1: o Select TransCorpora files by clicking the icon to the right of the TransCorpora

    File field. This displays the TransCorpora Search Manager dialog box.

    o To add TransCorpora files to the list of files to open, select them with your mouse while holding down the Ctrl key, then click the Open button. This will add them to the TransCorpora File field of the Step 1 - Select TransCorpora File dialog box.

    o Click the Add button in the Step 1 - Select TransCorpora File dialog box. o When all the desired files are in the list, click the Next button in the Step 1 -

    Select TransCorpora File dialog box.

    NOTE:

    You can also open a list of TransCorpora files by clicking the Open List button.

    Step 2: o Select the TermBase(s) you wish to open and link to the selected TransCorpora

    file(s). These TermBases are where you will be able to add translated expressions which you will extract from your TransCorpora files.

    Step 3: o Select Start to open all files. o Click OK when you have viewed all statistics. This will open the TransCorpora

    Search window.

  • MultiTrans User Manual TransCorpora Module

    33

    TransCorpora Search Screen Display When TransCorpora Search is opened, the main working area of MultiTrans is divided vertically into two panels.

    TransCorpora Search Window

    The left panel contains two interrelated sections: Search and Information Center. The right panel also contains two interrelated sections: Source and Target.

  • 34

    The Information Center section contains two tabs, respectively titled TransCorpora and Search Results. The TransCorpora tab displays information about each TransCorpora file. If multiple TransCorpora files are opened, a drop-down menu (just under the TransCorpora tab) allows you to select which TransCorpora file to display information about. The TransCorpora tab is comprised of three sub-tabs (in the bottom part of the section):

    The Unique List sub-tab displays all the unique words in the TransCorpora files, with their respective frequency.

    The File List sub-tab displays all the source files that are included in the TransCorpora File.

    The Alignment sub-tab displays the alignments that were performed throughout the TransCorpora File.

    The left panel also contains the Search utility. At the top of the section, you can type an

    expression, select the filter, press Enter or click on the Search icon . TransCorpora Search will then identify all sentences which contain either the exact expression, or all the searched words. The search results will be displayed in the Search Results tab of the Information Center section. The top part of the tab displays the found expressions and their frequency, whereas the bottom part of the tab lists every target sentence.

    Icons

    Below is a table listing the various icons you will see when in the TransCorpora workspace. These icons are located on the Toolbar.

    Icon Name Description

    Search search through opened TransCorpora files

    Associate associates an expression with a translation

    Previous Sentence

    moves the yellow highlight in sentence alignment to the previous sentence

    Follow Sentence at Cursor

    places the yellow highlight wherever the cursor is situated

    Next Sentence moves the yellow highlight in sentence alignment to the next

    sentence

  • MultiTrans User Manual TransCorpora Module

    35

    Align Sentence

    aligns sentences in source and target documents; shows up as yellow highlight

    Delete Alignment

    deletes an alignment

    Insert in TermBase

    allows you to insert the selected source and target expressions in the active TermBase

    Edit fields/Insert in TermBase

    allows you edit a selected field and to insert the selected source and target expressions in the active TermBase

    NOTES:

    Any text that you highlight in the Source and Target sub-sections will be added as source and target expressions in the active TermBase (Ctrl+Ins being the shortcut to use when adding expressions or words in a TermBase).

    Automatically aligned corresponding sentences are color-coded in your source and target documents for faster access. You can adjust these alignments manually, if needed. See Aligning Sentences in a TransCorpora File for more information.

  • 36

    Searching in TransCorpora Files

    To search in all opened TransCorpora files:

    1. Click on the Search tab in the TransCorpora Search window. 2. Type in the expression in the Search field.

    3. Select the desired Search Options:

    o Consecutive Words: The Consecutive Words option will only search for expressions containing the exact expression entered. For instance, if the expression is composed of two words, the search results will only display results containing these exact two words in sequence. If this option is not checked, TransCorpora Search will display every sentence containing the words entered in the Search field, and in no particular order. The only requirement is that all the words must be present in the same sentence.

    o Partial Match(es): The Partial Match(es) option will display partial results. If a sentence contains some of the words entered in the search field, the sentence will be retained and every partial expression found will appear in the search results. If this option is not selected, TransCorpora Search will display only sentences containing all the words in the search field.

    o Fuzzy Search: The Fuzzy Search option will search for a word with a fuzzy factor, which means that found expressions do not have to be an exact match of the searched item. To change the fuzzy search option:

    Click the right mouse button on Fuzzy Search. An ellipsis button will appear. Click on it. You can now change the maximum number of differences allowed (between what you type in the Search field and what MultiTrans will find).

    4. Press Enter or click on the Search expression icon . The search results are displayed in the Search Results section (Figure 1). The top part of the tab displays the found expression with their respective frequency. This is where partial expressions will be listed when the Show All Derivatives filter is selected. The bottom section displays all sentences where the selected expression from the top part is found, listing information such as the source document, the sentence and word number and the TransCorpora file in which the sentence was found.

  • MultiTrans User Manual TransCorpora Module

    37

    Figure 1

    When selecting a sentence from the search results, the source and target documents will scroll to the right location. The source sentence will be highlighted (default highlight color is yellow), with the found words colored in blue. The corresponding sentence in the target document will also be highlighted.

  • 38

    Pop-up menu functions

    If you right-click in the Source or Target windows, the following pop-up menu will appear:

    The options include: Search selection: search the corpus for the currently selected expression;

    Copy: copy the currently selected block of text in the corpus;

    Cut: cut the currently selected block of text in the corpus;

    Paste: paste the currently selected block of text in the corpus;

    Insert in TermBase: insert the currently selected block of text in the expression into a TermBase;

    Insert options: opens a pop-up menu with several options to choose from:

    o Insert Only o Insert and Align o Lower case o Proper case o Upper case

  • MultiTrans User Manual TransCorpora Module

    39

    o No Change o Trim Spaces o Send to TermBase o Edit fields on Insert

    Align Sentence: align currently selected text in corpus;

    Delete Sentence: deletes selected sentence in corpus;

    Follow Sentence at Cursor: highlights sentence where cursor is currently positioned;

    Previous Sentence: goes to the preceding sentence and highlights the selection in yellow (starting at the beginning of the expression);

    Next Sentence: goes to the next sentence in the expression and highlights the selection in yellow;

    Find: searches through the corpus for selected text;

    Font size: allows you to set the font size of the selected expression

    Save TransCorpora Search: saves the current TransCorpora search.

    Searching for an expression in a source or target file You can search for a word/expression in either the source or target files.

    To Search for an expression in a source or target file

    o Press Ctrl+F, or select Find in Source or Find in Target from the context-sensitive pop-up menu (you can access this menu by right-clicking anywhere within the source or target text).

    Printing Search Results There are times when you may want to keep a hard copy of your TransCorpora search results. This is a one-button action that is quick and easy to perform:

    1. Click on the TransCorpora Search menu.

    2. Select Print Search Result from the menu.

  • 40

    Importing and Exporting Exporting Corpus Alignments to a TMX File TMX (Translation Memory eXchange) is a standard format for exchanging translation memory data. A good source of information about the TMX standard is http://www.lisa.org/tmx/. To export MultiTrans corpus data to a TMX file, first open the corpus using TransCorpora Search. Then select Save As TMX File from the TransCorpora Search menu. This will display the following dialog:

    Enter the output file name, and press Save to generate a TMX file. Optional fields are:

  • MultiTrans User Manual TransCorpora Module

    41

    o TMX language list file: This optional text file contains language codes such as EN-US (one per line), which are used to populate the available selections on the General tab.

    o Filter special chars: If this option is checked, special characters will be replaced by representative codes. For example, the letter would be replaced by &eacute. This may or may not be required, depending on the system that will be processing the exported TMX file.

    The General tab displays the language codes selected for each language. For example, if you want English to be represented as EN-US in the TMX file, you would enter or select that in the General tab.

    Exporting Corpus Alignments to a TermBase You can export all the aligned segments from a corpus to a TermBase, by selecting Save to TermBase from the TransCorpora Search menu. The TermBase must be closed for the export to succeed (if you select an open TermBase, you will receive a message indicating that the export cannot proceed).

    Exporting Corpus Alignments to a Text File There are times when you may want to examine in detail how source and target files in the open corpus are aligned. Using the Save Alignment to option allows you to do this easily and effectively. A tab-delimited file will be created for easy browsing in a program of your choice (Microsoft Excel for example). The file will contain six columns:

    Alignment: a reference number for each row, beginning at 1.

    Source sentence: a sentence in the source language.

    Target sentence: the corresponding sentence in the target language, based on the current alignment.

    Corpus: the name of the corpus.

    Source file: the name of the source file.

    Target file: the name of the target file.

    To access the Save Alignment to option

    1. In the TransCorpora Search menu, select Save Alignment to. A dialog box will open.

    2. Choose the location you want to save the file to by selecting the folder icon and then browsing to the appropriate file.

    3. When you have selected the file to save to, select Open.

  • 42

    Importing a TMX File into a Corpus A new corpus can be created by importing data from a TMX file. To do this, proceed as follows:

    1. Close any open corpora or TermBases

    2. Click on the Imports button in the TransCorpora section of the Process bar

    3. Select Import Corpus from TMX and click OK. Then enter a file name in the dialog provided and click Next to import the file.

  • MultiTrans User Manual TransCorpora Module

    43

    Other TransCorpora Functions Aligning sentences in a TransCorpora file Source and target documents do not generally have the exact same number of sentences, and this may result in misalignments. If the highlighted sentence in the target document does not match the highlighted sentence in the source document, the text is misaligned. TransCorpora Search allows you to easily correct these misalignments. The most efficient way of using this function is to align your document as you go (i.e. "on the fly"). You do not need to spend time aligning your entire document, since you cannot know if you will be using every sentence. Therefore, we suggest that you align only when performing searches from the TransCorpora Search or TransTerm modules. By aligning your documents strictly on an as-needed basis, you will avoid spending a considerable amount of time initially creating alignments that may not be needed.

    To align sentences in a TransCorpora file

    1. In the source document, click anywhere in the sentence you want to align, and type Ctlr+F. This will highlight the sentence in yellow. You may also:

    o right-click and choose Follow sentence at cursor, or o choose the Follow sentence at cursor option from the menu.

    2. If the yellow-highlighted sentence in the target document does not match the one in the source document, use the Ctrl+Shift+Down Arrow (next sentence) and Ctrl+ Shift+Up Arrow (previous sentence) key combinations to move the yellow highlight to the corresponding sentence.

    3. When you have found the corresponding sentence and highlighted it in yellow, select Align Sentence, either from the TransCorpora Search menu or from the context-sensitive menu (displayed when right-clicking with your mouse), or simply type Ctrl+A.

    4. This new alignment will affect the rest of the document; you can view all of the new alignments by clicking the Alignment sub-tab of the TransCorpora tab.

    NOTES:

    The source document always controls the target document. That is, if you place the cursor somewhere in the source document and yellow-highlight the previous sentence (Ctrl+Up Arrow), the previous sentence in the target document will also be yellow-highlighted. However, when you use the Ctrl+Shift+Down Arrow and the Ctrl+ Shift+Up Arrow key combinations, only the highlighted sentence in the target document will move.

  • 44

    Adding Expressions to a TermBase from a TransCorpora file The TransCorpora Search process allows you to quickly and easily find the translation corresponding to an expression contained in a source document. You can then add these source and target expressions to an opened TermBase.

    To add expressions to a TermBase from TransCorpora Search

    1. If multiple TermBases are opened, select the destination TermBase from the toolbar.

    2. Highlight the desired source expression with your mouse. (You may also use the Shift+Right Arrow or Ctrl+Shift+Right Arrow key combinations to select the desired expression in the source document.)

    3. Highlight the corresponding target expression using your mouse. The terms to be added to the TermBase will appear in the Selected text fields of the Source and Target sections in the main working area.

    4. If necessary, you may edit the source and target expressions in the Selected text fields.

    5. Select Insert in TermBase either from the TransCorpora Search menu or from the context menu (which shows up when you right-click your mouse), or simply type the Ctrl+Ins shortcut key combination.

    6. Repeat the previous steps for each and every expression you wish to add in the TermBase.

    Insert Options

    You can set the way expressions will be inserted in a TermBase through the Insert Options function in the TransCorpora Search menu or by right-clicking in either Source or Target section and selecting Insert Options. The following window opens:

  • MultiTrans User Manual TransCorpora Module

    45

    Insert Only (Option is checked by default): This option simply inserts the selected source and target expressions in the active TermBase.

    Insert and Align: This option allows you to insert the selected source and target expressions in the active TermBase, while aligning your source and target texts.

    Lower Case (Option is checked by default): This option will insert expressions in the TermBase in lower case only.

    Title Case: This option allows you to insert expressions in the TermBase with initial capital letters for each word in the expression.

    Upper Case: This option allows you to insert expressions in the TermBase in capital letters.

    No Change: This option allows you to insert expressions in the TermBase as they appear in your source and target documents, with uppercase and lowercase letters intact.

    Trim Spaces (Option is checked by default) This option will insert expressions in the TermBase with all extraneous spaces trimmed. De-select this option if you need to keep the extra spaces in an expression.

    Send to Termbase: (Option is checked by default)

    Edit field on Insert: This option allows you to edit field upon their insertion.

  • 46

    Switching Between TransCorpora and TermBase Windows You may want to switch between the TermBase and TransCorpora windows to work in both modules, referring back to one module from time to time.

    To switch from the TransCorpora window to the TermBase window

    o From the Window menu, select TermBase Editor, or press Ctrl + W.

    To switch from the TermBase window to the TransCorpora window

    o From the Window menu, select TransCorpora Search, or press Ctrl + W.

    Saving and Closing TransCorpora files

    To save a TransCorpora file

    o Do one of the following: o Press Ctrl + S; o From the File menu select Save TransCorpora Search.

    To close a TransCorpora file

    o From the File menu select Close TransCorpora Search.

  • MultiTrans User Manual TransCorpora Module

    47

    Reversing Language Direction When searching in a TransCorpora file, MultiTrans searches only in the source language. However, there may be times when you need to search in the target language. To do this, you must reverse the language direction in the TransCorpora file.

    To reverse the language direction

    1. Open a TransCorpora Search. Once opened, a TransCorpora Search menu will be available.

    2. Choose Language Direction. You will see Source to Target selected by default. Select Target to Source, and all open TransCorpora files will now reverse the language direction.

    3. To reverse the language direction for the TransTerm functions, first reverse the language direction of the TransCorpora file(s).

    4. In your word processor, click on the TransTerm menu and choose TransTerm Options. On the Languages & TermBases tab, select your new source language and your new target language. All of the TransTerm functions will now search for your target language.

    NOTE:

    If you reverse the language direction in the TransTerm Options before the TransCorpora files, you will get an error message that says, "Language pair does not match TransCorpora Search language pair. You can still use the selected languages, but TransCorpora Search will not work properly. Continue with selected languages?". In this case, simply cancel the changes to the TransTerm Options and reverse the language direction in the TransCorpora Search first.

  • 48

    Appending Document Sets to a Corpus Instructions for appending document sets to a corpus are given in Using TransCorpora Builder. In summary, you proceed as if you were creating a new corpus, with one exception. Instead of selecting Create New TransCorpora File, select Append to existing TransCorpora file when you get to Step 4:

  • MultiTrans User Manual TransCorpora Module

    49

    Removing Document Sets from a Corpus MultiTrans provides a simple method to delete document sets from a corpus, and automatically update the required indexes.

    To remove document sets from a corpus

    1. Start TransCorpora Search (see To Open a TransCorpora Session). The following window opens:

    2. Click on the TransCorpora File field to display the TransCorpora Search Manager window.

    3. Select a corpus and click Modify to open the following window:

  • 50

    4. Select a document set and click Delete Pairs.

    5. When you are finished, click Done.

  • MultiTrans User Manual TransCorpora Module

    51

    Client-Server Considerations Corpora Server Console The Corpora Server Console allows corpora to be added, removed or modified on the Corpus Server, without stopping and restarting the server application.

    To Access the Corpora Server Console

    1. In the Start menu, select Programs > Corpus CS.

    2. Enter the address of the Corpora Server. If no address is entered, localhost will be the default. Click Connect. You will be prompted for a user name and password. The user name and address must match what is in the registry under HKEY_LOCAL_MACHINE\SOFTWARE\MultiCorpora\Multitrans CS\Corpus\Admin. The Corpora List Path is a read-only field, matching what is in the registry.

    3. Choose Start. A message will appear, "Server successfully started, # corpora were loaded". The Corpus Management tab will list the available corpora. The following options are available:

    Refresh: refreshes the display

    Put Online: allows access to the selected corpus through the server

    Put Off-line: disallows access to the selected corpus through the server. Once it is off-line, documents can be appended to the corpus, removed from the corpus, etc.

    Add Corpus: adds a corpus to the list, making it available for use through the server. Note the full path of the corpus on the server is necessary.

    Remove Corpus: removes a corpus from the list, making it unavailable for use through the server. The corpus itself is not deleted or modified in any way.

  • 52

    Opening TransCorpora files from the Server

    To Open TransCorpora files from the Server

    1. Click TransCorpora Search.

    2. Choose the yellow folder icon beside the TransCorpora File field. 3. In the TransCorpora Search Manager, choose the Remote Corpora tab. 4. Click the Register Remote Corpus button.

    5. Click Add Server. Leave the Connection Type as DCOM unless advised otherwise. Add the name of the server and click OK. At this point, depending on your organization, you may be prompted for a user name and password, which would be supplied to you by your IT department. You will see the list of TransCorpora files available from the server.

    6. Select the TransCorpora file(s) you wish to open, and choose Register. MultiTrans will bring you back to the Remote Corpora tab.

    7. Select the TransCorpora File(s) you wish to open, and choose Open.

    8. Click Add to add the TransCorpora File(s) to the main window, then select Next. (See Searching in TransCorpora Files for more information.)

    Creating a new TransCorpora file on the Server You can create a local TransCorpora file using the TransCorpora Builder.

    To create a new TransCorpora file on the server

    1. Move the entire folder to the server.

    2. Using the Corpora Server Console, choose the Add Corpus button and type in the full path of the TransCorpora file (the .tcs file). Select this TransCorpora file, and select Put Online.

  • MultiTrans User Manual TransCorpora Module

    53

    Specifying an Ignored Words List to Improve Performance and Scalability TransCorpora is based on a full-text search engine, which includes an index of words that appear in the text. This index is kept in memory. In client-server environments, portions of the index are transferred from server to client in response to queries. Therefore, omitting words such as articles from the index improves scalability by reducing memory usage, and improves performance by reducing the amount of data sent from a client to a server. There is one ignored-words list per language, with a file name of .ixl. Files are kept in the General folder under the MultiTrans installation directory. The format is very simple: one word per line.

  • 54

    Accessing Corpora from the Web Welcome to TransCorpora Web TransCorpora Web is a web application that allows you to search through your collection of previously translated reference documents for specific expressions and their translations. TransCorpora Web enables you to access your MultiTrans corpora from anywhere you have Internet capabilities. Using the TransCorpora Web module, you can search for previous translations rapidly and efficiently using the Internet. Besides previous in-house translation projects, additional sources of translated text can also be easily added to a corpus, including published web content.

  • MultiTrans User Manual TransCorpora Module

    55

    Accessing TransCorpora Web

    Log on screen

    After clicking on the appropriate TransCorpora Web link from your web browser, the following screen opens:

    To Log on to the TransCorpora Web

    1. In the Login field, type your user name (E.g., Myname).

    2. In the Password field, type your password (it will show up as ). 3. In the Language field, select your language of choice from the drop-down list.

    4. Click Go. You are now logged in.

  • 56

    Notes:

    The Password field is case-sensitive.

    The user name and password may not be required in your installation.

    Selecting a Corpus At the next screen, you will see a list of available collections of reference documents.

    To open a TransCorpora session

    1. In the Choose Corpora field, click on the file you wish to open. To select more than one file:

    Hold down Ctrl and click the right mouse button for individual selections, or Hold down SHIFT and click the right mouse button to include several files within

    a group. 2. Click Submit when you are finished selecting the file.

  • MultiTrans User Manual TransCorpora Module

    57

    Performing a Search

    The TransCorpora Search Screen

    On the TransCorpora Search screen, you will see the first pair of reference documents. The application will search in all pairs of reference documents simultaneously for any expression you enter. When the TransCorpora Search window opens, the main working area of MultiTrans is divided vertically in two parts: a left panel and a right panel. (see Figure 1)

    Figure 1 - TransCorpora Search window

    The left panel contains two interrelated sections: Search and Search Results. The right panel also contains two interrelated sections: Source and Target..

  • 58

    Search Screen - Left section

    This section contains various work tools (search, lists, etc.) and has three tabs: Search, TransCorpora, and Terminology (see Figure 2). Only the Search tab is available for use with TransCorpora Web.

    Figure 2

    Search Tab The Search tab is displayed by default. In the Search section, you can type an expression, select the options, then hit the Search button. TransCorpora Search will then identify all sentences which contain either the exact expression, or all the searched words. The search results will be displayed in the Search Results section of the Search tab. The top part of the Search Results section displays the found expressions and their frequency, whereas the bottom part lists every target sentence. To narrow the results down, you can select the Consecutive Words checkbox, the Partial Match(es) checkbox, or Fuzzy.

  • MultiTrans User Manual TransCorpora Module

    59

    o Consecutive Words will only find the expression you are search for if the words are actually together. For example, searching for Prime Minister with Consecutive Words selected will find all occurrences of Prime Minister. Searching for this same expression without Consecutive Words selected will find all sentences containing both Prime and Minister, but not necessarily together.

    The expression you are searching for will be highlighted in yellow in the top right window (see Figure 3). The corresponding sentence will be highlighted in yellow in the bottom right window. To view other occurrences of the expression you are searching for, select from the bottom left window. Once you have found a segment you wish to reuse, just highlight the segment, right-click and use the standard copy and paste operations to paste it into your current document.

    o Partial Match(es) will find any portion of the expression you are searching for, as long as two or more words are found in a single sentence. This option can be combined with Consecutive Words to find any portion of the expression you are searching for, as long as two or more consecutive words are found in a single sentence.

    o Fuzzy will find variations of what you are searching for. Using the example of searching for member of parliament, we will also find members of parliament, member of parliaments, as well as various other permutations. This option can be combined with any or both of the other two options to broaden the search.

  • 60

    Search Screen - Right section

    The right section contains two interrelated sub-sections: Source and Target. This is where your source and target documents will be displayed. The expression you are searching for will be highlighted in yellow in the top window the corresponding sentence will be highlighted in yellow in the bottom window.

    Figure 3 - Source and Target window

  • MultiTrans User Manual TransCorpora Module

    61

    To perform a search

    1. Type the expression in the Search field.

    2. Select the desired Search Options - Consecutive Words, Partial Words, or Fuzzy:

    3. Click Search. The search results are displayed in the Search Results section (see Figure 4). The top part of the tab displays the found expression with their respective frequency. The bottom section displays all sentences where the selected expression from the top part is found, listing information such as the source document, the sentence and word number and the TransCorpora file in which the sentence was found.

    Figure 4 - Search Results

  • 63

    TermBase Module Introducing the TermBase Module - Terminology management

    Large corporations face the challenge of maintaining a consistent corporate language in all of their communications, whether monolingual or multilingual. When different sectors of an organization work on communications products, inconsistencies can multiply rapidly. The use of acceptable terms is also critical to communicating a corporate message. MultiTrans supports the terminology management process with a scalable database platform for storing and tracking comprehensive terminology management information, as well as a set of integrated capabilities to easily capture new terminology, automatically extract terminology candidates and their corresponding translations from legacy documents, provide tools and resources to support terminology research activities and manage the approval status and lifecycle of terminology. Terminology extraction capabilities are included in the MultiTrans TransCorpora Builder module, and supported by the TermBase module. The TermBase module allows you to store and manage monolingual and multilingual terminology in flexible databases. These databases (called "TermBases") can store multiple translations for each expression. You can build versatile and easy-to-use multilingual terminology data bases manually or interactively with TransCorpora Search (using previously translated reference documents), and/or TransTerm. Moreover, TermBases are fully customizable: you can add unlimited fields from a variety of types. You may open a TermBase in stand-alone mode (using TermBase Manager), for manual operations, or in conjunction with the TransCorpora or TransTerm modules, for interactive use.

  • 64

    TermBase Editor Screen Display

    The TermBase Editor allows you to view, manage, and maintain your TermBase files, either in stand-alone mode (through TermBase Manager) or interactively, when using the TransCorpora and/or Transterm modules. You can change existing expressions and their equivalents, add new expressions and equivalents, and perform searches. When a TermBase is opened, the main working area of MultiTrans displays the TermBase Editor:

    Left (Expressions) Panel of the TermBase Editor Window

  • MultiTrans User Manual TermBase Module

    65

    The Left panel of the TermBase Editor displays the List of expressions of the active TermBase file. Right (Details) Panel of the TermBase Editor Window The Right panel of the TermBase Editor displays details for the selected expression and its equivalents (Details tab), and is also used to access a search tool (Search tab).

    Tab Function

    Details displays all the related information about each expression contained in the active TermBase.

    Search searches through folders to find specific files

    Expression Sub-section

    This sub-section displays information related to the selected expression, and allows you to add, modify or delete an expression or a word in the TermBase. You can also add new fields by selecting Field Definitions in the TermBase menu.

    To delete an expression, select it from the Expression List, and click the Delete link to the right of it (see Deleting an Expression or a Translation).

    To add a new expression, click the New link (see Adding an Expression to a TermBase).

    To modify/rename an expression, select it from the Expression List and click the Modify link to the right of it (see Modifying/Renaming an Expression ).

    To add a new translation associated to the active expression, click the New link (see Adding a Translation to a TermBase).

    To modify the value of a field, simply click on its name to display the appropriate editing dialog box. Some fields may not be modifiable (for instance, system fields such as Created Date, Created By, etc.).

    Translation Section

    This section displays any translation(s) associated to an expression, and allows you to add, modify or delete a translation. You can add new fields associated to a translation by selecting the Field DefinitionsTermBase_NewField option from the TermBase menu.

    To modify a translation, click the Modify link to the right of it (see Modifying a translation).

  • 66

    To delete a translation, click the Delete link to the right of it (see Deleting an Expression or a Translation).

    To view the translation in the Expression sub-section, and have its position displayed in the Expression List, click the View link.

    The Expression List Section

    The Expression List displays all the expressions in the dictionary for the selected language and TermBase. The list also displays all the information for all the fields associated with each expression.

    TIP:

    You can scroll through the list with the scroll bar, with the keyboard arrows, or by keying in the first letter of