Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
S DL Government (SDLGov) offers a suite of
products that allow users to train, retain, save,
and reuse translations to gain efficiencies and
offer more effective data transfer for actionable
decisions. Our integrated tools combine translation
management, translator productivity, machine language
learning, and translation project workflow into an
enterprise translation environment that effectively and
efficiently supports and satisfies multiple mission
requirements.
With nearly 80% of communications across the Internet
in languages other than English, it is imperative that we
not only collect the data, but that we have the ability to
rapidly understand and interpret that data. “Big Data” is
not just big, it's fast, and it's messy and unstructured in a
socially organic way. SDL Government’s translation
technologies offer the most comprehensive and
integrated approach to delivering turnkey solutions for
mitigating translation gaps.
We bring more than 25 years of translation technology
expertise to our Government customers; our products
are used by 9 of the top 10 global companies; and more
than 80% of worldwide high-tech/software companies
rely on our products for their translation needs.
SDLGov is an industry leader in the advancement of
machine translation in the areas of “Adaptive and Neural
Machine Translation”, social media (i.e. informal)
translation, and language identification. Our innovative
technology is about creating adaptable translation
engines, and our next MT release has incorporated this
capability. At SDLGov, our real-time Language Learning
machine translation engine continues to evolve after it
has been trained.
The rapid proliferation of Social Media throughout the
world has provided new avenues for populations to
communicate and organize.
SDLGov is a pioneer in the translation of informal and
social media languages. We developed the first-ever
Arabizi (informal language that creates Arabic words on
Western keyboards) translation engine to make sense of
the Arab Spring. We offer an integrated translation
technology solution that provides a way to automate and
scale many of the complex tasks needed for processing
informal language sources, applying multiple
methodologies, and performing translations across an
enterprise.
Our translation tools include Trados Studio 2017 – the
newest version of the global leading software application
in translator desktop productivity; and WorldServer 11 –
a powerful translation management and workflow
application sized for any US government or contractor
enterprise environment.
“ We bring more than 25 years of translation technology expertise. ”
About SDL Government
2
SDLGov can implement centralized translation memories
and terminology databases so that existing content can
be used and shared across an organization in a deliberate
and managed way. Our solutions accept content from a
wide variety of sources - from internet websites,
Facebook pages, and Twitter feeds, to audio and imagery
files processed via Automated Speech Recognition (ASR),
to scanned documents using Optical Character
Recognition (OCR).
By automating tasks and reusing already translated text,
SDL Government can dramatically reduce the time to
translate, reduce inaccuracies, improve the quality and
consistency of translations, and reduce the overall cost of
translating new, emerging content at a broader
community level. Our solutions help analysts and
translators to become more efficient and productive in
finding the most relevant bits of information in any
language. Our integrated solutions provide a mechanism
to move and analyze Big Data quickly to create the
actionable intelligence needed by mission operators.*
Contact Us 703.665.6731
[email protected] www.sdlgov.com
3
T he rapid proliferation of Social Media,
throughout the world, has provided new
avenues for populations to communicate and
organize. Informal languages and
communication has been around since the dawning of
man’s ability to speak, however, with this globalization
and interconnectivity, informal languages have spread
and become a standard for communications; not simply
how friends talk with one another. Informal languages
take many forms; they can be abbreviations due to
character limitations on Twitter, or having to Romanize
non Latin words with cell phone keyboards. These
languages are constantly evolving and growing.
Our mission at SDLGov is to enable analysts and
operators to understand what is being said, who is saying
it, and make timely and accurate decisions based on
what they find. SDLGov is a pioneer in the translation of
informal and social media languages. This advanced
translation capability enables analysts to more
thoroughly explore and exploit the internet, from the
dark web to the deep web, by providing clarity to what is
being said and by whom.
Experience in Multi-INT Fusion The vast majority of data collected
throughout the IC and DOD is in a
foreign language. This data can
include complex metadata; GEO
Tags, names of people and places, to
terabytes of SIGINT data from voice
and other communications. The
volume of data collected is not only
massive but also contains details
that are critical to have accurately
and quickly interpreted.
Reliable Translation The SDL Government’s tool suite allows for a
standardized translation of names, places, terms, and
other miscellaneous data; ensuring the reliable end
results every time.
• It enables crypto-linguists to process data, more
quickly, more efficiently, and more accurately
through the integration of advanced speech-
recognition technology and machine translation
allowing for faster and more accurate targeting and
exploitation.
• Analysts focused on Big Data can quickly and
accurately translate informal languages and translate
meta-data from a myriad of sources in numerous
languages to quickly determine what is relevant and
what is not.
• SDL Governments’ fusion of multiple, interacting
technologies provides an effective solution that
integrates within existing infrastructures and
accelerates the intelligence collection and analysis
processes.
Intelligence Capabilities
4
Technical Targeting The F3EAD (Find, Fix, Finish, Exploit, Analyze,
Disseminate) process has greatly evolved over the last
decade of operational employment. The volume of
information available to targeting professionals is
staggering and perpetually growing in nature. The
opportunity to overwhelm operators with unfiltered data
is clear and evident.
With most source data in foreign languages and the
reality of a dramatic shortage of cleared professionals
with the required language skills, the challenge to
delivering accurate, targeted, and useful data to
operators becomes limited.
The use of SDL Governments machine translation
technology enables operators on the ground as well as
Targeting analysts to get an instantaneous “gist” of the
information gathered. This enables the capability to
triage data of interest from background noise.
The Weapons of Mass Destruction, and Conventional
Weapons domains are inherently technical in nature.
Conventional machine translation engines are not well
equipped to deal with highly technical foreign language
data.
The SDL Government suite of products are fully
customizable and trainable that can be directly tailored
to any highly specific technical data set. Our Enterprise
Translation Server-G custom language pairs, developed
upon request or generated with our latest offering of
Language Learning, translate massive amounts of foreign
language source data.
Complex data sets and technical manuals require special
tools to properly process, analyze, and translate. Trados
Studio-G (TS-G) is the number one Computer Assisted
Translator productivity tool in the world with more than
300, 000 licenses in use worldwide. TS-G enables linguists
to build and share a translation memory of all their
translated data, greatly reducing overall translation time
and cost, while greatly increasing accuracy.
With SDL Governments real-time Language Learning, our
translation engines continue to evolve after they have
been trained. SDL Government Trados Studio-G also
enables linguists to create and share term-bases with all
of the relevant terminology for their field. This ensures
that technical terms are translated right the first time,
every time. Our innovative technology creates adaptable
translation engines that receive and process user
feedback and learns from it in real time. With Language
Learning, the user provided feedback or correction goes
back to the engine, is processed in real time, and
influences subsequent translations.*
“ The Weapons of Mass Destruction, and Conventional Weapons domains are inherently technical in nature. ”
5
A gencies are dealing with dozens of languages
and cultural nuances compounding multi-
lingual data challenges. Professional
translators, are in short supply and can only
translate possibly two to three thousand words per day,
per Common Sense Advisory. In a data-driven world,
government organizations potentially need to translate
millions of words per day from a variety of sources.
SDL Government’s Enterprise Translation Server-G (ETS-
G) is built on leading statistical machine translation
technology and its advanced research and development
activities.
Our ETS-G is implemented in multi-level secure networks,
and is accessed via a user-interface for quick translation,
multi-lingual, social media browsing, or via our REST API
for integration with other applications.
SDL Government’s MT Benefits • Secure On-Premise Translation
SDLGov’s Statistical Machine Translation solution installed
on-site enables large volumes of information to be
translated securely at high speed and high accuracy while
being cost-effective.
• Maximized Translation Resources
Government Agencies improve translation resources
effectiveness by translating all information automatically
and escalating only necessary work to human translators.
• Domain Customization
SDLGov MT supports user training for terminology that is
specific to certain missions or applications to deliver
improved translation accuracy.
• Industry Leader in MT Innovation and Advancement
More than 25 years of translation technology expertise.
Language Learning SDL Government’s ETS-G technology offers trainable
engines that can be directly tailored for the unique
domain of data that the government needs to handle.
These engines can be developed upon request, or with
our latest offering of adaptive Language Learning, or be
customized by the user in real time.
We are an industry leader in the advancement of MT in
the areas of “Adaptive and Neural Machine Translation”,
social media (i.e. informal) translation, and language
identification.
“ More than 80% of worldwide high-tech / software companies rely on our products for their translation needs. ”
Enterprise Translation Server-G
6
Our innovative technology is about creating adaptable
translation engines, and our next MT release has
incorporated this capability.
Translation Workflows SDL Government provides Machine Language Translation
support for hundreds of language pairs. Our MT engines
can be further trained on an organization’s specific
vocabulary and terminology to produce a customized
solution. As part of the overall translation infrastructure,
MT engines can also be coupled with human post-editing
to further improve translation quality and reduce time-to
-decision for translated content.
SDLGov’s MT allows for prioritization and maximal use of
translation resources. By providing the first round of
translation, our MT empowers human translators to work
only on more important translations – those that require
human interpretation and analysis.
By using existing translated content, organizations can
also train the SDLGov MT statistical machine translation
technology to produce more fluent, meaningful
translations in important subject areas and domains.
We bring years of translation technology expertise to our
Government customers; our products are used by 9 of
the top 10 global companies; and more than 80% of
worldwide high-tech / software companies rely on our
products for their translation needs. As an example, in a
completely automated environment, our machine
translation technology is translating almost 60 Billion
words annually for a major Internet travel site – across
13 different languages, 24-hours per day/365 days per
year.*
Our machine translation capabilities cover more than 95% of the World’s National Languages.
7
Trados Studio-G
A ccelerate productivity as you empower your
operations to manage translations to the
highest standards. SDL Government Trados
Studio-G offers a complete environment for
language professionals to edit or review translations,
manage translation projects, organize corporate
terminology and connect to machine translation.
Used by more than 300,000 translators worldwide, when
your Agency or organization chooses Trados Studio-G,
you are joining the largest translation user community in
the world.
Everything You Need in One Place • Innovative Features
That increase translators’ productivity such as predictive
typing and quality checking; advanced tools for
cooperating more easily with reviewers; automated
wizards and reports for efficient project management.
• Easy to Learn, Simple to Use
With a customizable workspace to suit every need.
• Opens All Common File Types
Anything from Microsoft Word to sophisticated formats
like Adobe InDesign – so you can work with any project.
• Powerful Tools
To fully leverage previous translation projects.
• Part of an Extensive Solution
Which includes sharing technology and machine
translation.
Details that Make a Difference AutoSuggest for Asian Languages
Now with Studio 2017 AutoSuggest has been enhanced
to support Chinese, Korean and Japanese. You can now
create AutoSuggest dictionaries with these languages
and enjoy better-than-ever productivity gains by typing
less and translating more.
Easier Document Navigation
Navigate through documents even more easily with
custom display filters in Studio 2017. Users can apply
multiple filters and save custom filters for future use so
to easily focus on priorities.
Merge Segments Over Hard Returns
Studio 2017 allows you to merge segments even over
hard returns so you can ensure the high quality of your
translation units and optimize your translation process.
Better File Type Support
Studio 2017 brings an even further enhanced Microsoft
Word filter for better handling of Word files. In addition,
we’ve included support for XLIFF 2.0 as well as 1.2, so
you can work on files that have been created using the
latest version of the XLIFF standard. Users can preview a
custom file filter directly from the File Types window – a
major time saver when preparing and testing custom or
in-built file types.
“ Used by more than 300,000 translators worldwide. ”
8
Instantly Create New Translation Memory
If your organization has been working without using a
translation memory before, you can create a new one by
bringing previously translated documents into Trados
Studio-G. The powerful alignment technology
automatically brings both original and translated content
together. Creating a new translation memory is simple.
Upload the original and corresponding file and Studio will
automatically align them to create a Translation Memory
for your new project.
Smarter Translation Memory
Translation Memories (TMs) are an organizations
greatest assets and getting the maximum possible from
them matters. Studio 2017’s transformative new upLIFT
Fragment Recall technology will automatically give you
high-quality fragment matches so that you can translate
new segments faster, even if you don’t have a full TM
match. Enjoy simpler TM leverage, save time and effort
and be confident that you’re getting high-quality or in-
built matches in both fuzzy and no match scenarios.
Repair “fuzzy” Matches
With increasingly short deadlines, being able to do as
much as possible in as little time is important. Reduce
translation time and cost with upLIFT Fuzzy Repair which
cleverly uses your trusted resources to repair fuzzy
matches so you save even more time when editing and
get the best possible fuzzy match.
Refined User Experience
We continue to make Studio easier to use and with
Studio 2017 you can quickly get started by dragging and
dropping files to create a new project. Project setup is
further enhanced with our ‘Recently used languages’
feature, or the possibility of reversing the language
direction, saving time when setting language pairs. Users
can get started with a simple right click in Windows
Explorer and enjoy working with Studio on high DPI 4K
screens, providing an all-round improved user
experience.*
SDL Government’s Professional Training and Certification Program
The SDL Government’s Professional Training and Certification Program is the
industry’s premier technology based program which provides a recognized
standard of excellence in SDL Government’s software knowledge. It is a
comprehensive professional education program offered to translators and
project managers and is designed to develop and validate expertise in the
use of SDL Government’s translation technology tools.
Contact us at [email protected]
Learn more at www.sdlgov.com/trados-certification
9
WorldServer-G
U S Government Agencies want to manage,
share and collaborate on all source content
from multiple channels, and translate,
quickly and reliably across multiple mission
critical areas.
WorldServer-G (WS-G) is the leading translation
management system in the industry. More than one
hundred of the top global companies rely on WS-G to
optimize their translation programs.
WS-G simplifies and accelerates the translation process
for any content – from social media to websites to
documents. It is designed to centrally manage, automate
and control numerous translation projects across
multiple language service providers to deliver quality
translations rapidly and cost effectively.
By automating many costly and time-consuming
activities, WS-G eliminates delays in translated content
availability, improves translation quality, and reduces
costs. WS-G is built for managers and their teams to
centrally manage, automate and control translation
projects to deliver quality translations on time and within
budget.
Key Benefits • Quality Translations for End Users
SDL Government WorldServer-G ensures every form of
content has the highest quality translation, and all
translations are sent through predefined workflows and
review cycles.
• Greatly Reduced Cost
SDL Government WorldServer-G automatically identifies
what content has changed and what has previously
approved translations. The cost of translations can be
reduced by as much as 40% and the overall cost of the
translation program can be reduced by as much as 80%.
• Decreased Administrative Burden
Processes that formerly required dozens of tasks are
reduced to a few human steps with SDL Government
WorldServer-G managing the routing, storage and
communications between program participants.
• Ability to Scale
Best practices and workflows are enforced by SDL
Government WorldServer-G’s scalable workflow function.
The proven process can be extended to a new language or
content type with little incremental effort.
“ Simplify, automate and accelerate the translation of any content from multiple languages through effective translation management.. ”
10
Capabilities
Translation Memory
WS-G features Translation Memory (TM), a database of
all previously translated content. Any new request for
document translation is passed through the TM database
to segment content and identify that which already has
approved versions in the target language. For content
where there is no exact translation, TM will offer similar
content to make the translator’s job faster and easier.
This will ensure consistency of translations across
organizations, agencies and document versions, while
reducing the volume and costs of translation services.
Terminology Management
Every organization has its own unique vocabulary,
terminology, and usage standards. Slang is constantly
evolving and the use of code words that have dual
meanings can make it extremely difficult for standard
translation systems to convey the true meaning of what
is being said. WS-G enables the creation or importation
of custom dictionaries that allow the user to customize
and standardize how a given word is translated from a
source to target text. Custom terminology databases, or
“termbases”, of user-defined terminology consisting of
words and or phrases can be created, saved and shared
between users to ensure that key terms are translated
how you want them to be translated, every time,
consistently and accurately throughout an entire
enterprise.
Workflow customization
Core to WorldServer-G is a workflow-based process
automation engine that drives all translation project
activities. You can easily configure the workflow so that,
for example the process for translating social media
material is different from that for translating very
technical documentation. Personnel can be designated as
the approved translators or reviewers to ensure subject
matter experts control the final quality of content. In
addition, comments, questions and notifications of
project related issues can be attached to any content to
improve the quality as it moves through the translation
process.*
11
T he term innovation is bandied about frequently.
Every day we are bombarded with “innovative new
approaches” and “technological innovations to …”
At SDL Government, we believe that true innovation
is the “introduction of something new or a new idea, method
or product” – and provides a real benefit to our customers.
We bring 25 years of translation technology expertise –
exceeding initial handcrafted translation knowledge, through
statistical learning, and bridging into contextually adaptive
machine translation offerings. We are leading the way for a
transformational wave of machine translation capabilities with
our Neural Network based language models and Adaptive
Learning Machine Translation engines.
SDL Government, however, does not stop there. We
understand the critical importance of technology integration
and cross-functional performance, and we focus on providing
flawless support to the execution of our customer’s missions.
SDL Government has partnered with leading global technology
providers such as IBM, Nuance Technology, and Carnegie
Mellon University to integrate machine translation technology
into Automated Speech Recognition (ASR) tools, Optical
Character Recognition (OCR) solutions, and multiple content
analytics tools and front-end ingestion engines. These
integrations enable analysts and operators to understand
what is being said, who is saying it, and make timely and
accurate decisions based on what they find.
Our solutions help analysts and translators to become more
efficient and productive in finding the most relevant bits of
information in any language. Our integrated solutions provide
a mechanism to analyze Big Data quickly to create the
actionable intelligence needed by mission operators.
The following pages highlight some of the SDL Government
Innovations that we have developed over the past 9 months.
12
13
I n the world of deployed military, accurate
communications for ground troops can mean the
difference between life and death. If a squad of
soldiers on the ground is searching for an insurgent,
they need to be able to talk with the local populace and
make informed decisions on the fly. Trusted wartime
translators can be costly to train and there are simply not
enough to go around. What can the squad do if none
speak the indigenous language and a human translator is
not available? They must turn to machine translation
(MT); the MT’s job can be more complicated in regions
where numerous dialects exist in a small area.
SDL Government’s innovations in the field of informal
dialect MT technology coupled with our integration
efforts with Automatic Speech Recognition (ASR) and
Optical Character Recognition (OCR) technology will
provide unprecedented access to high quality deployable
translation systems. Our current solution can run on a
tablet or ruggedized laptop allowing a deployed ground
troop to carry it into the battlefield with minimal burden.
The solution contains full statistical model language pairs
for high quality translations that do not limit the user to
just a limited set of phrases or interactions.
SDL Government understands the mission needs of
deployed military personnel. We understand that size,
weight, and power are crucial to success, but also that
those requirements need to be tempered with the
realization that if the technology does not perform its job
correctly, it doesn’t matter how small the device is. It is
SDL Government’s mission to advance MT technology to
allow unprecedented levels of communication for
deployed troops by combining Neural Network MT
(NNMT), ASR, and OCR technology with the latest
advancements in mobile hardware. Commercial research
and development in NNMT at SDL Government is paving
the way to make devices that will help our troops in
make the mission critical decisions they need to achieve
mission success.
In the field, people do not speak perfect English, Arabic,
Russian, Pashto, or any other formal language. They
speak in southern accents, regional dialects, and utilize
slang. SDL Government’s partnerships with the US
Government coupled with our commercial developments
allow us to understand over 100 language combinations
including 11 Arabic dialects as well as informal Russian,
Turkish, French and Farsi to name a few. With our
portfolio continuously growing, we help the warfighter
deploy the correct language translation technology to
support their missions overseas. *
“ Our current solution can run on a tablet or ruggedized laptop allowing a deployed ground troop to carry it into the battlefield. ”
More Than Just Machine Translation Deployed Translation Capabilities
14
O ptical Character Recognition (OCR)
technology has been around for decades. At
its most basic definition, OCR technology
digitally renders text from images.
Providers of OCR technology extract text directly from
pictures, scanned documents or other formats by means
of a series of algorithms and models. OCR technology is
widely used in virtually every segment of the population
– in both public and private sectors. Healthcare, Finance,
Military and Government are all heavy users of OCR
technology.
Providers of OCR applications are many and varied with
language and domain specialization being key
differentiators in today’s market place. The ability to
design, build and deploy a customizable, configurable
and integrated platform combining OCR with machine
translation (MT) that is OCR provider-agnostic, is highly
sought after by government and corporations alike. By
being provider-agnostic, SDLGov can deliver an
integrated platform that will automatically select the
right OCR technology based on the language
requirements and specific use case or from a set of best-
of-breed offerings.
SDL Government (SDLGov) has integrated several OCR
applications into its MT platform to deliver a seamless
user experience for US Government customers. Built on
SDLGov’s market leading MT technology, we have
created a robust platform that can integrate with a
variety of OCR engines/models to provide users with
quality transcriptions and translations in a fraction of the
time it would take a human transcriber/translator
unaided by technology.
There are several use cases that are enhanced, and
therefore made more productive, by integrating OCR
with MT, such as:
• Enabling Law Enforcement with Forensic analysis of
captured documents, cross-border operations, legal
proceedings, criminal investigations and field
operations support.
• Enabling Intelligence analysts more effectively access
and analyze multilingual OSINT, and support
operational HUMINT.
• Using translation technology to support Base/Border
security, force protection, operational kinetic
support, sensitive site exploitation (SSE), forensic
analysis.
There is a growth of foreign language content that is
drowning our analysts and operators. Finding the critical
“needle in the haystack” is made infinitely more complex
when the volume, velocity, and variety of this foreign
language content keeps increasing. SDLGov can
successfully enable US Government agencies to optimally
ingest, process, analyze, and disseminate crucial foreign
language content and the resulting intelligence, with the
right technology. *
Integrated Optical Character Recognition
15
S DL Government (SDLGov) is enabling our
customers to effectively manage and
understand an ever-expanding ocean of
multilingual content. Information such as news
stories, blogs, and webpages are now being effectively
translated using SDLGov machine translation (MT)
solutions. But the real growth of online content is now in
multimedia - audio, video, podcasts, images, and
animation. To address the challenges managing and
translating this multilingual content, we have developed
innovative solutions that enable users to gist and
understand significant quantities of multilingual data
types.
By integrating Automated Speech Recognition (ASR) and
Optical Character Recognition (OCR) technologies with
our industry-leading translation offerings, we have
created a unified platform that will ingest broadcast
media, social media, video, and audio, and other content
types – and output translated and understandable
content.
We have modelled our MT engines to handle content
from social media sources such as Twitter, Instagram,
YouTube and others. These sources are often the first
place that information about an emerging crisis or event
is found. Statistics show that only 28% of the more than 3
Billion Internet users speak English; 72% of Internet
traffic comes from non-English speaking countries; and
only 34% of Social Media messages are in English.
SDLGov translation technology solutions give analysts
real time access to information about real time events –
in the venues where information moves at lightning
speed.
The sheer volume of content is staggering. In 2016, every
minute of every day saw:
• 3.5 Million Text Messages Sent
• Twitter Users Send 350,000 Tweets
• 6.9 Million Users Watch Snapchat Videos
• YouTube Sharing 400 Hours of New Video
• 2.4 Million Instagram Users Responding to Posts
With our MT platform, we enable language agnostic
understanding of video, audio, imagery and other
content – to derive actionable intelligence. Through our
APIs we integrate quickly and effectively with analytics
tools to create solutions that address very specific
problems that then provide an unparalleled capability to
extract meaningful and useful intelligence from a wide
variety of multimedia sources in multiple languages. *
More Than Just Machine Translation Multilingual Multimedia Translation
“ 72% of Internet traffic comes from non-English speaking countries. ”
16
T he proliferation of Intelligence, Surveillance,
and Reconnaissance (ISR) platforms across the
Services has brought with it a new challenge of
how to quickly and effectively we can Process,
Exploit, and Disseminate (PED) multilingual data, in a
form that provides the warfighter or decision maker the
greatest clarity and understanding. ISR platforms,
whether ground-based, airborne, at sea, or space-based,
collect data in a myriad of formats including text, audio,
signals, imagery, and video. These diverse data streams
that come in many languages and dialects present a
challenge to analysts and
operators who need to quickly
understand this content so
they can identify and respond
to threats. With the volume of
collection ever increasing, it is
imperative that innovative
solutions be implemented to
augment existing operational
capabilities. On today’s
battlefield, the warfighter
faces the complex issue of an increasing Volume,
Velocity, and Variety of foreign language sources.
At SDL Government, we have proven experience in
delivering language technology to the warfighter by
enabling their understanding of Big Data in any language.
We have worked to develop a robust platform that
integrates cutting edge Machine Translation (MT),
Automated Speech Recognition (ASR), and Optical
Character Recognition (OCR) capabilities to maximize the
ability of the operator to process video, audio, and
imagery that contains foreign language, render it
understandable, and act on it in a timely manner. We
have deployed this technology on a variety of ISR
platforms, PED platforms, and operational ‘kits’ in use
across the Services, COCOMs as well as the Intelligence
community and Federal Law Enforcement community.
The importance of language technology becomes clear
when examining the human capacity to translate foreign
language data. The average human translator can
translate approximately 2500-3000 words per day, which
is the equivalent of about 5-6 pages. SDLGov’s Machine
Translation engine can translate an average of 4,000
words per minute but is scalable to almost unlimited
potential. This processing, can be done onboard the
sensor platform or at the PED site allowing the warfighter
to focus on the most relevant data streams while gisting
out all the others.
The benefits of integrating MT and Natural Language
Processing tools into ISR platforms includes: increased
force protection for ground personnel through enhanced
situational awareness capabilities, faster response times
for airborne targeting systems that are processing foreign
language audio and text data, and the maximization of
human capacity to process data while using integrated
analytical platforms. As the volume and velocity of data
that we capture grows, SDL Government is ready to meet
mission requirements with cutting edge Machine
Translation, that is integrated with other tools such as
ASR and OCR, enabling our customer’s mission success. *
Automated ISR Data Streams Processing
17
W ith more than 70% of communications
across the Internet in languages other
than English, it is imperative that we
not only be able to collect this content,
but that we have the ability to rapidly understand this
content as well. "Big Data" is not just big; it's fast, it's
messy, it’s unstructured in a socially organic way – and it
comes in a variety of languages and dialects. SDL
Government’s translation technologies offer the most
comprehensive and integrated approach to delivering
turnkey solutions for enabling the understanding of Big
Language Data.
Social media is and extremely rich and fluid source of
intelligence. Social media analytics has been used for
situational awareness during political events,
understanding population sentiment during social
uprisings, used to dissect the causes of regional conflicts,
and enable the surveillance of individuals and
organizations seeking to cause harm and damage. To gain
insights into potential threats, support forensic analysis
after a terrorist attack, or understand the social network
following ‘bad actors’ - it is paramount that this
information is not only analyzed – but also translated
effectively and in a timely manner so it is understood.
With approximately 72% of online traffic coming from
non-English speaking countries. Using SDL Government’s
machine translation (MT) technology, coupled with data
analytics and sentiment analysis (potentially in the native
language) – users can gain insight into real world events
in near real time – and understand what is being
communicated.
SDL Government (SDLGov) offers a scalable Machine
Translation Enterprise (MTE) platform which can be
integrated with 3rd party tools to perform social media
data analysis, such as entity extraction. Social media will
continue to increase in terms of content types, languages
used, dialects used, and numbers of individuals
communicating on a global level. SDLGov’s MT platform
provides the means of understanding the Big Language
data – and enable users to extract meaningful and
actionable information. *
More Than Just Machine Translation Social Media Analytics
“ Social media is and extremely rich and fluid source of intelligence. ”
18
T he quality of automated speech recognition
(ASR) technology has increased exponentially
over the past two decades. From the humble
beginnings of Nuance’s Dragon Naturally
Speaking in 1997 to sophisticated and complex acoustical
models developed by IBM for the Watson Group, to
massive crowd-sourced efforts at Google Translate, to
Amazon’s Alexa project, the state of the art is moving
rapidly. Faster computing processors, coupled with
exploding levels of parallelization, now allows for the
development of new acoustical language models that are
driving a higher level of speech integration into our
personal lives and our corporate cultures.
SDL Government (SDLGov) is at the commercial forefront
in integrating ASR with high quality machine translation
(MT) technology for a seamless user experience in the US
Government space. Currently built on SDLGov’s robust
MT platform, we have created a ubiquitous platform that
can integrate with a variety of ASR models via ReST APIs
to provide users with quality transcriptions and
translations in a fraction of the time it would take a
transcriber/translator to manually transcribe a foreign
language audio file and translate that transcription into
English.
There are many relevant use cases and mission sets that
are enhanced in productivity and effectiveness with our
integrated platform:
• Federal and State Law Enforcement
TITLE III intercepts/monitoring, custodial interviews,
interrogations, forensic analysis, cross-border operations,
monitored communications, legal proceedings,
investigations and operations.
• Intelligence Community
Batch translation and Analysis of large volumes of audio
files, ingestion of streaming content, web monitoring,
OSINT, SIGINT, operational HUMINT support, surveillance/
counter-surveillance.
• US Military
LLVI, ISR missions over foreign areas, border security, force
protection, operational kinetic support.
There is a growing problem of foreign language data that
is drowning our analysts and operators. Finding that
critical “needle in the haystack” is made infinitely more
complex when volume, velocity and variety of the foreign
language data keeps increasing. Technology must be
introduced effectively to gist, prioritize, analyze and
collaborate within this vast amount of content. SDLGov’s
mission is to enable our US Government community to
more effectively digest, process, analyze and understand
crucial foreign language data. *
Integrated Automated Speech Recognition
19