Upload
others
View
13
Download
0
Embed Size (px)
Citation preview
Simultaneous Machine Interpretation –Utopia?
Alex Waibel and the InterACT TeamCarnegie Mellon University
Karlsruhe Institute of TechnologyMobile Technologies, LLC
The Language Challenge
• Dilemma:– Living in the Global Village
• Globalization, Global Markets• Increased Exchange and Communication• European/International Integration
– Cultural Diversity:• Beauty, Identity, Language, Culture, Customs• Pride and Individualism• Language Ability
– Challenge:• Providing Access to Global Markets and OpportunitiesMaintaining Cultural Diversity/Individuality
“Everyone Speaks English”… ???
In Europe:
5
The Magnitude of the Problem
• Today Almost all Translation Done by Human Effort (>99%)• ~ 500,000 translators worldwide. ~150,000 in Europe• ~ $31 Billion dollar market• European Union: 1.3 B€ Spent on Translation/Interpretation
– 506 Language Directions to Translate– Current Effort Insufficient to Keep up with Needs of 27 Member States
• Worldwide 6000 Languages– 36,000,000 language directions !!!
• Actual translation work is currently only about 10% of translatable text.
• Translation needs are growing 25% -35% per year• …. And that’s just for Text…..
Interpretation of Speech• Conferences
– Estimated 300,000 conference per Year in Europe– Compared to Needs Few Professional Interpreters– 1% or Less are Interpreted
• Internet– On You-Tube, Every Minute 13 Hours of New Videos
• Television– Satellite, Cable: Virtually Unlimited Channels
• Lectures– Government, Universities, Corporations
• Meetings• Telephone Conversations• Travel Dialogs/Encounters
Can Technology Provide a Solution??
Technology
To Build a Speech Translator for a New Language– 6 Component-Engines: Automatic Speech Recognition, Machine
Translation, and Text-to-Speech Synthesis– Each is in Principle Language Independent,
but Requires Language Dependent Models– Models are Automatically Trained but Require Large Corpora– Certain Language Dependent Peculiarities Exist
Statistical Translation Approach
• Translation and Speech SystemsLearn Automatically
• Statistics Trained over Lots of Data• Uses Parallel or Speech Data• Data Volume is Growing on Internet
Speech Translation
Progression of Technologies:– Domain Limited, Clear Speaking Style (late 80’s-91)
• Janus (first European&US speech-to-speech system)• ATT, NEC, ATR
– Domain Limited, Spontaneous (‘91-’00)• Janus II/III (work on 20 languages),
Verbmobil, Nespole, Enthusiast,C-STAR, ATR, ETRI, NLPR,…
– Mobile Consecutive Interpretation• Transtac, Babylon, Phraselator, Jibbigo, U-STAR
– Domain Unlimited Simultaneous Interpretation• Parliamentary Speeches (TC-STAR)• Broadcast News (GALE)• Lectures, Seminars (InterACT, STAR-DUST, TC-STAR)
Mobile Consecutive InterpretationTechnologies for Cross-Lingual Dialog
Humanitarian Needs• How it is Done Now:
– Human Interpreters– Charts, Dictionaries
• Limitations/Problems:– Limited Supply!!– Fidelity/Trust/Security– Number of Languages
Jibbigo:
– Real-Time Translation
– 15++ languages
– 40,000 Words
– All on the Phone
– No Server Necessary!
Jibbigo on Apple Commercials
Jibbigo Systems
• iTunes & Android App Stores:– English, Spanish, French, German,
Japanese, Chinese, Korean, Filipino, Iraqi, Thai, Pashto, Dari
– Other Languages
• Cost:– Free Jibbigo Online Translator– Off-Line: Freedom from Network
• Outside of App Store:– Enterprise Versions for Special
Applications
Supported Devices
Humanitarian Deployment
Cobra Gold’11Thailand
Cambodia
San Jose , Honduras
Unlimited Domain SimultaneousSpeech Translation Technologies
Domain Unlimited
Domain Unlimited Translators for:– TV/Radio Broadcast Translation– Translation of Lectures and Speeches– Parliamentary Speeches (UN, EU,..)– Telephone Conversations– Meeting Translation
你们的评估准则是什么
Translation of Speeches
University Lectures
êß*0vúbØi∫BA¬pysUêÍ}hÿ5≈ƒÄ<„y‡ëŒkû¢OFˇØ∏kô#å
¯«Zeû
EU-BRIDGE –Bridges across the Language Divide
End-to-End Speech Translation
Lectures of the Future
Meeting of the Future
Arabic
Spanish
EnglishChinese
Seeing Personal Translations
Technology: Heads-up Display Goggles– Display Translation Text in Translation Glasses
Hearing Personal Translations
Targeted Audio– Array of Ultra-Sound Speakers
– Targeted Beam of Audio
– Can only be Heard in Narrow Area
– Multiple Arrays CouldProvide Multiple Languages
Prof. Alex Waibel
Internet Delivery
Students bring their own DevicesTranscription/Translation Outputis Delivered via Web PageInterpretation Done on ServerUser Can Select Languages
ASR
MT
Lecture 1
Lecture 2
Lecture 3
Components Services Events
Service Infrastructure
Adaptation,Learning
New ImprovedTechnologies
Speech-Services forUsers andDevelopers
Prof. Alex Waibel
Lecture Interpretation ServiceLaunch at KIT: Summer 2012,Support for 4 CoursesEU-BRIDGE, Support
• Translation of Power Point Slides• Presentation by Sub-Titles
Tools for Students
Search for Content
• Transcripts useful to Search for Content– Slides, and Lectures in the Cloud– Multi-Lingual Search and Retrieval in
Lectures and Slides by Way of Search Terms
New Challenges
Simultaneous Translation of Lectures•Continuous Monologue
– Broadcast News, Speeches, Lectures•Speaking-Style
– Fast, spontaneous, fragmentary, and no punctuation!!– Noise, Caughing, Singing (!)
•Vocabulary– Much larger, Special Vocabularies
•Speed, Realtime•Service-Infrastructure
– Many parallel lectures;– Automatic, robust assignment of compute power
The German Lecture Translator
• MT in German Lectures is particularly hard. Why?• Peculiarities of German:
– Wordorder:Ich möchte mich zu der Konferenz über Maschinelle Übersetzung anmelden I want to register to the conference on Machine Translation
– Compounds:WorterkennungsfehlerrateWord Recognition Error Rate
– Inflections and Agreement:Zu der nächsten wichtigen interessanten Vorlesung
Words, Words, Words…
• Technical Termsnormally not in ‘normal’ vocabularies– Cepstral-Koeffizienten– Wälzlagerungen Roller Bearings– Unterraum Subspace
• Technical Termswith special Meanings– Klausur Final Exam (not Retreat)– Vorzeichen Sign (not Omen)
• Formulas:– Eff von Ix f(x)
Words, Words, Words….
• Foreign Words in German Language– Computer Science, English Expressions– Political Speeches, Latin Proverbs
• Accent– “Würfelkalkül” (Asfour)
• Foreign Words in German Language– “Cloud”, “iPhone”, “iPad”, “Laser”
• Inflections & Declinations of these Words– Web-ge-casted, down-ge-loaded
• Formation of Compounds:– Cloudbasierter Webcastzugriff
Solution• Use Power Point Slides and Publications• Search Internet for Similar Topics• Incorporate User Corrections• Adapt Vocabulary to Lecture
The Long Tail of Language• Languages:
– Only a Few Languages are Currently Addressed (<50)– Development of Technology Takes Long & Is Expensive
• Ongoing Research to– Lower Cost per Language and Speed Development
Discussion
• Is Interpretation by Machine Possible?– Yes, Performance will Continue to Improve
and be Made Available over the Internet
• Are we Replacing Human Interpreters?– No! Machine Translation Quality Remains Worse
it Lacks Human Judgement and Intuition– But: Human vs. Machine is Usually not the Choice we have!
What about the Rest of us? The Common Reality is:Poor English or No Communication! A Social Challenge!
• Are we Hindering Human Language Learning?– No! Technology Enables and Empowers Human Interaction
thus Motivates and Supports more Language Learning
Our Mission
• Multi-Lingual Understanding & Integration for All• Maintain & Nurture Language Diversity and Heritage• Europe must Provide for its own Effective Support
(and not Outsource or Ignore the Problem)• We Can only Achieve this if we Embrace and Integrate
Both: Human and Machine Support• Achieve Symbiotic, Scalable Solutions by
Language Services that Complement and MagnifyHuman Effort with Machine Support