Upload
douglas-dorsey
View
214
Download
0
Tags:
Embed Size (px)
Citation preview
Summary ReportSurvey on Research and
Development of Machine Translation in Asian Countries
Virach Sornlertlamvanich
Information Research and Development Division
National Electronics and Computer Technology Center
Thailand
4-8 August 2002, 2nd AIIS/APT, Brunei
Survey on Research and Development of Machine Translation
in Asian Countries
Merlin Beach ResortPhuket, Thailand
May 13-14, 2002
Participation
Participants : 11 Countries
India, Indonesia, Japan, Korea, Lao PDR, Malaysia, Myanmar, Philippines, Singapore, Thailand, Vietnam
: and 1 region
Hong Kong
Total participants: 50 (oversea 24, local 26)- R&D group (39)- Supporting, policy and planning group (11)
Objectives:
1) To update technology status of machine translation in Asia.
2) To exchange research and development experience in the field.
3) To establish collaboration for developing a cross language web navigation in Asia.
4) To establish activities for technology transfer from experienced countries to the inexperienced countries.
5) To develop human resources in the field.
Activities
- Keynote speech: 'AAMT Activities and General Trends of MT' by Dr. Hitoshi Iida
- 17 papers by participants* MT Research Techniques: spoken language translation,
semantic annotation, research on particular cases
* MT Research Status in the Countries* Digitalization Research and Infrastructure Status* Problems in R&D
- Roundtable Discussion
* Collaboration within Asia* Standardization for languages within this region* Financial support problem* Possibility of joining the existing working bodies
MT status in Asia
Basic components:- Standardization (Character code and locale) Lao PDR, Myanmar, Philippines, Vietnam
Working on Regional MT:- Philippines and India (official lang. - dialects) Malaysia, Indonesia and Brunei (share resources
for Malay-English MT development)
Experience on English and mother language MT Hong Kong, Indonesia, Japan, Korea, Malaysia, Singapore, Thailand.
MT in Thailand
Computerization (1970-)Experience (1981-1996)Development and Service (1998-)Regional Collaboration (200?)
Dev. & Service
Collab.
Computerization
Experience
1970 1975 1980 1985 1990 1995 2000 2005
ParSit
SMART TRANSLATOR 1.9 TRANSLATOR 1.5
AGENT DICT
PLAETHAI
Available MT in Thailand
ParSit Service Process
ParSit Server- receive data from user- send data to MT server
2
User send URL/TEXTto ParSit server
1
ParSit Server- receive result from MT- send result to User
4
User receive ThaiWP from server
5
MT Server (E-T)3
Number of unique session/month
Jul-01
Aug-01
Sep-01
Oct-01
Nov-01
Dec-01
Jan-02
Feb-02
Mar-02
Apr-02
May-02
Jun-02
Jul-02
0
10000
20000
30000
40000
50000
60000
70000
80000
90000
100000
110000
120000
130000
140000
699446593280847
59616
94800101275
128042
105652
85070
4639541376
100858
134630Translation
Move to Science Park
Expected Collaboration in Asia
Establish Asian chapters for Intra-Asian Collaboration.
Construct Help-desk operation for standardization.
Establish a Working group or Liaison Secretariat for Language resources.
Establish a Working group or Liaison Secretariat for MT.
Make standardization among Asian languages Share language resources
Exchange information for research & application
Three Levels of Collaboration
StandardizationFont, Character code, I/O method, Print,
Locale
Language Resources and Processing ToolsDictionary, Corpus, NLP generic tools
Cross-Language R&D and ServicesMachine Translation, Search engine, Information Retrieval/Extraction
Resource Sharing
Cross-Language Technology
Open Source Software
Cross Language Technology
ChineseLanguage Processing
JapaneseLanguage Processing
FrenchLanguage Processing
KoreanLanguage Processing
MyanmarLanguage Processing
VietnamLanguage Processing
IndonesiaLanguage Processing
ThaiLanguage Processing
cc� �Language Processing
cc� �Language Processing
MT
MT MT MT MT
MT MT MT
MTMT
e-Content Dictionary e-Content Dictionary e-Content Dictionary e-Content Dictionary
e-Content Dictionary e-Content Dictionary
e-Content Dictionary e-Content Dictionary e-Content Dictionary e-Content Dictionary
ChineseLanguage Processing
JapaneseLanguage Processing
FrenchLanguage Processing
KoreanLanguage Processing
MyanmarLanguage Processing
VietnamLanguage Processing
IndonesiaLanguage Processing
ThaiLanguage Processing
.......Language Processing
.......Language Processing
MT
MT MT MT MT
MT MT MT
MTMT
e-Content Dictionary e-Content Dictionary e-Content Dictionary E-Content Dictionary
e-Content Dictionary e-Content Dictionary
e-Content Dictionary e-Content Dictionary e-Content Dictionary e-Content Dictionary
EnglishLanguage Processing
e-Content Dictionary
EnglishLanguage Processing
e-Content Dictionary
Application over the Cross Language Tech.
Cross Language Technology
VisualizationPresentation Extraction Retrieval Summarization MT Mining
E-services
EnglishLanguage Processing
ChineseLanguage Processing
JapaneseLanguage Processing
FrenchLanguage Processing
KoreanLanguage Processing
MyanmarLanguage Processing
VietnamLanguage Processing
IndonesiaLanguage Processing
ThaiLanguage Processing
……Language Processing
……Language Processing
MT
MT MT MT MT
MT MT MT
MTMT
e-Content Dictionary e-Content Dictionary e-Content Dictionary e-Content Dictionary
e-Content Dictionary e-Content Dictionary
e-Content Dictionary
e-Content Dictionary e-Content Dictionary e-Content Dictionary e-Content Dictionary
EnglishLanguage Processing
ChineseLanguage Processing
JapaneseLanguage Processing
FrenchLanguage Processing
KoreanLanguage Processing
MyanmarLanguage Processing
VietnamLanguage Processing
IndonesiaLanguage Processing
ThaiLanguage Processing
……Language Processing
……Language Processing
MT
MT MT MT MT
MT MT MT
MTMT
e-Content Dictionary e-Content Dictionary e-Content Dictionary e-Content Dictionary
e-Content Dictionary e-Content Dictionary
e-Content Dictionary
e-Content Dictionary e-Content Dictionary e-Content Dictionary e-Content Dictionary
Bridge the Digital Divide by Overcoming the Language
Barrier with a Smart MT Solution
[email protected]://www.links.nectec.or.th/virach