13
FossConf 2008 Chennai Development of Indic Language Spell Checking Dictionary for OOo K G Sulochana G Jaganadh C-DAC Thiruvananthapuram

Indian Language Spellchecker Development for OpenOffice.org

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Development of Indic Language Spell Checking Dictionary for OOo

K G SulochanaG Jaganadh C-DAC Thiruvananthapuram

Page 2: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Talk Summery

Introduction

OpenOffice.org

Hunspell

Building Dictionaries

Issues

Tips for Building Dictionaries

Evaluation of Performance

Conclusion

Page 3: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Introduction

Localization

Spell checking

Spell checker Development in Indian Languages

Page 4: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

OpenOffice.org

Free and Open Source Office Suite

Collaborative Development

Available with Interface in Indian Languages

Future Office Suite of India

Page 5: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Hunspell

Spell checking library in OOo

Free and Open Source

Capable of Handling Complex Languages

Unicode Support

Compound Handling

Page 6: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Building Dictionaries

Format of Hunspell Dictionaries

Word list .dic file

Rule Base or Affix list .aff file

How to prepare .dic file ?

How to prepare .aff file ?

Building OOo with your .dic file and .aff file

Page 7: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Issues

Availability of Word list

Rule Generation

Sandhi Handling

Tuning Compound Handling part for Indian Languages

Page 8: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Tips

Tips and tricks for generating word list

Tips and Tricks for rule base development

Page 9: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Testing and Evaluation

Testing of spell checker

Performance evaluation

Quality ensuring

Page 10: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Towards Future

Works to do in Hunspell for Indic Language Spell

Checking in OOo

Setting Up User Groups of Help Support and

Development

Page 11: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Concluding Remarks

Page 12: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Questions ?

Page 13: Indian Language Spellchecker Development for OpenOffice.org

FossConf 2008 Chennai

Thank You