Challenges in Designing Input Method Editors for Indian Languages: The Role of Word-Origin and Context

Back-transliteration based Input Method Editors are very popular for Indian Lan-guages. In this paper we evaluate two such Indic language systems to help un-derstand the challenge of designing a back-transliteration based IME. Through a detailed error-analysis of Hindi, Bang-la and Telugu data, we study the role of phonological features of Indian scripts that are reflected as variations and am-biguity in the transliteration. The impact of word-origin on back-transliteration is discussed in the context of code-switching. We also explore the role of word-level context to help overcome some of these challenges.

In  Proceedings of IJCNLP Workshop on Advances in Text Input Methods

Publisher  Association for Computational Linguistics

Details

TypeInproceedings
Share
Share this page on Facebook
Share this page on Twitter
Share this page on LinkedIn
E-mail this page
RSS feeds
> Publications > Challenges in Designing Input Method Editors for Indian Languages: The Role of Word-Origin and Context