Using Statistical Techniques and Web Search to Correct ESL Errors

In this paper we present a system for automatic correction of errors made by learners of English. The system has two novel aspects. First, machine-learned classifiers trained on large amounts of native data and a very large language model are combined to optimize the precision of suggested corrections. Second, the user can access real-life web examples of both their original formulation and the suggested correction. We discuss technical details of the system, including the choice of classifier, feature sets, and language model. We also present results from an evaluation of the system on a set of corpora. We perform an automatic evaluation on native English data and a detailed manual analysis of performance on three corpora of nonnative writing: the Chinese Learners' of English Corpus (CLEC) and two corpora of web and email writing.

Calico_published.pdf
PDF file

In  Calico Journal, Vol 26, No. 3

Publisher  CALICO Journal
Copyright © 2008 CALICO Journal

Details

TypeArticle
> Publications > Using Statistical Techniques and Web Search to Correct ESL Errors