Practical Experience with Grammar Sharing in Multilingual NLP

Michael Gamon, Carmen Lozano, Jessie Pinkham, and Tom Reutter

Abstract

In the Microsoft Natural Language Processing System (MSNLP), grammar sharing between English, French, Spanish, and German has been an important means for speeding up the development time for the latter grammars. Despite significant typological differences between these languages, a mature English grammar was taken as the starting point for each of the other three grammars. In each case, through a combination of adding and deleting a modest number of grammar rules, and modifying the conditions on many others, a broad-coverage target grammar emerged. Tests indicate that this approach has been successful in achieving a high degree of coverage in a relatively short period of time.

A shorter version of this Tech Report has been published in the Proceedings of the ACL Workshop "From Research to Commercial Applications: Making NLP Work in Practice", Madrid, Spain, 1997: http://www.aclweb.org/anthology/W/W97/W97-0908.pdf

Details

Publication typeTechReport
NumberMSR-TR-97-16
Pages8
InstitutionMicrosoft Research
> Publications > Practical Experience with Grammar Sharing in Multilingual NLP