Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora

Arul Menezes and Stephen D. Richardson

Abstract

Translation systems that automatically extract transfer mappings (rules or examples) from bilingual corpora have been hampered by the difficulty of achieving accurate alignment and acquiring high quality mappings. We describe an algorithm that uses a best-first strategy and a small alignment grammar to significantly improve the quality of the transfer mappings extracted. For each mapping, frequencies are computed and sufficient context is retained to distinguish competing mappings during translation. Variants of the algorithm are run against a corpus containing 200K sentence pairs and evaluated based on the quality of resulting translations.

Details

Publication typeInproceedings
URLhttp://www.eamt.org
PublisherAssociation for Computational Linguistics
> Publications > A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora