Automatic Validation of Terminology Translation Consistency with Statistical Method

  • Masaki Itagaki ,
  • Takako Aikawa ,
  • Xiaodong He

Published by European Association for Machine Translation

Publication

This paper presents a novel method to automatically validate terminology consistency in localized materials. The goal of the paper is twofold. First, we explore a way to extract phrase pair translations for compound nouns from a bilingual corpus using word alignment data. To validate the quality of the extracted phrase pair translations, we use a Gaussian mixture model (GMM) classifier . Second, we quantify consistency of translation as a measurement of quality. With this approach, a quality assurance process for terminology translation can be fully automated. It can also be used for maintaining bilingual training data quality for machine translation.