Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Using Machine Learning for System-Internal Evaluation of Transferred Linguistic Representations

Michael Gamon, Hisami Suzuki, and Simon Corston-Oliver

Abstract

We present an automated, system-internal evaluation technique for linguistic representations in a large-scale, multilingual MT system. We use machine-learned classifiers to recognize the differences between linguistic representations generated from transfer in an MT context from representations that are produced by "native" analysis of the target language. In the MT scenario, convergence of the two is the desired result. Holding the feature set and the learning algorithm constant, the accuracy of the classifiers provides a measure of the overall difference between the two sets of linguistic representations: classifiers with higher accuracy correspond to more pronounced differences between representations. More importantly, the classifiers yield the basis for error-analysis by providing a ranking of the importance of linguistic features. The more salient a linguistic criterion is in discriminating transferred representations from "native" representations, the more work will be needed in order to get closer to the goal of producing native-like MT. We present results from using this approach on the Microsoft MT system and discuss its advantages and possible extensions.

Details

Publication typeInproceedings
URLhttp://www.eamt.org
PublisherEuropean Association for Machine Translation
> Publications > Using Machine Learning for System-Internal Evaluation of Transferred Linguistic Representations