Asymmetric Features of Human Generated Translation
- Sauleh Eetemadi ,
- Kristina Toutanova
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing |
Published by Association for Computational Linguistics
Distinct properties of translated text have been the subject of research in linguistics for many year (Baker, 1993). In recent years computational methods have been developed to empirically verify the linguistic theories about translated text (Baroni and Bernardini, 2006). While many characteristics of translated text are more apparent in comparison to the original text, most of the prior research has focused on monolingual features of translated and original text. The contribution of this work is introducing bilingual features that are capable of explaining differences in translation direction using localized linguistic phenomena at the phrase or sentence level, rather than using monolingual statistics at the document level. We show that these bilingual features outperform the monolingual features used in prior work (Kurokawa et al., 2009) for the task of classifying translation direction.
Copyright to the above work (including, without limitation, the right to publish the work in whole or in part in any and all forms and media, now or hereafter known) is hereby transferred to the Association for Computational Linguistics (ACL), effective as of the date of this agreement, on the understanding that the work has been accepted for presentation at a meeting sponsored by the ACL and for publication in the proceedings of that meeting. However, each of the authors and the employers for whom the work was performed reserve all other rights, specifically including the following: (1) All proprietary rights other than copyright and publication rights transferred to ACL; (2) The right to publish in a journal or collection or to be used in future works of the author's own (such as articles or books) all or part of this work, provided that acknowledgment is given to the ACL and a full citation to its publication in the particular proceedings is included; (3) The right to make oral presentation of the material in any forum; (4) The right to make copies of the work for internal distribution within the author's organization and for external distribution as a preprint, reprint, technical report, or related class of document. Exceptions: Some of the foregoing conditions will not apply if the work is not copyrighted or has a non-transferrable copyright, because it has been produced by government employees acting within the scope of their employment. The exceptions are: "¢ The work has been produced by government employees acting within the scope of their employment and is not copyrighted "¢ The work has been produced by government employees acting within the scope of their employment and has non-transferrable copyright "¢ The work has been produced by government employees acting within the scope of their employment and the government reserves the right to reproduce the work for government purposes