Learning to Extract Katakana-English Word Pairs from Non-Aligned Web Queries Using a Noisy-Channel Model of Back-Transliteration
- E. Brill ,
- G. Kacmarcik ,
- C. Brockett ,
- Eric Brill
Proceedings of NLPRS 2001 |
This paper describes a method of extracting katakana words and phrases, along with their English counterparts from non-aligned monolingual web search engine query logs. The method employs a trainable edit distance function to find