I received an MS in Computer Science from Sofia University and a PhD in Computer Science from Stanford University. My dissertation was on machine learning models for syntactic and semantic analysis (my advisor was Christopher Manning).
I have been a researcher in the Natural Language Processing group at Microsoft Research, Redmond, since 2005.
My research interests focus on modeling the structure of natural language, using machine learning. Most recently, I have been working on machine translation, morphological analysis, and part-of-speech tagging. I have also worked on syntactic parsing, semantic role labeling, and summarization.
Here is my CV.
Teaching
Chris Quirk and I are teaching a seminar on statistical machine translation at the University of Washington in Spring 2011. Here is the course web-page.
2012
- Sungchul Kim, Kristina Toutanova, and Hwanjo Yu, Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia, in ACL, Association for Computational Linguistics, 10 July 2012
- Chris Quirk, Pallavi Choudhury, Jianfeng Gao, Hisami Suzuki, Kristina Toutanova, Michael Gamon, Wen-tau Yih, Lucy Vanderwende, and Colin Cherry, MSR SPLAT, a language analysis toolkit, in NAACL-HLT 2012, 2012
2011
- Jianfeng Gao, Kristina Toutanova, and Wen-tau Yih, Clickthrough-Based Latent Semantic Models for Web Search, in Proceedings of the Thirty-Fourth Annual International ACM SIGIR Conference, ACM, 24 July 2011
- Wen-tau Yih, Kristina Toutanova, John Platt, and Chris Meek, Learning Discriminative Projections for Text Similarity Measures, in Proceedings of the Fifteenth Conference on Computational Natural Language Learning , Association for Computational Linguistics, 13 June 2011
- Kristina Toutanova and Michel Galley, Why Initialization Matters for IBM Model 1: Multiple Optima and Non-Strict Convexity, in Proc. of the Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, June 2011
- Jason Naradowsky and Kristina Toutanova, Unsupervised Bilingual Morpheme Segmentation and Alignment with Context-rich Hidden Semi-Markov Models , in ACL, Association for Computational Linguistics, June 2011
2010
- Minwoo Jeong, Kristina Toutanova, Hisami Suzuki, and Chris Quirk, A Discriminative Lexicon Model for Complex Morphology, in The Ninth Conference of the Association for Machine Translation in the Americas, Association for Computational Linguistics, 1 November 2010
- John C. Platt, Kristina Toutanova, and Scott Wen-tau Yih, Translingual Document Representations from Discriminative Projections , in Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 9 October 2010
- John Platt, Kristina Toutanova, and Wen-tau Yih, Translingual Document Representations from Discriminative Projections, in Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Association for Computational Linguistics, 9 October 2010
- Jason R. Smith, Chris Quirk, and Kristina Toutanova, Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment, in Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL, Association for Computational Linguistics, 1 June 2010
2009
- Xiaodong He and Kristina Toutanova, Joint Optimization for Machine Translation System Combination, in In Proceedings of EMNLP, Association for Computational Linguistics, August 2009
- Kristina Toutanova and Colin Cherry, A global model for joint lemmatization and part-of-speech prediction, in Proceedings of ACL, Association for Computational Linguistics, August 2009
- Hoifung Poon, Colin Cherry, and Kristina Toutanova, Unsupervised Morphological Segmentation with Log-Linear Models, in Proceedings of NAACL-HLT, Association for Computational Linguistics, June 2009
2008
- Kristina Toutanova, Hisami Suzuki, and Achim Ruopp, Applying Morphology Generation Models to Machine Translation, in Proceedings of ACL, Association for Computational Linguistics, June 2008
- Xiaodong He, Jianfeng Gao, Chris Quirk, Patrick Nguyen, Arul Menezes, Robert Moore, Kristina Toutanova, Mei Yang, Bill dolan, Mu Li, Chi-Ho Li, Dongdong Zhang, Long Jiang, and Ming Zhou, The MSR-MSRA MT System for NIST Open Machine Translation 2008 Evaluation, in The 2008 NIST Open Machine Translation Evaluation Workshop, 2008
- Kristina Toutanova and Mark Johnson, A Bayesian LDA-based Model for Semi-Supervised Part-of-speech Tagging, in In Proceedings of NIPS, MIT Press, January 2008
- Xiaodong He, Jianfeng Gao, Chris Quirk, Patrick Nguyen, Arul Menezes, Robert Moore, Kristina Toutanova, Mei Yang, Bill dolan, Mu Li, Chi-Ho Li, Dongdong Zhang, Long Jiang, Ming Zhou, George Foster, Roland Kuhn, Jing Zheng, Wen Wang, Necip Fazil Ayan, Dimitra Vergyri, Nicolas Scheffer, and Andreas Stolcke, The MSR-NRC-SRI MT System for NIST Open Machine Translation 2008 Evaluation, in The 2008 NIST Open Machine Translation Evaluation Workshop, 2008
- Kristina Toutanova, Aria Haghighi, and Christopher D. Manning, A global joint model for semantic role labeling, in Computational Linguistics, 2008
- Jia Xu, Jianfeng Gao, Kristina Toutanova, and Hermann Ney, Bayesian semi-supervised Chinese word segmentation for statistical machine translation, in In Proceedings of Coling, 2008
2007
- Wen-tau Yih and Kristina Toutanova, Automatic Semantic Role Labeling (Tutorial Handout for AAAI-07), 22 July 2007
- Einat Minkov, Kristina Toutanova, and Hisami Suzuki, Generating Complex Morphology for Machine Translation, in Proceedings of ACL, Association for Computational Linguistics, June 2007
- Pi-Chuan Chang and Kristina Toutanova, A Discriminative Syntactic Word Order Model for Machine Translation, Association for Computational Linguistics, June 2007
- Kristina Toutanova, Chris Brockett, Michael Gamon, Jagadeesh Jagarlamundi, Hisami Suzuki, and Lucy Vanderwende, The Pythy Summarization System: Microsoft Research at DUC 2007, in Proceedings of DUC-2007, Association for Computational Linguistics, April 2007
- Kristina Toutanova and Hisami Suzuki, Generating Case Markers in Machine Translation, in Proceedings of NAACL, Association for Computational Linguistics, April 2007
- 鈴木久美 and Kristina Toutanova, Generating Japanese case markers for machine translation (機械翻訳における日本語格助詞の生成), in 言語処理学会第13回全国大会論文集, March 2007
- J. Gao, G. Andrew, M. Johnson, and K. Toutanova, A comparative study of parameter estimation methods for statistical natural language processing, in Proceedings of the 45th Annual Meeting of the Association for Computational Lingustics(ACL), January 2007
- Jianfeng Gao, Galen Andrew, Mark Johnson, and Kristina Toutanova, A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing, in Annual Meeting of the Association for Computational Linguistics (ACL), Association for Computational Linguistics, 2007
2006
- Kristina Toutanova, Competitive Generative Models with Structure Learning for NLP Classification Tasks, in In Proceedings of EMNLP, Association for Computational Linguistics, July 2006
- Hisami Suzuki and Kristina Toutanova, Learning to Predict Case Markers in Japanese, in Proceedings of ACL, Association for Computational Linguistics, July 2006
- Wen-tau Yih and Kristina Toutanova, Automatic Semantic Role Labeling (Tutorial Handout for HLT-NAACL-06), 4 June 2006
- 鈴木久美 and Kristina Toutanova, Automatic prediction of Japanese case markers (機械学習による日本語格助詞の予測), in 言語処理学会第12回全国大会論文集, March 2006
- Arul Menezes, Kristina Toutanova, and Chris Quirk, Microsoft research treelet translation system: NAACL 2006 Europarl evaluation, in WMT 2006, 2006
2005
- Kristina Toutanova, Effective statistical models for syntactic and semantic disambiguation, September 2005
- Kristina Toutanova, Aria Haghighi, and Christopher D. Manning, Joint learning improves semantic role labeling, in In Proceedings of ACL, 2005
- Aria Haghighi, Kristina Toutanova, and Christopher D. Manning, A Joint Model for Semantic Role Labeling, in In Proceedings of CoNLL, 2005
- Kristina Toutanova, Christopher D. Manning, Dan Flickinger, and Stephan Oepen, Stochastic HPSG Parse Selection using the Redwoods Corpus, in Journal of Logic and Computation, 2005
2004
- Kristina Toutanova, Christopher D. Manning, and Andrew Y. Ng, Learning random walk models for inducing word dependency distributions, in In Proceedings of ICML, 2004
- Kristina Toutanova, Penka Markova, and Christopher D. Manning, The Leaf Projection Path View of Parse Trees: Exploring String Kernels for HPSG Parse Selection, in In Proceedings of EMNLP, 2004
2003
- Kristina Toutanova, Mark Mitchell, and Christopher D. Manning, Optimizing Local Probability Models for Statistical Parsing, in In Proceedings of ECML, 2003
- Kristina Toutanova, Dan Klein, Christopher D. Manning, and Yoram Singer, Feature-rich part-of-speech tagging with a cyclic dependency network, in In Proceedings of NAACL, 2003
2002
- Kristina Toutanova and Robert C. Moore, Pronunciation Modeling for Improved Spelling Correction, Association for Computational Linguistics, July 2002
- Kristina Toutanova, H. Tolga Ilhan, and Christopher D. Manning, Extensions to HMM-based StatisticalWord Alignment Models, in In Proceedings of EMNLP, 2002
2000
- Kristina Toutanova and Christopher D. Manning, Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger, in In Proceedings of EMNLP, 2000
