Bio
Hisami Suzuki is a Researcher in the Natural Language Processing Group at MSR-Redmond. She joined the group in 1995, and has since worked on various NLP projects, including the development of a Japanese parser (NLPWin), input method for Japanese (IME), machine translation (MT), and more recently, knowledge acquisition from large data sources. Her main research interest is in learning and using linguistic knowledge for solving NLP problems, especially in Asian languages. She received a Ph.D in linguistics from the University of Chicago in 2002.
Selected Projects and Downloads
Language modeling. Microsoft Research IME Corpus is now available! This corpus provides a test data set for the task of Japanese character conversion for text input. Download the corpus from Microsoft Research IME Corpus. For more abour the corpus, see our techreport.
MSR RefRef is a simple tool for viewing coreference annotation at the document level. It has a native support for MUC and Kyoto Corpus formats. The tools is available for download for research purposes with source code from here. See our LREC 2006 paper for a detailed description of the tool.
Japanese NLP. We have a large-scale Japanese parser, currently used in our machine translation system and in building MindNet. Visit our Japanese NLP project page for a more detailed description.
MindNet is an automatically built knowledge base on lexical semantic relations. Some MindNet samples are now available online in English and Japanese! Visit our Mindnet/mnex (MindNet Explorer) page for an online MindNet exploration.
2009
- Hisami Suzuki, Xiao Li, and Jianfeng Gao, Discovery of Term Variation in Japanese Web Search Queries, in EMNLP, August 2009
- Colin Cherry and Hisami Suzuki, Discriminative Substring Decoding for Transliteration, in Proceedings of EMNLP, Association for Computational Linguistics, August 2009
2008
- Kristina Toutanova, Hisami Suzuki, and Achim Ruopp, Applying Morphology Generation Models to Machine Translation, in Proceedings of ACL, Association for Computational Linguistics, June 2008
- Mamoru Komachi and Hisami Suzuki, Minimally Supervised Learning of Semantic Knowledge from Query Logs, in Proceedings of IJCNLP, Hyderabad, India, January 2008
- 小町守 and 鈴木久美, Improving minimally supervised learning of semantic knowledge from query logs (検索ログからの半教師あり意味知識獲得の改善), in 人工知能学会論文誌, vol. 23, no. 3, pp. 217-225, 2008
2007
- Einat Minkov, Kristina Toutanova, and Hisami Suzuki, Generating Complex Morphology for Machine Translation, Association for Computational Linguistics, June 2007
- Kristina Toutanova, Chris Brockett, Michael Gamon, Jagadeesh Jagarlamundi, Hisami Suzuki, and Lucy Vanderwende, The Pythy Summarization System: Microsoft Research at DUC 2007, Association for Computational Linguistics, April 2007
- Kristina Toutanova and Hisami Suzuki, Generating Case Markers in Machine Translation, Association for Computational Linguistics, April 2007
- 関根聡 and 鈴木久美, Enriching extended named entity resources using query logs (検索ログによる拡張固有表現辞書の整備), in 言語処理学会第13回全国大会論文集, March 2007
- 鈴木久美 and Kristina Toutanova, Generating Japanese case markers for machine translation (機械翻訳における日本語格助詞の生成), in 言語処理学会第13回全国大会論文集, March 2007
- W. Yih, J. Goodman, L. Vanderwende, and H. Suzuki, Multi-Document Summarization by Maximizing Informative Content-Words, in Proceedings of IJCAI 2007, 12 January 2007
- Satoshi Sekine and Hisami Suzuki, Acquiring Ontological Knowledge from Query Logs, in Proceedings of WWW, Banff, Alberta, 2007
- Lucy Vanderwende, Hisami Suzuki, Chris Brockett, and Ani Nenkova, Beyond SumBasic: Task-Focused Summarization with Sentence Simplification and Lexical Expansion, in Information Processing and Management, Volume 43 , Issue 6, pages 1606-1618 , 2007
2006
- Jianfeng Gao, Hisami Suzuki, and Bin Yu, Approximation Lasso Methods for Language Modeling, in In Proceedings of ACL, Sydney, Australia, July 2006
- Hisami Suzuki and Kristina Toutanova, Learning to Predict Case Markers in Japanese, Association for Computational Linguistics, July 2006
- Hisami Suzuki and Gary Kacmarcik, RefRef: A Tool for Viewing and Exploring Coreference Space, European Language Resources Association, May 2006
- 鈴木久美 and Kristina Toutanova, Automatic prediction of Japanese case markers (機械学習による日本語格助詞の予測), in 言語処理学会第12回全国大会論文集, March 2006
2005
- Hisami Suzuki and Jianfeng Gao, Microsoft Research IME Corpus, no. MSR-TR-2005-168, December 2005
- Lucy Vanderwende, Gary Kacmarcik, Hisami Suzuki, and Arul Menezes, MindNet: an automatically-created lexical resource, in HLT/EMNLP Interactive Demonstrations Proceedings, October 2005
- Hisami Suzuki and Jianfeng Gao, A Comparative Study on Language Model Adaptation Using New Evaluation Metrics, Association for Computational Linguistics, October 2005
- Wei Yuan, Jianfeng Gao, and Hisami Suzuki, An Empirical Study on Language Model Adaptation Using a Metric of Domain Similarity, Springer-Verlag, October 2005
- 鈴木久美, Gary Kacmarcik, Lucy Vanderwende, and Arul Menezes, Mindnet/mnex: Tools for automatic construction and analysis of semantic relations database (意味関係データベースの自動構築と解析のためのツール), in 言語処理学会第11回全国大会論文集, March 2005
2004
- Hisami Suzuki, Phrase-Based Dependency Evaluation of a Japanese Parser, European Language Resources Association, May 2004
- Eric Ringger, Robert C. Moore, Eugene Charniak, Lucy Vanderwende, and Hisami Suzuki, Using the Penn Treebank to Evaluate Non-Treebank Parsers, European Language Resources Association, May 2004
- Jianfeng Gao and Hisami Suzuki, Capturing long distance dependency for language modeling: an empirical study, March 2004
- Jianfeng Gao and Hisami Suzuki, Capturing Long Distance Dependencies in Language Modeling, Asia Federation of Natural Language Processing, March 2004
2003
- Jianfeng Gao and Hisami Suzuki, Unsupervised Learning of Dependency Structure for Language Modeling, Association for Computational Linguistics, July 2003
2002
- Hisami Suzuki, A Development Enrivonment for Large-scale Multi-lingual Parsing Systems, Association for Computational Linguistics, August 2002
- Jianfeng Gao, Hisami Suzuki, and Yang Wen, Using Headword Dependency and Predictive Clustering for Language Modeling, Association for Computational Linguistics, July 2002
- Richard Campbell and Hisami Suzuki, Language-Neutral Syntax: An Overview, no. MSR-TR-2002-76, July 2002
- Richard Campbell and Hisami Suzuki, Language-Neutral Representation of Syntactic Structure, European Media Laboratory GmbH, May 2002
2001
- Simon Corston-Oliver, Michael Gamon, and Hisami Suzuki, Using Machine Learning for System-Internal Evaluation of Transferred Linguistic Representations, European Association for Machine Translation, January 2001
2000
- Gary Kacmarcik, Chris Brockett, and Hisami Suzuki, Robust Segmentation of Japanese Text into a Lattice for Parsing, DFKI GmbH, August 2000
- Hisami Suzuki, Chris Brockett, and Gary Kacmarcik, Using a Broad-Coverage Parser for Word-Breaking in Japanese, DFKI GmbH, August 2000



