|
|
Hisami Suzuki is a Researcher in the Natural Language Processing (NLP) Group.
Background
I first joined Microsoft Research in August 1995 as an intern, when I was in the graduate school of the linguistics department at the University of Chicago. My work then consisted of building a Japanese morphology and lexical component for the group's natural language understanding system, NLPWin. Since becoming a full-time employee in December 1995, I have been working on various components for NLPWin-Japanese, including morphology, syntax, language-neutral syntax and more recently, anaphora-resolution components.
My current research interests are on the issues of linguistic representation in various NLP tasks (e.g., machine translation, anaphora resolution), language modeling (particularly for the application of Japanese IME (character input method)), multi-document summarization, and processing of morphologically complex languages.
Selected Projects and Downloads
Language modeling. Microsoft Research IME Corpus is now available! This corpus provides a test data set for the task of Japanese character conversion for text input. Download the corpus from Microsoft Research IME Corpus. For more abour the corpus, see our techreport.
MSR RefRef is a simple tool for viewing coreference annotation at the document level. It has a native support for MUC and Kyoto Corpus formats. The tools is available for download for research purposes with source code from here. See our LREC 2006 paper for a detailed description of the tool.
Japanese NLP. We have a large-scale Japanese parser, currently used in our machine translation system and in building MindNet. Visit our Japanese NLP project page for a more detailed description.
MindNet is an automatically built knowledge base on lexical semantic relations. Some MindNet samples are now available online in English and Japanese! Visit our mnex (MindNet Explorer) page for an online MindNet exploration.
Presentation
-
Jianfeng Gao and Hisami Suzuki. 2007.
Foundations of Statistical Natural Language Processing: A Case Study of Text Input System.
A tutorial presented at MSRA HIT Weihai Summer School 2007 at Weihai, China.
[ppt]
Publications
Komachi, Mamoru and Hisami Suzuki. 2008. Minimally Supervised Learning of Semantic Knowledge from Query Logs. In Proceedings of IJCNLP, Hyderabad, India.
Minkov, Einat, Kristina Toutanova and Hisami Suzuki. 2007.
Generating Complex Morphology for Machine Translation. In Proceedings of ACL, Prague, Czech Republic.
Toutanova, Kristina, Chris Brockett, Michael Gamon, Jagadeesh Jagarlamudi, Hisami Suzuki and Lucy Vanderwende. 2007.
The PYTHY Summarization System: Microsoft Research at DUC2007. In Proceedings of Document Understanding Conference, presented at NAACL-HLT 2007, Rochester, New York.
Toutanova, Kristina and Hisami Suzuki. 2007
Generating Case Markers in Machine Translation. In Proceedings of NAACL-HLT, Rochester, New York.
Sekine, Satoshi and Hisami Suzuki. 2007
Acquiring Ontological Knowledge from Query Logs. In Proceedings of WWW, Banff, Alberta.
Sekine, Satoshi and Hisami Suzuki. 2007. 検索ログによる拡張固有表現辞書の整備 (Enhancing the Quality of Named Entity Dictionary Using Web Query Logs). 言語処理学会第13回全国大会論文集 (Proceedings of the 13th Annual Meeting of the Society of Natural Language Processing). In Japanese.
Suzuki, Hisami and Kristina Toutanova. 2007. 機械翻訳における日本語格助詞の生成 (Generating Case Markers for MT). 言語処理学会第13回全国大会論文集 (Proceedings of the 13th Annual Meeting of the Society of Natural Language Processing). In Japanese.
Yih, Wen-tau, Joshua Goodman, Lucy Vanderwende and Hisami Suzuki. 2007.
Multi-document Summarization by Maximizing Informative Content Words. In Proceedings of IJCAI, Hyderabad, India.
Suzuki, Hisami and Kristina Toutanova. 2006.
Learning to Predict Case Markers in Japanese. In Proceedings of COLING-ACL, Sydney, Australia.
Gao, Jianfeng, Hisami Suzuki and Bin Yu. 2006.
Approximation Lasso Methods for Language Modeling. In Proceedings of COLING-ACL, Sydney, Australia.
Vanderwende, Lucy, Hisami Suzuki and Chris Brockett. 2006.
Microsoft Research at DUC2006: Task-Focused Summarization with Sentence Simplification and Lexical Expansion. In Proceedings of Document Understanding Conference, presented at HLT-NAACL 2006, New York, New York.
Suzuki, Hisami and Gary Kacmarcik. 2006.
RefRef: A Tool for Viewing and Exploring Coreference Space. In Proceedings of LREC 2006, Genova, Italy.
Suzuki, Hisami and Kristina Toutanova. 2006. 機械学習による日本語格助詞の予測 (Prediction of Japanese Case Markers Using Machine Learning Methods). 言語処理学会第12回全国大会論文集 (Proceedings of the 12th Annual Meeting of the Society of Natural Language Processing). In Japanese.
Suzuki, Hisami and Jianfeng Gao. 2005b.
Microsoft Research IME Corpus. Microsoft Research Technical Report, TR-2005-168.
Suzuki, Hisami and Jianfeng Gao. 2005a.
A Comparative Study on Language Model Adaptation Using New Evaluation Metrics. In Proceedings of HLT/EMNLP 2005, Vancouver, Canada, pp.265-272.
Yuan, Wei, Jianfeng Gao and Hisami Suzuki. 2005.
An Empirical Study on Language Model Adaptation Using a Metric of Domain Similarity. In Proceedings of the Second International Joint Conference on Natural Language Processing (IJCNLP 05), Jeju Island, Korea, pp.957-968.
Lucy Vanderwende, Gary Kacmarcik, Hisami Suzuki and Arul Menezes. 2005.
MindNet: an automatically-created lexical resource. In HLT/EMNLP Interactive Demonstrations Proceedings, Vancouver, Canada, pp.8-9.
Lucy Vanderwende and Hisami Suzuki. 2005.
Frequency-based Summarizer and a Language Modeling Extension. Available from http://www.isi.edu/~cyl/MTSE2005/MSE2005/papers/index.html.
Suzuki, Hisami. 2004.
Phrase-Based Dependency Evaluation of a Japanese Parser. In Proceedings of LREC 2004, Lisbon, Portugal, pp.863-866.
Ringger, Eric K., Robert C. Moore, Eugene Charniak, Lucy Vanderwende and Hisami Suzuki. 2004.
Using the Penn Treebank to Evaluate Non-Treebank Parsers. In Proceedings of LREC 2004, Lisbon, Portugal, pp.867-870.
Gao, Jianfeng and Hisami Suzuki. 2004.
Capturing Long Distance Dependencies in Language Modeling: An Empirical Study. In Proceedings of First International Joint Conference
on Natural Language Processing, Sanya, Hainan, pp.53-60.
Gao, Jianfeng and Hisami Suzuki. 2003.
Unsupervised Learning of Dependency Structure for Language Modeling. In Proceedings of ACL 2003, Sapporo, pp.521-528.
Suzuki, Hisami. 2002a.
A Development Enrivonment for Large-scale Multi-lingual Parsing Systems. In Proceedings of the Workshop on Grammar Engineering an Evaluation, COLING 2002, Taipei.
Suzuki, Hisami. 2002b. Multi-modularity in Computational Grammar. Ph.D. Thesis, The University of Chicago.
Gao, Jianfeng, Hisami Suzuki and Yang Wen. 2002.
Using Headword Dependency and Predictive Clustering for Language Modeling. In Proceedings of EMNLP 2002, Philadelphia, pp.248-256.
Campbell, Richard and Hisami Suzuki. 2002a.
Language-Neutral Representation of Syntactic Structure. In
Proceedings of the First International Workshop on Scalable Natural Language Understanding (SCANALU 2002), Heidelberg, Germany.
Campbell, Richard and Hisami Suzuki. 2002b.
Language-Neutral Syntax: An Overview.
Microsoft Research Technical Report, MSR-TR-2002-76.
Corston-Oliver, Simon, Michael Gamon and Hisami Suzuki. 2001.
Using Machine Learning for System-Internal Evaluation of Transferred Linguistic Representations. In Proceedings of the MT Summit VIII
, Santiago De Compostela, Spain, pp.109-114.
Suzuki, Hisami, Chris Brockett and Gary Kacmarcik. 2000.
Using a Broad-Coverage Parser for Word-Breaking in Japanese. In Proceedings
of COLING 2000, Saarbrüken, Germany, pp.822-827.
Kacmarcik, Gary, Chris Brockett and Hisami Suzuki. 2000.
Robust Segmentation of Japanese Text into a Lattice for Parsing.
In Proceedings of COLING 2000, Saarbrüken, Germany, pp.390-396.
Natural Language Processing Group's home page
|