*
Quick Links|Home|Worldwide
Microsoft*
Search for





Bill Dolan


Bill Dolan is a Principal Researcher and Manager of the Natural Language Processing Group.


Background Information

I am a Principal Researcher in Microsoft Research, where I manage the Natural Language Processing group .

My undergraduate degree is from UC Berkeley, and my Ph.D. is from UCLA Linguistics . I joined MSR in 1992, and most of my work since then has focused on semantic processing. I am also deeply involved in our group's machine translation effort (see our MT blog and the and Live Translator homepage), as well as the Microsoft Research ESL Assistant project, which helps non-native speakers write better English by showing them targeted usage examples culled from the web. For more details on this project, see our project site and blog .

During the 1990s I was preoccupied with building richly structured semantic networks from text data as part of the MindNet project. This work spurred my interest in "the paraphrase problem": when do superficially dissimilar strings of words convey essentially the same meaning?

  • On its way to an extended mission at Saturn, the Cassini probe on Friday makes its closest rendezvous with Saturn's dark moon Phoebe.
  • The Cassini spacecraft, which is en route to Saturn, is about to make a close pass of the ringed planet's mysterious moon Phoebe.

    Learning to identify and generate such paraphrase alternations is key to developing applications that appear to understand human language, and we've done some interesting work in this area . I have also been active in helping establishing the Recognizing Textual Entailment challenges , which address a closely related problem.


    Selected Publications

  • Xing Yi, Jianfeng Gao and William B. Dolan. 2008. A Web-based English Proofing System for English as a Second Language Users, In Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP), Hyderabad, India, January 7-12, 2008.

  • Michael Gamon, Jianfeng Gao, Chris Brockett, Alexandre Klementiev, William B. Dolan, Dmitriy Belenko, Lucy Vanderwende. 2007. Using Contextual Speller Techniques and Language Modeling for ESL Error Correction. In Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP) 2008, Hyderabad, India.

  • Bar-Haim, R., I. Dagan, B. Dolan, L. Ferro, D. Giampiccolo, B. Magnini and I. Szpektor. 2006. The Second PASCAL Recognising Textual Entailment Challenge. In Proceedings of the Second PASCAL Challenges Workshop on Recognising Textual Entailment, Venice, Italy.

  • Chris Brockett, William B. Dolan and Michael Gamon. 2006. Correcting ESL Errors Using Phrasal SMT Techniques. In Proceedings of COLING-ACL 2006, Sydney, Australia.

  • Vanderwende, L. and W. B. Dolan. 2006. What syntax can contribute in the entailment task. In MLCW 2005, LNAI 3944, pp. 205-216. J. Quinonero-Candela et al. (eds.). Springer-Verlag.

  • Dolan, W. B. and C. Brockett. 2005. Automatically Constructing a Corpus of Sentential Paraphrases. In Proceedings of IWP2005: The Third International Workshop on Paraphrasing, Cheju, Korea.

  • Brockett, C. and W. B. 2005. Support Vector Machines for Paraphrase Identification and Corpus Construction. In Proceedings of IWP2005: The Third International Workshop on Paraphrasing,Cheju, Korea.

  • Brockett, C. and W. B. Dolan. 2005. Echo Chamber: A Game for Eliciting a Colloquial Paraphrase Corpus. In AAAI 2005 Spring Symposium, Knowledge Collection from Volunteer Contributors (KCVC05). Stanford, CA. March 21-23, 2005.

  • Quirk, C., C. Brockett, and W. B. Dolan. 2004. Monolingual Machine Translation for Paraphrase Generation, In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona Spain, pp. 142-149. (Corrected version) (Alternate link)

  • Dolan W. B., C. Quirk, and C. Brockett. 2004. Unsupervised Construction of Large Paraphrase Corpora: Exploiting Massively Parallel News Sources. Proceedings of COLING 2004, Geneva, Switzerland.

  • Dolan, W.B., J. Pinkham, and S. D. Richardson. 2002. MSR-MT: the Microsoft research machine translation system. Machine translation: from research to real users. In Fifth conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 237-239.

  • Richardson, S., W. Dolan, A. Menezes, and J. Pinkham. 2001. Achieving commercial-quality translation with example-based methods. In Proceedings of MT Summit VIII, Santiago De Compostela, Spain, pp. 293-298.

  • Richardson, S., W. Dolan, A. Menezes, and M. Corston-Oliver. 2001. Overcoming the customization bottleneck using example-based MT. In Proceedings, Workshop on Data-driven Machine Translation, 39th Annual Meeting and 10th Conference of the European Chapter, Association for Computational Linguistics Toulouse, France, pp. 9-16.

  • Dolan, William, Lucy Vanderwende, and Stephen Richardson. 2000. Polysemy in a Broad-Coverage Natural Language Processing System. In Polysemy: Theoretical and Computational Approaches. Ravin, Y. and Leacock, C., eds., Oxford University Press.

  • Richardson, Stephen D., Dolan, William B., and Vanderwende, Lucy 1998. MindNet: acquiring and structuring semantic information from text. In Proceedings of COLING '98.

  • Dolan, W. 1995. Metaphor as an Emergent Property of Machine-Readable Dictionaries. In Proceedings of the AAAI 1995 Spring Symposium Series.

  • Dolan, W. 1994. Word Sense Ambiguation: clustering related senses. In Proceedings of COLING94, 712-716.

  • Dolan, William B. 1994. Exploiting Lexical Information for Visual Processing. In Proceedings of AAI-94 Workshop on the Integration of Natural Language and Vision Processing, Seattle, Washington, 185-188.

  • Dolan, William B., L. Vanderwende, and S. Richardson. 1993. Automatically Deriving Structured Knowledge Base from On-line Dictionaries. In Proceedings of the Pacific Association for Computational Linguistics, April 21-24, 1993, Vancouver, British Columbia.

  • Richardson, S., L. Vanderwende, and W. Dolan. 1993. Combining Dictionary-based and Example-based Methods for Natural Language Analysis.. In . Proceedings of the Fifth International Conference on Theoretical and Methodological Issues in Machine Translation, Kyoto, Japan. pp 69-79.



  • ©2008 Microsoft Corporation. All rights reserved. Terms of Use |Trademarks |Privacy Statement