2014

         Jianfeng Gao, Patrick Pantel, Michael Gamon, Xiaodong He, Li Deng, Yelong Shen. 2014. Modeling interestingness with deep neural networks. In EMNLP, Oct, 2014. [PDF]

         Michael Auli, Michel Galley, Jianfeng Gao. 2014. Large-scale expected BLEU training of phrase-based reordering models. In EMNLP, Oct, 2014. [PDF]

         Yelong Shen, Xiaodong He, Jianfeng Gao, Li Deng, Gregoire Mesnil. 2014. A latent semantic model with convolutional-pooling structure for information retrieval. In CIKM, Nov, 2014. [PDF]

         Tianbing Xu, Jianfeng Gao, Lin Xiao and Amelia C. Regan. 2014. Online classification using a voted RDA method. In AAAI, July, 2014. [PDF]

         Jianfeng Gao, Xiaodong He, Wen-tau Yih and Li Deng. 2014. Learning continuous phrase representations for translation modeling. In ACL, June, 2014. [PDF]

         Michael Auli and Jianfeng Gao. 2014. Decoder integration and expected bleu training for recurrent neural network language models. In ACL (short paper), June, 2014. [PDF]

         Xiaodong He, Jianfeng Gao and Li Deng. 2014. Deep learning for natural language processing and related applications (Tutorial at ICASSP). In ICASSP, May, 2015. [PDF]

         Yuening Hu, Michael Auli, Qin Gao and Jianfeng Gao. 2014. Minimum translation modeling with recurrent neural networks. In EACL, April, 2014. [PDF]

         Yelong Shen, Xiaodong He, Jianfeng Gao, Li Deng and Gregoire Mesnil. 2014. Learning semantic representations using convolutional neural networks for web search. In WWW (short paper), April, 2014. [PDF]

2013

         Jianfeng Gao, Xiaodong He, Wen-tau Yih and Li Deng. 2013. Learning semantic representations for the phrase translation model. Microsoft Research Technical Report MSR-TR-2013-88. September, 2013. [PDF]

         Jianfeng Gao, Tianbing Xu, Lin Xiao and Xiaodong He. 2013. A voted regularized dual averaging method for large scale discriminative training in natural language processing. Microsoft Research Technical Report MSR-TR-2013-89. September, 2013. [PDF]

         Po-Sen Huang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Acero and Larry Heck. 2013. Learning deep structured semantic models for web search using clickthrough data. In CIKM 2013. [PDF]

         Jianfeng Gao, Gu Xu and Jinxi Xu. 2013. Query expansion using path-constrained random walks. In SIGIR 2013. [PDF]

         Jagadeesh Jagarlamudi and Jianfeng Gao. 2013. Modeling click-through based word-pairs for web search. In SIGIR 2013. [PDF]

         Li Deng, Xiaodong He and Jianfeng Gao. 2013. Deep stacking networks for information retrieval. In ICASSP 2013. [PDF]

         Jennifer Gillenwater, Xiaodong He, Jianfeng Gao and Li Deng. 2013. End-to-end learning of parsing models for information retrieval. In ICASSP 2013. [PDF]

         Jianfeng Gao and Xiaodong He. 2013. Training MRF-based phrase translation models using gradient ascent. In NAACL 2013. [PDF]

         Hui Zhang, Kristina Toutanova, Chris Quirk and Jianfeng Gao. 2013. Beyond left-to-right: multiple decomposition structures for SMT. In NAACL 2013. [PDF]

2012

         Jianfeng Gao and Jian-Yun Nie. 2012. Towards concept-based translation models using search logs for query expansion. In CIKM 2012. [PDF]

         Hisami Suzuki and Jianfeng Gao. 2012. A unified approach to transliteration-based text input with online spelling correction. In EMNLP 2012. [PDF]

         Jianfeng Gao, Shasha Xie, Xiaodong He and Alnur Ali. 2012. Learning lexicon models from search logs for query expansion. In EMNLP 2012. [PDF]

         Jagadeesh Jagarlamudi and Jianfeng Gao. 2012. Modeling click-through based word-pairs for web search. In WWW 2012 (poster). [PDF]

         Chris Quirk, Pallavi Choudhury, Jianfeng Gao, Hisami Suzuki, Kristina Toutanova, Michael Gamon, Wen-tau Yih, Lucy Vanderwende and Colin Cherry. 2012. MSR SPLAT, a language analysis toolkit. 2012. In NAACL 2012 (demo). [PDF] (The demo system is accessible here).

         Kristen Parton and Jianfeng Gao. 2012. Combining signals for cross-lingual relevance feedback. In AIRS 2012. [PDF]

2011

         Jianfeng Gao, Kristina Toutanova and Wen-tau Yih. 2011. Clickthrough-based latent semantic models for web search. In SIGIR 2011. [PDF]

         Amittai Axelrod, Xiaodong He and Jianfeng Gao. 2011. Domain adaptation via pseudo in-domain data selection. In EMNLP 2011. [PDF]

         Jianfeng Gao. 2011. Statistical translations and web search ranking. Tutorial at 2011 ACL/SIGIR Summer School. [slides] [note]

2010

         Jianfeng Gao, Xiaodong He and Jian-Yun Nie. 2010. Clickthrough-based translation models for web search: from word models to phrase models. In CIKM 2010. [PDF]

         Jianfeng Gao, Xiaolong Li, Daniel Micol, Chris Quirk, and Xu Sun. 2010. A large scale ranker-based system for search query spelling correction. In COLING 2010. [PDF]

         Alex Cheng, Fei Xia and Jianfeng Gao. 2010. A comparison of unsupervised methods for part-of-speech tagging in Chinese. In COLING 2010. [PDF]

         Xu Sun, Jianfeng Gao, Daniel Micol and Chris Quirk. 2010. Learning phrase-based spelling error models from clickthrough data. In ACL 2010. [PDF]

         Kuansan Wang, Xiaolong Li, and Jianfeng Gao. 2010. Multi-style language model for web scale information retrieval. In SIGIR 2010. [PDF]

         Jianfeng Gao, Patrick Nguyen, Xiaolong Li, Chris Thrasher, Mu Li and Kuansan Wang. 2010. A comparative study of Bing web n-gram language models for web search and natural language processing. In SIGIR 2010 Web N-gram workshop. [PDF]

         Jian Huang, Jianfeng Gao, Jiangbo Miao, Xiaolong Li, Kuansan Wang and Fritz Behr. 2010 Exploring web scale language models for search query processing. In WWW 2010. [PDF]

2009

         Jianfeng Gao, Wei Yuan, Xiao Li, Kefeng Deng and Jian-Yun Nie. 2009. Smoothing clickthrough data for web search ranking. In SIGIR. [PDF]

         Jianfeng Gao, Qiang Wu, Chris Burges, Krysta Svore, Yi Su, Nazan Khan, Shalin Shah and Hongyan Zhou. 2009. Model adaptation via model interpolation and boosting for web search ranking. In EMNLP. [PDF]

         Hisami Suzuki, Xiao Li and Jianfeng Gao. 2009. Discovery of term variation in Japanese web search queries. In EMNLP. [PDF]

         Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen and Robert Moore. 2009. Improving monolingual hypothesis alignment for machine translation system combination. To appear in ACM Trans on Asian Language Information Processing. [draft version]

         Qiang Wu, Christopher J. C. Burges, Krysta M. Svore and Jianfeng Gao. 2009. Adapting boosting for information retrieval. To appear in Information Retrieval. [PDF] (The original publication is available at www.springerlink.com)

 

2008

         Jianfeng Gao and Mark Johnson. 2008. A comparison of Bayesian estimators for unsupervised hidden Markov model POS taggers. In EMNLP. [PDF] (The toolkit which consists of the six Bayesian estimators used in this study is available for download.)

         Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen and Robert Moore. 2008. Indirect-HMM-based hypothesis alignment for combining outputs from machine translation systems. In EMNLP. [PDF]

         Jia Xu, Jianfeng Gao, Kristina Toutanova and Hermann Ney. 2008. Bayesian semi-supervised Chinese word segmentation for statistical machine translation. In Proceedings of the 22nd International Conference on Computational Linguistics (COLING), Manchester, UK. [PDF]

         Guihong Cao, Jian-Yun Nie, Jianfeng Gao and Stephen Robertson.2008. Selecting good expansion terms for pseudo-relevance feedback. In SIGIR. [PDF]

         Michael Gamon, Jianfeng Gao, Chris Brockett, Alexandre Klementiev, William B. Dolan, Dmitriy Belenko and Lucy Vanderwende. 2008. Using contextual speller techniques and language modeling for ESL error correction. In IJCNLP. [PDF] (We have developed a web service of the contextual speller, click here to try.)

         Xing Yi, Jianfeng Gao and William B. Dolan. 2008. A web-based English proofing system for English as a second language users. In IJCNLP. [PDF, Poster]

 

2007

         Patrick Nguyen, Jianfeng Gao, and Milind Mahajan. 2007. MSRLM: a scalable language modeling toolkit. Microsoft Research Technical Report, MSR-TR-2007-144. [PDF] (The toolkit is used in the MSR statistical machine translation system for NIST evaluation, and is available for download.)

         Jianfeng Gao, Galen Andrew, Mark Johnson and Kristina Toutanova. 2007. A comparative study of parameter estimation methods for statistical natural language processing. In ACL. [PDF]

         Galen Andrew and Jianfeng Gao. 2007. Scalable training of L1-regularized log-linear models. In ICML. [PDF] (The source code is available for download.)

         Ken Church, Ted Hard and Jianfeng Gao. 2007. Compressing trigram language models with Golomb coding. In EMNLP-CoNLL. [PDF]

         Guihong Cao, Jianfeng Gao and Jian-Yun Nie. 2007. A system to mine large-scale bilingual dictionaries from monolingual web pages. In MT Summit XI. [PDF]

         Guihong Cao, Jianfeng Gao, Jian-Yun Nie and Jing Bai. 2007. From query translation to cross-language query expansion with Markov chain models. In CIKM. [PDF]

         Jianfeng Gao and Hismai Suzuki. 2007. Foundations of statistical natural language processing: a case study of text input system. Tutorial at MSR Weihai Summer School. [slides] (A sample set of the IME corpus used in the examples in the tutorial is available for download. For a detailed description of the IME corpus, see our technical report)

2006

         Jianfeng Gao and Jian-Yun Nie, 2006. Study of Statistical Models for Query Translation: Finding a Good Unit of Translation. In SIGIR. [PDF]

         Jianfeng Gao, Jian-Yun Nie, Ming Zhou. 2006. Statistical Query Translation Models for Cross Language Information Retrieval. ACM Trans on Asian Language Information Processing, 5(4): 323-359. [draft version]

         Jianfeng Gao, Hisami Suzuki, Wei Yuan. 2006. An Empirical Study on Language Model Adaptation. ACM Trans on Asian Language Information Processing, 5(3): 207-227. [draft version]

         Jianfeng Gao, Hisami Suzuki, Bin Yu. 2006. Approximation Lasso Methods for Language Modeling. In COLING-ACL. [PDF]

         Lei Shi, Cheng Nie, Ming Zhou, Jianfeng Gao. 2006. A DOM Tree Alignment Model for Mining Parallel Data from the Web. In COLING-ACL. [PDF]

         Zhengyu Zhou, Jianfeng Gao, Frank K Soong, Helen Meng. 2006. A Comparative Study of Discriminative Methods for Reranking LVCSR N-best Hypotheses in Domain Adaptation and Generalization. In ICASSP. [PS]

         Chin-Yew Lin, Guihong Cao, Jianfeng Gao, Jian-Yun Nie. An Information-Theoretic Approach to Automatic Evaluation of Summaries. In HLT-NAACL. [PDF]

         Yi Zhang, Ke Wu, Jianfeng Gao, Philip Vines. 2006. Automatic Acquisition of Chinese-English Parallel Corpus from the Web. In ECIR. [PDF]

2005

2004

2003

2002

2001

2000

1999

         Jianfeng Gao. 1999. Case and constraint: research on intelligent CAD systems. (in Chinese) PhD thesis, Shanghai Jiaotong University, 1999.  (zip)