2012
·
Jianfeng
Gao and Jian-Yun Nie. 2012. Towards concept-based translation models using search logs
for query expansion. In CIKM 2012. [PDF]
·
Hisami
Suzuki and Jianfeng Gao. 2012. A unified approach to transliteration-based
text input with online spelling correction. In EMNLP 2012. [PDF]
·
Jianfeng
Gao, Shasha Xie, Xiaodong He and Alnur Ali. 2012. Learning lexicon
models from search logs for query expansion. In EMNLP 2012. [PDF]
·
Jagadeesh
Jagarlamudi and Jianfeng Gao. 2012. Modeling
click-through based word-pairs for web search. In WWW 2012 (poster). [PDF]
·
Chris
Quirk, Pallavi Choudhury, Jianfeng Gao, Hisami Suzuki, Kristina Toutanova,
Michael Gamon, Wen-tau Yih, Lucy Vanderwende and Colin Cherry. 2012. MSR SPLAT,
a language analysis toolkit. 2012. In NAACL
2012 (demo). [PDF]
(The demo system is accessible here).
2011
·
Jianfeng
Gao, Kristina Toutanova and Wen-tau Yih. 2011.
Clickthrough-based latent semantic models for web search. In SIGIR 2011. [PDF]
·
Amittai Axelrod, Xiaodong
He and Jianfeng Gao. 2011. Domain adaptation via
pseudo in-domain data selection. In EMNLP
2011. [PDF]
·
Jianfeng
Gao. 2011. Statistical translations and web search
ranking. Tutorial at 2011 ACL/SIGIR Summer School. [slides] [note]
2010
·
Jianfeng
Gao, Xiaodong He and Jian-Yun
Nie. 2010. Clickthrough-based translation models for web search: from word
models to phrase models. In CIKM 2010.
[PDF]
·
Jianfeng
Gao, Xiaolong Li, Daniel Micol,
Chris Quirk, and Xu Sun. 2010. A large scale ranker-based system for search
query spelling correction. In COLING 2010.
[PDF]
·
Alex
Cheng, Fei Xia and Jianfeng Gao.
2010. A comparison of unsupervised methods for part-of-speech tagging in
Chinese. In COLING 2010. [PDF]
·
Xu
Sun, Jianfeng Gao, Daniel Micol
and Chris Quirk. 2010. Learning phrase-based spelling error models from
clickthrough data. In ACL 2010. [PDF]
·
Kuansan
Wang, Xiaolong Li, and Jianfeng Gao. 2010. Multi-style
language model for web scale information retrieval. In SIGIR 2010. [PDF]
·
Jianfeng
Gao, Patrick Nguyen, Xiaolong Li, Chris Thrasher, Mu
Li and Kuansan Wang. 2010. A comparative study of Bing web n-gram language
models for web search and natural language processing. In SIGIR 2010 Web N-gram workshop. [PDF]
·
Jian Huang, Jianfeng Gao,
Jiangbo Miao, Xiaolong Li, Kuansan Wang and Fritz
Behr. 2010 Exploring web scale language models for search query processing. In WWW
2010. [PDF]
2009
·
Jianfeng
Gao, Wei Yuan, Xiao Li, Kefeng
Deng and Jian-Yun Nie. 2009. Smoothing clickthrough
data for web search ranking. In SIGIR.
[PDF]
·
Jianfeng
Gao, Qiang Wu, Chris Burges, Krysta Svore, Yi Su, Nazan
Khan, Shalin Shah and Hongyan Zhou. 2009. Model adaptation via model
interpolation and boosting for web search ranking. In EMNLP. [PDF]
·
Hisami
Suzuki, Xiao Li and Jianfeng Gao. 2009. Discovery of
term variation in Japanese web search queries. In EMNLP. [PDF]
·
Xiaodong
He, Mei Yang, Jianfeng Gao, Patrick Nguyen and Robert
Moore. 2009. Improving monolingual hypothesis alignment for machine translation
system combination. To appear in ACM Trans on Asian Language Information
Processing. [draft version]
·
Qiang
Wu, Christopher J. C. Burges, Krysta M. Svore and Jianfeng Gao.
2009. Adapting boosting for information retrieval. To appear in Information Retrieval. [PDF] (The original publication is
available at www.springerlink.com)
2008
·
Jianfeng
Gao and Mark Johnson. 2008. A comparison of Bayesian
estimators for unsupervised hidden Markov model POS taggers. In EMNLP. [PDF] (The toolkit which consists of
the six Bayesian estimators used in this study is available
for download.)
·
Xiaodong
He, Mei Yang, Jianfeng Gao, Patrick Nguyen and Robert
Moore. 2008. Indirect-HMM-based hypothesis alignment for combining outputs from
machine translation systems. In EMNLP. [PDF]
·
Jia
Xu, Jianfeng Gao, Kristina Toutanova and Hermann Ney.
2008. Bayesian semi-supervised Chinese word segmentation for statistical machine
translation. In Proceedings of the 22nd International Conference on
Computational Linguistics (COLING), Manchester, UK. [PDF]
·
Guihong
Cao, Jian-Yun Nie, Jianfeng Gao
and Stephen Robertson.2008. Selecting good expansion terms for pseudo-relevance
feedback. In SIGIR. [PDF]
·
Michael
Gamon, Jianfeng Gao, Chris Brockett, Alexandre Klementiev, William B. Dolan, Dmitriy
Belenko and Lucy Vanderwende. 2008. Using contextual
speller techniques and language modeling for ESL error correction. In IJCNLP. [PDF] (We have developed a web
service of the contextual speller, click here
to try.)
·
Xing
Yi, Jianfeng Gao and William B. Dolan. 2008. A
web-based English proofing system for English as a second
language users. In IJCNLP. [PDF, Poster]
2007
·
Patrick Nguyen, Jianfeng Gao, and Milind Mahajan. 2007. MSRLM: a scalable language modeling toolkit.
Microsoft Research Technical Report, MSR-TR-2007-144. [PDF] (The toolkit is used in the MSR
statistical machine translation system for NIST evaluation, and is available
for download.)
·
Jianfeng
Gao, Galen Andrew, Mark Johnson and Kristina
Toutanova. 2007. A comparative study of parameter estimation methods for
statistical natural language processing. In ACL. [PDF]
·
Galen
Andrew and Jianfeng Gao. 2007. Scalable training of L1-regularized log-linear
models. In ICML. [PDF] (The source code is available
for download.)
·
Ken
Church, Ted Hard and Jianfeng Gao. 2007. Compressing
trigram language models with Golomb coding. In EMNLP-CoNLL. [PDF]
·
Guihong
Cao, Jianfeng Gao and Jian-Yun
Nie. 2007. A system to mine large-scale bilingual dictionaries from monolingual
web pages. In MT Summit XI. [PDF]
·
Guihong
Cao, Jianfeng Gao, Jian-Yun
Nie and Jing Bai. 2007. From query translation to cross-language
query expansion with Markov chain models. In CIKM. [PDF]
·
Jianfeng
Gao and Hismai Suzuki. 2007.
Foundations of statistical natural language processing: a case study of text
input system. Tutorial at MSR Weihai Summer School. [slides] (A sample set of the IME
corpus used in the examples in the tutorial is available
for download. For a detailed description of the IME corpus, see our technical report)
2006
·
Jianfeng
Gao and Jian-Yun Nie, 2006.
Study of Statistical Models for Query Translation: Finding a Good Unit of
Translation. In SIGIR. [PDF]
·
Jianfeng
Gao, Jian-Yun Nie, Ming
Zhou. 2006. Statistical Query Translation Models for Cross Language Information
Retrieval. ACM Trans on Asian Language Information Processing, 5(4):
323-359. [draft version]
·
Jianfeng
Gao, Hisami Suzuki, Wei Yuan. 2006. An Empirical Study
on Language Model Adaptation. ACM Trans on Asian Language Information
Processing, 5(3): 207-227. [draft
version]
·
Jianfeng
Gao, Hisami Suzuki, Bin Yu. 2006. Approximation Lasso
Methods for Language Modeling. In COLING-ACL. [PDF]
·
Lei
Shi, Cheng Nie, Ming Zhou, Jianfeng Gao. 2006. A DOM
Tree Alignment Model for Mining Parallel Data from the Web. In COLING-ACL.
[PDF]
·
Zhengyu Zhou, Jianfeng Gao, Frank K Soong, Helen Meng. 2006. A Comparative Study of
Discriminative Methods for Reranking LVCSR N-best
Hypotheses in Domain Adaptation and Generalization. In ICASSP. [PS]
·
Chin-Yew
Lin, Guihong Cao, Jianfeng Gao, Jian-Yun
Nie. An Information-Theoretic Approach to Automatic Evaluation of Summaries. In
HLT-NAACL. [PDF]
·
Yi
Zhang, Ke Wu, Jianfeng Gao,
Philip Vines. 2006. Automatic Acquisition of Chinese-English Parallel Corpus
from the Web. In ECIR. [PDF]
2005