Hang Li (李航)

Senior Researcher and Research Manager in the Information Retrieval and Mining Group, Microsoft Research.

I am a research manager at Microsoft Research. I am also adjunct professor of Peking University, Nanjing University, Xian Jiaotong University, and Nankai University.  I am co-director of the MS joint lab at PKU and member of the machine learning lab at NJU.

I joined Microsoft Research in June 2001. Prior to that, I worked at the Research Laboratories of NEC Corporation.

I obtained a B.S. in Electrical Engineering from Kyoto University in 1988 and a M.S. in Computer Science from Kyoto University in 1990. I earned my Ph.D. in Computer Science from the University of Tokyo in 1998.

I am interested in statistical learning, natural language processing, data mining, and information retrieval.
 

    Contact Information

Microsoft Research Asia

4F, Sigma Center

No. 49 Zhichun Road, Haidian District

Beijing, China 100080

Email: hangli at microsoft dot com

Tel: (86-10)58963177

Fax: (86-10)88097306

 

     Recent Publications (Publication List)

      Learning to Rank

·         Xiubo Geng, Tie-Yan Liu, Tao Qin, Andrew Arnold, Hang Li, Heung-Yeung  Shum, Query Dependent Ranking with K Nearest Neighbor, Proc. of SIGIR 2008, 115-122. (pdf)

·         Jun Xu, Tie-Yan Liu, Min Lu, Hang Li, Wei-Ying Ma, Directly Optimizing Evaluation Measures in Learning to Rank, Proc. of SIGIR 2008, 107-114. (pdf)

·         Fen Xia, Tie-Yan Liu, Jue Wang, Wensheng Zhang, Hang Li, Listwise Approach to Learning to Rank Theory and Algorithm, Proc. of ICML 2008, 1192-1199.  (pdf)

·         Yanyan Lan, Tie-Yan Liu, Tao Qin, Zhiming Ma, Hang Li, Query Level Stability and Generalization in Learning to Rank, Proc. of ICML 2008, 512-519. (pdf).

·         Rong Jin, Hamed Valizadegan, Hang Li, Ranking Refinement and Its Application to Information Retrieval, Proc. of WWW 2008, 397-406. (pdf)

·         Tao Qin, Tie-Yan Liu, Xu-Dong Zhang, De-Sheng Wang, Wen-Ying Xiong, Hang Li, Learning to Rank Relational Objects and Its Application to Web Search, Proc. of WWW 2008, 407-416. (pdf)

·         Tao Qin, Xu-Dong Zhang, Ming-Feng Tsai, De-Sheng Wang, Tie-Yan Liu, Hang Li, Query-level Loss Functions for Information Retrieval,  Information Processing and Management, 44, 838-855, 2008. (pdf)

·         Tie-Yan Liu, Jun Xu, Tao Qin, Wenying Xiong, Hang Li, LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval, Proc. of SIGIR 2007 Workshop on Learning to Rank for Information Retrieval. (pdf)

·         Tao Qin, Tie-Yan Liu, Wei Lai, Xu-Dong Zhang, De-Sheng Wang, Hang Li, Ranking with Multiple Hyperplanes, Proc. of SIGIR 2007, 279-286. (pdf)

·         Jun Xu, Hang Li,  AdaRank: A Boosting Algorithm for Information Retrieval, Proc. of SIGIR 2007, 391-398. (pdf)

·         Xiuobo Geng, Tie-Yan Liu, Tao Qin, Hang Li, Feature Selection for Ranking, Proc. of SIGIR 2007, 407-414. (pdf)

·         Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, Hang Li, Learning to Rank: From Pairwise Approach to Listwise Approach, Proc. of ICML 2007, 129-136. (pdf)

·         Yu-Ting Liu, Tie-Yan Liu, Tao Qin, Zhi-Ming Ma, Hang Li, Supervised Rank Aggregation, Proc. of WWW 007, 481-490. (pdf)

·         Jun Xu, Yunbo Cao, Hang Li, and Yalou Huang, Cost Sensitive Learning of SVM for Ranking, Proc. of  ECML-2006 poster, 833-840. (pdf)

·         Yunbo Cao, Jun Xu, Tie-Yan Liu, Hang Li, Yalou Huang, and Hsiao-Wuen Hon, Adapting Ranking SVM to Document Retrieval, Proc. of SIGIR 2006, 186-193. (pdf)

 

     General Search

·         Huanhuan Cao, Daxin Jiang, Jian Pei, Qi He, Zhen Liao, Enhohng Chen, Hang Li, Context-Aware Query Suggestion by Mining Click-Through and Session Data, Proc. of KDD 2008, 875-883. (pdf) SIGKDD08 Best Application Paper Award

·         Yuting Liu, Bin Gao, Tie-Yan Liu, Ying Zhang, Zhiming Ma, Shuyuan He, Hang Li, BrowseRank: Letting Users Vote for Page Importance, Proc. of SIGIR 2008, 451-458. SIGIR08 Best Student Paper Award (pdf)

·         Jiafeng Guo, Gu Xu, Hang Li, Xueqi Cheng, A Unified and Discriminative Model for Query Refinement, Proc. of SIGIR 2008, 379-386. (pdf)

·         Guoyang Shen, Bin Gao, Tie-Yan Liu, Guang Feng, Shiji Song, and Hang Li, Detecting Link Spam using Temporal Information, Prof. of ICDM-2006, 1049-1053. (pdf)

·         Min Zhao, Hang Li, Adwait Ratnaparkhi, Hsiao-Wuen Hon, and Jue Wang, Adapting Document Ranking to Users Preferences using Click-through Data, Proc. of AIRS 2006, 26-42. (pdf)

 

     Specialized Search

·         Gu Xu, Hang Li, Wei-Ying Ma, Fora: Leveraging the Power of Internet Communities for Question Answering, Proc. of WWW 2008 Workshop on QA Web. (pdf)

·         Jun Xu, Yunbo Cao, Hang Li, Nick Craswell, and Yalou Huang, Searching Documents based on Relevance and Type, Proc. of ECIR-2007, 629-636. (pdf)

·         Guoping Hu, Jingjing Liu, Yunbo Cao, Hang Li, Jian-Yun Nie, and Jianfeng Gao, A Supervised Learning Approach to Entity Search, Proc. of AIRS 2006, 54-66. (pdf)

·         Jun Xu, Yunbo Cao, Hang Li, Min Zhao, and Yalou Huang, A Supervised Learning Approach to Search of Definitions, Journal of Computer Science and Technology, 21(3), 439-449, 2006. (pdf)

·         Yunbo Cao, Jingjing Liu, Shenghua Bao, and Hang Li, Research on Expert Search at Enterprise Track of TREC 2005, Proc. of TREC 2005. (pdf)

·         Hang Li, Yunbo Cao, Jun Xu, Yunhua Hu, Shenjie Li, and Dmitriy Meyerzon, A New Approach to Intranet Search Based on Information Extraction. Proc. of CIKM 2005 industry track, 460-468. (pdf)

·         Jun Xu, Yunbo Cao, Hang Li, and Min Zhao, Ranking Definitions with Supervised Learning Method, Proc. of WWW 2005 industry track, 811-819.  (pdf)

 

     Information Extraction

·         Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shuming Shi, Yunbo Cao, Chin-Yew Lin, and Hang Li, Web Page Title Extraction and Its Application, Information Processing and Management, 43(5), 1332-1347, 2007. (pdf)

·         Yunhua Hu, Hang Li, Yunbo Cao, Dmitriy Meyerzon, Li Teng, and Qinghua Zheng, Automatic Extraction of Titles from General Documents using Machine Learning, Information Processing and Management, 42, 1276-1293, 2006. (pdf)

·         Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Shuming Shi, Yunbo Cao, and Hang Li, Title Extraction from Bodies of HTML Documents and Its Application to Web Page Retrieval, Proc. of SIGIR 2005, 250-257.  (pdf)

·         Yunhua Hu, Hang Li, Yunbo Cao, Dmitriy Meyerzon, and Qinghua Zheng, Automatic Extraction of Titles from General Documents using Machine Learning, Proc. of JCDL 2005, 145-154.  (pdf)

 

Text Mining

·         Congkai Sun, Bin Gao, Zhenfu Cao, and Hang Li, HTM: A Topic Model for Hypertexts, Prof. of EMNLP 2008, to appear. (pdf)

·         Xiaonan Ji, Gu Xu, James Bailey, and Hang Li, Mining, Ranking, and Using Acronym Patterns, Prof. of APWeb-2008, 371-382. (pdf)

·         Conghui Zhu, Jie Tang, Hang Li, Hwee Tou Ng, and Tie-Jun Zhao, A Unified Tagging Approach to Text Normalization, Prof. of ACL 2007, 688-695. (pdf)

·         Shenghua Bao, Yunbo Cao, Bing Liu, Yong Yu, and Hang Li, Mining Latent Associations of Objects Using a Typed Mixture Model - A Case Study on Expert/Expertise Mining, Prof. of ICDM-2006, 803-807. (pdf)

·         Jie Tang, Hang Li, Yunbo Cao, and Zhaohui Tang, Email Data Cleaning, Proc. of SIGKDD 2005 industry track, 489-498. (pdf)

     Selected Publications (Publication List)

·         Hang Li, A Probabilistic Approach to Lexical Semantic Knowledge Acquisition and Structural Disambiguation, PhD Thesis, The University of Tokyo, 1998. (pdf)

·         Hang Li and Naoki Abe, Generalizing Case Frames Using a Thesaurus and the MDL Principle, Computational Linguistics, 24(2), pp.217-244, 1998. (pdf)

·         Hang Li, Word Clustering and Disambiguation based on Co-occurrence Data, Natural Language Engineering, 8(1), pp.25-42, 2002. (pdf)

·         Hang Li and Kenji Yamanishi,  Text Classification Using ESC-based Stochastic Decision Lists, Information Processing & Management, 38(3), pp.343-361, 2002.

·         Kenji Yamanishi and Hang Li, Mining Open Answers in Questionnaire Data, IEEE Intelligent Systems, 17(5), pp.58-63, 2002. (pdf)

·         Hang Li and Cong Li, Word Translation Disambiguation Using Bilingual Bootstrapping, Computational Linguistics, 30(1), pp.1-22, 2004. (pdf)

·         Hang Li, Yunbo Cao, Jun Xu, Yunhua Hu, Shenjie Li, and Dmitriy Meyerzon, A New Approach to Intranet Search Based on Information Extraction. Proc. of CIKM 2005 industry track, 460-468. (pdf)

·         Yunbo Cao, Jingjing Liu, Shenghua Bao, and Hang Li, Research on Expert Search at Enterprise Track of TREC 2005, Proc. of TREC 2005. (pdf)

·         Yunbo Cao, Jun Xu, Tie-Yan Liu, Hang Li, Yalou Huang, and Hsiao-Wuen Hon, Adapting Ranking SVM to Document Retrieval, Proc. of SIGIR 2006, 186-193. (pdf)

·         Jun Xu, Hang Li,  AdaRank: A Boosting Algorithm for Information Retrieval, Proc. of SIGIR 2007, 391-398. (pdf)

·         Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, Hang Li, Learning to Rank: From Pairwise Approach to Listwise Approach, Proc. of ICML 2007, 129-136. (pdf)

·         Fen Xia, Tie-Yan Liu, Jue Wang, Wensheng Zhang, Hang Li, Listwise Approach to Learning to Rank Theory and Algorithm, Proc. of ICML 2008, 1192-1199.  (pdf)

·         Huanhuan Cao, Daxin Jiang, Jian Pei, Qi He, Zhen Liao, Enhohng Chen, Hang Li, Context-Aware Query Suggestion by Mining Click-Through and Session Data, Proc. of KDD 2008, 875-883. (pdf)

·         Yuting Liu, Bin Gao, Tie-Yan Liu, Ying Zhang, Zhiming Ma, Shuyuan He, Hang Li, BrowseRank: Letting Users Vote for Page Importance, Proc. of SIGIR 2008, 451-458. (pdf)

    Recent Academic Activities

·         Associate editor of ACM Transaction on Asian Language Information Processing.

·         Editorial board members of Journal of Computer Science & Technology, Computational Linguistics & Chinese Language Processing, Journal of Chinese Information Processing.

·         Program co-chairs of PAKDD'07 and AIRS08.

·         Poster and demo co-chair of SIGIR08.

·         Area chairs or program committee members of ACL'04, COLING'04, IJCNLP'04, EMNLP'04, CoNLL'04, IJCAI'05, ACL'05, CoNLL'05, AIRS'05, ACL'06, PRICAI'06, AIRS'06, ICCPOL'06, CIKM'06, NIPS'06, SDM'07, WWW'07, NAACL-HLT'07, SIGIR07, EMNLP07, CIKM07, IJCNLP08, PAKDD08, WWW08, ACL08, KDD08, COLING08, CIKM08, EMNLP08, ECML/PKDD08.

    Product Developments

·         NEC TopicScope (previously SurveyAnalyzer)

·         Microsoft TextMiner (internal tool)

·         Microsoft SQL Server 2005 Text Mining

·         Microsoft SharePoint Searver 2007 Search

 

      Last updated date: 9/24/2008, by Hang Li