Hang Li

Current Position

Senior researcher and research manager of Web Search and Mining Group at Microsoft Research Asia.

Microsoft Research Asia
No. 5 Danling Street, Haidian District
Beijing, China 100080
Email: hangli@microsoft.com
Tel: (86-10)59173177
Fax: (86-10)88097306

Education and Work History

1977/9 – 1982/8 Jianshe High School, Xi’an, China

1982/9 – 1983/2 Xi’an Jiaotong University, Xi’an, China

1983/4 – 1984/2 Preparatory School for Chinese Students to Japan, Changchun, China

1984/4 – 1988/3 Kyoto University, Japan, Bachelor in Electrical and Electronics Engineering

1988/4 – 1990/3 Kyoto University, Japan, Master Degree in Electrical and Electronics Engineering, Supervisor: Prof. Makoto Nagao

1994/4 – 1998/7 University of Tokyo, Japan, Ph.D in Computer Science, Supervisor: Prof. Jun'ichi Tsujii

1990/4 – 2001/5 NEC Research Laboratories, Japan

2001/6 – present Microsoft Research Asia

Professional Duties

  • Adjunct professors of Peking University, Nanjing University, Nankai University, Xi’an Jiaotong University.
  • IEEE Senior Member, ACM Member, ACL Member, CCF Member.
  • Associate editor of ACM Transaction on Asian Language Information Processing (2007-2010).
  • Subject area editor of Journal of Computer Science & Technology (2003-present).
  • Editorial board member of Journal of the American Society for Information Science and Technology (2010-present).
  • Editorial board member of Computational Linguistics & Chinese Language Processing (2004-present).
  • Editorial board member of Journal of Chinese Information Processing (2007-present).
  • 2013: senior program committee member of WSDM'13.
  • 2012: track co-chair of the web search track of WWW'12; senior program committee members or area chairs of WSDM'12, KDD'12, CIKM'12, ACML'12, AIRS'12; co-chair of KDD'12 summer school; program committee members of ACL'12, NAACL'12, SIGIR'12, ICDM'12, NIPS'12.
  • 2011: program committee co-chair of WSDM'11; finance chair of SIGIR'11; area chairs of SIGIR'11, AAAI'11, NIPS'11; program committee members of WWW'11, ACL'11, KDD'11, ICDM'11, EMNLP'11, ACML'11; co-organizer of SIGIR'11 workshop on Query Representation and Understanding.
  • 2010: program committee co-chair of EMNLP'10, senior program committee members of WSDM'10, KDD'10 and SIGIR'10; area chairs of ACL'10 and ACML'10; program committee members of WWW'10, ICDM'10, and COLING'10; co-organizer of SIGIR'10 workshop on Query Representation and Understanding.
  • 2009: publicity chair of KDD’09; area chairs of EMNLP'09 and ACML'09; program committee members of WWW’09, ACL’09, NAACL-HLT’09, SIGIR’09, CIKM’09, ICDM’09; co-organizer of SIGIR'09 workshop on Learning to Rank for Information Retrieval; co-editor of special issue on Learning to Rank for Information Retrieval at Information Retrieval Journal; co-editor of special issue on Machine Learning and Applications at Journal of Computer Science & Technology.
  • 2008: program committee co-chair of AIRS’08; poster and demo co-chair of SIGIR’08; senior program committee member of CIKM'08; area chair of EMNLP'08, program committee members of IJCNLP’08, PAKDD’08, WWW’08, ACL’08, KDD’08, COLING’08, ECML/PKDD’08; co-organizer of SIGIR'08 workshop on Learning to Rank for Information Retrieval.
  • 2007: program committee co-chair of PAKDD'07; advisory board member of IJCAI’07; program committee members of SDM'07, WWW'07, NAACL-HLT'07, SIGIR’07, EMNLP’07, CIKM’07; co-organizer of SIGIR'07 workshop on Learning to Rank for Information Retrieval.
  • 2006: program committee members of ACL'06, PRICAI'06, AIRS'06, ICCPOL'06, CIKM'06, NIPS'06.
  • 2005: area chair of ACL'05; program committee members of IJCAI'05, CoNLL'05, AIRS'05.
  • 2004: program committee members of ACL'04, COLING'04, IJCNLP'04, EMNLP'04, CoNLL'04.

Awards

  • AIRS'10 Best Paper Award
  • SIGIR’08 Best Student Paper Award
  • SIGKDD’08 Best Application Paper Award
  • Microsoft Star Award 2005
  • Microsoft Star Award 2007

Invited Talks and Tutorials

  1. Hang Li, Jun Xu, WWW 2012 Tutorial: Enhancing Search Relevance -Machine Learning Techniques for Better Matching of Query and Document, Lyon, 2012.
  2. Hang Li, Jun Xu, WSDM 2012 Tutorial: Machine Learning for Query-Document Matching in Search, Seattle, 2012.
  3. Hang Li, ICONIP 2011 Tutorial: Learning to Rank, Shanghai, 2011.
  4. Hang Li, Invited Talk at the Third Pao-Lu Hsu Statistics Conference: Regularized Semantic Indexing, Zhejing University, 2011.
  5. Daxin Jiang, Jian Pei, Hang Li, SIGIR 2011 Tutorial: Enhancing Web Search by Mining Search and Browse Logs, Beijing, 2010.
  6. Daxin Jiang, Jian Pei, Hang Li,  SIGIR 2010 Tutorial: Enhancing Web Search by Mining Search and Browse Logs, Geneva, 2010.
  7. Daxin Jiang, Jian Pei, Hang Li, WWW 2010 Tutorial: Web Search/Browse Log Mining: Challenges, Methods, and Applications, Raleigh, 2010.
  8. Hang Li,  ACML 2009 Tutorial: Learning to Rank, Nanjing, 2009.
  9. Hang Li, ACL 2009 Tutorial: Learning to Rank, Singapore, 2009.
  10. Hang Li, Invited Talk at the First Pao-Lu Hsu Statistics Conference: AdaRank, A Boosting Algorithm for Information Retrieval, Peking University, 2007.
  11. Hang Li, ISCSLP 2006 Tutorial: Text Information Extraction and Retrieval, Singapore, 2006.

Conference Papers

  1. Yunhua Hu, Yanan Qian, Hang Li, Daxin Jiang, Jian Pei, Qinghua Zheng, Mining Query Subtopics from Search Log Data, In Proceedings of the 35th Annual International ACM SIGIR Conference (SIGIR’12), to appear, 2012.
  2. Quan Wang, Zheng Cao, Jun Xu, Hang Li, Group Matrix Factorization for Scalable Topic Modeling, In Proceedings of the 35th Annual International ACM SIGIR Conference (SIGIR’12), to appear, 2012.
  3. Xiaobing Xue, Yu Tao, Daxin Jiang and Hang Li, Automatically Mining Question Reformulation Patterns from Search Log Data, In Proceedings of the 50th Annual Meeting of Association for Computational Linguistics (ACL’12), to appear, 2012.
  4. Fan Bu, Hang Li, Xiaoyan Zhu, String Re-Writing Kernel, In Proceedings of the 50th Annual Meeting of Association for Computational Linguistics (ACL’12), to appear, 2012.
  5. Hang Li, Gu Xu, W. Bruce Croft, Michael Bendersky, Ziqi Wang, Evelyne Viegas, QRU-1: A Public Dataset for Promoting Query Representation and Understanding Research, In Proceedings of the Workshop on Web Search Click Data, (WSCD'12), 2012.
  6. Chen Wang, Keping Bi, Yunhua Hu, Hang Li, and Guihong Cao. Extracting Search-Focused Key N-Grams for Relevance Ranking in Web Search. In Proceedings of the 3rd ACM International Conference on Web Search and Data Mining (WSDM'12), 343-352, 2012.
  7. Bin Gao, Tie-Yan Liu, Wei Wei, Taifeng Wang, and Hang Li, Semi-Supervised Ranking on Very Large Graph with Rich Metadata, Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), 96-104, 2011.
  8. Wu Wei, Hang Li, Yunhua Hu, and Rong Jin, Multi-task Learning in Square Integrable Space, Proceedings of the 25th Conference on Artificial Intelligence (AAAI'11), 2011.
  9. Quan Wang, Jun Xu, Hang Li, Nick Craswell, Regularized Latent Semantic Indexing, In Proceedings of the 34th Annual International ACM SIGIR Conference (SIGIR’11), 685-694, 2011.
  10. Ziqi Wang, Gu Xu, Hang Li and Ming Zhang, A Fast and Accurate Method for Approximate String Search, In Proceedings of the 49th Annual Meeting of Association for Computational Linguistics: Human Language Technologies (ACL-HLT’11), 52-61, 2011.
  11. Jun Xu, Wei Wu, Hang Li, Gu Xu, A Kernel Approach to Addressing Term Mismatch, In Proceedings of the 20th International World Wide Web Conference (WWW’11), poster, 153-154, 2011.
  12. Jun Xu, Hang Li, Chaoliang Zhong, Relevance Ranking Using Kernels, In Proceedings of the 6th Asian Information Retrieval Societies Symposium (AIRS'10), Best Paper Award, 1-12, 2010.
  13. Biao Xiang, Daxin Jiang, Jian Pei, Xiaohui Sun, Enhong Chen, Hang Li, Context-Aware Ranking in Web Search. In Proceedings of the 33rd Annual International ACM SIGIR Conference (SIGIR’10), 451-458, 2010.
  14. Jingfang Xu, Chuanliang Chen, Gu Xu, Hang Li, Elbio Abib, Improving Quality of Training Data for Learning to Rank Using Click-Through Data. Proceedings of the 3rd ACM International Conference on Web Search and Data Mining (WSDM'10), 171-180, 2010.
  15. Wei Chen, Tie-Yan Liu, Yanyan Lan, Zhiming Ma, Hang Li, Ranking Measures and Loss Functions in Learning to Rank. In Advances in Neural Information Processing Systems 22 (NIPS’09), to appear.
  16. Fen Xia, Tie-Yan Liu, Hang Li, Statistical Consistency of Top-k Ranking. In Advances in Neural Information Processing Systems 22 (NIPS’09), to appear.
  17. Bin Gao, Tie-Yan Liu, Zhiming Ma, Taifeng Wang, Hang Li, A General Markov Framework for Page Importance Computation. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM’09, short paper), 1835-1838, 2009.
  18. Jiafeng Guo, Gu Xu, Xueqi Cheng, Hang Li, Named Entity Recognition in Query. In Proceedings of the 32nd Annual International ACM SIGIR Conference (SIGIR’09), 267-274.
  19. Xin Jiang, Yunhua Hu, Hang Li, A Ranking Approach to Keyphrase Extraction. In Proceedings of the 32nd Annual International ACM SIGIR Conference (SIGIR’09, poster), 267-274.
  20. Gu Xu, Shuanghong Yang, Hang Li, Named Entity Mining from Click-Through Log Using Weakly Supervised Latent Dirichlet Allocation. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), 1365-1374.
  21. Bin Zhou, Daxin Jiang, Jian Pei, Hang Li, OLAP on Search Logs: An Infrastructure Supporting Data-Driven Applications in Search Engines. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), 1395-1404.
  22. Yanyan Lan, Tie-Yan Liu, Zhi-Ming Ma, Hang Li. Generalization Analysis of Listwise Learning to Rank Algorithms. In Proceedings of the 26th International Conference on Machine Learning (ICML’09), 577-584.
  23. Huanhuan Cao, Daxin Jiang, Jian Pei, Enhong Chen, Hang Li, Towards Context-aware Search by Learning a Very Large Variable Length Hidden Markov Model from Search Logs. In Proceedings of the 18th World Wide Web Conference (WWW'09), 191-200, 2009.
  24. Qi He, Daxin Jiang, Zhen Liao, Steven C. H. Hoi, Kuiyu Chang, Ee-Peng Lim, Hang Li. Web Query Recommendation via Sequential Query Prediction. In Proceedings of the 25th International Conference on Data Engineering (ICDE'09), pages 1443-1454, 2009.
  25. Tao Qin, Tie-Yan Liu, Xu-Dong Zhang, De-Sheng Wang, Hang Li. Global Ranking Using Continuous Conditional Random Fields. In Advances in Neural Information Processing Systems 21 (NIPS’09), 1281-1288, 2009.
  26. Yuting Liu, Bin Gao, Tie-Yan Liu, Ying Zhang, Zhiming Ma, Shuyuan He, Hang Li. BrowseRank: Letting Users Vote for Page Importance, In Proceedings of the 31st Annual International ACM SIGIR Conference (SIGIR’08), pages 451-458, 2008. (SIGIR’08 Best Student Paper Award)
  27. Huanhuan Cao, Daxin Jiang, Jian Pei, Qi He, Zhen Liao, Enhohng Chen, Hang Li. Context-Aware Query Suggestion by Mining Click-Through and Session Data, In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), 875-883, 2008. (SIGKDD’08 Best Application Paper Award)
  28. Jun Xu, Tie-Yan Liu, Min Lu, Hang Li, Wei-Ying Ma. Directly Optimizing Evaluation Measures in Learning to Rank. In Proceedings of the 31st Annual International ACM SIGIR Conference (SIGIR’08), 107-114, 2008.
  29. Xiubo Geng, Tie-Yan Liu, Tao Qin, Andrew Arnold, Hang Li, Heung-Yeung Shum. Query Dependent Ranking with K Nearest Neighbor. In Proceedings of the 31st Annual International ACM SIGIR Conference (SIGIR’08), 115-122, 2008.
  30. Jiafeng Guo, Gu Xu, Hang Li, Xueqi Cheng. A Unified and Discriminative Model for Query Refinement. In Proceedings of the 31st Annual International ACM SIGIR Conference (SIGIR’08), 379-386, 2008.
  31. Tao Qin, Tie-Yan Liu, Xu-Dong Zhang, De-Sheng Wang, Wen-Ying Xiong, and Hang Li. Learning to Rank Relational Objects and Its Application to Web Search. In Proceedings of the 17th International World Wide Web Conference (WWW’08), 407-416, 2008.
  32. Rong Jin, Hamed Valizadegan, and Hang Li. Ranking Refinement and Its Application to Information Retrieval. In Proceedings of the 17th International World Wide Web Conference (WWW’08), 397-406, 2008.
  33. Gu Xu, Hang Li, and Wei-Ying Ma. Fora: Leveraging the Power of Internet Communities for Question Answering. In Proceedings of the 1st International Workshop on Question Answering on the Web (QAWeb’08), 2008.
  34. Xiaonan Ji, Gu Xu, James Bailey, and Hang Li. Mining, Ranking, and Using Acronym Patterns. In Proceedings of the 10th Asia Pacific Web Conference (APWeb’08), 371-382, 2008.
  35. Congkai Sun, Bin Gao, Zhenfu Cao, and Hang Li, HTM: A Topic Model for Hypertexts, In Proceedings of the 2008 conference on Empirical Methods in Natural Language Processing (EMNLP’08), 514-522, 2008.
  36. Fen Xia, Tie-Yan Liu, Jue Wang, Wensheng Zhang, Hang Li. Listwise Approach to Learning to Rank –Theory and Algorithm, In Proceedings of the 25th International Conference on Machine Learning (ICML’08), 1192-1199, 2008.
  37. Yanyan Lan, Tie-Yan Liu, Tao Qin, Zhiming Ma, Hang Li. Query Level Stability and Generalization in Learning to Rank, In Proceedings of the 25th International Conference on Machine Learning (ICML’08), 512-519, 2008.
  38. Jun Xu, Yunbo Cao, Hang Li, Nick Craswell, and Yalou Huang. Searching Documents based on Relevance and Type. In Proceedings of the 29th European Conference on Information Retrieval (ECIR’07), 629-636, 2007.
  39. Yu-Ting Liu, Tie-Yan Liu, Tao Qin, Zhi-Ming Ma, and Hang Li. Supervised Rank Aggregation. In Proceedings of the 16th International World Wide Web Conference (WWW’07), 481-490, 2007.
  40. Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li. Learning to Rank: From Pairwise Approach to Listwise Approach. In Proceedings of the 24th International Conference on Machine Learning (ICML’07), 129-136, 2007.
  41. Tie-Yan Liu, Jun Xu, Tao Qin, Wenying Xiong, and Hang Li, LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval. In Proceedings of First SIGIR Workshop on Learning to Rank for Information Retrieval, 2007.
  42. Tao Qin, Tie-Yan Liu, Wei Lai, Xu-Dong Zhang, De-Sheng Wang, and Hang Li. Ranking with Multiple Hyperplanes. In Proceedings of the 30th Annual International ACM SIGIR Conference (SIGIR’07), 279-286, 2007.
  43. Jun Xu and Hang Li. AdaRank: A Boosting Algorithm for Information Retrieval. In Proceedings of the 30th Annual International ACM SIGIR Conference (SIGIR’07), 391-398, 2007.
  44. Xiuobo Geng, Tie-Yan Liu, Tao Qin, and Hang Li. Feature Selection for Ranking. In Proceedings of the 30th Annual International ACM SIGIR Conference (SIGIR’07), 407-414, 2007.
  45. Tie-Yan Liu, Jun Xu, Tao Qin, Wenying Xiong, and Hang Li. LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval. In Proceedings of SIGIR 2007 Workshop on Learning to Rank for Information Retrieval, 2007.
  46. Yunbo Cao, Jun Xu, Tie-Yan Liu, Hang Li, Yalou Huang, Hsiao-Wuen Hon. Adapting Ranking SVM to Document Retrieval. In Proceedings of the 29th Annual International ACM SIGIR Conference (SIGIR’06), 186-193, 2006.
  47. Min Zhao, Hang Li, Adwait Ratnaparkhi, Hsiao-Wuen Hon, and Jue Wang. Adapting Document Ranking to Users Preferences using Click-through Data. In Proceedings of Asian Information Retrieval Symposium 2006, 26-42, 2006.
  48. Guoping Hu, Jingjing Liu, Yunbo Cao, Hang Li, Jian-Yun Nie, and Jianfeng Gao. A Supervised Learning Approach to Entity Search. In Proceedings of Asian Information Retrieval Symposium 2006, 54-66, 2006.
  49. Jun Xu, Yunbo Cao, Hang Li, and Yalou Huang. Cost Sensitive Learning of SVM for Ranking. In Proceedings of the 17th European Conference on Machine Learning (ECML’06), 833-840, 2006.
  50. Shenghua Bao, Yunbo Cao, Bing Liu, Yong Yu, and Hang Li. Mining Latent Associations of Objects Using a Typed Mixture Model - A Case Study on Expert/Expertise Mining. In Proceedings of the 2006 IEEE International Conference on Data Mining (ICDM’06), 803-807, 2006.
  51. Guoyang Shen, Bin Gao, Tie-Yan Liu, Guang Feng, Shiji Song, and Hang Li. Detecting Link Spam using Temporal Information. In Proceedings of the 2006 IEEE International Conference on Data Mining (ICDM’2006, short paper), 1049-1053, 2006.
  52. Jun Xu, Yunbo Cao, Hang Li, and Min Zhao. Ranking Definitions with Supervised Learning Method. In Proceedings of the 14th World Wide Web Conference (WWW’05), 811-819, 2005.
  53. Yunhua Hu, Hang Li, Yunbo Cao, Dmitriy Meyerzon, and Qinghua Zheng. Automatic Extraction of Titles from General Documents using Machine Learning. In Proceedings of the 14th Joint Conference on Digital Library 2005 (JCDL’05), 145-154, 2005.
  54. Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Shuming Shi, Yunbo Cao, and Hang Li. Title Extraction from Bodies of HTML Documents and Its Application to Web Page Retrieval. In Proceedings of the 28th Annual International ACM SIGIR Conference (SIGIR’05), 250-257, 2005.
  55. Jie Tang, Hang Li, Yunbo Cao, and Zhaohui Tang. Email Data Cleaning. In Proceedings of the 11th ACM KDD International Conference on Knowledge Discovery and Data Mining 2005 (KDD’05), 489-498, 2005.
  56. Hang Li, Yunbo Cao, Jun Xu, Yunhua Hu, Shenjie Li, and Dmitriy Meyerzon. A New Approach to Intranet Search Based on Information Extraction. In Proceedings of the 14th ACM Conference on Information and Knowledge Management (CIKM’05), industry track, 460-468, 2005.
  57. Yunbo Cao, Jingjing Liu, Shenghua Bao, and Hang Li. Research on Expert Search at Enterprise Track of TREC 2005. In Proceedings of the 14th Text Retrieval Conference (TREC'05), 2005.
  58. Cong Li, J-Rong Wen, and Hang Li. Text Classification Using Stochastic Keyword Generation. In Proceedings of the 20th International Conference on Machine Learning (ICML’03), 464-471, 2003.
  59. Yunbo Cao, Hang Li, and Li Lian. Uncertainty Reduction in Collaborative Bootstrapping: Measure and Algorithm. In Proceedings of the 41st Annual Meeting of Association for Computational Linguistics (ACL’03), 327-334, 2003.
  60. Jianfeng Gao, Joshua T. Goodman, Guihong Cao, and Hang Li. Exploring Asymmetric Clustering for Statistical Language Modeling. In Proceedings of the 40th Annual Meeting of Association for Computational Linguistics (ACL’02), 183-190, 2002.
  61. Cong Li and Hang Li. Word Translation Disambiguation Using Bilingual Bootstrapping. In Proceedings of the 40th Annual Meeting of Association for Computational Linguistics (ACL’02), 343-351, 2002.
  62. Yunbo Cao and Hang Li. Base Noun Phrase Translation Using Web Data and the EM Algorithm. In Proceedings of the 19th International Conference on Computational Linguistics (COLING’02), 127-133, 2002.
  63. Hang Li and Kenji Yamanishi. Mining from Open Answers in Questionnaire Data. In Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’01), 443-449, 2001.
  64. Hang Li and Kenji Yamanishi. Topic Analysis Using a Finite Mixture Model. Proceedings of Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large (EMNLP-VLC’00), 35-44, 2000.
  65. Hang Li and Kenji Yamanishi. Text Classification Using ESC-based Stochastic Decision Lists. Proceedings of the 8th ACM International Conference on Information and Knowledge Management (CIKM’99), 122-130, 1999.
  66. Hang Li and Naoki Abe. Word Clustering and Disambiguation based on Co-occurrence Data. In Proceedings of the 18th International Conference on Computational Linguistics and the 36th Annual Meeting of Association for Computational Linguistics (COLING-ACL’98), 749-755, 1998.
  67. Hang Li and Kenji Yamanishi. Document Classification Using a Finite Mixture Model. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL’97), 39-47, 1997.
  68. Naoki Abe and Hang Li. Learning Word Association Norms Using Tree Cut Pair Models. In Proceedings of the 13th International Conference on Machine Learning (ICML’96), 3-11, 1996.
  69. Hang Li and Naoki Abe. Clustering Words with the MDL Principle. In Proceedings of the 16th International Conference on Computational Linguistics (COLING’96), 5-9, 1996.
  70. Hang Li and Naoki Abe. Learning Dependencies between Case Frame Slots. In Proceedings of the 16th International Conference on Computational Linguistics (COLING’96), 10-15, 1996.
  71. Hang Li. A Probabilistic Disambiguation Method based on Psycholinguistic Principles. In Proceedings of the 4th Workshop on Very Large Corpora (VLC’96), 141-154, 1996.
  72. Naoki Abe, Hang Li, and Atsuyoshi Nakamura. On-line Learning of Binary Lexical Relations Using Two-dimensional Weighted Majority Algorithms. In Proceedings of the 12th International Conference on Machine Learning (ICML’95), 3-11, 1995.
  73. Hang Li and Naoki Abe. Generalizing Case Frames Using a Thesaurus and the MDL Principle. In Proceedings of Recent Advances in Natural Language Processing (RANLP’95), 230-248, 1995.
  74. John A. Bateman and Hang Li. The Application of Systemic-functional Grammar to Japanese and Chinese for Use in Text Generation. In Proceedings of the 1988 International Conference on Computer Processing of Chinese and Oriental Languages, 443-447, 1988.

Journal Papers

  1. Bin Gao, Tie-Yan Liu, Yuting Liu, Taifeng Wang, Zhiming Ma, Hang Li: Page Importance Computation based on Markov Processes, Information Retrieval Journal, 14(5): 488-514, 2011.
  2. Hang Li, A Short Introduction to Learning to Rank, IEICE Transactions on Information and Systems, E94-D(10), 2011.
  3. Zhen Liao, Daxin Jiang, Enhong Chen, Jian Pei, Huanhuan Cao, Hang Li, Mining Concept Sequences from Large-Scale Search Logs for Context-Aware Query Suggestion, ACM Transactions on Intelligent Systems and Technology, 3(1), 2011.
  4. Wei Wu, Jun Xu, Hang Li, and Satoshi Oyama, Learning A Robust Relevance Model for Search Using Kernel Methods, Journal of Machine Learning Research, 12, 1429-1458. 2011.
  5. Tao Qin, Tie-Yan Liu, Jun Xu, and Hang Li. LETOR: A Benchmark Collection for Research on Learning to Rank for Information Retrieval, Information Retrieval Journal, 13(4):346-374, Springer, 2010.
  6. Tao Qin, Tie-Yan Liu, and Hang Li. A General Approximation Framework for Direct Optimization of Information Retrieval Measures, Information Retrieval Journal, 13(4):375-397, Springer, 2010.
  7. Yuting Liu, Tie-Yan Liu, Zhiming Ma, and Hang Li. A Framework to Compute Page Importance based on User Behaviors, Information Retrieval Journal, to appear.
  8. Ming Li, Hang Li, Zhi-Hua Zhou. Semi-Supervised Document Retrieval. Information Processing and Management. 45:341-355, 2009.
  9. Tao Qin, Xu-Dong Zhang, Ming-Feng Tsai, De-Sheng Wang, Tie-Yan Liu, and Hang Li. Query-level Loss Functions for Information Retrieval. Information Processing and Management, 44:838-855, Elsevier Science, 2008.
  10. Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shuming Shi, Yunbo Cao, Chin-Yew Lin, and Hang Li. Web Page Title Extraction and Its Application. Information Processing & Management, 43(5):1332-1347, Elsevier Science, 2007.
  11. Jun Xu, Yunbo Cao, Hang Li, Min Zhao, and Yalou Huang. A Supervised Learning Approach to Search of Definitions. Journal of Computer Science and Technology, 21(3):439-449, Springer, 2006.
  12. Yunhua Hu, Hang Li, Yunbo Cao, Li Teng, Dmitriy Meyerzon, and Qinghua Zheng. Automatic Extraction of Titles from General Documents using Machine Learning. Information Processing & Management, 42(5):1276-1293, Elsevier Science, 2006.
  13. Hang Li and Cong Li. Word Translation Disambiguation Using Bilingual Bootstrapping. Computational Linguistics, 30(1):1-22, MIT Press, 2004.
  14. Hang Li and Kenji Yamanishi. Topic Analysis Using a Finite Mixture Model. Information Processing & Management, 39(4):521-541, Elsevier Science, 2003.
  15. Hang Li, Yunbo Cao, and Cong Li. Using Bilingual Web Data to Mine and Rank Translations. IEEE Intelligent Systems, 39(4):54-59, 2003.
  16. Hang Li and Kenji Yamanishi. Text Classification Using ESC-based Stochastic Decision Lists. Information Processing & Management, 38(3):343-361, Elsevier Science, 2002.
  17. Hang Li. Word Clustering and Disambiguation based on Co-occurrence Data. Natural Language Engineering, 8(1):25-42, Cambridge University Press, 2002.
  18. Kenji Yamanishi and Hang Li. Mining Open Answers in Questionnaire Data. IEEE Intelligent Systems, 17(5):58-63, 2002.
  19. Hang Li and Naoki Abe. Learning Dependencies between Case Frame Slots. Computational Linguistics, 25(3):283-291, MIT Press, 1999.
  20. Hang Li and Naoki Abe. Generalizing Case Frames Using a Thesaurus and the MDL Principle. Computational Linguistics, 24(2):217-244, MIT Press, 1998.
  21. Hang Li and Naoki Abe. Clustering Words with the MDL Principle. Journal of Natural Language Processing, 4(2):71-88, The Natural Language Processing Society of Japan, 1997.
  22. Hang Li. A Probabilistic Disambiguation Method based on Psycholinguistic Principles . Computer Software, 13(6):53-65, Iwanami Shoten, 1996. (in Japanese)

Articles

  1. Irwin King, Hang Li: A Report on the Fourth ACM International Conference on Web Search and Data Mining (WSDM 2011). SIGKDD Explorations 13(1): 52-53, 2011.
  2. Hang Li. ACM International Conference on Web Search and Data Mining (WSDM 2011), Newsletter of China Computer Federation, 7(5):69-71, 2011. (in Chinese)
  3. W. Bruce Croft, Michael Bendersky, Hang Li, Gu Xu. Query Understanding and Representation. SIGIR Forum 44(2): 48-53, 2010.
  4. Hang Li, Tie-Yan Liu, ChengXiang Zhai: Learning to rank for information retrieval (LR4IR 2009). SIGIR Forum 43(2): 41-45, 2009.
  5. Hang Li, Tie-Yan Liu, ChengXiang Zhai: Learning to rank for information retrieval (LR4IR 2008). SIGIR Forum 42(2): 76-79, 2008.
  6. Thorsten Joachims, Hang Li, Tie-Yan Liu, ChengXiang Zhai: Learning to rank for information retrieval (LR4IR 2007). SIGIR Forum 41(2):58-62, 2007.
  7. Hang Li. Text Mining. Machine Learning and Its Applications, Tsinghua University Press, 2005. (in Chinese)
  8. Hang Li. Machine Learning and Natural Language Processing. Issues in Chinese Information Processing, Science Press 2003. (in Chinese)
  9. Hang Li. Introduction to Model Selection - Using Natural Language Processing Problems as Examples. IPSJ Magazine, 42(1), The Information Processing Society of Japan, 2001. (in Japanese)
  10. Hang Li. Text Classification Using Machine Learning Techniques. Journal of SICE, 38(7):456-460, The Society of Instrument and Control Engineers, 1999. (in Japanese)
  11. I. Dan Melamed and Hang Li. Review of Ambiguity Resolution in Language Learning: Computational and Cognitive Models. by Hinrich Schütze, Computational Linguistics, 25(3):436-439, MIT Press, 1998.

Books

  1. Hang Li, Learning to Rank for Information Retrieval and Natural Language Processing, Synthesis Lectures on Human Language Technology, Lecture 12, Morgan & Claypool Publishers, 2011.

  2. Hang Li, Statistical Learning Methods (in Chinese), Tsinghua University Press, 2012, ISBN 978-7-302-27595-4.

Product Developments

  • Microsoft SharePoint Search 2010 metadata extraction and relevance ranking
  • Microsoft Bing 2009 data mining and ranking
  • Microsoft Live Search 2008 data mining and ranking
  • Microsoft SharePoint Server 2007 metadata extraction
  • Microsoft SQL Server 2005 text mining
  • Microsoft TextMiner (internal tool)
  • NEC TopicScope (previously SurveyAnalyzer)

Granted US Patents

  1. Hang Li, Tie-Yan, Tao Qin, Multi-Ranker for Search, No. 8122015, February 21, 2012.
  2. Hang Li, Bin Gao, Tie-Yan Liu, Yuting Liu, Calculating Web Page Importance Based on Web Behavior Model, No. 8103599, Janueary 24, 2012.
  3. Hang Li, jointly with Qing Yu andd Jun Xu, Topics in Relevance Ranking Model for Web Search, No. 8065310, November 22, 2011.
  4. Hang Li, jointly with Tie-Yan Liu, Lei Qi and Bin Gao, Calculating Global Importance of Documents based on Global Hitting Times, No. 7930303, April 19, 2011.
  5. Hang Li, jointly with Yunbo Cao and Jun Xu, Ranking and Accessing Definitions of Terms, No. 7877383, January 25, 2011.
  6. Hang Li, jointly with Bin Gao, Tie-Yan Liu, and Lei Yang, Anti-Spam Tool for Brower, No. 7860971, December 28, 2010.
  7. Hang Li, jointly with Tie-Yan Liu, Xiubo Geng, and Tao Qin,  Feature Selection for Ranking, No. 7853599, December 14, 2010.
  8. Hang Li, jointly with Yunbo Cao, Mining Latent Association of Objects Using a Typed Mixture Model, No. 7849097, December 7, 2010.
  9. Hang Li, jointly with Tie-Yan Liu and Yu-Ting Liu, Supervised Rank Aggregation based on Rankings, No. 7840522, November 23, 2010.
  10. Hang Li, jointly with Jianfeng Gao and Yunbo Cao, Training a Ranking Component, No. 7783629, August 24, 2010.
  11. Hang Li, jointly with Tie-Yan Liu, Tao Qin and Zhe Cao, Listwise Ranking. No. 7734633, June 8, 2010.
  12. Hang Li, jointly with Dmitriy Meyerzon, Ranking Search Results Using Feature Extraction. No. 7716198, May 11, 2010.
  13. Hang Li, jointly with Jianfeng Gao and Yunbo Cao, Factoid based Searching. No. 7707204, April 27, 2010.
  14. Hang Li, jointly with Tie-Yan Liu. Active Spam Testing System. No. 7680851, March 16, 2010.
  15. Hang Li, jointly with Tie-Yan Liu, Lei Qi, Bin Gao, and Lei Yang, Calculating Importance of Documents Factoring Historical Importance. No. 7676520, March 9, 2010.
  16. Hang Li, jointly with Yunbo Cao and Jun Xu, Search by Document Type and Relevance. No. 7644074, January 5, 2010.
  17. Hang Li, jointly with Jun Xu, Yunbo Cao, and Tie-Yan Liu, Learning A Document Ranking Using A Loss Function with A Rank Pair or A Query Parameter. No. 7593934, September 22, 2009.
  18. Hang Li, jointly with Yunbo Cao and Zhaohui Tang, Electronic Mail Data Cleaning. No. 7590608, September 15, 2009.
  19. Hang Li, jointly with Yunbo Cao, Uncertainty Reduction in Collaborative Bootstrapping. No. 7512582, March 31, 2009.
  20. Hang Li, jointly with Ruihua Song, Yunbo Cao, and Dmitriy Meyerzon, Extraction of Information from Documents. No. 7469251, December 23, 2008.
  21. Hang Li, jointly with Yunbo Cao, Olivier Ribet and Benjamin Martin, Text Mining Apparatus and Associated Methods. No. 7461056, December 2, 2008.
  22. Hang Li, Method and Apparatus for Identifying Translations. No. 7346487, March 18, 2008.
  23. Hang Li, Method and Apparatus for Training a Translation Disambiguation Classifier. No. 7318022, January 8, 2008.
  24. Hang Li, jointly with Yunbo Cao, Learning and Using Generalized String Patterns for Information Extraction. No. 7299228, November 20, 2007.
  25. Hang Li, jointly with Yunbo Cao, Method and Apparatus for Browsing Document Content. No. 7284006, October 16, 2007.
  26. Hang Li, jointly with Cong Li and Ji-Rong Wen. Data Classification Using Stochastic Key Feature Generation, No. 7209908, April 24, 2007.