Chao Liu
I am a researcher of the Search Quality & Cyber-Intelligence Lab of Microsoft Research. I used to work on statistical methods for software reliability, which earned me a PhD from the University of Illinois at Urbana-Champaign (UIUC). Since joining MSR, my research has shifted to, and focused on, Web search. Specifically, I have been building statistical models to interpret web users' behaviors, and leveraging them to uplift users' online experience.
Recent and Upcoming Events
- Keynote speech "When Machine Learning Meets the Web" delievered at Symposium on Search Engine and Web Mining (SEWM'10), Chengdu, China, May 15, 2010. [pptx][ppt][pdf]
- Tutorial on "Statistical Models for Web Search Clicks Log Analysis" with Fan Guo, to be presented at CIKM'09, Hong Kong, China, Nov 2., 2009. [pptx][ppt][pdf]
- "BBM: Bayesian Browsing Model from Petabyte-scale Data" was selected as one of the six “Best of KDD’09” and nominated for the best research paper award.
|
|
The invited article "Data Mining for Software Engineering" was published as a cover feature in the IEEE Computer Magazine, Aug 2009 |
Recent Professional Services
- Conference PC Member: WSDM'2011, WSDM'2010, VLDB'2010, WWW'2010, SIGIR'2010, SIGKDD'2010, AAAI'2010, etc
- Journal Reviewer: IEEE TKDE, DMKD, IEEE TSE, IEEE Software, etc
Journal Publications
- Chao Liu, Fan Guo, and Christos Faloutsos, Bayesian Browsing Model: Exact Inference of Document Relevance from Petabyte-Scale Data, in ACM Transaction on Knowledge Discovery from Data, vol. 4, pp. 19:1–19:26, Association for Computing Machinery, Inc., New York, NY, USA [Invited Submission], October 2010
- Tao Xie, Suresh Thummalapenta, David Lo, and Chao Liu, Data Mining for Software Engineering, in Computer, vol. 42, no. 8, pp. 55-62, IEEE Computer Society, Los Alamitos, CA, USA, 2009
- Chao Liu, Xiangyu Zhang, and Jiawei Han, A Systematic Study of Failure Proximity, in IEEE Transactions on Software Engineering, vol. 34, no. 6, IEEE Computer Society, Los Alamitos, CA, USA, November 2008
- David Lo, Siau-Cheng Khoo, and Chao Liu, Mining temporal rules for software maintenance, in Journal of Software Maintenance and Evolution: Research and Practice, vol. 20, no. 4, pp. 227–247, John Wiley & Sons, Inc., New York, NY, USA, July 2008
- Chao Liu, Long Fei, Xifeng Yan, Jiawei Han, and Samuel P. Midkiff, Statistical Debugging: A Hypothesis Testing-Based Approach, in IEEE Transactions on Software Engineering, vol. 32, no. 10, pp. 831-848, IEEE Computer Society, Los Alamitos, CA, USA, 2006
Conference Publications
2011
- Nathan N. Liu, Xiangrui Meng, Chao Liu, and Qiang Yang, Wisdom of the Better Few: Cold Start Recommendation via Representative based Rating Elicitation, in Proceedings of the 5th ACM Recommender Systems Conference, ACM, October 2011
- Hao Ma, Chao Liu, Irwin King, and Michael R. Lyu, Probabilistic Factor Models for Web Site Recommendation, in the 34the Annual ACM SIGIR Conference , ACM, July 2011
- Xiaoxin Yin, Wenzhao Tan, and Chao Liu, FACTO: A Fact Lookup Engine Based on Web Tables, in 20th Int’l. World Wide Web Conf. (WWW’11), March 2011
- Hao Ma, Dengyong Zhou, Chao Liu, Michael R. Lyu, and Irwin King, Recommender systems with social regularization, in Proceedings of the fourth ACM international conference on Web search and data mining, Association for Computing Machinery, Inc., New York, NY, USA, January 2011
2010
- Chao Liu, Ryen W. White, and Susan Dumais, Understanding web browsing behaviors through Weibull analysis of dwell time, in SIGIR '10: Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval, Association for Computing Machinery, Inc., New York, NY, USA, October 2010
- Chao Liu, Hung-chih Yang, Jinliang Fan, Li-Wei He, and Yi-Min Wang, Distributed Nonnegative Matrix Factorization for Web-Scale Dyadic Data Analysis on MapReduce, in WWW'10: Proceedings of the 19th International World Wide Web Conference, Association for Computing Machinery, Inc., April 2010
2009
- Chao Liu, Mei Li, and Yi-Min Wang, Post-Rank Reordering: Resolving Preference Misalignments between Search Engines and End Users, in CIKM '09: Proceedings of The 18th ACM Conference on Information and Knowledge Management, Association for Computing Machinery, Inc., November 2009
- Chao Liu, Fan Guo, and Christos Faloutsos, BBM: Bayesian Browsing Model from Petabyte-scale Data, in KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, Association for Computing Machinery, Inc., June 2009
- Fan Guo, Chao Liu, Anitha Kannan, Tom Minka, Michael Taylor, Yi-Min Wang, and Christos Faloutsos, Click Chain Model in Web Search, in WWW'09: Proceedings of the 18th International World Wide Web Conference, Association for Computing Machinery, Inc., April 2009
- Fan Guo, Chao Liu, and Yi-Min Wang, Efficient multiple-click models in web search, in WSDM'09: Proceedings of the Second ACM International Conference on Web Search and Data Mining, Association for Computing Machinery, Inc., February 2009
2008
- David Lo, Siau-Cheng Khoo, and Chao Liu, Mining past-time temporal rules from execution traces, in WODA '08: Proceedings of the 2008 international workshop on dynamic analysis, Association for Computing Machinery, Inc., New York, NY, USA, July 2008
- David Lo, Siau-Cheng Khoo, and Chao Liu, Efficient Mining of Recurrent Rules from a Sequence Database, in DASFAA'08: Proceedings of the 13th International Conference on Database Systems for Advanced Applications, Philadelphia, PA, 2008
2007
- Chao Liu, Xiangyu Zhang, Jiawei Han, Yu Zhang, and Bharat K. Bhargava, Failure Indexing: A Dynamic Slicing Based Approach, in Proceeding of the 23rd IEEE International Conference on Software Maintenance (ICSM'07), 2007
- David Lo, Siau-Cheng Khoo, and Chao Liu, Efficient mining of iterative patterns for software specification discovery, in KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, New York, NY, USA, 2007
2006
- Chao Liu, Xifeng Yan, and Jiawei Han, Mining Control Flow Abnormality for Logic Error Isolation, in Proceedings of 2006 SIAM International Conference on Data Mining (SDM'06), Bethesda, 2006
- Chao Liu and Jiawei Han, Fault-aware Fingerprinting: Towards Mutualism between Failure Investigation and Statistical Debugging, in SIGSOFT '06/FSE-14: Doctoral Symposium at the 14th ACM SIGSOFT international symposium on Foundations of software engineering, ACM Press, New York, NY, USA, 2006
- Bo Zhao and Chao Liu, Efficient SIP-Specific Event Notification, in ICNICONSMCL '06: Proceedings of the International Conference on Networking, International Conference on Systems and International Conference on Mobile Communications and Learning Technologies, IEEE Computer Society, Washington, DC, USA, 2006
- Qiaozhu Mei, Chao Liu, Hang Su, and ChengXiang Zhai, A probabilistic approach to spatiotemporal theme pattern mining on weblogs, in Proceedings of the 15th international conference on World Wide Web (WWW'06), Association for Computing Machinery, Inc., Edinburgh, Scotland, 2006
- Chao Liu, Zeng Lian, and Jiawei Han, How Bayesians Debug, in Proceedings of the 6th IEEE International Conference on Data Mining (ICDM'06), Hong Kong, China, 2006
- Chao Liu and Jiawei Han, Failure Proximity: A Fault Localization-Based Approach, in Proceedings of 14th ACM SIGSOFT International Symposium on Foundations of Software Engineering(FSE'06), Portland, OR, 2006
- Chao Liu, Chen Chen, Jiawei Han, and Philip S. Yu, GPLAG: detection of software plagiarism by program dependence graph analysis, in KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, Association for Computing Machinery, Inc., Philadelphia, PA, 2006
2005
- William Yurcik and Chao Liu, A First Step Toward Detecting SSH Identity Theft in HPC Cluster Environments: Discriminating, in Masqueraders Based on Command Behavior,” in Cluster Security (Cluster-Sec) Workshop at the 5th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid, 2005
- Chao Liu, Xifeng Yan, Hwanjo Yu, Jiawei Han, and Philip S. Yu, Mining Behavior Graphs for "Backtrace" of Noncrashing Bugs, in Proceedings of 2005 SIAM International Conference on Data Mining (SDM'05), Society for Industrial and Applied Mathematics, Newport Beach, CA, 2005
- Chao Liu, Xifeng Yan, Long Fei, Jiawei Han, and Samuel P. Midkiff, SOBER: statistical model-based bug localization, in Proceedings of the 10th European Software Engineering Conference held jointly with 13th ACM SIGSOFT International Symposium on Foundations of Software Engineering (ESEC/FSE'05), Lisbon, Portugal, 2005
2003
- Chao Liu, Ming Zhang, Minrui Zheng, and Yixin Chen, Step-by-Step Regression: A More Efficient Alternative for Polynomial Multiple Linear Regression in Stream Cube., in PAKDD, Springer, 2003
Mailing Address:
Chao Liu (99-2417)
One Microsoft Way
Redmond, WA 98052
Email: FirstnameLastname at microsoft dot com
Fax: +1-425-936-7329




