Bin Gao

Bin Gao is a Lead Researcher in Internet Economics and Computational Advertising Group (IECA), Microsoft Research Asia (MSRA). His research interests include machine learning, data mining, information retrieval, and computational advertising. He has published more than 30 quality papers in referred international conferences and journals, including KDD, WWW, SIGIR, WSDM, CIKM, ECML, ICDM, ACM MM, IEEE TKDE, IRJ, etc. He has 20 granted or pending US / international patents. Prior to joining Microsoft, he got his Ph.D. degree from School of Mathematical Sciences, Peking University, where his research efforts were on pattern recognition and machine learning. When he was a Ph.D. candidate, he also worked as an intern in Web Search and Mining (WSM) Group at MSRA for two and a half years. Before studying in Peking University, he got his bachelor degree from School of Mathematical and System Sciences, Shandong University. He is an IEEE member and an ACM member. 

Latest

  • Currently I am focused on deep learning for text mining. Quality BS/MS/Ph.D. students who are interested in my research are welcomed to apply the MSRA internship program by sending me their resumes.

  • I am co-organizing a WSDM workshop with Jiang Bian on Deep Learning for Web Search and Data Mining (DL-WSDM 2015). [Workshop Website]

Awards

  • MSRA FY12 Research Breakthrough Award (IECA Group)
  • SIGIR 2008 Best Student Paper Award
  • SIGKDD 2005 Student Travel Award 

 

 

 

 

 

 

Tutorials

  • Bin Gao, Taifeng Wang, and Tie-Yan Liu. Large-Scale Graph Mining and Learning for Information Retrieval. A half-day tutorial in the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2012). [PDF]
  • Bin Gao, Taifeng Wang, and Tie-Yan Liu. Ranking on Large-Scale Graphs with Rich Metadata. A half-day tutorial in the 20th International World Wide Web Conference (WWW 2011). [PDF]

Workshops

  • Bin Gao and Jiang Bian. The WSDM 2015 Workshop on Deep Learning for Web Search and Data Mining (DL-WSDM 2015), in conjunction with the Eighth ACM International WSDM Conference (WSDM 2015). [Workshop Website]
  • Bin Gao, Jiang Bian, Richard Socher, and Scott Wen-tau Yih. The ICML 2014 Workshop on Knowledge-Powered Deep Learning for Text Mining (KPDLTM 2014), in conjunction with the 31st International Conference on Machine Learning (ICML 2014). [Workshop Website]
  • Bin Gao, Jun Yan, Dou Shen, and Tie-Yan Liu. The SIGIR 2013 Workshop on Internet Advertising: Theory and Practice (IATP 2013), in conjunction with the 36th Annual ACM SIGIR Conference (SIGIR 2013). [Workshop Website] [Proceedings]
  • Esin Saka, Dou Shen, Bin Gao, Jun Yan, and Ying Li. The 7th International Workshop on Data Mining for Online Advertising (ADKDD 2013), in conjunction with the 19th International Conference on Knowledge Discovery and Data Mining (SIGKDD 2013). [Workshop Website]
  • Jun Yan, Bin Gao, Zheng Chen, and Dou Shen. The 1st International Workshop on Web Entity Modeling and Applications (WEMA 2012), in conjunction with the 12th IEEE International Conference on Data Mining (ICDM 2012). [Workshop Website]

Publications

(Authors associated with * are/were the interns I have supervised in MSRA.)

Journal Papers

  • Ying Zhang*, Weinan Zhang*, Bin Gao, Xiaojie Yuan, and Tie-Yan Liu. Bid Keyword Suggestion in Sponsored Search based on Competitiveness and Relevance. Information Processing & Management (IPM), Volume 50, Issue 4, Pages 508–523, July 2014. [PDF]
  • Bin Gao, Tie-Yan Liu, Yuting Liu, Taifeng Wang, Zhiming Ma, and Hang Li. Page Importance Computation based on Markov Processes, Information Retrieval Journal (IRJ), DOI: 10.1007/s10791-011-9164-x, 2011. [PDF]
  • Yuting Liu, Tie-Yan Liu, Bin Gao, Zhiming Ma, and Hang Li. A Framework to Compute Page Importance based on User Behaviors, Information Retrieval Journal (IRJ), Volume 13, Number 1, 22-45, DOI: 10.1007/s10791-009-9098-82009, 2010. [PDF]
  • Bin Gao, Tie-Yan Liu, Qian-Sheng Cheng, Guang Feng, Tao Qin, and Wei-Ying Ma. Hierarchical Taxonomy Preparation for Text Categorization Using Consistent Bipartite Spectral Graph Copartitioning, IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 17, no. 9, pp. 1263-1273, September 2005. [PDF]
  • Jinwen Ma, Bin Gao, Yang Wang, and Qiansheng Cheng. Conjugate and Natural Gradient Rules for BYY Harmony Learning on Gaussian Mixture with Automated Model Selection, International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI), vol. 19, no. 5, pp. 701-713, 2005. [PDF]

Conference Papers

Deep Learning

  • Chang Xu, Yalong Bai, Jiang Bian, Bin Gao, Gang Wang, Xiaoguang Liu, and Tie-Yan Liu. RC-NET: A General Framework for Incorporating Knowledge into Word Representations, in the Proceedings of the 23rd ACM International Conference on Information and Knowledge Management (CIKM 2014), 2014. [PDF]
  • Jiang Bian, Bin Gao, and Tie-Yan Liu. Knowledge-Powered Deep Learning for Word Embedding, in the Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD 2014), 2014. [PDF]
  • Fei Tian, Bin Gao, and Tie-Yan Liu. Learning Deep Representations for Graph Clustering, in the Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI 2014), 2014. [PDF]
  • Fei Tian, Hanjun Dai, Jiang Bian, Bin Gao, Rui Zhang, and Tie-Yan Liu. A Scalable Probabilistic Model for Learning Multi-Prototype Word Embeddings, in the Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014), 2014. [PDF]
  • Siyu Qiu*, Qing Cui*, Jiang Bian, Bin Gao, and Tie-Yan Liu. Co-learning of Word Representations and Morpheme Representations, in the Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014), 2014. [PDF]

Computational Advertising

  • Haifeng Xu*, Bin Gao, Diyi Yang*, and Tie-Yan Liu. Predicting Advertiser Bidding Behaviors in Sponsored Search by Rationality Modeling, in the Proceedings of the 22nd International World Wide Web Conference (WWW 2013), 2013. [PDF]
  • Kai Hui*, Bin Gao, Ben He, and Tiejian Luo. Sponsored Search Ad Selection by Keyword Structure Analysis, in the Proceedings of the 35th European Conference on Information Retrieval (ECIR 2013), pp, 230-241, 2013. [PDF]
  • Changhao Jiang*, Min Zhang, Bin Gao, and Tie-Yan Liu. A Study on Potential Head Advertisers in Sponsored Search, in the Proceedings of the 8th Asia Information Retrieval Societies Conference (AIRS 2012), pp, 174-186, Tianjin, China, December 17-19, 2012. [PDF
  • Weinan Zhang*, Ying Zhang*, Bin Gao, Yong Yu, Xiaojie Yuan, and Tie-Yan Liu. Joint Optimization of Bid and Budget Allocation in Sponsored Search, in Proceedings of the 18th International Conference on Knowledge Discovery and Data Mining (KDD 2012), pp, 1177-1185, 2012. [PDF
  • Jian Tang, Ning Liu, Jun Yan, Yelong Shen, Shaodan Guo, Bin Gao, Shuicheng Yan, and Ming Zhang. Learning to Rank Audience for Behavioral Targeting in Display Ads, in the Proceedings of the 20th ACM international conference on Information and knowledge management (CIKM 2011), pp, 605-610, 2011. [PDF]

Data Mining

  • Zhicong Cheng*, Bin Gao, Congkai Sun*, Yanbing Jiang, and Tie-Yan Liu. Let Web Spammers Expose Themselves, in the Proceedings of the fourth ACM international conference on Web search and data mining (WSDM 2011), pp, 525-534, 2011. [PDF]
  • Zhicong Cheng*, Bin Gao, and Tie-Yan Liu. Actively Predicting Diverse Search Intent from User Browsing Behaviors, in the Proceedings of the 19th International World Wide Web Conference (WWW 2010), pp, 221-230, 2010. [PDF]
  • Bin Gao, Tie-Yan Liu, and Wei-Ying Ma. Star-Structured High-Order Heterogeneous Data Co-clustering based on Consistent Information Theory, in the Proceedings of the Sixth International Conference on Data Mining (ICDM 2006), pp. 880-884, 2006. [PDF]
  • Bo Chen, Bin Gao, Tie-Yan Liu, Yu-Fu Chen, and Wei-Ying Ma. Fast Spectral Clustering of Data with Sequential Matrix Compression, in the Proceedings of ECML 2006, pp. 590-597, 2006. [PDF]
  • Bin Gao, Tie-Yan Liu, Xin Zheng, Qian-Sheng Cheng, and Wei-Ying Ma. Consistent Bipartite Graph Co-Partitioning for Star-Structured High-Order Heterogeneous Data Co-Clustering, in Proceedings of the 11th International Conference on Knowledge Discovery and Data Mining (KDD 2005), pp. 41-50, 2005. [PDF]
  • Bin Gao, Tie-Yan Liu, Tao Qin, Xin Zheng, Qian-Sheng Cheng, and Wei-Ying Ma. Web Image Clustering by Consistent Utilization of Visual Features and Surrounding Texts, in Proceedings of the 13th Annual ACM International Conference on Multimedia (ACM Multimedia 2005), pp. 112-121, 2005. [PDF]

Large-Scale Graph Ranking and Mining

  • Bin Gao, Tie-Yan Liu, Wei Wei*, Taifeng Wang, Hang Li. Semi-Supervised Ranking on Very Large Graph with Rich Metadata, in Proceedings of the 17th International Conference on Knowledge Discovery and Data Mining (KDD 2011), pp, 96-104, 2011. [PDF]
  • Bin Gao, Tie-Yan Liu, Zhiming Ma, Taifeng Wang, Hang Li. A general markov framework for page importance computation, in the Proceedings of the 18th ACM conference on Information and knowledge management (CIKM 2009), pp, 1835-1838, 2009. [PDF]
  • [SIGIR Best Student Paper Award] Yuting Liu, Bin Gao, Tie-Yan Liu, Ying Zhang*, Zhiming Ma, Shuyuan He, and Hang Li. BrowseRank: Letting Users Vote for Page Importance, in the Proceedings of the 31st Annual International ACM SIGIR Conference (SIGIR 2008), pp. 451-458, 2008. [PDF]
  • Lei Yang*, Lei Qi*, Yan-Ping Zhao, Bin Gao, and Tie-Yan Liu. Link Analysis using Time Series of Web Graphs, in the Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management (CIKM 2007), pp. 1011-1014, 2007. [PDF]
  • Guoyang Shen, Bin Gao, Tie-Yan Liu, Shiji Song, and Hang Li. Detecting Link Spam from Temporal Statistics of Websites, in the Proceedings of the Sixth International Conference on Data Mining (ICDM 2006), pp. 1049-1053, 2006. [PDF]
  • Guang Feng, Tie-Yan Liu, Xu-Dong Zhang, Tao Qin, Bin Gao, and Wei-Ying Ma. Level-Based Link Analysis, Web Technologies Research and Development, Lecture Notes in Computer Science (APWeb 2005): 7th Asia-Pacific Web Conference, Shanghai, China, March 29 - April 1, 2005. Proceedings, pp. 183-194. [PDF]

Machine Learning

  • Congkai Sun*, Bin Gao, Zhenfu Cao, and Hang Li. HTM: A Topic Model for Hypertexts, in the Proceedings of the 2008 conference on Empirical Methods in Natural Language Processing (EMNLP 2008), pp. 514-522, 2008. [PDF]
  • Tie-Yan Liu, Yiming Yang, Hao Wan, Qian Zhou, Bin Gao, Hua-Jun Zeng, Zheng Chen, and Wei-Ying Ma. An Experimental Study on Large-Scale Web Categorization, in Proceedings of the 14th international conference on World Wide Web (WWW 2005), pp. 1106-1107, 2005. [PDF]
  • Bin Gao, Tie-Yan Liu, Qian-Sheng Cheng, and Wei-Ying Ma. A Linear Approximation Based Method for Noise-Robust and Illumination-Invariant Image Change Detection, Lecture Notes in Computer Science - Advances in Multimedia Information Processing (PCM 2004): 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30 - December 3, 2004. Proceedings, Part III, pp. 95-102. [PDF]
  • Jinwen Ma, Bin Gao, Yang Wang, and Qiansheng Cheng. Two Further Gradient BYY Learning Rules for Gaussian Mixture with Automated Model Selection, Lecture Notes in Computer Science - Intelligent Data Engineering and Automated Learning (IDEAL 2004): 5th International Conference, Exeter, UK. August 25-27, 2004. Proceedings, pp. 690-695. [PDF]

Book Chapters

  • Bin Gao and Tie-Yan Liu. Ranking on Large-scale Graphs with Rich Metadata. Machine Learning and Its Applications, Tsinghua University Press, Beijing, 2011. (ISBN 978-7-302-206853-6)
  • Tie-Yan Liu and Bin Gao. High-Order Heterogeneous Data Mining. Machine Learning and Its Applications, Tsinghua University Press, Beijing, 2007. (ISBN 978-7-302-16076-2)

Professional Activities

  • Session Chair, Search Log Analysis, SIGIR 2012
  • Program Committee, CHI 2013
  • Senior Program Committee, CIKM 2011
  • Program Committee, WWW 2014 tutorial
  • Program Committee, WWW 2011, 2012, 2013
  • Program Committee, SIGIR 2009, 2010, 2011, 2012, 2013, 2014
  • Program Committee, WSDM 2015
  • Program Committee, CIKM 2012, 2013
  • Program Committee, ACML 2014
  • Program Committee, IEEE Big Data 2013
  • Program Committee, ACML 2012, Asian Conference on Machine Learning
  • Program Committee, NIPS 2010 Workshop on Machine Learning in Online Advertising
  • Program Committee, NIPS 2010 Workshop on Machine Learning for Social Computing
  • Program Committee, KDD 2008 Workshop on Data Mining for Business Applications
  • Program Committee, ICDM 2009 Workshop on Optimization Based Methods for Emerging Data Mining Problems
  • Reviewer, IEEE Transactions on Knowledge and Data Engineering
  • Reviewer, IEEE Transactions on Multimedia
  • Reviewer, ACM Transactions on Intelligent Systems and Technology
  • Reviewer, Pattern Recognition Letters
  • Reviewer, Information Retrieval
  • Reviewer, Knowledge and Information Systems
  • Reviewer, Journal of Computational Science
  • Reviewer, Journal of Computer Science and Technology
  • Reviewer, IJCNLP 2008 (International Joint Conference on Natural Language Processing)
  • Reviewer, DEXA 2008, 2009 (International Conference on Database and Expert Systems Applications)
  • Reviewer, AIRS 2009 (Asia Information Retrieval Symposium)

Bin Gao (高斌)

Lead Researcher

Microsoft Research Asia (MSRA)

Building 2, No. 5 Dan Ling Street,

Haidian District,

Beijing, P. R. China, 100080

Tel: +86-10-5917-3461

Fax: +86-10-8286-8529

Email: bingao AT microsoft DOT com

Current Interns

  • Hongfei Xue
  • Huazheng Wang
  • Xiaoying Zhang
  • Chonglin Sun

Previous Interns

  • Runwei Qiang
  • Siyu Qiu
  • Qing Cui
  • Xiang Li
  • Donghyun Kim
  • Xuan Hu
  • Lunbo Xu
  • Diyi Yang
  • Kai Hui
  • Pingguang Yuan
  • Haifeng Xu
  • Yangwei Wu
  • Wenlin Chen
  • Changhao Jiang
  • Weinan Zhang
  • Dixin Zhang
  • Wei Wei
  • Zhicong Cheng
  • Ruoshi Yuan
  • Congkai Sun
  • Kang Ji
  • Ying Zhang
  • Lei Wu
  • Lei Yang
  • Lei Qi
Last update: 25-Sept-2014