Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Rui Cai

Dr. Rui Cai is a Lead Researcher at Microsoft Research Asia. He received the B.E. and Ph.D. degrees in computer science from Tsinghua University, Beijing, China, in 2001 and 2006, respectively. His research interests include web search and data mining, machine learning, pattern recognition, computer vision, multimedia content analysis, and signal processing. He is a member of Association for Computing Machinery (ACM) and the Institute of Electrical and Electronics Engineers (IEEE).

 

Projects

Website Structure Understanding and its Applications

Website structure understanding can be treated as a reverse engineering for the purpose of automatically discovering the layout templates and URL patterns of a website, and understanding how these templates and patterns are integrated to organize the website. The study of this problem has had a great impact to many applications which can leverage such site-level knowledge to help web search and data mining. (Check out more details here)

Picto: A large scale visual indexing and recognition system

Object image recognition is a challenge but important problem. Towards addressing this problem, we initialed the Picto project. Our research in this project covers three fundamental aspects of this problem: low-level image features, middle level image representations, and indexing and recognition algorithms. We specially emphasize scalability and applicability in our research. (Check out more details here)

3D Object Reconstruction and Recognition

We study the problem of 3D object reconstruction and recognition. For reconstruction, we aim at developing algorithms and systems to lower down the barrier of 3D reconstruction for common users. In this way, we can collect a world-class 3D object repository via leveraging crowdsourcing. For recognition, we aim at dealing with a large-scale task (e.g. identifying thousands of objects), and providing real-time performance. (Check out more details here)

 

Selected Publications

  1. Chi Zhang, Zhiwei Li, Rui Cai, Hongyang Chao, and Yong Rui. "As-Rigid-As-Possible Stereo under Second Order Smoothness Priors". in Proc. of the 13th European Conference on Computer Vision (ECCV 2014), Part II, pp.112-126, Zurich, Switzerland. September 6-12, 2014.
  2. Qiang Hao, Rui Cai, Zhiwei Li, Lei Zhang, Yanwei Pang, Feng Wu, and Yong Rui. "Efficient 2D-to-3D Correspondence Filtering for Scalable 3D Object Recognition". in Proc. of the 26th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2013), pp.899-906, Portland, Oregon, USA. June 23-28, 2013. [dataset]

  3. Qiang Hao, Rui Cai, Zhiwei Li, Lei Zhang, Yanwei Pang, and Feng Wu. "3D Visual Phrases for Landmark Recognition". in Proc. of the 25th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2012), pp.3594-3601, Providence, Rhode Island, USA. June 18-20, 2012. [dataset]
  4. Wenbin Tang, Rui Cai, Zhiwei Li, and Lei Zhang. "Contextual Synonym Dictionary for Visual Object Retrieval". in Proc. of the 19th ACM International Conference on Multimedia (MM 2011), pp.503-512, Scottsdale, Arizona, USA. November 28-December 1, 2011.
  5. Qiang Hao, Rui Cai, Yanwei Pang, and Lei Zhang. "From One Tree to a Forest: a Unified Solution for Structured Web Data Extraction". in Proc. of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2011), pp.775-784, Beijing, China. July 24-28, 2011. [dataset]
  6. Linkai Weng, Zhiwei Li, Rui Cai, Yaoxue Zhang, Yuezhi Zhou, Laurence T. Yang, and Lei Zhang. "Query by Document via a Decomposition-Based Two-Level Retrieval Approach". in Proc. of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2011), pp.505-514, Beijing, China. July 24-28, 2011.
  7. Bing Li, Rong Xiao, Zhiwei Li, Rui Cai, Bao-Liang Lu, and Lei Zhang. "Rank-SIFT: Learning to Rank Repeatable Local Interest Points". in Proc. of the 24th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2011), pp.1737-1744, Colorado Springs, USA. June 20-25, 2011. 
  8. Qiang Hao, Rui Cai, Changhu Wang, Rong Xiao, Jiang-Ming Yang, Yanwei Pang, and Lei Zhang. "Equip Tourists with Knowledge Mined from Travelogues". in Proc. of the 19th International World Wide Web Conference (WWW 2010), pp.401-410, Raleigh, NC, USA, April 26-30, 2010.
  9. Tao Lei, Rui Cai, Jiang-Ming Yang, Yan Ke, Xiaodong Fan, and Lei Zhang. "A Pattern Tree-based Approach to Learning URL Normalization Rules". in Proc. of the 19th International World Wide Web Conference (WWW 2010), pp.611-620, Raleigh, NC, USA, April 26-30, 2010.
  10. Chen Lin, Jiang-Ming Yang, Rui Cai, Xin-Jing Wang, Wei Wang, and Lei Zhang. "Simultaneously Modeling Semantics and Structure of Threaded Discussions: A Sparse Coding Approach and Its Applications". in Proc. of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2009), pp.131-138, Boston, US, July 19-23, 2009.
  11. Jiang-Ming Yang, Rui Cai, Chunsong Wang, Hua Huang, Lei Zhang, and Wei-Ying Ma. "Incorporating Site-Level Knowledge for Incremental Crawling of Web Forums: A List-wise Strategy". in Proc. of the 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2009), pp.1375-1384, Paris, France, June 28-July 1, 2009.
  12. Xiaolin Shi, Jun Zhu, Rui Cai, and Lei Zhang. "User Grouping Behavior in Online Forums". in Proc. of the 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2009), pp.777-786, Paris, France, June 28-July 1, 2009.
  13. Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei Zhang, and Wei-Ying Ma. "Incorporating Site-Level Knowledge to Extract Structured Data from Web Forums". in Proc. of the 18th International World Wide Web Conference (WWW 2009), pp.181-190, Madrid, Spain, April 20-24, 2009.
  14. Yida Wang, Jiang-Ming Yang, Wei Lai, Rui Cai, Lei Zhang, and Wei-Ying Ma. "Exploring Traversal Strategy for Efficient Web Forum Crawling". in Proc. of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp.459-466, Singapore, July 20-24, 2008.
  15. Rui Cai, Jiang-Ming Yang, Wei Lai, Yida Wang, and Lei Zhang. "iRobot: An Intelligent Crawler for Web Forums". in Proc. of the 17th International World Wide Web Conference (WWW 2008), pp.447-456, Beijing, P.R. China, April 21-25, 2008.
  16. Rui Cai, Lie Lu, and Alan Hanjalic. "Co-clustering for Auditory Scene Categorization". IEEE Transactions on Multimedia (TMM), Vol. 10, No. 4, pp.596-606, June 2008.
  17. Rui Cai, Chao Zhang, Lei Zhang, and Wei-Ying Ma. "Scalable Music Recommendation by Search". in Proc. of the 15th ACM International Conference on Multimedia (MM 2007), pp.1065-1074, Augsburg, Germany, September 24-29, 2007.
  18. Rui Cai, Lie Lu, Alan Hanjalic, Hong-Jiang Zhang, and Lian-Hong Cai. "A Flexible Framework for Key Audio Effects Detection and Auditory Context Inference". IEEE Trans. Audio, Speech, and Language Processing (TASLP), Vol. 14, No. 3, pp.1026-1039, May 2006.
  19. Rui Cai, Lie Lu, and Alan Hanjalic. "Unsupervised Content Discovery in Composite Audio". in Proc. of the 13th ACM International Conference on Multimedia (MM 2005), pp.628-637, Singapore, November 6-11, 2005.

 

Granted Patents

  1. United States Patent 8051083. Forum Web Page Clustering based on Repetitive Regions. November 1, 2011.
  2. United States Patent 8099408. Web Forum Crawling Using Skeletal Links. January 17, 2012.
  3. United States Patent 8185482. Modeling Semantic and Structure of Threaded Discussions. May 22, 2012.
  4. United States Patent 8326820. Long-Query Retrieval. December 4, 2012.
  5. United States Patent 8344233. Scalable Music Recommendation by Search. January 1, 2013.
  6. United States Patent 8370119. Website Design Pattern Modeling. February 5, 2013.
  7. United States Patent 8429110. Pattern Tree-based Rule Learning. April 23, 2013.
  8. United States Patent 8438168. Scalable Music Recommendation by Search. May 7, 2013.
  9. United States Patent 8458115. Mining Topic-Related Aspects From User Generated Content. June 4, 2013.
  10. United States Patent 8473574. Automatic Online Video Discovery and Indexing. June 25, 2013.
  11. United States Patent 8645353. Anchor Image Identification for Vertical Video Search. February 4, 2014.
  12. United States Patent 8645354. Scalable Metadata Extraction for Video Search. February 4, 2014.
  13. United States Patent 8650094. Music Recommendation using Emotional Allocation Modeling. February 11, 2014.
  14. United States Patent 8700600. Web Forum Crawling Using Skeletal Links. April 15, 2014.
  15. United States Patent 8856129. Flexible and Scalable Structured Web Data Extraction. October 7, 2014. 

 

Professional Activities

  • PC member. Research Track. The 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2015), Sydney, Australia, August 2015.
  • PC member. Web Mining Track. The 24th International World Wide Web Conference (WWW 2015), Florence, Italy, May 2015

  • PC member. The 8th ACM International Conference on Web Search and Data Mining (WSDM 2015), Shanghai, China, February 2015.

  • PC member. Research Track, Industry and Government Track. The 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2014), New York, USA, August 2014.
  • PC member. Short Paper Committee. The 37th ACM International Conference on Research and Development in Information Retrieval (SIGIR 2014), Gold Coast, Queensland, Australia, July 2014
  • PC member. The 8th International AAAI Conference on Weblogs and Social Media (ICWSM 2014), Ann Arbor, MI, USA, June 2014
  • PC member. The 15th IEEE International Conference on Multimedia and Expo (ICME 2014), Chengdu, China, July 2014.
  • PC member. Search Systems and Applications Track and Web Mining Track. The 23rd International World Wide Web Conference (WWW 2014), Seoul, Korea, April 2014
  • PC member. The 4th International Workshop on Data Extraction and Object Search (DEOS 2014), attached to the 23rd International World Wide Web Conference (WWW 2014), Seoul, Korea, April 2014
  • PC member. The 9th Asian Information Retrieval Societies Conference (AIRS 2013), Singapore, December 2013
  • PC member. The 21st ACM International Conference on Multimedia (MM 2013), Barcelona, Catalunya, Spain, October 2013
  • PC member. Short Paper Committee. The 36th ACM International Conference on Research and Development in Information Retrieval (SIGIR 2013), Dublin, Ireland, July 2013
  • PC member. The 19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2013), Chicago, USA, August 2013.
  • PC member. The 14th IEEE International Conference on Multimedia and Expo (ICME 2013), San Jose, California, USA, July 2013.
  • PC member. The International Symposium on Circuits and Systems (ISCAS 2013), Beijing, China, May 2013.
  • PC member. Content Analysis Track and Search Systems and Applications Track. The 22nd International World Wide Web Conference (WWW 2013), Rio de Janeiro, Brazil, May 2013
  • PC member. Poster Committee. The 35th ACM International Conference on Research and Development in Information Retrieval (SIGIR 2012), Portland, Oregon, USA, August 2012
  • PC member. The 13th IEEE International Conference on Multimedia and Expo (ICME 2012), Melbourne, Australia, July 2012
  • PC member. Web Mining Track. The 21st International World Wide Web Conference (WWW 2012), Lyon, France, April 2012
  • PC member. The 7th Asian Information Retrieval Societies Conference (AIRS 2011), Dubai, United Arab Emirates, Dec. 2011
  • PC member. The 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2011), San Diego, CA, Aug. 2011
  • PC member. The 34th ACM International Conference on Research and Development in Information Retrieval (SIGIR 2011), Beijing, China, July 2011
  • PC member. The 4th ACM International Conference on Web Search and Data Mining (WSDM 2011), Hong Kong, Feb. 2011
  • PC member. The 12th IEEE International Conference on Multimedia and Expo (ICME 2011), Barcelona, Spain, July 2011.
  • PC member. The 6th Asia Information Retrieval Societies Conference (AIRS 2010), Taipei, Taiwan, Dec. 2010
  • PC member. The 5th Asia Information Retrieval Symposium (AIRS 2009), Sapporo, Hokkaido, Japan, Oct. 2009
  • PC member. ACM Workshop on Improving Non English Web searching (ACM iNEWS 2008), in conjunction with the 17th ACM Conference on Information and Knowledge Management (CIKM 2008), Napa Valley, California, Oct. 2008
  • PC member. The 28th International Conference on Distributed Computing Systems (ICDCS 2008), Beijing, China, Jun. 2008
  • PC member. The 4th Asia Information Retrieval Symposium (AIRS 2008), Harbin, China, Jan. 2008
  • Reviewer. Neurocomputing. 2012, 2013
  • Reviewer. IEEE Transactions on Image Processing. 2011, 2012, 2013
  • Reviewer. IEEE Multimedia. 2010
  • Reviewer. IEEE Transactions on Multimedia. 2008, 2010, 2011, 2012, 2013, 2014
Rui Cai 

Rui Cai, Ph.D.

Lead Researcher

 

Multimedia Search and Mining Group

Microsoft Research Asia  

 

Email: ruicai AT microsoft DOT com

Mail Address: Microsoft Research Asia.

Bldg#2, No. 5 Dan Ling Street, Haidian District

Beijing, PRC, 100080

 

Tel: +86-10-59173029

Fax: +86-10-88099681

-------------------------------------------------

Overview

Publications