Assistant Managing Director
Microsoft Research Asia
Microsoft Corp.
Dr. Wei-Ying Ma is an Assistant Managing Director at Microsoft Research Asia where he oversees multiple research groups including Web Search and Mining, Natural Language Computing, Data Management and Analytics, and Internet Economics and Computational Advertising. He and his team of researchers have developed many key technologies that have been transferred to Microsoft’s Online Services Division including Bing Search Engine and Microsoft Advertising. He has published more than 250 papers at international conferences and journals. He is a Fellow of the IEEE and a Distinguished Scientist of the ACM. He currently serves on the editorial boards of ACM Transactions on Information System (TOIS) and ACM/Springer Multimedia Systems Journal. He is a member of International World Wide Web (WWW) Conferences Steering Committee. In recent years, he served as program co-chair of WWW 2008, program co-chair of Pacific Rim Conference on Multimedia (PCM) 2007, general co-chair of Asia Information Retrieval Symposium (AIRS) 2008, and the general co-chair of ACM SIGIR 2011.
Before joining Microsoft in 2001, Wei-Ying was with Hewlett-Packard Labs in Palo Alto, California where he worked in the fields of multimedia content analysis and adaptation. From 1994 to 1997, he was engaged in the Alexandria Digital Library project at the University of California, Santa Barbara. He received a bachelor of science in electrical engineering from the National Tsing Hua University in Taiwan in 1990. He earned a Master of Science degree and doctorate in electrical and computer engineering from the University of California at Santa Barbara in 1994 and 1997, respectively.
Summary of Research Works
Wei-Ying Ma developed several technologies during his Ph.D. research at University of California at Santa Barbara, including one of the first content-based image retrieval systems on the Web (called Netra), the widely-used Gabor texture features for image retrieval, and one of the first practical image segmentation solutions for processing a large number and variety of natural scene images (which enables image retrieval systems to provide region-based search capabilities). He is also one of the first researchers to identify the problem of similarity measure in content-based image retrieval, and developed a machine learning approach to learn the similarity measure for image retrieval. In recent years, he has been leading a team at Microsoft Research Asia to develop a system to analyze large-scale multimedia data for automatic annotation.
Starting in 2003, Wei-Ying has expanded his research into general Web search and has applied many innovative ideas from image analysis and segmentation to Web page analysis and information extraction. In particular, he developed the first technique to analyze Web pages using visual cues and use the information to model the Web and extract structured data from Web pages. With these advanced Web-analysis techniques, Wei-Ying has led his team to develop a next-generation search engine that goes beyond traditional page-level relevance ranking. By extracting and integrating information about real-world entities such as people, places and things (e.g. products) from billions of public Web pages, his system creates a paradigm shift on Web search by enabling search queries, relevance ranking, and browsing and navigation of search results at the level of entities and objects. The resulting entity-level search engine – the first on the Web that provides automatic summaries of entities and allow users to navigate and explore their relationships – can be found at http://entitycube.research.microsoft.com (a Chinese-version of the search engine, called Renlifang, is also available at http://renlifang.msra.cn). He and his team also built the Microsoft academic search engine based on entity-level search technologies, which is available at http://academic.research.microsoft.com/. It provides many innovative ways to retrieve rank and explore scientific papers, conferences, journals, and authors based on their importance and relationship.
Wei-Ying and his team also initiated an effort in Microsoft to develop a web-scale data mining infrastructure for search. Different from traditional Internet services, search involves myriad offline computations to analyze the data at a very large scale, and an infrastructure for “scale” experiments is often required to evaluate the effectiveness of newly invented algorithms in a semi-real environment. Such an infrastructure is also critical for supporting massive web mining, knowledge discovery, and asynchronous metadata exchange in a search engine pipeline so that the cycle of idea formulation, experimentation, and deployment can be iterated quickly.
Wei-Ying is an inventor or co-inventor of over 80 patents in the area of web search and multimedia information retrieval.
The following are some of the systems Wei-Ying and his team have developed at Microsoft Research which have been released to the public.
EntityCube (http://entitycube.research.microsoft.com)
EntityCube is a research prototype for exploring object-level search technologies, which automatically summarizes the Web for entities with a modest web presence. Key technologies include web-scale entity extraction, name disambiguation, entity ranking, and relationship extraction and exploration.
Renlifang (http://renlifang.msra.cn/)
Renlifang is the Chinese version of EntityCube (and the name EntityCube is the English translation of Renlifang) which currently has millions of daily page-views during peak times. It has received wide press coverage and publicity in China.
Microsoft Academic Search Engine (http://academic.research.microsoft.com/)
Using similar technologies, Wei-Ying and his team created this academic search service to facilitate the exchange of ideas and communications between academic communities. A user can retrieve relevant information on academic papers, scientists, conferences, and journals and thus generate more accurate, relevant, and efficient results in comparison to document-level ranking. Features of this search service include the ability to find top scientists, conferences, and journals in a specific field, locate top research papers, and identify rising stars or hot topics in a specific field.
Microsoft TravelGuide (http://travel.msra.cn/)
TravelGuide is a vertical search engine for the travel domain that utilizes deep web crawling and forum site structure analysis technologies developed by Wei-Ying and his team. This engine aggregates travel related information from across the web and presents relevant knowledge to the user, helping them understand more about travel destinations, such as popular places, themes, short trips, etc.
Selected Publications
- Object-Level Ranking: Bringing Order to Web Objects
Zaiqing Nie, Yuanzhi Zhang, Ji-Rong Wen, and Wei-Ying Ma
International World Wide Web conference (WWW 2005) - 2D Conditional Random Fields for Web Information Extraction
Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Ying Ma
The 22nd International Conference on Machine Learning (ICML 2005) - Extracting Objects from the Web
Zaiqing Nie, Fei Wu, Ji-Rong Wen, Wei-Ying Ma
The 22nd International Conference on Data Engineering (ICDE 2006) - Simultaneous Record Detection and Attribute Labeling in Web Data Extraction
Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Ying Ma
International Conference on Knowledge Discovery and Data Mining (SIGKDD 2006) - Object-Level Vertical Search
Zaiqing Nie, Yuanzhi Zhang, Ji-Rong Wen, and Wei-Ying Ma
The Third Biennial Conference on Innovative Data Systems Research (CIDR 2007) -
Webstudio: Building Infrastructure for Web Data Management
Ji-Rong Wen and Wei-Ying Ma
Invited talk at International Conference on Management of Data (SIGMOD 2007) - Web Object Retrieval
Zaiqing Nie, Yunxiao Ma, Shuming Shi, Ji-Rong Wen, Wei-Ying Ma
International World Wide Web conference (WWW 2007) -
Exploring traversal strategy for web forum crawling
Yida Wang, Jiang-Ming Yang, Wei Lai, Rui Cai, Lei Zhang, Wei-Ying Ma
The 32nd Annual International ACM SIGIR Conference (SIGIR 2008) -
Mining interesting locations and travel sequences from GPS trajectories
Yu Zheng, Lizhu Zhang, Xing Xie, Wei-Ying Ma
International World Wide Web conference (WWW 2009) -
Incorporating site-level knowledge to extract structured data from web forums
Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei Zhang, Wei-Ying Ma
international World Wide Web conference (WWW 2009) - Learning to cluster web search results
Hua-Jun Zeng, Qi-Cai He, Zheng Chen, Wei-Ying Ma, Jinwen Ma
ACM SIGIR Conference 2004: 210-217 (SIGIR 2004) - Learning block importance models for web pages
Ruihua Song, Haifeng Liu, Ji-Rong Wen, Wei-Ying Ma
International World Wide Web Conference 2004: 203-211 (WWW 2004) - Learning a Semantic Space from User's Relevance Feedback for Image Retrieval
Xiaofei He, Oliver King, Wei-Ying Ma, Mingjing Li, Hongjiang Zhang
IEEE Transactions on Circuits and Systems for Video Technology, Jan 2003 - Detecting web page structure for adaptive viewing on small form factor devices
Yu Chen, Wei-Ying Ma, Hong-Jiang Zhang
International World Wide Web Conference 2003: 225-233 (WWW 2003) - VIPS: a vision-based page segmentation algorithm
D. Cai, S. Yu, J.R. Wen, Wei-Ying Ma
Microsoft Research Technical Report, MSR-TR-2003-79, 2003 - Improving pseudo-relevance feedback in web information retrieval using web page segmentation
Shipeng Yu, Deng Cai, Ji-Rong Wen, Wei-Ying Ma
International World Wide Web Conference 2003: 11-18 (WWW 2003) - Probabilistic query expansion using query logs
Hang Cui, Ji-Rong Wen, Jian-Yun Nie, Wei-Ying Ma
International World Wide Web 2002: 325-332 (WWW 2002) - Edge flow: a technique for boundary detection and image segmentation
Wei-Ying Ma and B. S. Manjunath
IEEE Transactions on Image Processing, Vol. 9, No. 8, pp. 1375-1388, August 2000 - Netra: a toolbox for navigating large image databases
Wei-Ying Ma and B. S. Manjunath
IEEE Multimedia Systems. Vol. 7, pp. 184-198, May 1999 - Texture features for browsing and retrieval of image data
B. S. Manjunath and Wei-Ying Ma
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, pp. 837-842. August 1996
Keynote Speeches
Wei-Ying Ma has been invited to give keynote speeches at the following academic conferences and industrial forums on web search, multimedia computing, and cloud computing.
- Empowering People with Knowledge: the Next Frontier for Web Search
The 14th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), Hyderabad, India, 2010. - Rethinking Multimedia Search in the new “Cloud + Clients” Era
The Workshop on Large-scale Multimedia Mining and Retrieval at ACM Multimedia Conference 2009 - Cloud Computing and the Future of Internet Services
The 10th International Mobile Data Management (MDM) Conference 2009 - Building Web-scale Data Mining Infrastructure for Search
The 10th Asia-Pacific Web Conference, APWeb 2008 - The Challenges and Opportunities of Mining Billions of Web Images for Search and Online Applications
The Multimedia Retrieval Workshop at SIGIR 2007 - The Challenges and Opportunities of Mining Billions of Web Images for Search and Advertising
The 9th International Conference on Visual Information Systems, VISUAL2007 - Building Infrastructure to Support Web-scale Data Mining for Search
DBWeb in Kyoto, Japan, 2006 - Object-level Vertical Search
Workshop on Web Information Retrieval and Integration at ICDE 2006 - From Relevance to Intelligence: Toward Next Generation Web Search
Multimedia Information Retrieval (MIR) Workshop at ACM Multimedia Conference 2005 - Adaptive Content Delivery on Mobile Internet across Multiple Form Factors
International Multimedia Modeling (MMM) Conference 2004 - Towards Next Generation Web Search
International Conference on Web Information Systems (WISE) 2004
