Yunbo Cao (曹云波)

Associate Researcher

Microsoft Research Asia

5F, Sigma Center,

No. 49 Zhichun Road, Haidian District,

Beijing, China 100190

yunbo dot cao at microsoft dot com

 

    Education

2006 ~ present

PhD candidate in computer science, Shanghai Jiao Tong University, Shanghai

1998 ~ 2001

M.S. in computer science, Peking University, Beijing

1994 ~ 1998

B.S. in probability & statistics, Peking University, Beijing

    Professional Activities

 

PC Member

ACL-HLT 2008, SIGIR 2008, AIRS 2008, NLP-KE 2008, WICOW-2, SIGIR 2007, NAACL-HLT 2007, SIGIR 2006 (poster), AIRS 2006

 

Reviewer

IEEE Transaction on Knowledge and Data Engineering, Pattern Recognition Letter, Journal of Computer Science and Technology

    Publications

29.

Yunbo Cao, Huizhong Duan, Chin-Yew Lin, Yong Yu, and Hsiao-Wuen Hon. Recommending Questions Using the MDL-Based Tree Cut Model. WWW 2008

28.

Huizhong Duan, Yunbo Cao, Chin-Yew Lin, and Yong Yu. Searching Questions by Identifying Question Topic and Question Focus. ACL 2008.

27.

Shenghua Bao, Huizhong Duan, Qi Zhou, Miao Xiong, Yunbo Cao, and Yong Yu. A Probabilistic Model for Fine-Grained Expert Search. ACL 2008.

26.

Yuanjie Liu, Shasha Li, Yunbo Cao, Chin-Yew Lin, Dingyi Han, and Yong Yu. Understanding and Summarizing Answers in Community-based Question Answering Services. COLING 2008.

25.

Young-In Song, Chin-Yew Lin, Yunbo Cao, and Hae-Chang Rim. Questing Utility: A Novel Static Ranking of Question Search. AAAI 2008.

24.

Jingjing Liu, Yunbo Cao, and Yalou Huang. Effective Entity Resolution in Product Review Domain. ICMLC 2007.

23.

Jingjing Liu, Yunbo Cao, Chin-Yew Lin, and Yalou Huang. Low-Quality Product Review Detection in Opinion Summarization. EMNLP-CoNLL 2007.

22.

Jun Xu, Yunbo Cao, Hang Li, Nick Craswell, and Yalou Huang. Searching Documents Based on Relevance and Type. ECIR 2007

21.

Shengliang Xu, Shenghua Bao, Yunbo Cao, and Yong Yu. Using Social Annotations to Improve Language Model for Information Retrieval. CIKM 2007

20.

Shengliang Xu, Shenghua Bao, Yong Yu, and Yunbo Cao. Using Social Annotations to Smooth the Language Model for IR. PAKDD 2007

19.

Huizhong Duan, Qi Zhou, Zhen Lu, Ou Jin, Shenghua Bao, Yunbo Cao, and Yong Yu. Research on Enterprise Track of TREC 2007 at SJTU APEX Lab. TREC 2007.

18.

Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shuming Shi, Yunbo Cao, Chin-Yew Lin, and Hang Li. Web Page Title Extraction and Its Application. Information Processing and Management, 2007

17.

Yunhua Hu, Hang Li, Yunbo Cao, Li Teng and Dmitriy Meyeron, and Qinghua Zheng. Automatic Extraction of Titles from General Documents Using Machine Learning. Information Processing and Management, 2007

16.

Yunbo Cao, Jun Xu, Tie-Yan Liu, Hang Li, Yalou Huang, and Hsiao-Wuen Hon. Adapting Ranking SVM to Document Retrieval. SIGIR 2006.

15.

Shenghua Bao, Huizhong Duan, Qi Zhou, Miao Xiong Yunbo Cao, and Yong Yu. Research on Expert Search at Enterprise Track of TREC 2006. TREC 2006

14.

Jun Xu, Yunbo Cao, Hang Li and Yalou Huang. Cost-Sensitive Learning of SVM for Ranking. ECML 2006.

13.

Shenghua Bao, Yunbo Cao, Hang Li, Bing Liu, and Yong Yu. Mining Latent Associations of Objects Using a Typed Mixture Model A Case Study on Expert/Expertise Mining. ICDM 2006 (short paper).

12.

Rui Li, Shenghua Bao, Jin Wang, Yong Yu and Yunbo Cao. An Effective Algorithm for Mining Competitors from the Web. ICDM 2006 (short paper).

11.

Guo-Ping Hu, Jingjing Liu, Hang Li, Yunbo Cao, Jian-Yun Nie, and Jianfeng Gao. A Supervised Learning Approach to Entity Search. AIRS 2006

10.

Jun Xu, Yunbo Cao, Hang Li, Min Zhao, and Yalou Huang. A Supervised Learning Approach to Search of Definitions. Journal of Computer Science and Technology.

9.

Yunbo Cao, Jingjing Liu, Shenghua Bao and Hang Li. Research on Expert Search at Enterprise Track of TREC 2005. TREC 2005

8.

Jun Xu, Yunbo Cao, Hang Li and Min Zhao. Ranking Definitions with Supervised Learning Methods. WWW 2005

7.

Hang Li, Yunbo Cao, Jun Xu, Yunhua Hu, Shenjie Li, Dmitriy Meyerzon. A New Approach to Intranet Search Based on Information Extraction. CIKM 2005

6.

Jie Tang, Hang Li, Yunbo Cao and Zhaohui Tang. Email Data Cleaning. KDD 2005.

5.

Yunhua Hu, Hang Li, Yunbo Cao, Dmitriy Meyeron, and Qinghua Zheng. Automatic Extraction of Titles from General Documents Using Machine Learning. JCDL 2005

4.

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Shuming Shi, Yunbo Cao, Hang Li. Title Extraction from Bodies of HTML Documents and Its Application to Web Page Retrieval. SIGIR 2005

3.

Yunbo Cao, Hang Li, Li Lian. Uncertainty Reduction in Collaborative Bootstrapping: Measure and Algorithm. ACL 2003

2.

Hang Li, Yunbo Cao and Cong Li. Using Bilingual Web Data to Mine and Rank Translations. IEEE Intelligent Systems 18 (4), p54-59, 2003

1.

Yunbo Cao and Hang Li, Base Noun Phrase Translation Using Web Data and the EM Algorithm, COLING 2002

    Patents

Yunbo Cao and Hang Li. Method and Apparatus for Browsing Document Content. US Patent No. 7,284,006.

Yunbo Cao and Hang Li. Learning and Using Generalized String Patterns for Information Extraction. US Patent No. 7,299,228

    Product Developments

Microsoft TextMiner (internal tool)

Microsoft SQL Server 2005 Text Mining

Microsoft SharePoint Searver 2007 Search

     Last updated date: 6/1/2008, by Yunbo Cao