Xiaoxin Yin
One Microsoft Way
Redmond, WA 98052
xyin (at)1microsoft1(dot)1com
I am a researcher
in the
Internet Services Research Center of
Microsoft Research. My current work is focused on applied research on web
search.
I
received my Ph.D. degree in May 2007 from the
Department of Computer Science at
University of Illinois at Urbana Champaign. I was a member of
Data Mining Research Group, and my Ph.D. advisor is
Prof. Jiawei Han. I graduated from
Tsinghua University in 2001 with a B.E. degree in Computer Science, and I
received M.S. degree from UIUC in 2003.
Research
My
research interests include
-
Web Search Quality
-
Link Analysis and Its Application on
Web Search
-
Multi-relational Data Mining
Publications
Theses:
Journal Papers and Book Chapters:
- Xiaoxin Yin, Jiawei Han,
Philip S. Yu. "Truth
Discovery with Multiple Conflicting Information Providers on the Web. IEEE
Trans. Knowledge and Data Engineering". 20(6): 796-808 (2008)
- Xiaoxin Yin, Jiawei Han,
Philip S. Yu. "CrossClus:
User-Guided Multi-Relational Clustering", in the journal of Data Mining
and Knowledge Discovery (DAMI), Springer.
-
Xiaoxin Yin, Jiawei Han, Jiong Yang,
Philip S. Yu. "Efficient
Classification across Multiple Database Relations: A CrossMine Approach",
in IEEE Transactions on Knowledge and Data Engineering (TKDE), 18(6):
770-783, June 2006.
-
Xiaoxin Yin, Jiawei Han, Jiong Yang,
and Philip S. Yu. "CrossMine: Efficient Classification across Multiple
Database Relations", in Constraint-based mining and inductive databases,
Jean-Francois Boulicaut, Luc de Raedt, Heikki Mannila (eds.), Springer
2005.
Conference Papers:
-
Yizhou Sun, Tianyi Wu,
Zhijun Yin, Hong Cheng, Jiawei Han, Xiaoxin Yin, Peixiang Zhao. "BibNetMiner:
mining bibliographic information networks". In 2008 ACM SIGMOD Conference,
June 2008. (Demo)
-
Xiaoxin Yin, Jiawei Han.
"Exploring
the Power of Heuristics and Links in Multi-relational Data Mining". In
17th Int’l. Symp. Methodologies for Intelligent Systems (ISMIS), May 2008.
(Invited paper)
-
Xiaoxin Yin, Jiawei Han, Philip S.
Yu. "Truth
Discovery with Multiple Conflicting Information Providers on the Web."
in 13th Int’l. Conf. on Knowledge Discovery and Data Mining (KDD’07),
San Jose, CA, Aug 2007. (short paper) (Slides)
-
Xiaoxin Yin, Jiawei Han, Philip S.
Yu. "Object
Distinction: Distinguishing Objects with Identical Names by Link Analysis",
in 23rd Int. Conf. on Data Engineering (ICDE'07), Istanbul, Turkey,
April 2007. (short paper) (Slides)
-
Xiaoxin Yin, Jiawei Han, Philip S.
Yu. "LinkClus:
Efficient Clustering via Heterogeneous Semantic Links", in 32nd
Int'l. Conf. on Very Large Data Bases (VLDB'06), Seoul, Korea, Aug 2006.
(Slides)
- Xiaoxin Yin, William Yurcik,
Adam, Slagell, “VisFlowCluster-IP:
Connectivity-Based Visual Clustering of Network Hosts”, in 21st IFIP
International Information Security Conference (SEC’06), Karlstad,
Sweden, May 2006.
-
Xiaoxin Yin, Jiawei Han. "Efficient
Classification from Multiple Heterogeneous Databases", in 9th
European Conf. on Principles and Practice of Knowledge Discovery in
Databases (PKDD'05), Porto, Portugal, Oct 2005.
-
Xiaoxin Yin, Jiawei Han, Philip S.
Yu. "Cross-Relational
Clustering with User's Guidance", in 11th Int'l. Conf. on Knowledge
Discovery and Data Mining (KDD'05), Chicago, IL, Aug 2005. (Slides)
-
Xiaoxin Yin, Jiawei Han, Jiong Yang.
"Searching
for Related Objects in Relational Databases", in 17th Int'l.
Scientific and Statistical Database Management Conf. (SSDBM'05), Santa
Barbara, CA, June 2005.
-
Sandeep Uttamchandani, Xiaoxin Yin,
John Palmer, Gul Agha. "MonitorMining:
A Gray-box Approach for Creating Domain Knowledge in Automation of the
Observe-Analyze-Act Loop", in 9th IFIP/IEEE International Symposium
on Integrated Network Management (IM'2005), Nice, France, May 2005.
-
Xiaolei Li, Jiawei Han, Xiaoxin Yin,
and Dong Xin. "Mining
Evolving Customer-Product Relationships in Multi-Dimensional Space", in
21st Int. Conf. on Data Engineering (ICDE'05), Tokyo, Japan, April
2005. (poster paper)
-
Xiaoxin Yin, Jiawei Han, Jiong Yang,
and Philip S. Yu, "CrossMine:
Efficient Classification across Multiple Database Relations", in 20th
Int'l. Conf. on Data Engineering (ICDE'04), Boston, MA, March 2004. (Slides)
-
Xiaoxin Yin, Jiawei Han. "CPAR:
Classification based on Predictive Association Rules" in 3rd SIAM
International Conference on Data Mining (SDM'03), San Francisco, CA, May
2003. (poster paper)
-
Xiaoyan Zhu, Xiaoxin Yin. "A
New Textual/Non-textual Classifier for Document Skew Correction", in
16th Int. Conf. Pattern Recognition (ICPR'02), Quebec, Canada, Aug,
2002.
Conference Tutorials:
-
Jiawei Han, Xiaoxin Yin
and Philip S. Yu, "Exploring
the Power of Links in Scalable Data Analysis", ICDE'08
conference tutorial, Cancun, Mexico, April 2008.
-
Jiawei Han, Xiaoxin Yin,
and Philip S. Yu, "Exploring
the Power of Links in Data Mining", Tutorial for Proc. 2007 Int.
Conf. on Principles and Practice of Knowledge Discovery in Databases
(PKDD'07), Warsaw, Poland, Sept. 2007.
Full list of publications
(out of date)
Download Research Tools
TruthFinder:
Truth Discovery with Multiple Conflicting Information Providers on the Web
LinkClus: LinkClus:
Efficient Clustering via Heterogeneous Semantic Links
CrossMine: CrossMine:
Efficient Classification across Multiple Database Relations
CPAR: CPAR:
Classification based on Predictive Association Rules
Miscellaneous
I play badminton
in my spare time, although I am bad at it.
I enjoy taking
photos to memorize the precious moments of life. Here is
my gallery.