Brief Bio
I am a researcher in the Data Management, Exploration, and Mining (DMX) group at Microsoft Research. I have a PhD in computer science from Stanford University and a B.Tech. from Indian Institute of Technology, Madras. I am originally from Bangalore, India.
Research
My research interests are broadly in the area of information management. I am currently working on the data cleaning project in the DMX group. Broadly, the goal of the project is to develop scalable techniques and tools to identify and fix inconsistencies and errors in data. I am also very interested in continuous queries and data streams, which was the topic of my PhD thesis.
- Arvind Arasu, Surajit Chaudhuri, and Raghav Kaushik, Learning String Transformations from Examples, in VLDB, Very Large Data Bases Endowment Inc., August 2009
- Arvind Arasu and Raghav Kaushik, A Declarative Entity Representation Framework, in Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), June 2009
- Arvind Arasu, Christopher Re, and Dan Suciu, Large-Scale Deduplication with Constraints using Dedupalog, in Proceedings of the 25th International Conference on Data Engineering, ICDE 2009, IEEE Computer Society, 29 March 2009
- Arvind Arasu, Surajit Chaudhuri, and Raghav Kaushik, Transformation-based Framework for Record Matching, in Proceedings of the 24th International Conference on Data Engineering, ICDE 2008, IEEE Computer Society, June 2008
- Arvind Arasu, Venkatesh Ganti, and Raghav Kaushik, Efficient Exact Set-Similarity Joins, in Proceedings of the 32nd International Conference on Very Large Data Bases, VLDB 2006, Very Large Data Bases Endowment Inc., August 2006
Professional Activities
- Program Committees: ICDE 2007, VLDB 2007 (Demo track), SSPS 2007, DASFAA 2008, MMSDE 2008, SSPS 2008
- Reviewer: ACM TODS, VLDB Journal, IEEE TKDE, and others.
Contact Information
Arvind Arasu (99/3737),
One Microsoft Way,
Redmond, WA 98052
Phone: +1 (425) 706 0543
Email: arvinda@microsoft.com



