Kaushik Chakrabarti
Data Management, Exploration and Mining Group
Microsoft Research
One Microsoft Way
Redmond, WA 98052
Fax: (425)936-7329
Phone: (425)703-5137
Email: kaushik@microsoft.com
Click here for a short bio
Recent News
(7/2012) I have been invited to give a keynote talk at the Workshop on Entity-oriented and Semantic Search (JIWES) at SIGIR 2012. Here is the title and abstract.
(8/2011) I have been invited to serve as a workshop co-chair for VLDB 2014 to be held at the beautiful Hangzhou, China.
(7/2011) The Distributed and Parallel Databases Journal (Springer’s international journal on database management and information retrieval) is planning a special issue on ranking in databases. I am the editor of this special issue. If you are working on ranking in databases, please consider submitting your work to this special issue. The deadline for the paper submission is October 7, 2011. The call for papers can be found on the journal web site: http://www.springer.com/journal/10619
(4/2011) The fifth international workshop on ranking in databases (DBRank) 2011 will be held in conjunction with VLDB 2011 in Seattle, WA, USA. Davide Martinenghi and I are the program co-chairs. If you are working on ranking, please consider submitting a paper to DBRank 2011. The deadline is June 7, 2011.
Research Interests
Kaushik's interests spans many aspects of data management, information retrieval and data mining. He is specifically interested in the following topics:
- Bridging structured and unstructured data: how can structured and unstructured information (text) be used in conjunction (rather than separately) for better search and analysis?
- Information Extraction and Text Analytics: how can unstructured data inside the enterprise as well as that on the web be used for business intelligence?
- Web Mining: how can we mine the web to extract knowledge about structured entities like products and people?
- Context-aware Search: how can a user's context be used to improve search?
- Analysis over big data: how can computational paradigms like MapReduce be used to analyse massive amounts of data (like query logs, web documents)?
- Information Retrieval: how can IR engines efficiently support new kinds of queries like top-k and proximity queries?
- Machine learning: how can machine learning techniques be used to solve real-world problems in search, information extraction and data analysis?
Education
- Ph.D., Computer Science, University of Illinois at Urbana Champaign, 2001.
- M.S., Computer Science, University of Illinois at Urbana Champaign, 1999.
- B. Tech. , Computer Science and Engineering, Indian Institute of Technology, 1996.
Recent Professional Activities
- Workshop co-chair, VLDB 2014
- PC Member, ICDE 2013
- Member of Best Paper Award committee, ICDM 2011
- Area PC Vice-Chair, "Query Processing and Optimization" track, ICDE 2012
- Associate editor, TKDE
- Program co-chair, DBRank 2011 (at VLDB 2011)
- Member of Editorial Board, Distributed and Parallel Databases Journal
- PC Member, WWW 2011
- PC Member, ICDE 2011 (Demo track)
- PC Member, ICDM 2011
- PC Member, SIGMOD 2010
- PC Member, ICDE 2010 (Both Research and Demo tracks)
- PC Member, DBRank 2010
- PC Member, ICDM 2009
- PC Member, ICME (IEEE International Conference on Multimedia and Expo) 2009
- PC Member, DBRank 2009
- PC Member, ICME 2008
- PC Member, ICDM 2007
- PC Member, DBRank 2007
- PC Member, ECML/PKDD 2007
- PC Member, ICDM 2006
- PC Member, ICDM 2005
Best Paper Awards
-
Our paper Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases awarded Best Paper of SIGMOD 2001
-
Our paper Approximate Query Processing Using Wavelets awarded "Best of VLDB 2000" (along with 4 other papers) and invited to "Best of VLDB 2000" issue of VLDB Journal
2013
- Meihui Zhang and Kaushik Chakrabarti, Semantic Matching and Annotation of Numeric and Time-Varying Attributes in Web Tables, ACM SIGMOD, June 2013
- Bilyana Taneva, Tao Cheng, Kaushik Chakrabarti, and Yeye He, Mining Acronym Expansions and Their Meanings Using Query Click Log, WWW Conference 2013, May 2013
- Tao Cheng, Kaushik Chakrabarti, Surajit Chaudhuri, Vivek Narasayya, and Manoj Syamala, Data Services for E-tailers Leveraging Web Search Engine Assets, in ICDE Conference, April 2013
2012
- Kaushik Chakrabarti, Surajit Chaudhuri, Tao Cheng, and Dong Xin, A Framework for Robust Discovery of Entity Synonyms, in SIGKDD, 2012
- Mohamed Yakout, Kris Ganjam, Kaushik Chakrabarti, and Surajit Chaudhuri, InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables, in ACM SIGMOD Conference, 2012
- Chi Wang, Kaushik Chakrabarti, Tao Cheng, and Surajit Chaudhuri, Targeted Disambiguation of Ad-hoc, Homogeneous Sets of Named Entities, in World Wide Web Conference, 2012
2011
- Senjuti Basu Roy and Kaushik Chakrabarti, Location-Aware Type Ahead Search on Spatial Databases: Semantics and Efficiency, in ACM SIGMOD Conference, June 2011
- Bahman Bahmani, Kaushik Chakrabarti, and Dong Xin, Fast Personalized PageRank on MapReduce, in ACM SIGMOD Conference, June 2011
- Kaushik Chakrabarti, Surajit Chaudhuri, and Venkatesh Ganti, Interval-Based Pruning for Top-k Processing over Compressed Lists, in ICDE Conference, IEEE, April 2011
- Kaushik Chakrabarti, Surajit Chaudhuri, Tao Cheng, and Dong Xin, Automatically Tagging Entities with Descriptive Phrases, in WWW (Poster paper), 2011
2010
- Sanjay Agrawal, Kaushik Chakrabarti, Surajit Chaudhuri, Venkatesh Ganti, Arnd Christian König, and Dong Xin, Query Portals: Dynamically Generating Portals for Entity-Oriented Web Queries, in International Conference on Management of Data (SIGMOD 2010) , Association for Computing Machinery, Inc., 6 June 2010
2009
- Sanjay Agrawal, Kaushik Chakrabarti, Surajit Chaudhuri, Venkatesh Ganti, Arnd Christian König, and Dong Xin, Exploiting Web Search Engines to Search Structured Information , in 18th International World Wide Web Conference (WWW 2009), Association for Computing Machinery, Inc., April 2009
- Sanjay Agrawal, Kaushik Chakrabarti, Surajit Chaudhuri, Venkatesh Ganti, Arnd Christian König, and Dong Xin, Query Portals: Dynamically Generating Portals for Web, in 18th International World Wide Web Conference (WWW 2009), Association for Computing Machinery, Inc., April 2009
2008
- Kaushik Chakrabarti, Surajit Chaudhuri, Venkatesh Ganti, and Dong Xin, An efficient filter for approximate membership checking, in SIGMOD Conference, Association for Computing Machinery, Inc., 2008
- Sanjay Agrawal, Kaushik Chakrabarti, Surajit Chaudhuri, and Venkatesh Ganti, Scalable Adhoc Entity Extraction from Text Collections , in VLDB Conference, 2008
2006
- Kaushik Chakrabarti, Venkatesh Ganti, Jiawei Han, and Dong Xin, Ranking Objects by Exploiting Relationships: Computing Top-K over Aggregation, in SIGMOD Conference, 2006
2004
- Kaushik Chakrabarti, Michael Ortega-Binderberger, Sharad Mehrotra, and Kriengkrai Porkaew, Evaluating Refined Queries in Top-k Retrieval Systems, in IEEE Trans. Knowl. Data Eng. (TKDE) , 2004
- Kaushik Chakrabarti, Surajit Chaudhuri, and Seung-won Hwang, Automatic Categorization of Query Results , in ACM SIGMOD Conference, 2004
2003
- Michael Ortega-Binderberger, Kaushik Chakrabarti, and Sharad Mehrotra, Efficient Evaluation of Relevance Feedback for Multidimensional All-pairs Retrieval, in ACM symposium on Applied computing , 2003
2002
- Michael Ortega-Binderberger, Kaushik Chakrabarti, and Sharad Mehrotra, An Approach to Integrating Query Refinement in SQL, in EDBT Conference, 2002
- Kaushik Chakrabarti, Eamonn Keogh, Sharad Mehrotra, and Michael Pazzani, Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases, in ACM Transactions on Database Systems (TODS), 2002
2001
- Kaushik Chakrabarti, Michael Ortega, Kriengkrai Porkaew, and Sharad Mehrotra, Query Refinement in Similarity Retrieval Systems, in IEEE Data Engineering Bulletin, 2001
- Eamonn Keogh, Kaushik Chakrabarti, Sharad Mehrotra, and Michael Pazzani, Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases, in ACM SIGMOD Conference, 2001
- Kaushik Chakrabarti, Minos Garofalakis, Rajeev Rastogi, and Kyuseok Shim, Approximate Query Processing Using Wavelets, in VLDB Journal, 2001
- Eamonn Keogh, Kaushik Chakrabarti, Michael Pazzani, and Sharad Mehrotra, Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases, in Knowledge and Information Systems Journal (KAIS), 2001
- Kaushik Chakrabarti, Large Multidimensional Datasets inside a Database System (Ph.D. Thesis), 2001
- Michael Ortega-Binderberger, Kaushik Chakrabarti, and Sharad Mehrotra, Database Support for Multimedia Applications, in Image Databases: Search and Retrieval of Digital Imagery , John Wiley & Sons, 2001
2000
- Kaushik Chakrabarti and Sharad Mehrotra, Local Dimensionality Reduction: A New Approach to Indexing High Dimensional Spaces, in VLDB Conference, 2000
- Michael Ortega-Binderberger, Sharad Mehrotra, Kaushik Chakrabarti, and Kriengkrai Porkaew, WebMARS: A Multimedia Search Engine, in IS&T/SPIE's Annual Symposium of Electronic Imaging, 2000
- Kaushik Chakrabarti, Michael Ortega-Binderberger, Kriengkrai Porkaew, and Sharad Mehrotra, Similar Shape Retrieval in MARS, in IEEE International Conference on Multimedia, 2000
- Kaushik Chakrabarti, Minos Garofalakis, Rajeev Rastogi, and Kyuseok Shim, Approximate Query Processing Using Wavelets, in VLDB Conference, 2000
- Kaushik Chakrabarti, Kriengkrai Porkaew, and Sharad Mehrotra, Efficient Query Refinement in Multimedia Databases, in ICDE Conference, 2000
1999
- Kriengkrai Porkaew, Sharad Mehrotra, Michael Ortega, and Kaushik Chakrabarti, Similarity Search Using Multiple Examples in MARS, in International Conference on Visual Information and Information Systems , 1999
- Kaushik Chakrabarti and Sharad Mehrotra, The Hybrid Tree: An Index Structure for High Dimensional Feature Spaces, in ICDE Conference, 1999
- Kriengkrai Porkaew, Kaushik Chakrabarti, and Sharad Mehrotra, Query Refinement for Multimedia Similarity Retrieval in MARS, in ACM Multimedia Conference, 1999
- Kaushik Chakrabarti and Sharad Mehrotra, Efficient Concurrency Control in Multidimensional Access Methods, in ACM SIGMOD Conference, 1999
1998
- Kaushik Chakrabarti, Sharad Mehrotra, Michael Ortega, Kriengkrai Porkaew, and Robert Winkler, Processing Uncertainty Queries in Database Management Systems, in 1998 Annual FedLab Symposium, Advanced Displays & Interactive Displays Consortium, 1998
- Kaushik Chakrabarti and Sharad Mehrotra, Dynamic Granular Locking Approach to Phantom Protection in R-trees, in ICDE Conference, 1998
- Michael Ortega, Yong Rui, Kaushik Chakrabarti, Kriengkrai Porkaew, Sharad Mehrotra, and Thomas S. Huang, Supporting Ranked Boolean Similarity Queries in MARS, in IEEE Trans. Knowl. Data Eng. (TKDE), 1998
1997
- Michael Ortega, Yong Rui, Kaushik Chakrabarti, Sharad Mehrotra, and Thomas S. Huang, Supporting Similarity Queries in MARS, in ACM Multimedia Conference, 1997
