John Shafer is a member of Microsoft Search Labs in Mountain View, California, where he is working on using data mining to improve search quality.
He received a B.S. in Computer Science and Engineering from Cornell University in 1992, and a Ph.D. in Computer Science from the University of Wisconsin - Madison in 1998. His thesis was on the design and implementation of parallel algorithms for large-scale data-mining, most of this work being done while a member of the Quest Data Mining group at IBM Almaden Research Center (1994-2000). The implementations of these algorithms were subsequently incorprated into IBM's Intelligent Miner for Data product.
Prior to joining Search Labs, John worked in industry for 6 years, first at Propel Systems (2000-2001), developing a scalable and distributed platform for eCommerce, and more recently at BEA Systems, developing core infrastructure for their Business Process Modeling and Enterprise Service Bus products, shipping several versions of each over the last 4.5 years.
- Ekaterina Gonina, Anitha Kannan, John Shafer, and Mihai Budiu, Parallelizing large-scale data processing applications with data skew: a case study in product-offer matching, in International Workshop on MapReduce and its Applications (MapReduce) 2011, June 2011
- John Shafer, Rakesh Agrawal, and Hady Lauw, Symphony: A Platform for Search-Driven Applications, in Proc. ICDE Conf (ICDE), IEEE, March 2010
- Hady Lauw, John Shafer, Rakesh Agrawal, and Alex Ntoulas, Homophily in the Digital World: A LiveJournal Case Study, in IEEE Internet Computing: Special Issue on Social Computing in Blogosphere, IEEE Computer Society, March 2010
- John Shafer, Rakesh Agrawal, and Hady Lauw, Symphony: Enabling Search-Driven Applications, in VLDB 2009 Workshop on Using Search Engine Technology for Information Management (USETIM), August 2009
- Stelios Paparizos, Alexandros Ntoulas, John Shafer, and Rakesh Agrawal, Answering web queries using structured data sources, in Proc. SIGMOD Conf., June 2009
- Ariel Fuxman, Anitha Kannan, Andrew B Goldberg, Rakesh Agrawal, Panayiotis Tsaparas, and John Shafer, Improving Classification Accuracy Using Automatically Extracted Training Data, in International Conference on Knowledge Discovery and Data Mining, June 2009