Dennis Fetterly
RESEARCHER
.
Dennis is a Researcher in Microsoft Research's Silicon Valley lab, which he joined in May, 2003. His research interests include a wide variety of topics including web crawling, the evolution and similarity of pages on the web, identifying spam web pages, and large scale distributed systems. He is currently working on DryadLINQ, TidyFS, and a project evaluating policies for corpus selection. Interesting past projects include the MSRBot web crawler, Dryad, the Your Desktop on Your Keychain project, which utilizes flash memory devices to enable users to carry their desktop PC state with them from machine to machine, and PageTurner, a large scale study of the evolution of web-pages.
Publications
- Omar Alonso, Dennis Fetterly, and Mark Manasse, Duplicate News Story Detection Revisited, no. MSR-TR-2013-60, May 2013
- Nick Craswell, Bodo Billerbeck, Dennis Fetterly, and Marc Najork, Robust Query Rewriting using Anchor Data, in 6th ACM International Conference on Web Search and Data Mining (WSDM), ACM, February 2013
- Marc Najork, Dennis Fetterly, Alan Halverson, Krishnaram Kenthapadi, and Sreenivas Gollapudi, Of Hammers and Nails: An Empirical Comparison of Three Paradigms for Processing Large Graphs, in 5th ACM International Conference on Web Search and Data Mining (WSDM), ACM, February 2012
- Bodo Billerbeck, Nick Craswell, Dennis Fetterly, and Marc Najork, Microsoft Research at TREC 2011 Web Track, in Proc. of the 20th Text Retrieval Conference (TREC), National Institute of Standards and Technology , November 2011
- Dennis Fetterly, Maya Haridasan, Michael Isard, and Swaminathan Sundararaman, TidyFS: A Simple and Small Distributed File System, in Proceedings of the USENIX Annual Technical Conference (USENIX'11), USENIX, 15 June 2011
- Nick Craswell, Dennis Fetterly, and Marc Najork, The Power of Peers, in 33rd European Conference on IR Research (ECIR), Springer Verlag, April 2011
- Nick Craswell, Dennis Fetterly, and Marc Najork, Microsoft Research at TREC 2010 Web Track, in Proc. of the 19th Text Retrieval Conference (TREC), National Institute of Standards and Technology , November 2010
- Dennis Fetterly, Maya Haridasan, Michael Isard, and Swaminathan Sundararaman, TidyFS: A Simple and Small Distributed Filesystem, no. MSR-TR-2010-124, 1 October 2010
- Dennis Fetterly and Frank McSherry, A Data-Parallel Toolkit for Information Retrieval, in Proceedings of SIGIR, Association for Computing Machinery, Inc., 19 July 2010
- Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Ulfar Erlingsson, Pradeep Kumar Gunda, Jon Currey, Frank McSherry, and Kannan Achan, Some sample programs written in DryadLINQ, no. MSR-TR-2009-182, December 2009
- Nick Craswell, Dennis Fetterly, Marc Najork, Stephen Robertson, and Emine Yilmaz, Microsoft Research at TREC 2009: Web and Relevance Feedback Tracks, in Proc. of the 18th Text Retrieval Conference (TREC), National Institute of Standards and Technology , November 2009
- Dennis Fetterly, Nick Craswell, and Vishwa Vinay, The Impact of Crawl Policy on Web Search Effectiveness, in Proceedings of the 32nd annual international ACM SIGIR conference on Research and development in information retrieval, Association for Computing Machinery, Inc., 19 July 2009
- Dennis Fetterly, Nick Craswell, and Vishwa Vinay, Measuring the Search Effectiveness of a Breadth-First Crawl , in Proceedings of the 31st European Conference on Information Retrieval (ECIR) , Springer Verlag, 9 April 2009
- Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Úlfar Erlingsson, Pradeep Kumar Gunda, and Jon Currey, DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language, in OSDI'08: Eighth Symposium on Operating System Design and Implementation, USENIX, December 2008
- Dennis Fetterly, Nick Craswell, and Vishwa Vinay, Search Effectiveness with a Breadth-First Crawl, in Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, 2008
- Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, and Dennis Fetterly, Dryad: Distributed Data-parallel Programs from Sequential Building Blocks, in Proceedings of the 2007 Eurosys Conference, Association for Computing Machinery, Inc., Lisbon, Portugal, March 2007
- Muthukarrupan Annamalai, Andrew Birrell, Dennis Fetterly, and Ted Wobber, Implementing Portable Desktops: a New Option and Comparisons, no. TR-2006-151, October 2006
- Alexandros Ntoulas, Marc Najork, Mark Manasse, and Dennis Fetterly, Detecting Spam Web Pages Through Content Analysis, in 15th International World Wide Web Conference (WWW), Association for Computing Machinery, Inc., Edinburgh, Scotland, May 2006
- Dennis Fetterly, Mark Manasse, and Marc Najork, Detecting Phrase-Level Duplication on the World Wide Web, in 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Association for Computing Machinery, Inc., Salvador, Brazil, August 2005
- Dennis Fetterly, Mark Manasse, and Marc Najork, On the Evolution of Clusters of Near-Duplicate Web Pages, in Journal of Web Engineering, vol. 2, no. 4, pp. 228-246, Institute of Electrical and Electronics Engineers, Inc., October 2004
- Dennis Fetterly, Mark Manasse, and Marc Najork, Spam, Damn Spam, and Statistics: Using statistical analysis to locate spam web pages, in 7th International Workshop on the Web and Databases (WebDB), Association for Computing Machinery, Inc., June 2004
- Dennis Fetterly, Mark Manasse, Marc Najork, and Janet Wiener, A Large-Scale Study of the Evolution of Web Pages, in Software: Practice & Experience, vol. 34, no. 2, pp. 213-237, Wiley, February 2004
- Dennis Fetterly, Mark Manasse, and Marc Najork, On the Evolution of Clusters of Near-Duplicate Web Pages, in Proceedings of the 1st Latin American Web Congress (LA-WEB), IEEE Computer Society, Washington, DC, USA, November 2003
- Dennis Fetterly, Mark Manasse, Marc Najork, and Janet Wiener, A large-scale study of the evolution of web pages, in Proceedings of the 12th International World Wide Web Conference (WWW), ACM, New York, NY, USA, May 2003
