Dennis Fetterly
RSDE
.
Dennis is a Research Software Development Engineer in Microsoft Research's Silicon Valley lab, which he joined in May, 2003. His research interests include a wide variety of topics including web crawling, the evolution and similarity of pages on the web, identifying spam web pages, and large scale distributed systems. He is currently working on DryadLINQ, TidyFS, and a project evaluating policies for corpus selection. Interesting past projects include the MSRBot web crawler, Dryad, the Your Desktop on Your Keychain project, which utilizes flash memory devices to enable users to carry their desktop PC state with them from machine to machine, and PageTurner, a large scale study of the evolution of web-pages.
Publications
- Dennis Fetterly, Nick Craswell, and Vishwa Vinay, The Impact of Crawl Policy on Web Search Effectiveness, in Proceedings of the 32nd annual international ACM SIGIR conference on Research and development in information retrieval, Association for Computing Machinery, Inc., 19 July 2009
- Dennis Fetterly, Nick Craswell, and Vishwa Vinay, Measuring the Search Effectiveness of a Breadth-First Crawl , in Proceedings of the 31st European Conference on Information Retrieval (ECIR) , Springer Verlag, 9 April 2009
- Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Ăšlfar Erlingsson, Pradeep Kumar Gunda, and Jon Currey, DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language, in OSDI'08: Eighth Symposium on Operating System Design and Implementation, USENIX, December 2008
- Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Ulfar Erlingsson, Pradeep Kumar Gunda, Jon Currey, Frank McSherry, and Kannan Achan, Some sample programs written in DryadLINQ, no. MSR-TR-2008-74, May 2008
- Dennis Fetterly, Nick Craswell, and Vishwa Vinay, Search Effectiveness with a Breadth-First Crawl, in Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, 2008
- Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, and Dennis Fetterly, Dryad: Distributed Data-parallel Programs from Sequential Building Blocks, in Proceedings of the 2007 Eurosys Conference, Association for Computing Machinery, Inc., Lisbon, Portugal, March 2007
- Muthukarrupan Annamalai, Andrew Birrell, Dennis Fetterly, and Ted Wobber, Implementing Portable Desktops: a New Option and Comparisons, no. TR-2006-151, October 2006
- Alexandros Ntoulas, Marc Najork, Mark Manasse, and Dennis Fetterly, Detecting Spam Web Pages Through Content Analysis, in 15th International World Wide Web Conference (WWW), Association for Computing Machinery, Inc., Edinburgh, Scotland, May 2006
- Dennis Fetterly, Mark Manasse, and Marc Najork, Detecting Phrase-Level Duplication on the World Wide Web, in 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Association for Computing Machinery, Inc., Salvador, Brazil, August 2005
- Dennis Fetterly, Mark Manasse, and Marc Najork, On the Evolution of Clusters of Near-Duplicate Web Pages, in Journal of Web Engineering, vol. 2, no. 4, pp. 228-246, Institute of Electrical and Electronics Engineers, Inc., October 2004
- Dennis Fetterly, Mark Manasse, and Marc Najork, Spam, Damn Spam, and Statistics: Using statistical analysis to locate spam web pages, in 7th International Workshop on the Web and Databases (WebDB), Association for Computing Machinery, Inc., June 2004
- Dennis Fetterly, Mark Manasse, Marc Najork, and Janet Wiener, A Large-Scale Study of the Evolution of Web Pages, in Software: Practice & Experience, vol. 34, no. 2, pp. 213-237, Wiley, February 2004
- Dennis Fetterly, Mark Manasse, and Marc Najork, On the Evolution of Clusters of Near-Duplicate Web Pages, in Proceedings of the 1st Latin American Web Congress (LA-WEB), IEEE Computer Society, Washington, DC, USA, November 2003
- Dennis Fetterly, Mark Manasse, Marc Najork, and Janet Wiener, A large-scale study of the evolution of web pages, in Proceedings of the 12th International World Wide Web Conference (WWW), ACM, New York, NY, USA, May 2003
Projects



