Web Spam Detection

We investigated heuristics for automatically identifying "spam" web pages, i.e. pages that are created to enrich the publisher rather than to provide utility to the consumer.

Publications

Issued Patents

  • Marc A. Najork, Dennis C. Fetterly, Mark S. Manasse, Alexandros Ntoulas. Using content analysis to detect spam web pages. US patent 7,962,510, issued 6/14/2011.