Learning Query Intent from Regularized Click Graphs

Xiao Li, Ye-Yi Wang, and Alex Acero

Abstract

This work presents the use of click graphs in improving query intent classifiers, which are critical if vertical search and general-purpose search services are to be offered in a unified user interface. Previous works on query classification have primarily focused on improving feature representation of queries, e.g., by augmenting queries with search engine results. In this work, we investigate a completely orthogonal approach — instead of enriching feature representation, we aim at drastically increasing the amounts of training data by semi-supervised learning with click graphs. Specifically, we infer class memberships of unlabeled queries from those of labeled ones according to their proximities in a click graph. Moreover, we regularize the learning with click graphs by content-based classification to avoid propagating erroneous labels. We demonstrate the effectiveness of our algorithms in two different applications, product intent and job intent classification. In both cases, we expand the training data with automatically labeled queries by over two orders of magnitude, leading to significant improvements in classification performance. An additional finding is that with a large amount of training data obtained in this fashion, classifiers using only query words/phrases as features can work remarkably well.

Details

Publication typeInproceedings
Published inSIGIR'08: the 31st Annual ACM SIGIR conference on Research and Development in Information Retrieval
Pages339-346
AddressSingapore, Singapore
PublisherAssociation for Computing Machinery, Inc.
> Publications > Learning Query Intent from Regularized Click Graphs