3D Visual Phrases for Landmark Recognition

Qiang Hao; Rui Cai; Zhiwei Li (李志伟); Lei Zhang; Yanwei Pang; Feng Wu

3D Visual Phrases for Landmark Recognition

Qiang Hao ,
Rui Cai ,
Zhiwei Li (李志伟) ,
Lei Zhang ,
Yanwei Pang ,
Feng Wu

Proc. of the 25th IEEE Conference on Computer Vision and Pattern Recognition | June 2012

Published by Institute of Electrical and Electronics Engineers, Inc.

Publication

Download BibTex

In this paper, we study the problem of landmark recognition and propose to leverage 3D visual phrases to improve the performance. A 3D visual phrase is a triangular facet on the surface of a reconstructed 3D landmark model. In contrast to existing 2D visual phrases which are mainly based on co-occurrence statistics in 2D image planes, such 3D visual phrases explicitly characterize the spatial structure of a 3D object (landmark), and are highly robust to projective transformations due to viewpoint changes. We present an effective solution to discover, describe, and detect 3D visual phrases. The experiments on 10 landmarks have achieved promising results, which demonstrate that our approach provides a good balance between precision and recall of landmark recognition while reducing the dependence on post-verification to reject false positives.

© 2012 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.