Share this page
Share this page E-mail this page Print this page RSS feeds
Home > Publications > Indexing High-Dimensional Rectangles for Fast Multimedia Identification
Indexing High-Dimensional Rectangles for Fast Multimedia Identification

This paper addresses the problem of quickly performing point queries against high-dimensional regions. Such queries are useful in the increasingly important problems of multimedia identification and retrieval, where different database entries have different metrics for similarity. While the database literature has focused on indexing for high-dimensional nearest neighbor and epsilon range queries, indexing for point queries against high-dimensional regions has not been addressed. We present an efficient indexing method for these queries, which relies on the combination of redundancy and bit vector indexing to achieve significant performance gains. We have implemented our approach in a real-world audio fingerprinting system, and have obtained a factor of 56 speed-up over linear scan. Furthermore, the well-known Hilbert bulk-loaded R-Trees, a technique capable of searching low-dimensional regions, are shown to be ineffective in our audio fingerprinting system, because of the inherently high-dimensional properties of the problem.

bitvectortr.pdf
PDF file

Details

Type: Inproceedings
Pages: 10
Number: MSR-TR-2003-38
Institution: Microsoft Research