Using Audio Fingerprinting for Duplicate Detection and Thumbnail Generation
- Chris J.C. Burges ,
- Dan Plastina ,
- John Platt ,
- Erin Renshaw ,
- Henrique S. Malvar
International Conference of Acoustic, Speech and Signal Processing |
Published by Institute of Electrical and Electronics Engineers, Inc.
Audio fingerprinting is a powerful tool for identifying file-based or streaming audio, using a database of fingerprints. This paper presents two new applications of audio fingerprinting: duplicate detection, whose goal is to identify duplicate audio clips in a set, even if they differ in compression quality or duration, and thumbnail generation, which aims to provide a representative short clip of a music track. Neither application requires an external database of fingerprints. Thanks to the robustness of the fingerprinting engine, both applications perform well; the duplicate detector has a false positive rate that is conservatively bounded above by 1% on a very large data set, and the thumbnail generator significantly outperforms using a fixed window.
© 2005 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.