Improve Audio Representation by Using Feature Structure Patterns
- Rui Cai ,
- Lie Lu ,
- Hong-Jiang Zhang ,
- Lian-Hong Cai
Published by Institute of Electrical and Electronics Engineers, Inc.
Although statistical characteristics of audio features are widely used for audio representation in most of current audio analysis systems and have been proved to be effective, they only utilized the average feature variations over time, and thus lead to ambiguities in some cases. Structure patterns, which describe the representative structure characteristics of both temporal and spectral features, are proposed to improve audio representation. In this paper, three kind structure patterns, including energy envelope pattern, sub-band spectral shape pattern and harmonicity prominence pattern, are proposed or refined, as successive development of our previous work [1]. Evaluations on a content-based audio retrieval system with more than 1500 clips showed very encouraging results.
© 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.