Improve Audio Representation by Using Feature Structure Patterns

  • Rui Cai ,
  • Lie Lu ,
  • Hong-Jiang Zhang ,
  • Lian-Hong Cai

Published by Institute of Electrical and Electronics Engineers, Inc.

Publication

Although statistical characteristics of audio features are widely used for audio representation in most of current audio analysis systems and have been proved to be effective, they only utilized the average feature variations over time, and thus lead to ambiguities in some cases. Structure patterns, which describe the representative structure characteristics of both temporal and spectral features, are proposed to improve audio representation. In this paper, three kind structure patterns, including energy envelope pattern, sub-band spectral shape pattern and harmonicity prominence pattern, are proposed or refined, as successive development of our previous work [1]. Evaluations on a content-based audio retrieval system with more than 1500 clips showed very encouraging results.