Academia.eduAcademia.edu

Outline

Audio Matching via Chroma-Based Statistical Features

2005, International Symposium/Conference on Music Information Retrieval

Abstract

In this paper, we describe an efficient method for audio matching which performs effectively for a wide range of classical music. The basic goal of audio matching can be described as follows: consider an audio database containing several CD recordings for one and the same piece of music interpreted by various musicians. Then, given a short query audio clip of one interpretation, the goal is to automatically retrieve the corresponding excerpts from the other interpretations. To solve this problem, we introduce a new type of chroma-based audio feature that strongly correlates to the harmonic progression of the audio signal. Our feature shows a high degree of robustness to variations in parameters such as dynamics, timbre, articulation, and local tempo deviations. As another contribution, we describe a robust matching procedure, which allows to handle global tempo variations. Finally, we give a detailed account on our experiments, which have been carried out on a database of more than 110 hours of audio comprising a wide range of classical music.

References (8)

  1. E. Allamanche, J. Herre, B. Fröba, and M. Cremer. AudioID: Towards Content-Based Identification of Audio Material. In Proc. 110th AES Convention, Amsterdam, NL, 2001.
  2. M. A. Bartsch and G. H. Wakefield. Audio thumbnailing of pop- ular music using chroma-based representations. IEEE Trans. on Multimedia, 7(1):96-104, Feb. 2005.
  3. N. Hu, R. Dannenberg, and G. Tzanetakis. Polyphonic audio matching and alignment for music retrieval. In Proc. IEEE WASPAA, New Paltz, NY, October 2003.
  4. F. Kurth, M. Clausen, and A. Ribbrock. Identification of highly distorted audio material for querying large scale data bases, 2002.
  5. M. Müller, F. Kurth, and T. Röder. Towards an efficient algo- rithm for automatic score-to-audio synchronization. In Proc. ISMIR, Barcelona, Spain, 2004.
  6. R. J. Turetsky and D. P. Ellis. Force-Aligning MIDI Syntheses for Polyphonic Music Transcription Generation. In Proc. IS- MIR, Baltimore, USA, 2003.
  7. G. Tzanetakis, A. Ermolinskyi, and P. Cook. Pitch histograms in audio and symbolic music information retrieval. In Proc. ISMIR, Paris, France, 2002.
  8. A. Wang. An Industrial Strength Audio Search Algorithm. In Proc. ISMIR, Baltimore, USA, 2003.