Audio Matching via Chroma-Based Statistical Features
2005, International Symposium/Conference on Music Information Retrieval
Abstract
In this paper, we describe an efficient method for audio matching which performs effectively for a wide range of classical music. The basic goal of audio matching can be described as follows: consider an audio database containing several CD recordings for one and the same piece of music interpreted by various musicians. Then, given a short query audio clip of one interpretation, the goal is to automatically retrieve the corresponding excerpts from the other interpretations. To solve this problem, we introduce a new type of chroma-based audio feature that strongly correlates to the harmonic progression of the audio signal. Our feature shows a high degree of robustness to variations in parameters such as dynamics, timbre, articulation, and local tempo deviations. As another contribution, we describe a robust matching procedure, which allows to handle global tempo variations. Finally, we give a detailed account on our experiments, which have been carried out on a database of more than 110 hours of audio comprising a wide range of classical music.
References (8)
- E. Allamanche, J. Herre, B. Fröba, and M. Cremer. AudioID: Towards Content-Based Identification of Audio Material. In Proc. 110th AES Convention, Amsterdam, NL, 2001.
- M. A. Bartsch and G. H. Wakefield. Audio thumbnailing of pop- ular music using chroma-based representations. IEEE Trans. on Multimedia, 7(1):96-104, Feb. 2005.
- N. Hu, R. Dannenberg, and G. Tzanetakis. Polyphonic audio matching and alignment for music retrieval. In Proc. IEEE WASPAA, New Paltz, NY, October 2003.
- F. Kurth, M. Clausen, and A. Ribbrock. Identification of highly distorted audio material for querying large scale data bases, 2002.
- M. Müller, F. Kurth, and T. Röder. Towards an efficient algo- rithm for automatic score-to-audio synchronization. In Proc. ISMIR, Barcelona, Spain, 2004.
- R. J. Turetsky and D. P. Ellis. Force-Aligning MIDI Syntheses for Polyphonic Music Transcription Generation. In Proc. IS- MIR, Baltimore, USA, 2003.
- G. Tzanetakis, A. Ermolinskyi, and P. Cook. Pitch histograms in audio and symbolic music information retrieval. In Proc. ISMIR, Paris, France, 2002.
- A. Wang. An Industrial Strength Audio Search Algorithm. In Proc. ISMIR, Baltimore, USA, 2003.