Academia.eduAcademia.edu

Outline

Towards Cross-Version Singing Voice Detection

2015

Abstract

In the field of Music Information Retrieval (MIR), the automated detection of the singing voice within a given music recording constitutes a challenging and important research problem. The goal of this task is to find those segments within a given recording where one or several singers are active. In this study, we investigate the performance of state-of-the-art approaches by considering various music scenarios. First, we validate our singing voice detection system, which incorporates well-known techniques from audio signal processing and machine learning, against a public benchmark. Second, we consider a controlled yet instructive scenario using multiple versions (interpretations by different musicians) of the 24 songs of the cycle “Winterreise” by Franz Schubert. Within this cross-version scenario, which comprises various singers and pianists as well as different recording conditions, we systematically address the following research questions: Is bootstrapping a viable approach fo...

References (14)

  1. Berenzweig, A. L. & Ellis, D. P. W.: Locating Singing Voice Segments within Music Signals. In: Proceedings of the IEEE Workshop on the Applica- tions of Signal Processing to Audio and Acoustics (WASPAA), 119-122. New Paltz, NY, USA (2001).
  2. Breiman, L.: Random forests. In: Machine learning (2001), 45, 1: 5-32.
  3. Ewert, S., Müller, M. & Grosche, P.: High Res- olution Audio Synchronization Using Chroma On- set Features. In: Proceedings of IEEE Interna- tional Conference on Acoustics, Speech, and Sig- nal Processing (ICASSP), 1869-1872. Taipei, Tai- wan (2009).
  4. Gärtner, D. & Dittmar, C.: Vocal characteristics classification of audio segments: An investigation of the influence of accompaniment music on low- level features. In: Proceedings of the International Conference on Machine Learning and Applications (ICMLA). Miami, Florida, USA (2009).
  5. Konz, V., Müller, M. & Kleinertz, R.: A Cross- Version Chord Labelling Approach for Exploring Harmonic Structures-A Case Study on Beethoven's Appassionata. In: Journal of New Music Research (2013), 1-17.
  6. Lehner, B., Sonnleitner, R. & Widmer, G.: Towards Lightweight, Real-time-capable Singing Voice De- tection. In: Proceedings of the International Con- ference on Music Information Retrieval (ISMIR). Porto, Portugal (2013).
  7. Lehner, B., Widmer, G. & Sonnleitner, R.: On the reduction of false positives in singing voice detec- tion. In: Proceedings of IEEE International Con- ference on Acoustics, Speech, and Signal Processing (ICASSP), 7480-7484. Florence, Italy (2014).
  8. Lukashevich, H. & Dittmar, C.: Effective singing voice detection in popular music using ARMA filter- ing. In: Proceedings of the International Conference on Digital Audio Effects (DAFx). Bordeaux, France (2007).
  9. Mauch, M., Fujihara, H., Yoshii, K. & Goto, M.: Timbre and Melody Features for the Recognition of Vocal Activity and Instrumental Solos in Polyphonic Music. In: Proceedings of the International Confer- ence on Music Information Retrieval (ISMIR), 233- 238. Miami, Florida, USA (2011).
  10. Nwe, T. L. & Wang, Y.: Automatic Detection of Vocal Segments in Popular Songs. In: Proceedings of the International Conference on Music Information Retrieval (ISMIR). Barcelona, Spain (2004).
  11. Prätzlich, T. & Müller, M.: Frame-Level Audio Seg- mentation for Abridged Musical Works. In: Pro- ceedings of the International Conference on Mu- sic Information Retrieval (ISMIR). Taipei, Taiwan (2014).
  12. Ramona, M., Richard, G. & David, B.: Vocal de- tection in music with support vector machines. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas, Nevada, USA (2008).
  13. Tzanetakis, G.: Song-specific Bootstrapping of Singing Voice Structure. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), 2027-2030. Taipei, Taiwan (2004).
  14. Vembu, S. & Baumann, S.: Separation of vocals from polyphonic audio recordings. In: Proceedings of the International Conference on Music Informa- tion Retrieval (ISMIR). London, UK (2005).