Academia.eduAcademia.edu

Outline

Perceptual sensitivity to a model of the source spectrum

2013, The Journal of the Acoustical Society of America

https://doi.org/10.1121/1.4806316

Abstract

A psychoacoustic model of the source spectrum has been proposed in which four spectral slope parameters describe perception of overall voice quality: H1-H2 (the difference in amplitude between the first and second harmonics), H2-H4, H4-2000 Hz (i.e., the harmonic nearest 2000 Hz), and 2000-5000 Hz. The goals of this study are to evaluate perceptual sensitivity in the mid-to-high frequency range of the model and determine how sensitivity to one parameter varies as a function of another. To determine listener sensitivity to slope changes for each parameter, just-noticeable differences were obtained for series of stimuli based on synthetic copies of one male and one female voice. Twenty listeners completed an adaptive up-down paradigm. To provide a baseline of listener sensitivity to each spectral slope parameter, the synthetic voices were manipulated so that spectral slope varied by 0.5 dB increments for each parameter while other parameters remained constant. We then assessed how listener sensitivity to a given harmonic slope parameter changes when the others covary. These results will help assess the validity of the model and determine what sources of cross-voice variability in spectral configuration are perceptible.

References (6)

  1. Garellek, M., Esposito, C. M., Keating, P., and Kreiman, J. (2013). "Voice quality and tone identification in White Hmong," J. Acoust. Soc. Am. 133, doi: 10.1121/1.4773259.
  2. Javkin, H. R., Antoñanzas-Barroso, N., and Maddieson, I. (1987). "Integrated software for analysis and synthesis of voice quality," Behavior Research Methods 42,1030-1041.
  3. Kreiman, J., and Gerratt, B. R. (2010). "Perceptual sensitivity to the first harmonic amplitude in the voice source," J. Acoust. Soc. Am. 128, 2085-2089.
  4. Kreiman, J., Garellek, M., and Esposito, C. M. (2011). "Perceptual importance of the voice source spectrum from H2 to 2kHz," J. Acoust. Soc. Am. 130, 2570.
  5. Kreiman, J., and Gerratt, B. R. (2012). "Perceptual interaction of the harmonic source and noise in voice," J. Acoust. Soc. Am. 131, 492-500.
  6. de Krom, G. (1993). "A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals," J. Speech Hear. Res. 36, 254-266.