Academia.eduAcademia.edu

Outline

Kernel techniques for generalized audio crossfades

2015, Cogent Mathematics

https://doi.org/10.1080/23311835.2015.1102116

Abstract

This paper explores a variety of density and kernel-based techniques that can smoothly connect (crossfade or "morph" between) two functions. When the functions represent audio spectra, this provides a concrete way of adjusting the partials of a sound while smoothly interpolating between existing sounds. The approach can be applied to both interpolation-crossfades (where the crossfade connects two different sounds over a specified duration) and to repetitive-crossfades (where a series of sounds are generated, each containing progressively more features of one sound and fewer of the other). The interpolation surface can be thought of as the two dimensions (time and frequency) of a spectrogram, and the kernels can be chosen so as to constrain the surface in a number of desirable ways. When successful, the timbre of the sounds is changed dynamically in a plausible way. A series of sound examples demonstrate the strengths and weaknesses of the approach.

References (17)

  1. F. Boccardi and C. Drioli, "Sound Morphing With Gaussian Mixture Models," Proc. 4th COST G-6 Workshop on Digital Audio Effects, Limerick, Ireland, Dec. 2001.
  2. M Dolson, "The phase vocoder: a tutorial," Computer Music Journal, Spring, Vol. 10, No. 4, 14-27, 1986.
  3. T. Erbe, Soundhack Manual, Frog Peak Music, Lebanon, NH, 1994 (pp. 7- 40).
  4. E. Farnetani and D. Recasens, "Coarticulation and Connected Speech Pro- cesses," in Handbook of Phonetic Sciences, 2cnd Edition, Ed. W. J. Hardcas- tle, J. Laver, F. E. Gibbon, Blackwell Pubs. 2010 (pp. 316-352).
  5. K. Fitz, L. Haken, S. Lefvert, and M. O'Donnell, "Sound morphing using Loris and the reassigned bandwidth-enhanced additive sound model: Practice and applications," in International Computer Music Conference, Gotenborg, Sweden, 2002.
  6. K. E. Gustafson, Introduction to Partial Differential Equations and Hilbert Space Methods, John Wiley and Sons, Hoboken, NJ, 1980 (pp. 1-35).
  7. W. Hatch, High-Level Audio Morphing Strategies, MS Thesis, McGill Uni- versity, Aug. 2004.
  8. J. Laroche and M. Dolson, "Improved phase vocoder time-scale modification of audio," IEEE Trans. on Audio and Speech Processing, Vol. 7, No. 3, May 1999.
  9. R. J. McAulay and T. F. Quatieri, "Speech analysis/synthesis based on a si- nusoidal representation," IEEE Trans. on Acoustics, Speech, and Signal Pro- cessing ASSP-34(4), 744-754, 1986.
  10. F. Oberhettinger, Fourier Transforms of Distributions and Their Inverses Academic Press, New York, 1973 (pp. 15-17).
  11. A. V. Oppenheim and R. W. Schafer, Discrete-Time Signal Processing, 3rd Edition, Prentice-Hall, New Jersey, 2009 (pp. 730-742).
  12. L. Polansky, and M. McKinney, "Morphological mutation functions: ap- plications to motivic transformations and to a new class of cross-synthesis techniques," Proc. of the ICMC, Montreal, 1991.
  13. X. Serra, "Sound hybridization based on a deterministic plus stochastic de- composition model," in Proc. of the 1994 International Computer Music Con- ference, Aarhus, Denmark, 348351, 1994.
  14. W. A. Sethares, Rhythm and Transforms, Springer-Verlag, London, UK 2007 (pp. 111-145)
  15. W. A. Sethares, A. Milne, S. Tiedje , A. Prechtl and J. Plamondon, "Spectral tools for dynamic tonality and audio morphing," Computer Music Journal, Vol. 33, No. 2, Pages 71-84, Summer 2009.
  16. M. Slaney, M. Covell, and B. Lassiter, "Automatic audio morphing," Proc- ceedings of the 1996 International Conference on Acoustics, Speech, and Sig- nal Processing, Atlanta, GA, May 1996.
  17. E. Tellman, L. Haken, B. Holloway, "Timbre morphing of sounds with un- equal numbers of features" Journal of the Audio Engineering Society, Vol. 43, No. 9, 678-689, Sept. 1995.