Academia.eduAcademia.edu

Outline

On object-based audio with reverberation

Abstract

Object-based audio is gaining momentum as a means for future audio productions to be format-agnostic and interactive. Recent standardization developments make recommendations for object formats, however the capture, production and reproduction of reverberation is an open issue. In this paper, we review approaches for recording, transmitting and rendering reverberation over a 3D spatial audio system. Techniques include channel-based approaches where room signals intended for a specific reproduction layout are transmitted, and synthetic reverberators where the room effect is constructed at the renderer. We consider how each approach translates into an object-based context considering the end-to-end production chain of capture, representation, editing, and rendering. We discuss some application examples to highlight the implications of the various approaches.

References (62)

  1. REFERENCES
  2. S. Spors, H. Wierstorf, A. Raake, F. Melchior, M. Frank, et al., "Spatial sound with loudspeakers and its perception: a review of the current state," Proc. IEEE, vol. 101, no. 9, pp. 1920-1938, 2013.
  3. ITU, "Recommendation ITU-R BS.775-3, Mul- tichannel stereophonic sound system with and without accompanying picture," International Telecommunication Union (ITU), August 2012.
  4. F. Rumsey, Spatial audio. Oxford, UK: Focal Press, 2001.
  5. S. Füg, A. Hölzer, C. Borß, C. Ertel, M. Kratschmer, et al., "Design, coding and process- ing of metadata for object-based interactive au- dio," in 137 Conv. Audio Eng. Soc., Los Angeles, CA, USA, 2014.
  6. M. A. Gerzon, "Ambisonics in multichannel broadcasting and video," J. Audio Eng. Soc., vol. 33, no. 11, pp. 859-871, 1985.
  7. B. Shirley, R. Oldfield, F. Melchior, and J.-M. Batke, "Platform independent audio," in Media Production, Delivery and Interaction for Platform Independent Systems, John Wiley & Sons, Ltd, 2013, pp. 130-165.
  8. L. Remaggi, P. J. B. Jackson, and P. Coleman, "Es- timation of room reflection parameters for a rever- berant spatial audio object," in 138 Conv. Audio Eng. Soc., Warsaw, Poland, 2015.
  9. AES 60 TH INTERNATIONAL CONFERENCE, Leuven, Belgium, 2016 February 3-5
  10. H. Stenzel and U. Scuda, "Producing interactive immersive sound for MPEG-H: a field test for sports broadcasting," in 137 Conv. Audio Eng. Soc., Los Angeles, CA, USA, 2014.
  11. R. Oldfield, B. Shirley, and J. Spille, "An object- based audio system for interactive broadcasting," in 137 Conv. Audio Eng. Soc., Los Angeles, CA, USA, 2014.
  12. E. D. Scheirer, R. Väänänen, and J. Huopaniemi, "AudioBIFS: describing audio scenes with the MPEG-4 multimedia standard," IEEE Trans. Mul- timedia, vol. 1, no. 3, pp. 237-250, 1999.
  13. EBU, "Tech 3364, Audio Definition Model," Eu- ropean Broadcasting Union (EBU), January, 2014.
  14. J. Herre, J. Hilpert, A. Kuntz, and J. Plogsties, "MPEG-H 3D audio -The new standard for cod- ing of immersive spatial audio," IEEE J. Sel. Top- ics Signal Process., vol. 9, no. 5, pp. 770-779, 2015.
  15. G. Potard and I. Burnett, "An XML-based 3D au- dio scene metadata scheme," in Proc. 25th AES Int. Conf., London, UK, 2004, pp. 17-19.
  16. N. Peters, T. Lossius, and J. C. Schacher, "The spatial sound description interchange format: principles, specification, and examples," Com- puter Music Journal, vol. 37, no. 1, pp. 11-22, 2013.
  17. M. R. Schroeder, "Statistical parameters of the frequency response curves of large rooms," J. Au- dio Eng. Soc., vol. 35, no. 5, pp. 299-306, 1987.
  18. M. Vorländer, "Simulation of the transient and steady-state sound propagation in rooms using a new combined ray-tracing/image-source algo- rithm," J. Acoust. Soc. Am., vol. 86, no. 1, pp. 172- 178, 1989.
  19. S. E. Olive and F. E. Toole, "The detection of re- flections in typical rooms," J. Audio Eng. Soc., vol. 37, no. 7/8, pp. 539-553, 1989.
  20. S. Bech, "Timbral aspects of reproduced sound in small rooms II," J. Acoust. Soc. Am., vol. 99, no. 6, pp. 3539-3550, 1996.
  21. N. Kaplanis, S. Bech, S. H. Jensen, and T. van Wa- terschoot, "Perception of reverberation in small rooms: a literature study," in Proc. 55th AES Int. Conf., Helsinki, 2014.
  22. V. Välimäki, J. D. Parker, L. Savioja, J. O. Smith, and J. S. Abel, "Fifty years of artificial reverbera- tion," IEEE Trans. Audio Speech Lang. Proc., vol. 20, no. 5, pp. 1421-1448, 2012.
  23. P. Zahorik, D. S. Brungart, and A. W. Bronkhorst, "Auditory distance perception in humans: a sum- mary of past and present research," Acta. Acust. united Ac., vol. 91, no. 3, pp. 409-420, 2005.
  24. J.-M. Jot, "Efficient models for reverberation and distance rendering in computer music and virtual audio reality," in Proc. Int. Computer Music Con- ference, Thessaloniki, Greece, 1997.
  25. G. Theile, "Multichannel natural recording based on psychoacoustic principles," in 108 Conv. Audio Eng. Soc., Paris, France, 2000.
  26. G. Theile and H. Wittek, "Principles in surround recordings with height," in 130 Conv. Audio Eng. Soc., London, UK, 2011.
  27. H. Lee and C. Gribben, "On the optimum micro- phone array configuration for height channels," in 134 Conv. Audio Eng. Soc., Rome, Italy, 2013.
  28. K. Hamasaki, T. Shinmura, S. Akita, and K. Hiyama, "Approach and mixing technique for nat- ural sound recording of multichannel audio," in Proc. 19th AES Int. Conf., Schloss Elmau, Ger- many, 2001.
  29. K. Hamasaki, "Multichannel recording techniques for reproducing adequate spatial impression," in Proc. 24th AES Int. Conf., Banff, Canada, 2003.
  30. J. Francombe, T. Brookes, R. Mason, R. Flindt, P. Coleman, et al., "Production and reproduction of program material for a variety of spatial audio formats," in 138 Conv. Audio Eng. Soc., Warsaw, Poland, 2015.
  31. G. Thomas, A. Engström, J.-F. Macq, O. A. Aziz Niamut, B. Shirley, et al., "State of the art and challenges in media production, broadcast and de- livery," in Media Production, Delivery and In- teraction for Platform Independent Systems, John Wiley & Sons, Ltd, 2013, pp. 5-73.
  32. D. G. Malham and A. Myatt, "3D sound spatial- ization using ambisonic techniques," Computer Music Journal, vol. 19, no. 4, pp. 58-70, 1995.
  33. M. Frank, F. Zotter, and A. Sontacchi, "Producing 3d audio in ambisonics," in Proc. 57th AES Int. Conf., Los Angeles, CA, USA, 2015.
  34. B. A. Blesser, "An interdisciplinary synthesis of reverberation viewpoints," J. Audio Eng. Soc., vol. 49, no. 10, pp. 867-903, 2001.
  35. W. G. Gardner, "Efficient convolution without input-output delay," J. Audio Eng. Soc., vol. 43, no. 3, pp. 127-136, 1995.
  36. A. Reilly and D. McGrath, "Convolution process- ing for realistic reverberation," in 98 Conv. Audio Eng. Soc., Paris, France, 1995.
  37. AES 60 TH INTERNATIONAL CONFERENCE, Leuven, Belgium, 2016 February 3-5
  38. E. Deruty, Creative convolution: new sounds from impulse responses, https : / / www . soundonsound . com / sos / sep10 / articles / convolution . htm, September 2010, accessed 9th July 2015.
  39. S. Tervo, J. Pätynen, A. Kuusinen, and T. Lokki, "Spatial decomposition method for room impulse responses," J. Audio Eng. Soc., vol. 61, no. 1/2, pp. 17-28, 2013.
  40. J. Merimaa and V. Pulkki, "Spatial impulse re- sponse rendering I: analysis and synthesis," J. Au- dio Eng. Soc., vol. 53, no. 12, pp. 1115-1127, 2005.
  41. V. Pulkki, "Spatial sound reproduction with direc- tional audio coding," J. Audio Eng. Soc., vol. 55, no. 6, pp. 503-516, 2007.
  42. A. Politis, T. Pihlajamäki, and V. Pulkki, "Para- metric spatial audio effects," in 15th Int. Conf. Digital Audio Effects (DAFx-12), York, UK, 2012.
  43. V. Pulkki, "Virtual sound source positioning us- ing vector base amplitude panning," J. Audio Eng. Soc., vol. 45, no. 6, pp. 456-466, 1997.
  44. V. Pulkki and J. Merimaa, "Spatial impulse re- sponse rendering II: reproduction of diffuse sound and listening tests," J. Audio Eng. Soc., vol. 54, no. 1/2, pp. 3-20, 2006.
  45. A. Politis, J. Vilkamo, and V. Pulkki, "Sector- based parametric sound field reproduction in the spherical harmonic domain," IEEE J. Sel. Topics Signal Process., vol. 9, no. 5, pp. 852-866, 2015.
  46. F. Melchior, C. Sladeczek, A. Partzsch, and S. Brix, "Design and implementation of an interac- tive room simulation for wave field synthesis," in Proc. 40th AES Int. Conf., Tokyo, Japan, 2010.
  47. E. M. Hulsebos and D. de Vries, "Parameteriza- tion and reproduction of concert hall acoustics measured with a circular microphone array," in 112 Conv. Audio Eng. Soc., Munich, Germany, 2002.
  48. E. M. Hulsebos, "Auralization using wave field synthesis," PhD thesis, Delft University of Tech- nology, 2004.
  49. L. Savioja, J. Huopaniemi, T. Lokki, and R. Väänänen, "Creating interactive virtual acoustic environments," J. Audio Eng. Soc., vol. 47, no. 9, pp. 675-705, 1999.
  50. J. Nowak, J. Liebetrau, and T. Sporer, "On the perception of apparent source width and lis- tener envelopment in wave field synthesis," in 5th Workshop on Quality of Multimedia Experi- ence (QoMEX), IEEE, Klagenfurt, Austria, 2013, pp. 82-87.
  51. F. Melchior, "Investigations on spatial sound de- sign based on measured room impulse responses," PhD thesis, Delft University of Technology, 2011.
  52. J.-M. Jot, "An analysis/synthesis approach to real- time artificial reverberation," in Proc. ICASSP'92, IEEE, San Francisco, CA, USA, 1992, pp. 221- 224.
  53. R. Väänänen and J. Huopaniemi, "Advanced Au- dioBIFS: virtual acoustics modeling in MPEG-4 scene description," IEEE Trans. Multimedia, vol. 6, no. 5, pp. 661-675, 2004.
  54. M. Honkala, Acoustics modeling in MPEG-4, http : / / www . tml . tkk . fi / Opinnot / Tik - 111.590/2002s/Paperit/honkala_MPEG4_ OK.pdf, accessed 28th May 2015.
  55. J. Schmidt and E. F. Schroeder, "New and ad- vanced features for audio presentation in the MPEG-4 standard," in 116 Conv. Audio Eng. Soc., Berlin, Germany, 2004.
  56. J.-M. Trivi and J.-M. Jot, "Rendering MPEG- 4 AABIFS content through a low-level cross- platform 3D audio API," in Proc. ICME'02, IEEE, vol. 1, Lausanne, Switzerland, 2002, pp. 513-516.
  57. S. K. Zieliński, F. Rumsey, and S. Bech, "Effects of down-mix algorithms on quality of surround sound," J. Audio Eng. Soc, vol. 51, no. 9, pp. 780- 798, Sep. 2003.
  58. J. Vilkamo, A. Kuntz, and S. Füg, "Reduction of spectral artifacts in multichannel downmixing with adaptive phase alignment," J. Audio Eng. Soc, vol. 62, no. 7/8, pp. 516-526, 2014.
  59. A. Adami, E. Habets, and J. Herre, "Down- mixing using coherence suppression," in Proc. ICASSP2014, IEEE, Florence, Italy, 2014, pp. 2878-2882.
  60. J. Anderson and S. Costello, "Adapting artifi- cial reverberation architectures for B-format sig- nal processing," in Ambisonics Symposium 2009, Graz, Austria, 2009.
  61. F. Lopez-Lezcano, "An architecture for reverbera- tion in high order ambisonics," in 137 Conv. Audio Eng. Soc., Los Angeles, CA, USA, 2014.
  62. L. Beranek, Concert Halls and Opera Houses: Music, Acoustics, and Architecture, 2nd ed. New York: Springer-Verlag, 2004.