Academia.eduAcademia.edu

Outline

A Meta-Analysis of High Resolution Audio Perceptual Evaluation

https://doi.org/10.17743/JAES.2016.0015

Abstract

There is considerable debate over the benefits of recording and rendering high resolution audio, i.e., systems and formats that are capable of rendering beyond CD quality audio. We undertook a systematic review and meta-analysis to assess the ability of test subjects to perceive a difference between high resolution and standard, 16 bit, 44.1 or 48 kHz audio. All 18 published experiments for which sufficient data could be obtained were included, providing a meta-analysis involving over 400 participants in over 12,500 trials. Results showed a small but statistically significant ability of test subjects to discriminate high resolution content, and this effect increased dramatically when test subjects received extensive training. This result was verified by a sensitivity analysis exploring different choices for the chosen studies and different analysis approaches. Potential biases in studies, effect of test methodology, experimental design, and choice of stimuli were also investigated. The overall conclusion is that the perceived fidelity of an audio recording and playback chain can be affected by operating beyond conventional levels.

Key takeaways
sparkles

AI

  1. A meta-analysis of 18 studies with over 400 participants shows a small but significant ability to discriminate high resolution audio.
  2. Training significantly enhances discrimination ability, with trained subjects showing a strong and consistent effect compared to untrained subjects.
  3. Study design factors such as stimulus duration and methodology impact discrimination results, suggesting a need for improved experimental protocols.
  4. Potential biases in existing studies may lean towards Type II errors, indicating the true discrimination ability could be underestimated.
  5. The analysis emphasizes that perceived audio fidelity increases when operating beyond conventional standards, warranting further investigation into high resolution audio.

References (87)

  1. A. Flexer, et al., "A MIREX Meta-Analysis of Hub- ness in Audio Music Similarity," in Int. Soc. Music Inf. Retrieval Conf. (ISMIR), Porto, Portugal (2012).
  2. J. B. L. Smith and E. Chew, "A Meta-Analysis of the MIREX Structure Segmentation Task," in Int. Soc. Music Inf. Retrieval Conf. (ISMIR), Curitiba, Brazil (2013).
  3. M. McVicar, et al., "Automatic Chord Estima- tion from Audio: A Review of the State of the Art, " IEEE/ACM Trans. Audio, Speech Lang. Proc., vol. 22 (2014). doi:10.1109/TASLP.2013.2294580
  4. E. Hemery and J.-J. Aucouturier, "One Hundred Ways to Process Time, Frequency, Rate and Scale in the Central Auditory System: A Pattern-Recognition Meta- Analysis," Frontiers in Computational Neuroscience, 03 July 2015. doi:10.3389/fncom.2015.00080
  5. R. J. Wilson, "Special Issue Introduction: High- Resolution Audio," J. Audio Eng. Soc., vol. 52, p. 116 (2004 Mar.).
  6. J. R. Stuart and P. G. Craven, "A Hierarchical Ap- proach to Archiving and Distribution," presented at the 137th Convention of the Audio Engineering Society (2014 Oct.), convention paper 9178.
  7. H. van Maanen, "Requirements for Loudspeakers and Headphones in the 'High Resolution Audio' Era," pre- sented at the AES 51st International Conference: Loud- speakers and Headphones (2013 Aug.), conference paper 1-3
  8. J. R. Stuart, "Coding for High-Resolution Audio Sys- tems," J. Audio Eng. Soc., vol. 52, pp. 117-144 (2004 Mar.)
  9. J. R. Stuart, "High-Resolution Audio: A Perspec- tive," J. Audio Eng. Soc., vol. 63, pp. 831-832 (2015 Oct.).
  10. W. Woszczyk, "Physical and Perceptual Consider- ations for High-Resolution Audio," presented at the 115th Convention of the Audio Engineering Society (2003 Oct.), convention paper 5931.
  11. H. M. Jackson, et al., "The Audibility of Typical Digital Audio Filters in a High-Fidelity Playback System," presented at the 137th Convention of the Audio Engineering Society (2014 Oct.), convention paper 9174.
  12. J. Vanderkooy, "A Digital-Domain Listening Test for High-Resolution," presented at the 129th Convention of the Audio Engineering Society (2010 Nov.), convention paper 8203.
  13. B. Smagowska and M. Pawlaczyk-Łuszczyńska, "Effects of Ultrasonic Noise on the Human Body- A Bibliographic Review," Int. J. Occupational Safety and Ergonomics (JOSE), vol. 19, pp. 195-202 (2013). doi:10.1080/10803548.2013.11076978
  14. W. B. Snow, "Audible Frequency Ranges of Music, Speech, and Noise," J. Acoust. Soc. Am., vol. 3, pp. 155-166 (1931). doi:10.1121/1.1915552
  15. D. Gannett and I. Kerney, "The Discernibility of Changes in Program Band Width," Bell Systems Tech- nical J., vol. 23, pp. 1-10 (1944). doi:10.1002/j.1538- 7305.1944.tb03144.x
  16. H. Fletcher, Speech and Hearing in Communication (Princeton, New Jersey: Van Nostrand, 1953).
  17. T. Zislis and J. L. Fletcher, "Relation of High Fre- quency Thresholds to Age and Sex," J. Aud. Res., vol. 6, pp. 189-198 (1966)
  18. J. D. Harris and C. K. Meyers, "Tentative Audio- metric Threshold Level Standards from 8 to 18 kHz," J. Acoust. Soc. Am., vol. 49, pp. 600-608 (1971). doi:10.1121/1.1912392
  19. J. L. Northern, et al., "Recommended High- Frequency Audiometric Threshold Levels (8000- 18000
  20. Hz)," J. Acoust. Soc. Am., vol. 52, pp. 585-595 (1972). doi:10.1121/1.1913149
  21. D. R. Cunningham and C. P. Goetzinger, "Extra-High Frequency Hearing Loss and Hyperlipi- demia," Audiology, vol. 13, pp. 470-484 (1974). doi:10.3109/00206097409071710
  22. S. A. Fausti, et al., "A System for Evaluating Audi- tory Function from 8000-20000 Hz," J. Acoust. Soc. Am., vol. 66, pp. 1713-(1979).
  23. T. Oohashi, et al., "Multidisciplinary Study on the Hypersonic Effect," in Interareal Coupling of Human Brain Function,Int. Congress Series vol. 1226: (Elsevier, 2002), pp. 27-42. doi:10.1016/s0531-5131(01)00494-0
  24. B. Theiss and M. O. J. Hawksford, "Phantom Source Perception in 24 Bit @ 96 kHz Digital Audio," presented at the 103rd Convention of the Audio Engineer- ing Society (1997 Sep.), convention paper 4561.
  25. N. Kanetada, et al., "Evaluation of Sound Qual- ity of High Resolution Audio," in 1st IEEE/IIAE Int. Conf. Intelligent Systems and Image Processing (2013). doi:10.12792/icisip2013.014
  26. K. Ashihara and S. Kiryu, "Audibility of Com- ponents above 22 kHz in a Harmonic Complex Tones," Acustica -Acta Acustica, vol. 89, pp. 540-546 (2003)
  27. W. Bulla, "Detection of High-Frequency Harmon- ics in a Complex Tone," presented at the 139th Convention of the Audio Engineering Society (2015 Oct.), convention paper 4561.
  28. J.-E. Mortberg, "Is Dithered Truncation Preferred over Pure Truncation at a Bit Depth of 16 Bits when a Digital Requantization has Been Performed on a 24 Bit Sound File?," Bachelor, Lulea University of Technology (2007).
  29. M. L. Lenhardt, et al., "Human Ultrasonic Speech Perception," Science, vol. 253, pp. 82-85 (1991). doi:10.1126/science.2063208
  30. S. Nakagawa and S. Kawamura, "Temporary Threshold Shift in Audition Induced by Exposure to Ul- trasound via Bone Conduction," in 27th Annual Meeting Int. Soc. Psychophysics, Herzliya, Israel (2011).
  31. T. Hotehama and S. Nakagawaa, "Modulation De- tection for Amplitude-Modulated Bone-Conducted Sounds with Sinusoidal Carriers in the High-and Ultrasonic- Frequency Range," J. Acoust. Soc. Am., vol. 128, pp. 3011 (2010 Nov.). doi:10.1121/1.3493421
  32. K. Krumbholz, et al., "Microsecond Temporal Res- olution in Monaural Hearing without Spectral Cues?" J. Acoust. Soc. Am., vol. 113, pp. 2790-2800 (2003). doi:10.1121/1.1547438
  33. M. N. Kunchur, "Audibility of Temporal Smearing and Time Misalignment of Acoustic Signals," Technical Acoustics, vol. 17 (2007)
  34. M. N. Kunchur, "Temporal Resolution of Hear- ing Probed by Bandwidth Restriction," Acta Acustica united with Acustica, vol. 94, pp. 594-603 (2008). doi:10.3813/AAA.918069
  35. M. Kunchur, "Probing the Temporal Resolution and Bandwidth of Human Hearing," in Proceedings of Meetings on Acoustics, New Orleans (2007). doi:10.1121/1.2998548
  36. T. Muraoka, et al., "Examination of Audio Bandwidth Requirements for Optimum Sound Signal Transmission," J. Audio Eng. Soc., vol. 29, pp. 2-9 (1981 Jan./Feb.).
  37. K. Ashihara, et al., "Hearing Thresholds in Free- Field for Pure Tone above 20 kHz," in Int. Cong. Acoustics (ICA) (2004).
  38. K. Ashihara, et al., "Hearing Threshold for Pure Tones above 20 kHz," Acoust. Sci. & Technology, vol. 27, pp. 12-19 (2006 Jan.).
  39. K. Ashihara, "Hearing Thresholds for Pure Tones above 16 kHz," JASA Express Letters (2007 Aug.).
  40. M. Omata, et al., "A Psychoacoustic Measurement and ABR for the Sound Signals in the Frequency Range between 10 kHz and 24 kHz," presented at the 125th Con- vention of the Audio Engineering Society (2008 Oct.), con- vention paper 7566.
  41. M. Koubori, et al., "Psychoacoustic Measurement and Auditory Brainstem Response in the Frequency Range between 10 kHz and 30 kHz," presented at the 129th Con- vention of the Audio Engineering Society (2010 Nov.), con- vention paper 8294.
  42. J. N. Oppenheim and M. O. Magnasco, "Human Time-Frequency Acuity Beats the Fourier Uncertainty Prin- ciple," Physical Review Letters, vol. 110 (Jan. 25 2013). doi:10.1103/PhysRevLett.110.044301
  43. M. Majka, et al., "Hearing Overcome Uncer- tainty Relation and Measure Duration of Ultrashort Pulses," Europhysics News, vol. 46, pp. 27-31 (2015). doi:10.1051/epn/2015105
  44. T. Oohashi, et al., "High-Frequency Sound Above the Audible Range Affects Brain Electric Activity and Sound Perception," presented at the 91st Convention of the Audio Engineering Society (1991 Oct.), convention paper 3207.
  45. T. Oohashi, et al., "Inaudible High-Frequency Sounds Affect Brain Activity: Hypersonic Effect," J. Neu- rophysiology, vol. 83, pp. 3548-3558 (2000).
  46. R. Yagi, et al., "Auditory Display for Deep Brain Activation: Hypersonic Effect," in Int. Conf. Auditory Dis- play, Kyoto, Japan (2002)
  47. R. Yagi, et al., "Modulatory Effect of Inaudi- ble High-Frequency Sounds on Human Acoustic Percep- tion," Neuroscience Letters, vol. 351, pp. 191-195 (2003). doi:10.1016/j.neulet.2003.07.020
  48. A. Fukushima, et al., "Frequencies of Inaudible High-Frequency Sounds Differentially Affect Brain Ac- tivity: Positive and Negative Hypersonic Effects," PLOS One(2014). doi:10.1371/journal.pone.0095464
  49. R. Kuribayashi, et al., "High-Resolution Music with Inaudible High-Frequency Components Produces a Lagged Effect on Human Electroencephalographic Activ- ities," Clinical Neuroscience, vol. 25, pp. 651-655 (2014 Jun.). doi:10.1097/wnr.0000000000000151
  50. S. Han-Moi, et al., "Inaudible High-Frequency Sound Affects Frontlobe Brain Activity," Contempo- rary Engineering Sciences, vol. 23, pp. 1189-1196 (2014).
  51. M. Honda, et al., "Functional Neuronal Network Subserving the Hypersonic Effect," in Int. Cong. Acoustics (ICA), Kyoto (2004).
  52. T. Oohashi, et al., "The Role of Biological System other than Auditory Air-Conduction in the Emergence of the Hypersonic Effect," Brain Research, vol. 1073-1074, pp. 339-347 (2006). doi:10.1016/j.brainres.2005.12.096
  53. M. Higuchi, et al., "Ultrasound Influence on Im- pression Evaluation of Music," IEEE Pacific Rim Conf. Comm., Comp. and Sig. Proc. (PacRim) (2009 Aug.).
  54. M. Kamada and K. Toraichi, "Effects of Ultra- sonic Components on Perceived Tone Quality," in IEEE Int. Conf. Acoustics, Speech, and Signal Processing (1989). doi:10.1109/ICASSP.1989.266850
  55. S. Yoshikawa, et al., "Does High Sampling Fre- quency Improve Perceptual Time-Axis Resolution of Digi- tal Audio Signal?" presented at the 103rd Convention of the Audio Engineering Society (1997 Sep.), convention paper 4562.
  56. R. Yagi, et al., "A Method for Behavioral Evaluation of the 'Hypersonic Effect,' " Acoust. Sci. & Tech., vol. 24, pp. 197-200 (2003).
  57. D. Blech and M. Yang, "DVD-Audio versus SACD: Perceptual Discrimination of Digital Audio Coding For- mats," presented at the 116th Convention of the Audio En- gineering Society (2004 May), convention paper 6086.
  58. A. Marui, et al., "Subjective Evaluation of High Resolution Recordings in PCM and DSD Audio Formats," presented at the 136th Convention of the Audio Engineering Society (2014 Apr.), convention paper 9019.
  59. T. Nishiguchi and K. Hamasaki, "Differences of Hearing Impressions among Several High Sampling Digital Recording Formats," presented at the 118th Convention of the Audio Engineering Society (2005 May), convention paper 6469.
  60. G. Plenge, et al., "Which Bandwidth Is Necessary for Optimal Sound Transmission?" J. Audio Eng. Soc., vol. 28, pp. 114-119 (1980 Mar.).
  61. K. Ashihara, "Audibility of Complex Tones above 20 kHz," 29th Int. Cong. and Exhibition on Noise Control Engineering (InterNoise) (2000)
  62. K. Ashihara and S. Kiryu, "Detection Threshold for Tones above 22 kHz," presented at the 110th Convention of the Audio Engineering Society (2001 May), convention paper 5401.
  63. K. Hamasaki, et al., "Perceptual Discrimination of Very High Frequency Components in Musical Sound Recorded with a Newly Developed Wide Frequency Range Microphone," presented at the 117th Convention of the Audio Engineering Society (2004 Oct.), convention paper 6298.
  64. E. B. Meyer and D. R. Moran, "Audibility of a CD- Standard A/D/A Loop Inserted into High-Resolution Audio Playback," J. Audio Eng. Soc., vol. 55, pp. 775-779 (2007 Sep.).
  65. T. Nishiguchi, et al., "Perceptual Discrimination of Very High Frequency Components in Wide Frequency Range Musical Sound," Applied Acoustics, vol. 70, pp. 921-934 (2009). doi:10.1016/j.apacoust.2009.01.002
  66. H. M. Jackson, et al., "Further Investigations of the Audibility of Digital Audio Filters in a High-Fidelity Playback System," J. Audio Eng. Soc. (under review), 2016
  67. A. Pras and C. Guastavino, "Sampling Rate Dis- crimination: 44.1 kHz vs. 88.2 kHz," presented at the 128th Convention of the Audio Engineering Society (2010 May), convention paper 8101.
  68. S. Yoshikawa, et al., "Sound Quality Evaluation of 96 kHz Sampling Digital Audio," presented at the 99th Convention of the Audio Engineering Society (1995 Oct.), convention paper 4112.
  69. T. Nishiguchi, et al., "Perceptual Discrimination be- tween Musical Sounds with and without Very High Fre- quency Components," presented at the 115th Convention of the Audio Engineering Society (2003 Oct.), convention paper 5876.
  70. W. Woszczyk, et al., "Which of the Two Digital Audio Systems Best Matches the Quality of the Analog System?" presented at the AES 31st Int. Conference: New Directions in High Resolution Audio London, (2007 June), conference paper 30.
  71. M. Mizumachi, et al., "Subjective Evaluation of High Resolution Audio Under In-car Listening Environ- ments," presented at the 138th Convention of the Audio Engineering Society (2015 May), convention paper 9226.
  72. R. Repp, "Recording Quality Ratings by Music Pro- fessionals," in Int. Comp. Music Conf. (ICMC), New Or- leans (2006).
  73. R. King, et al., "How Can Sample Rates Be Properly Compared in Terms of Audio Quality?" presented at the 133rd Convention of the Audio Engineering Society (2012 Oct.), eBrief 77.
  74. B. C. J. Moore, "The Role of Temporal Fine Struc- ture Processing in Pitch Perception, Masking, and Speech Perception for Normal-Hearing and Hearing-Impaired Peo- ple," J. Assoc. Res. Otolaryngology, vol. 9, pp. 399-406 (2008). doi:10.1007/s10162-008-0143-x
  75. P. G. Craven, "Antialias Filters and System Tran- sient Response at High Sample Rates," J. Audio Eng. Soc., vol. 52, pp. 216-242 (2004 Mar.).
  76. G. S. Thekkadath and M. Spanner, "Comment on 'Human Time-Frequency Acuity Beats the Fourier Uncer- tainty Principle,' " Physical Review Letters, vol. 114 (2015).
  77. D. Dranove, "Comments on 'Audibility of a CD- standard A/D/A Loop Inserted into High-Resolution Audio Playback,' " J. Audio Eng. Soc., vol. 58, p. 173-174 (2010 Mar.).
  78. J. P. T. Higgins and S. Green, Eds., Cochrane Hand- book for Systematic Reviews of Interventions, Version 5.1.0 (The Cochrane Collaboration, 2011).
  79. ITU, "Subjective assessment of sound quality, Rec- ommendation BS.562-3" (1990).
  80. F. A. A. Kingdom and N. Prins, Psy- chophysics: A Practical Introduction (Academic Press, 2009).
  81. L. R. Rabiner and J. A. Johnson, "Perceptual Eval- uation of the Effects of Dither on Low Bit Rate PCM Sys- tems," The Bell System Technical J., vol. 51, (1972 Sep.). doi:10.1002/j.1538-7305.1972.tb02665.x
  82. P. Kvist, et al., "A Listening Test of Dither in Au- dio Systems," presented at the 118th Convention of the Audio Engineering Society (2005 May), convention paper 6328.
  83. M. Egger, et al., "Bias in Meta-Analysis Detected by a Simple, Graphical Test," BMJ vol. 315, pp. 629-634 (1997). doi:10.1136/bmj.315.7109.629
  84. J. Lau, et al., "The Case of the Misleading Funnel Plot," BMJ, vol. 333, pp. 597-600 (2006). doi:10.1136/bmj.333.7568.597
  85. J. J. Deeks and J. P. T. Higgins, "Statistical Algo- rithms in Review Manager 5," Statistical Methods Group of the Cochrane Collaboration, August 2010
  86. N. Mantel and W. Haenszel, "Statistical Aspects of the Analysis of Data from Retrospective Studies of Dis- ease," J. National Cancer Institute, vol. 22, pp. 719-748 (1959).
  87. R. DerSimonian and N. Laird, "Meta-Analysis in Clinical Trials," Controlled Clinical Trials, vol. 7, pp. 177- 188 (1986). doi:10.1016/0197-2456(86)90046-2