Academia.eduAcademia.edu

Perceptual audio quality assessment

description8 papers
group2 followers
lightbulbAbout this topic
Perceptual audio quality assessment is the evaluation of audio signals based on human perception, focusing on how listeners experience sound quality. This field employs subjective testing methods and objective algorithms to quantify audio fidelity, often considering factors such as clarity, distortion, and overall listener satisfaction.
lightbulbAbout this topic
Perceptual audio quality assessment is the evaluation of audio signals based on human perception, focusing on how listeners experience sound quality. This field employs subjective testing methods and objective algorithms to quantify audio fidelity, often considering factors such as clarity, distortion, and overall listener satisfaction.

Key research themes

1. How can listener variability and task design influence the reliability of perceptual voice and audio quality assessments?

This research theme focuses on the sources of variability in human perceptual ratings of voice and audio quality, and how experimental design choices (including rating scales, tasks, and listener backgrounds) affect reliability and agreement among listeners. Understanding these factors is crucial for developing standardized and valid clinical and perceptual evaluation protocols that yield consistent and interpretable results, which underpin both subjective assessments and the validation of objective quality metrics.

Key finding: This paper presents a detailed theoretical framework attributing variability in clinical voice quality ratings to multiple sources, including listener backgrounds and biases, the nature of the rating task, and random error.... Read more
Key finding: This paper introduces a web-based tool (WAET) implemented via the Web Audio API to conduct perceptual listening tests with flexible test types and interfaces, remotely deployable without programming knowledge. The framework... Read more
Key finding: This psychoacoustic study designed a representative listening test paradigm mimicking distracted (everyday) listening rather than analytical listening to measure perceived degradation due to microphone handling noise. By... Read more
Key finding: This study demonstrated that individual cognitive differences, specifically working memory capacity and selective attention, significantly influence subjective sound quality ratings in older listeners with near-normal... Read more

2. What objective and subjective methods effectively quantify perceptual audio quality across diverse applications, including speech and spatial audio?

This research theme investigates computational models, objective metrics, and subjective testing methodologies used for evaluating audio quality perception in various contexts—ranging from speech communication systems (VoIP), digital audio broadcasting, to spatial audio and ambisonics. The focus is on comparing and validating algorithmic metrics against listener ratings, improving real-time assessment techniques, and extending quality evaluation to emerging audio formats while incorporating perceptual and spatial localization components.

by Peter Pocta and 
1 more
Key finding: The paper combines subjective listening tests and objective quality models (PEAQ, POLQA Music) to assess the audio quality impact of typical lossy codecs in digital audio broadcasting and web-casting. Results show that low... Read more
Key finding: This work proposes AMBIQUAL, a novel full-reference metric for assessing spatial audio quality of Ambisonic B-format signals, evaluating both Listening Quality and Localization Accuracy. The metric extends ViSQOLAudio by... Read more
Key finding: This paper proposes CAQoE, a no-reference, context-aware speech quality metric designed for real-time VoIP applications under varying noise conditions. Unlike traditional metrics requiring reference signals, CAQoE initially... Read more
Key finding: By experimentally comparing Perceptual Evaluation of Speech Quality (PESQ) and the E-Model under various network conditions and codecs in VoIP systems, this paper found discrepancies between off-line (PESQ) and real-time... Read more
Key finding: This paper develops a non-intrusive speech quality evaluator (NI-SQE) using natural scene statistics on mean-subtracted contrast normalized spectrogram features. By avoiding the need for a pristine reference, their method... Read more

3. How can computational modeling and pilot data aid the selection of stimuli and parameters to optimize perceptual audio and video quality studies?

This theme focuses on methodologies for selecting experimental stimuli, parameters, and degradation levels to maximize the perceptual discriminability and representativeness of audio and video quality assessments. It incorporates techniques such as perceptual similarity distances, multidimensional scaling, and statistical modeling to ensure even coverage of perceived quality ranges in subjective tests, thereby improving the robustness and interpretability of results. These design strategies impact both subjective test efficacy and objective metric validation.

Key finding: This paper proposes a paired-comparison based parameter selection methodology where observers judge similarity in quality between parameter-modulated video stimuli. The approach uses classical multidimensional scaling on... Read more
Key finding: Combining subjective flicker tests with objective image quality metrics, this study proposes a methodology to determine objective metric thresholds guaranteeing visually lossless compression levels. Human observers performed... Read more
Key finding: This study compares two fundamental computational models of image quality assessment—the Visible Differences Predictor (error sensitivity based) and the Structural Similarity Index (structural similarity based)—against... Read more

All papers in Perceptual audio quality assessment

We present a system for content-based retrieval of perceptually similar sound events in audio documents ('sound spotting', using a query by example. The system consists of three discrete stages: a front-end for feature extraction, a... more
To facilitate better consistency between programs and stations, ITU, EBU and ARIB have investigated the standardization of broadcast loudness. This paper examines some consequences of a global loudness standard with regard to metering and... more
The ITU-R BS.1770 multichannel loudness algorithm performs a sum of channel energies with weighting coefficients based on azimuth and elevation angles of arrival of the audio signal. In its current version, these coefficients were... more
A systematic review of typical biases encountered in modern audio quality listening tests is presented. The following three types of bias are discussed in more detail: bias due to affective judgments, response mapping bias, and interface... more
This paper provides complementary data to the review of biases in audio quality listening tests by Zieliński et al. (2008) [1]. The paper presents selected illustrations of range equalizing bias, centering bias, stimulus spacing bias,... more
Basic perceptual quality of coded audio material is commonly evaluated using ITU-R BS-1534 MUSHRA (Multi Stimulus with Hidden Reference and Anchors) listening tests. MUSHRA guidelines call for experienced listeners. However, the majority... more
There are a range of different methods for comparing or measuring the similarity between environmental sound effects. These methods can be used as objective evaluation techniques, to evaluate the effectiveness of a sound synthesis method... more
This material was originally intended as part of the article "The Loudness War: Background, Speculation and Recommendations" [1] but was removed for reasons of scope and to keep that article to a manageable length.) In this paper, I... more
The current ITU's standard for objective assessment of audio quality, Perceptual Evaluation of Audio Quality (PEAQ), has some shortcomings that prevent its reliable use for a number of codification conditions and some kind of signals. The... more
Over the last decades, the simulation of musical instruments by digital means has become an important part of modern music production and live performance. Since the first release of the Kemper Profiling Amplifier (KPA) in 2011,... more
This paper presents a study undertaken to evaluate user ratings on auditory feedback of sound source selection within a multi-track auditory environment where sound placement is controlled by a gesture control system. Selection... more
Perceptual listening tests are commonplace in audio research and a vital form of evaluation. While a large number of tools exist to run such tests, many feature just one test type, are platform dependent, run on proprietary software, or... more
This paper presents dual sensor based management system for television viewing at home environment in compliance with BS1770 loudness measurement standard aim of the research was mainly to address the issue of sudden increase in loudness... more
While the LUFS standard was originally developed for broadcast applications, it offers a convenient means of calibrating program material stimuli to an equal loudness level, while remaining in a multichannel format. However, this... more
In this age of DTV systems which allow wide dynamic range, we easily find media content which is not within the confort zone of most of the listeners. With the advent of various object loudness measurement techniques and compliance... more
Evaluating audio source separation algorithms means rat- ing the quality or intelligibility of separated source sig- nals. While objective criteria fail to account for all audi- tory phenomena so far, precise subjective ratings can be... more
In the last 20 year there has been an increasing need for an objective method for eval- uating audio from a perceptual point of view. Perceptual encoding and its prevalence in popular audio distribution models highlights the demand for... more
There is a consensus among many in the audio industry that recorded music has grown increasingly compressed over the past few decades. Some industry professionals are concerned that this compression often results in poor audio quality... more
The current ITU's standard for objective assessment of audio quality, Perceptual Evaluation of Audio Quality (PEAQ), has some shortcomings that prevent its reliable use for a number of codification conditions and some kind of signals. The... more
Modern audio mastering procedures include selectively equalisation of specific frequency bands for the tonal enhancement of the unmastered material. This process is mostly based on music scores' or listening information, like the musical... more
Download research papers for free!