Key research themes
1. How can parametric and sinusoidal models improve low-bitrate audio coding efficiency and perceptual quality?
This research area focuses on advancing parametric audio coding techniques using sinusoidal and exponentially damped sinusoidal (EDS) models to efficiently represent audio signals at very low bitrates, while preserving perceptual quality. It addresses challenges in modeling transient signals, optimizing parameter quantization, and integrating psychoacoustic considerations to enhance sparse signal representation and subjective audio coding performance.
2. What are the advancements in hybrid and scalable audio coding techniques for universal speech and music compression?
This theme investigates methods integrating multiple coding paradigms—such as linear predictive coding (LPC), algebraic code-excited linear prediction (ACELP), transform coding (TCX), and scalable coding frameworks—to achieve efficient, universal audio coding that supports both speech and music signals. It includes mode switching, variable frame lengths, entropy coding improvements, and scalable-to-lossless transitions, aiming for robust quality across diverse content at multiple bitrates.
3. How can psychoacoustically-informed perceptual models and lossless coding techniques enhance audio compression fidelity and objective quality assessment?
This research direction integrates auditory perception models into audio codec design and lossless coding methods to optimize compression while preserving subjective audio quality. It also explores objective metrics aligned with human perception, enabling reliable prediction of coding artifacts impact. Lossless and perceptual lossless techniques using predictive filtering and entropy coding aim to maximize coding efficiency without compromising signal integrity.