Key research themes
1. How can feature extraction and dimensionality reduction improve accuracy in music genre and audio type classification?
This theme investigates the development and application of advanced feature extraction methods combined with dimensionality reduction techniques to enhance audio classification accuracy, particularly in music genre classification and speech/music discrimination. The focus lies on capturing relevant audio characteristics through timbral, spectral, and rhythmic features and optimizing their representation in reduced dimension spaces that preserve class-distinguishing information, facilitating more effective classification algorithms.
2. What roles do binaural and spatial features play in classifying complex acoustic scenes and spatial audio recordings?
This research theme focuses on the extraction and utilization of binaural spatial cues and spectro-temporal features for the classification of spatial audio scenes recorded with binaural setups. It addresses the classification of complex environments and sound distributions around a listener, which is essential for applications in virtual reality, audio indexing, and scene analysis. The studies explore feature selection, classifier performance, and challenges related to reverberation and source ambiguity in acoustically rich settings.
3. How are deep learning and neuromorphic approaches advancing audio event classification and bioacoustic signal recognition?
This theme examines the shift towards deep learning architectures, particularly convolutional neural networks (CNNs), and emerging neuromorphic computing techniques including spiking neural networks (SNNs) in audio event detection, environmental sound classification, and bioacoustic signal analysis. The focus lies in leveraging biologically inspired models and data-driven feature representations for improved robustness, scalability, and real-time processing capabilities across diverse audio classification tasks.