Key research themes
1. How can spatial-temporal feature extraction and data fusion improve accuracy and robustness in human action recognition?
This research theme investigates the development and integration of spatial and temporal features for human action recognition (HAR), focusing on methods that combine multiple feature types or modalities to capture intricate motion and appearance cues. The goal is to enhance recognition accuracy and robustness across varied environmental conditions and datasets by effectively modeling both static pose and dynamic movement patterns.
2. What are the effective dimensionality reduction strategies for handling high-dimensional features in large-scale human action recognition datasets?
This area focuses on addressing the computational and storage challenges posed by increasingly high-dimensional feature vectors, especially those derived from Fisher vectors and Bag-of-Words models on large-scale datasets. The studies explore how dimensionality reduction techniques such as principal component analysis (PCA) or learned projections can unearth latent structures in feature spaces, reduce redundancy, and facilitate efficient and accurate classification in expansive HAR datasets comprising numerous action classes and real-world variability.
3. How can skeletal data and body part representations be leveraged for efficient and interpretable human action recognition?
This theme explores techniques that utilize human skeleton-based features and body part models to improve interpretability, reduce feature size, and increase recognition accuracy. Approaches include representing body dimensions variations, part-based graph models, and compact skeleton descriptors to capture meaningful and discriminative motion patterns. Such methods offer the advantage of robustness to occlusion and viewpoint changes and facilitate lightweight, explainable HAR systems.