Key research themes
1. How can Zernike moments be effectively utilized for rotation-, scale-, and translation-invariant feature extraction in image-based pattern recognition tasks?
This research theme focuses on the application of Zernike moments and related orthogonal moment variants to extract robust features invariant under geometric transformations such as rotation, scaling, and translation for image analysis, particularly in face recognition, head pose estimation, and cosmic ray detection. The invariant and orthogonal properties of Zernike moments make them powerful tools for constructing compact, discriminative feature representations that improve classification and recognition performance while handling noise and varied image conditions.
2. What are the computational challenges and solutions for accurately and efficiently calculating Zernike and pseudo Zernike moments in practical applications?
Substantial research investigates algorithmic advances to reduce the high computational cost and numerical instability inherent in calculating traditional Zernike and pseudo Zernike moments, especially for high order moments or large images. Solutions include recursive algorithms, linear combinations with geometric moments, and integral image-based methods to accelerate computations while preserving numerical precision and orthogonality. Addressing computational efficiency and numerical errors is key for enabling real-time and large-scale applications of these moments in imaging.
3. How do Zernike moments and their variants integrate with machine learning methods to enhance classification and recognition tasks?
Integrating Zernike moment-based feature extraction with machine learning classifiers, including neural networks and discriminant analysis, has been shown effective for improving pattern recognition in images. Research explores novel hybrid descriptors combining Zernike moments with other statistical features, recursive and discriminant feature selection methods, and uses of orthogonal moments to produce compact, discriminative representations facilitating improved predictive accuracy in domains like face recognition and biomedical image classification.