Key research themes
1. What acoustic features and modeling approaches improve vowel and phoneme recognition accuracy in Arabic speech recognition?
This theme explores acoustic feature extraction methods and modeling techniques targeting the unique phonetic characteristics of Arabic vowels and phonemes, including their length, dialectal variations, and diacritic ambiguity. Improving the representation and classification of these units is crucial to enhancing overall Arabic ASR system accuracy.
2. How can acoustic and language model integration, dialect variability, and corpus development improve multi-dialect Arabic ASR performance?
This research area focuses on addressing challenges posed by Arabic's multiple dialects, dialectal phonetic and orthographic variations, the scarcity of large annotated corpora, and morphological richness. It investigates corpus gathering, normalization of dialectal variants, deep learning architectures, and language modeling strategies to build robust multi-dialect Arabic ASR systems.
3. How can phoneme duration modeling and visual speech features enhance recognition of Quranic Arabic and improve robustness in noisy or challenging environments?
This area investigates specialized phonetic phenomena such as phoneme lengthening (Medd) in Quranic recitation and the use of visual lip movement features to aid recognition, especially where audio input is noisy or limited. These methodological advances aim at improving phoneme classification accuracy in religious Arabic recitations and general speech recognition robustness leveraging visual cues.