Speech Recognition Arabic

description7 papers

group41 followers

lightbulbAbout this topic

Speech Recognition Arabic is a subfield of computational linguistics and artificial intelligence focused on the automatic identification and processing of spoken Arabic language. It involves the development of algorithms and models that enable machines to convert spoken Arabic into text, facilitating human-computer interaction and enhancing accessibility in various applications.

lightbulbAbout this topic

Key research themes

1. What acoustic features and modeling approaches improve vowel and phoneme recognition accuracy in Arabic speech recognition?

This theme explores acoustic feature extraction methods and modeling techniques targeting the unique phonetic characteristics of Arabic vowels and phonemes, including their length, dialectal variations, and diacritic ambiguity. Improving the representation and classification of these units is crucial to enhancing overall Arabic ASR system accuracy.

Comparative Analysis of Arabic Vowels using Formants and an Automatic Speech Recognition System

by amir hussain

2021

Key finding: The study demonstrates that analyzing first and second formants alongside a Hidden Markov Model (HMM)-based recognizer for Modern Standard Arabic vowels facilitates understanding vowel similarities and differences. It reveals... Read more

articleView Paper downloadDownload

Efficient Feature Extraction Algorithms to Develop an Arabic Speech Recognition System

by Professor (Dr.) Ratnadeep R . Deshmukh

2022, Engineering, Technology & Applied Science Research

Key finding: This paper shows that Power-Normalized Cepstral Coefficients (PNCC) and Modified Group Delay Function (ModGDF) outperform the widely used Mel-Frequency Cepstral Coefficients (MFCC) in Arabic speech recognition tasks. Using... Read more

articleView Paper downloadDownload

Constructing accurate and robust HMM/GMM models for an Arabic speech recognition system

by Mohamed ELHADJ

2024, International Journal of Speech Technology

Key finding: Experimental results show that integrating complementary acoustic features such as voiced formants and pitch with conventional MFCCs in HMM/GMM systems significantly reduces error rates for Arabic ASR. The study emphasizes... Read more

articleView Paper downloadDownload

On Developing an Automatic Speech Recognition System for Standard Arabic

by Fadoua Drira and

2015

Key finding: Through systematic experiments varying frame windowing, acoustic parameter numbers from MFCC and PLP, acoustic modeling units, Gaussian mixtures, and Baum-Welch reestimations, this study achieves 94.02% phoneme recognition... Read more

articleView Paper downloadDownload

Study of acoustic parameters and models for Automatic Standard Arabic Speech Recognition

by Adel M. Alimi

2016

Key finding: The research empirically validates the suitability of MFCC and PLP for feature extraction in Arabic ASR, and employs statistical HMM-based modeling with appropriate acoustic units and grammar for Standard Arabic. It... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. How can acoustic and language model integration, dialect variability, and corpus development improve multi-dialect Arabic ASR performance?

This research area focuses on addressing challenges posed by Arabic's multiple dialects, dialectal phonetic and orthographic variations, the scarcity of large annotated corpora, and morphological richness. It investigates corpus gathering, normalization of dialectal variants, deep learning architectures, and language modeling strategies to build robust multi-dialect Arabic ASR systems.

Multi-Dialect Arabic Speech Recognition

by Abbas Ali

2024, 2020 International Joint Conference on Neural Networks (IJCNN)

Key finding: Developed a large multi-dialect annotated speech corpus and used a combined convolutional and recurrent deep neural network architecture trained end-to-end with a beam search decoder coupled with a tetra-gram language model.... Read more

articleView Paper downloadDownload

Investigation Amazigh speech recognition using CMU tools

by Hassan Satori

2015

Key finding: This study shows that the CMU Sphinx HMM-based toolkit can be effectively adapted to resource-poor languages similar to Arabic in phonetic complexity, such as Amazigh, achieving 92.89% recognition accuracy on digits speech.... Read more

articleView Paper downloadDownload

Arabic Speech Recognition: Advancement and Challenges

by haifa Alhasson

2024, IEEE access

Key finding: The review highlights that multi-dialect Arabic ASR development requires addressing language dependency and complex morphology through tailored architectures and extensive datasets. It analyzes recent advances in ML and deep... Read more

articleView Paper downloadDownload

Arabic Dialect Processing

by Mona Diab

2025, Similar Languages, Varieties, and Dialects

Key finding: This work underscores that dialectal Arabic is the primary form used in spontaneous speech and new media, necessitating dialect-specific resources and lexicons for effective ASR. It emphasizes the significant phonological,... Read more

articleView Paper downloadDownload

Speech Recognition System of Arabic Digits Based on A Telephony Arabic Corpus

by Yousef Alotaibi

2016, The 2008 International …

Key finding: By utilizing a telephony Arabic corpus of digits and applying HMM-based recognition techniques, this study demonstrates the importance of handling Arabic phoneme classes, syllable structures, and phoneme variations across... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can phoneme duration modeling and visual speech features enhance recognition of Quranic Arabic and improve robustness in noisy or challenging environments?

This area investigates specialized phonetic phenomena such as phoneme lengthening (Medd) in Quranic recitation and the use of visual lip movement features to aid recognition, especially where audio input is noisy or limited. These methodological advances aim at improving phoneme classification accuracy in religious Arabic recitations and general speech recognition robustness leveraging visual cues.

Rule-Based Embedded HMMs Phoneme Classification to Improve Qur’anic Recitation Recognition

by Mohammed A. H. Ali

2024, Electronics

Key finding: Introduces a Rule-Based Phoneme Duration Algorithm integrated with HMMs that models the phoneme lengthening (Medd) specific to Quranic recitations, capturing phoneme duration features governed by Tajweed rules. This approach... Read more

articleView Paper downloadDownload

Lips Reading Spoken Arabic Word Based on The Geometric Shape Features of The Lip

by International Journal of Scientific Research in Science and Technology IJSRST

2023, International Journal of Scientific Research in Science and Technology

Key finding: Proposes a visual speech recognition method utilizing geometric features extracted from 20 lip landmarks to classify spoken Arabic words based on lip shape and movement without audio input. Experimental methodology emphasizes... Read more

articleView Paper downloadDownload

Arabic language learning assistance based on automatic speech recognition system

by Ayoub Maatallaoui and

2012, world-comp.org

Key finding: Develops a speaker-independent Arabic ASR system using CMU Sphinx with specific focus on pronunciation error detection for language learning applications, particularly in assessing phonetically challenging features such as... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Speech Recognition Arabic

Arabic Speech Recognition System Based on CMUSphinx

by Hassan Satori

In this paper we present the creation of an Arabic version of Automated Speech Recognition System (ASR). This system is based on the open source Sphinx-4, from the Carnegie Mellon University. Which is a speech recognition system based on... more