Papers by Mohammad Faidzul Nasrudin

Journal of Information and Communication Technology
This paper describes the design and creation of a monolingual parallel corpus for the Malay langu... more This paper describes the design and creation of a monolingual parallel corpus for the Malay language written in Jawi. This paper proposes a new corpus called the National University of Malaysia Word Tokenization (NUWT) corpora To the best of our knowledge, currently, there is no sufficiently comprehensive, well-designed standard corpus that is annotated and made available for the public for the Jawi script corpora. This corpus contains the Jawi-specific Buckwalter character code and can be used to evaluate the performance of word tokenization tasks, as well as further language processing. The objective of this work is to conform and standardize the corpora between similar characters in Jawi. It consists of three subcorporas with documents from different genres. The gathering and processing steps, as well as the definition of several evaluation tasks regarding the use of these corpora, are included in this paper. One of the important roles and fundamental tasks of the corpus, which is the tokenization, is also presented in this paper. The development of the Malay language tokenizer is based on the syntactic data compatibility of Malay words written in Jawi. A series of experiments were performed to validate the corpus and to fulfill the requirement of the Jawi script tokenizer with an average error rate of 0.020255. Based on this promising result, the token will be used for the disambiguation and unknown word resolution, such as out-ofvocabulary (OOV) problem in the tagging process.

Symmetry, 2022
An exploration task can be performed by a team of mobile robots more efficiently than human count... more An exploration task can be performed by a team of mobile robots more efficiently than human counterparts. They can access and give live updates for hard-to-reach areas such as a disaster site or a sewer. However, they face some issues hindering them from optimal path planning due to the symmetrical shape of the environments. Multiple robots are expected to explore more areas in less time while solving robot localization and collision-avoidance issues. When deploying a multi-robot system, it is ensured that the hardware parts do not collide with each other or the surroundings, especially in symmetric environments. Two types of methods are used for collision avoidance: centralized and decentralized. The decentralized approach has mainly been used in recent times, as it is computationally less expensive. This article aims to conduct a systematic literature review of different collision-avoidance strategies and analyze the performance of innovative collision-avoidance techniques. Differ...

Journal of theoretical and applied information technology, 2016
There are many causes of deformation in an image and one of which during its acquisition to a dig... more There are many causes of deformation in an image and one of which during its acquisition to a digital image. The deformation takes different forms or causes different effects on the acquired image comparing with the original image including poor resolution, shear, noise, variation in the intensity and etc. A paper scanned by a scanner is a good example of possible deformation in images. Consequently, paper texture identification or fingerprinting is one of the research fields of pattern recognition that exposed to the deformation problem. Applications such as documents authentication deemed to be constrained by the deformation problem. Subsequently, one of the well-known methods in images texture extraction is the Locale Binary Pattern (LBP) method. However, the LBP method show a number of drawbacks in paper images texture extraction and two of which are neglecting some texture information of the images and incompetent to some images deformation due to its local view. In this paper ...

Automatic License Plate Recognition (ALPR) has wide range of commercial applications such as find... more Automatic License Plate Recognition (ALPR) has wide range of commercial applications such as finding stolen cars, controlling access to car parks and gathering traffic flow statistics. Existing Libyan License Plate Recognition (LLPR) methods are not presented promising results due to their inefficient features for the extracted characters and numbers. In this work, an improved LLPR method is presented. The method is composed of five stages: pre-processing, license plate extraction, character and numbers segmentation, feature extraction and license plate recognition. In the pre-processing, undesired data, such as background noises are removed. Then, the license plate is extracted using few mathematical morphologies, Connected Component Analysis (CCA) and Region of Interest (ROI) extraction. After that, characters and numbers from the image regions of the license plate are extracted. A combination of geometrical features and Gabor features are considered to represent each of the character and word in the plates. Then, the recognition is done by using a template matching and a Probabilistic Neural Network (PNN) classification. The performance of the proposed method is evaluated and tested using 100 self-collected images of Libyan national license plates. The experimental results have shown that the proposed method has produced promising results and superior than other existing methods.

Excessive amounts of image spam cause many problems to e-mail users. Since image spam is difficul... more Excessive amounts of image spam cause many problems to e-mail users. Since image spam is difficult to detect using conventional text-based spam approach, various image processing techniques have been proposed. In this paper, we present an ensemble method using frequent itemset mining (FIM) for filtering image spam. Despite the fact that FIM techniques are well established in data mining, it is not commonly used in the ensemble method. In order to obtain a good filtering performance, a SIFT descriptor is used since it is widely known as effective image descriptors. K-mean clustering is applied to the SIFT keypoints which produce a visual codebook. The bag-of-word (BOW) feature vectors for each image is generated using a hard bag-of-features (HBOF) approach. FIM descriptors are obtained from the frequent itemsets of the BOW feature vectors. We combine BOW, FIM with another three different feature selections, namely Information Gain (IG), Symmetrical Uncertainty (SU) and Chi Square (CS...

Automatic License Plate Recognition (ALPR) has wide range of commercial applications such as find... more Automatic License Plate Recognition (ALPR) has wide range of commercial applications such as finding stolen cars, controlling access to car parks and gathering traffic flow statistics. Existing Libyan License Plate Recognition (LLPR) methods are not presented promising results due to their inefficient features for the extracted characters and numbers. In this work, an improved LLPR method is presented. The method is composed of five stages: pre-processing, license plate extraction, character and numbers segmentation, feature extraction and license plate recognition. In the pre-processing, undesired data, such as background noises are removed. Then, the license plate is extracted using few mathematical morphologies, Connected Component Analysis (CCA) and Region of Interest (ROI) extraction. After that, characters and numbers from the image regions of the license plate are extracted. A combination of geometrical features and Gabor features are considered to represent each of the chara...
Indonesian Journal of Electrical Engineering and Computer Science, 2020
With the substantial expansion of image information, image processing and computer vision have si... more With the substantial expansion of image information, image processing and computer vision have significant roles in several applications, including image classification, image segmentation, pattern recognition, and image retrieval. An important feature that has been applied in many image applications is texture. Texture is the characteristic of a set of pixels that form an image. Therefore, analyzing texture has a significant impact on segmenting an image or detecting important portions of an image. This paper provides a review on LBP and its modifications. The aim of this review is to show the current trends for using, modifying and adapting LBP in the domain of image processing.

International Journal on Advanced Science, Engineering and Information Technology, 2019
Document image binarization is the first essential step in digitalizing images and is considered ... more Document image binarization is the first essential step in digitalizing images and is considered an essential technique in both document image analysis applications and optical character recognition operations, the binarization process is used to obtain a binary image from the original image, binary image is the proper presentation for image segmentation, recognition, and restoration as underlined by several studies which assure that the next step of document image analysis applications depends on the binarization result. However, old and historical document images mainly suffering from several types of degradations, such as bleeding through the blur, uneven illumination and other types of degradations which makes the binarization process a difficult task. Therefore, extracting of foreground from a degraded background relies on the degradation, furthermore it also depends on the type of used paper and document age. Developed binarization methods are necessary to decrease the impact of the degradation in document background. To resolve this difficulty, this paper proposes an effective, enhanced binarization technique for degraded and historical document images. The proposed method is based on enhancing an existing binarization method by modifying parameters and adding a post-processing stage, thus improving the resulting binary images. This proposed technique is also robust, as there is no need for parameter tuning. After using document image binarization Contest (DIBCO) datasets to evaluate this proposed technique, our findings show that the proposed method efficiency is promising, producing better results than those obtained by some of the winners in the DIBCO.
International Journal of Engineering & Technology, 2018
Jawi and Roman scripts are represented Malay language. In the past, Jawi writings are widely used... more Jawi and Roman scripts are represented Malay language. In the past, Jawi writings are widely used by the Malay community and foreigners; and it can be seen in the old documents. Old documents face the risk of background damage. In order to preserve this valuable information, there are significant needs to automated Jawi materials. Based on previous literature, POS-tags are known as the first phase in the automated text analysis; and the development of language technologies can barely initiate without this phase. We highlight the existing POS-tags approaches; and suggest the development of Malay Jawi POS-tags using extended ME-based approach on NUWT Corpus. Results have shown that the proposed model yielded a higher accuracy in comparison to the state-of-the-art model. Â

Journal of Imaging, 2019
In this era of digitization, most hardcopy documents are being transformed into digital formats. ... more In this era of digitization, most hardcopy documents are being transformed into digital formats. In the process of transformation, large quantities of documents are stored and preserved through electronic scanning. These documents are available from various sources such as ancient documentation, old legal records, medical reports, music scores, palm leaf, and reports on security-related issues. In particular, ancient and historical documents are hard to read due to their degradation in terms of low contrast and existence of corrupted artefacts. In recent times, degraded document binarization has been studied widely and several approaches were developed to deal with issues and challenges in document binarization. In this paper, a comprehensive review is conducted on the issues and challenges faced during the image binarization process, followed by insights on various methods used for image binarization. This paper also discusses the advanced methods used for the enhancement of degrad...

International Journal on Advanced Science, Engineering and Information Technology, 2019
Scene in the photo occulated by uniform particles distribution can degrade the image quality acci... more Scene in the photo occulated by uniform particles distribution can degrade the image quality accidently. State of the art pre-processing methods are able to enhance visibility by employing local and global filters on the image scene. Regardless of air light and transmission map right estimation, those methods unfortunately produce artifacts and halo effects because of uncorrelated problem between the global and local filter's windows. Besides, previous approaches might abruptly eliminate the primary scene structure of an image like texture and colour. Therefore, this study aims not solely to improve scene image quality via a recovery method but also to overcome image content issues such as the artefacts and halo effects, and finally to reduce the light disturbance in the scene image. We introduce our proposed visibility enhancement method by using joint ambience distribution that improves the colour cast in the image. Furthermore, the method is able to balance the atmospheric light in correspondence to the depth map accordingly. Consequently, our method maintains the image texture structural information by calculating the lighting estimation and maintaining a range of colours simultaneously. The method is tested on images from the Benchmarking Single Image Dehazing research by assessing their clear edge ratio, gradient, range of saturated pixels, and structural similarity metric index. The scene image restoration assessment results show that our proposed method had outperformed resuls from the Tan, Tarel and He methods by gaining the highest score in the structural similarity index and colourfulness measurement. Furthermore, our proposed method also had achieved acceptable gradient ratio and percentage of the number of saturated pixels. The proposed approach enhances the visibility in the images without affecting them structurally.

International Journal on Advanced Science, Engineering and Information Technology, 2018
The aim of this paper is to assess students' perceptions of their competency and interests in Sci... more The aim of this paper is to assess students' perceptions of their competency and interests in Science, Technology, Engineering and Mathematics (STEM) throughout Malaysia. These perceptions are obtained during and after they were engaged in using a STEM module and building a robotic prototype that was in line with the STEM teachers' specification, and was conducted at the National Science Centre, Malaysia. This activity was undertaken because the target ratio for the number of students enrolling in STEM programs is not met. The developed STEM module is based on four stages of the learning cycle in Kolb's experiential learning theory. The stages are Concrete Experience, Reflective Observation, Abstract Conceptualization, and Active Experimentation. These stages have five key educational activities which are watching videos, reading modules, assembling robotic components, drag and drop using blockly software and lastly playing a robotic game. The key element of the activities is the utilization of a robotic prototype as the main component in increasing the students' interest in STEM via games. This module was evaluated in both qualitative and quantitative case studies of students to inform teachers' perceptions of the developed modules and robotic prototypes. Data were collected through two training events at a science exhibition at the National Science Centre and taken from two distinct groups, namely primary and secondary school students in range 11 to 15 year old. The evaluation comprised of five areas, which were interaction, engagement, challenge, competency, and interest. The results show that developed module and robotic prototype based n teacher's perception received positive response from the respondents. It can efficiently raise students' interest in STEM that meets the Malaysia Education Blueprint 2013-2025.

International Journal on Advanced Science, Engineering and Information Technology, 2017
Particle Swarm Optimization (PSO) is one of the famous algorithms inspired by the natural behavio... more Particle Swarm Optimization (PSO) is one of the famous algorithms inspired by the natural behavior of a swarm (particles). However, it is used to solve n-dimensional problems in search space. One of its modified versions a Polar Particle Swarm Optimizer was operated in polar coordinates by using an appropriate mapping function introduced based on polar coordinates. The modified algorithm faced some problems, such as generating a distorted search space, which may have been caused by the method of randomization. This paper introduces an initialization technique that operates entirely in polar coordinates. Moreover, an investigation based on standard PSO was done to test the proposed technique. The second part was to use the new initialization technique to enhance the polar PSO performance. In addition, the proposed techniques show evenly distributed points in the polar search space. Furthermore, experimental results were obtained by using various benchmark test functions on different settings of dimensions. While its shows a little enhancement in some benchmark test functions in both PSO and polar PSO, statistically there are no significant differences by using the analysis of variance (ANOVA).

International Journal on Advanced Science, Engineering and Information Technology, 2016
Maximum limit of human visibility without the assistance of equipment is 1000 m based on Internat... more Maximum limit of human visibility without the assistance of equipment is 1000 m based on International Commission on Illumination. The use of a camera in the outdoor for the purpose of navigation, monitoring, remote sensing and robotic movement sometimes may yield images that are interrupted by haze, fog, smoke, steam and water drops. Fog is the random movement of water drops in the air that normally exists in the early morning. This disorder causes a differential image observed experiences low contrast, obscure, and difficult to identify targets. Analysis of the interference image can restore damaged image as a result of obstacles from atmospheric particles or drops of water during image observation. Generally, images with atmospheric particles contain a homogeneous texture like brightness and a heterogeneous texture which is the object that exists in the atmosphere. A pre-processing method based on the dark channel prior statistical measure of contrast vision and prior knowledge still produces good image quality but less effective to overcome Halo problem or ring light, and strong lighting. This study aims to propel the development of machine vision industry aimed at navigation or monitoring for ground transportation, air or sea.
Journal of Computer Science, 2016
Active contours also known as snakes became a familiar and widely used in the field of image segm... more Active contours also known as snakes became a familiar and widely used in the field of image segmentation and restoration of historical documents in last few decades. Gradient Vector Flow (GVF) snake successes in overcome of converge to boundary concavities which represents the drawback of traditional snakes. Deep concavity problem it has become Obstacle faced GVF snake when restoring broken characters of historical documents. In this study we proposed algorithm to use genetic algorithm with GVF snake algorithm in order to optimize snake points to get right positions in deep concavity boundaries, also adding a Divergence factor as the third force to enhance the restoring and recognizing results. The experimental results show that our proposed algorithm has more capture than GVF alone.

Jawi script is a script that has the Arabic influence. In the past, these writings are widely use... more Jawi script is a script that has the Arabic influence. In the past, these writings are widely used by the Malay community as well as foreigners who have diplomatic relations, business, missionary and such. At that time, the Malay language is the lingua franca of this region. So there are many Malay heritages such as manuscripts, religious books, letters, documents and other agreements in the Jawi script. There are significant needs to do the transliteration of the Jawi text on the materials to Malay Roman. Thus, research on machine transliteration will help the effort. Many researches in machine transliteration in the world for high-level language have been done; such as English, European, and Asia languages such as Chinese, Japanese, Korean and Arabic. However the research in the context of the Malay language is still lacking, especially those involving the Romanized transliteration of Jawi. Jawi writing is quite different from the Urdu and Arabic although they share the same chara...
Journal of Theoretical and Applied Information Technology
Robot Path Planning (RPP) in dynamic environments is a search problem based on the examination of... more Robot Path Planning (RPP) in dynamic environments is a search problem based on the examination of collision-free paths in the presence of dynamic and static obstacles. Many techniques have been developed to solve this problem. Trapping in a local minima and maintaining a Real-Time performance are known as the two most important challenges that these techniques face to solve such problem. This study presents a comprehensive survey of the various techniques that have been proposed in this domain. As part of this survey, we include a classification of the approaches and identify their methods.

This paper proposes a method based on the Multi-Swarm Particle Swarm Optimization (PSO) with Loca... more This paper proposes a method based on the Multi-Swarm Particle Swarm Optimization (PSO) with Local Search on the multi-robot search system to find a given target in a Complex environment that contains static obstacles. This method by applying Multi-Swarm with Multi-Best particles on the multi-robot system can overcome the premature convergence problem, which is one of the main problems of Basic PSO. As the time progress the global searching of the algorithm decrease and therefore the robots tend to get group together in the small-explored region that called Premature Convergence and cannot reach the target. By combining the Local Search method with Multi-Swarm, We can guarantee the global convergence of this proposed algorithm and the robots can reach the target. The Experimental results obtained in a simulated environment show that biological and sociological inspiration could be useful to meet the challenges of robotic applications that can be described as optimization problems.

The style of writing or calligraphy applied in ancient manuscripts gives useful information to pa... more The style of writing or calligraphy applied in ancient manuscripts gives useful information to paleographers. The information helps paleographer to identify date, writer, number of writers, place of origin, and the originality of manuscripts. This information is known as features. The features from characters, tangent value, dominant background and also Grey-Level Co-occurrence Matrix (GLCM) have been used in this field of research. A novel technique was proposed for digital Jawi Paleography. Jawi is the original Malay writing based on Arabic characters. The technique proposed models triangles on images and extracts features from them. The features are used for classification. In this paper, new features for the Triangle Model are proposed. Also, the implementation of four zones is reported. The number of features has been extended from 12 to 45. For validation of proposed algorithm, 60,000:20,000 training and testing data from Hoda digit dataset has been prepared, selected randomly for 10 rounds of testing. For further verification, two Supervised Machine Learning (SML) and three Unsupervised Machine Learning (UML) algorithms were experimented. These experiments were conducted using a new Arabic calligraphy dataset that was set up from 1,225 Arabic letters taken from calligraphy books. From the data, SML experiments were conducted with the ratio of 807:408 for training and testing. Whereas for the UML, three classifiers were tested on 30 images of Arabic calligraphy dataset. Results from the tests prove that the Triangle Model technique can successfully be used in digital paleography of Jawi characters.

2013 International Conference on Control, Decision and Information Technologies (CoDIT), 2013
Triangle is a basic geometry. There are six types of triangle, but scalene triangle was chosen to... more Triangle is a basic geometry. There are six types of triangle, but scalene triangle was chosen to be used in this research which is based on coordinates of corners generated by our proposed algorithm. In this paper, nine features are proposed. Six of the features were derived from coordinates and sides of triangle whereas three others are angle of corners. After features are identified, the image is zoned into 25 zones. The zoning processes are based on Cartesian plan, Vertical and Horizontal zones. From the zoning process, nine features are expanded to become 225 features. The features proposed are used on HODA, MNIST, IFHCDB and BANGLA datasets. Experiments are conducted using supervised learning which are Support Vector Machine (SVM) and Multi-layer Perceptron (MLP). Results from the experiments are evaluated with different Cost (c) for the SVM and Learning Rate (LR) for the MLP. Then, the result is compared to the state of the art researches in this area.
Uploads
Papers by Mohammad Faidzul Nasrudin