Academia.eduAcademia.edu

Outline

Automatic Arabic Hand Written Text Recognition System

2007

Abstract

Despite of the decent development of the pattern recognition science applications in the last decade of the twentieth century and this century, text recognition remains one of the most important problems in pattern recognition. To the best of our knowledge, little work has been done in the area of Arabic text recognition compared with those for Latin, Chins and Japanese text. The main difficulty encountered when dealing with Arabic text is the cursive nature of Arabic writing in both printed and handwritten forms. An Automatic Arabic Hand-Written Text Recognition (AHTR) System is proposed. An efficient segmentation stage is required in order to divide a cursive word or sub-word into its constituting characters. After a word has been extracted from the scanned image, it is thinned and its base line is calculated by analysis of horizontal density histogram. The pattern is then followed through the base line and the segmentation points are detected. Thus after the segmentation stage, the cursive word is represented by a sequence of isolated characters. The recognition problem thus reduces to that of classifying each character. A set of features extracted from each individual characters. A minimum distance classifier is used. Some approaches are used for processing the characters and post processing added to enhance the results. Recognized characters will be appended directly to a word file which is editable form.

References (16)

  1. Amin, A., A. Kaced, J.P. Haton and Moher, 1980. Handwritten Arabic character recognition by I.R.A.C. system. Proc. Fifth. Intl. Conf. Pattern Recognition, pp: 721-731.
  2. El-Sheikh, T.S. and S. Taweel, 1990. Real-time Arabic handwritten character recognition. Patt. Recog., 23: 1323-1332.
  3. Amin, A. and G. Masini, 1986. Machine recognition of multi-font printed Arabic text. Proc. 8th Conf. Patt. Recog. (Paris, France), pp: 392-395.
  4. Almuallim, H. and S. Yamagushi, 1987. A method of recognition of Arabic cursive handwritten. IEEE Trans. PAMI, 9: 5.
  5. Bozinovic, R.M. and S.N. Srihari, 1989. Off-line cursive script word recognition. IEEE Trans. PAMI, 11: 1.
  6. Al-Yousefi, H. and S. Udpa, 1992. Recognition of Arabic characters. IEEE Trans. PAMI, 14: 8.
  7. Chen, M., A. Kundu and J. Zhou, 1994. Off-line handwritten word recognition using a hidden Markov model type stochastic network. IEEE Trans. PAMI, 16: 5.
  8. Abuhabiba, S.I., S.A. Mahmoud and R.J. Green, 1994. Recognition of handwritten cursive Arabic characters. TEEE Trans. PAMI, 16: 6.
  9. Emam, A.M., 1995. Designing a reader machine for the blind. Ph.D. Thesis. University of Alexandria.
  10. Mustapha, E. and A. Lazrek, 2003. Arabic scientific document composition. ICITNS 2003, Amman, Jordan.
  11. Altuwaijri, M. and M. Bayoumi, 1994. Recognition of Arabic characters using neural networks. ICECS, pp: 720-725. Dec 19-22, Cairo, Egypt.
  12. Rafael, C.G. and R.E. Woods, 1992. Digital Image Processing. New York.
  13. Kapogiannopoulos, G. and M. Papadakis, 1994. Character recognition using a biorthogonal discrete wavelet transform. SPIE, Optical Pattern Recogn., 2825: 384-393.
  14. Rashkovskiy, O., L. sadovnik and N. Caviris, 1994. Scale, rotation and shift invariant wavelet transform. SPIE, Optical Pattern Recogn., 2237: 390-401.
  15. Devijver, P. and J. Kittler, 1982. Pattern Recognition: A Statistical Approach. Prentice Hall, Englewood Cliffs, N.J.
  16. Robert, J.S., 1992. Pattern Recognition: Statistical, Structural and Neural Approaches. New York.