Structure in on-line documents
2001, Proceedings of Sixth International Conference on Document Analysis and Recognition
https://doi.org/10.1109/ICDAR.2001.953906Abstract
text segment is then classified as drawing, ruled table or underlined keyword using stroke properties. The individual regions are processed and the results are assembled to identify the structure of the on-line document.
References (15)
- E. Bruzzone and M. Coffetti. An algorithm for extract- ing cursive text lines. In Proceedings of the 5 th Interna- tional Conference on Document Analysis and Recognition (ICDAR'99), pages 749-752, Bangalore, India, September 1999.
- S. D. Connell and A. Jain. Learning prototypes for on-line handwritten digits. In Proceedings of the 14 th International Conference on Pattern Recognition, pages 182-184, Bris- bane, Australia, August 1998.
- Y. Hirayama. A method for table structure analysis using DP matching. In Proceedings of the 3 rd International Confer- ence on Document Analysis and Recognition (ICDAR'95), pages 583-586, Montreal, Canada, August 1995.
- J. Hu, R. Kashi, D. Lopresti, and G. Wilfong. Table de- tection across multiple media. In Proceedings of the Work- shop on Document Layout Interpretation and its Applica- tions, Bangalore, India, September 1999.
- A. K. Jain and R. C. Dubes. Algorithms for Clustering Data. Prentice Hall, 1988.
- A. K. Jain and B. Yu. Document representation and its appli- cation to page decomposition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(3):294-308, March 1998.
- T. G. Keininger and A. Dengel. The T-RECS approach for table structure recognition and table border determination. In Proceedings of the Workshop on Document Layout Inter- pretation and its Applications, Bangalore, India, September 1999.
- W. Kornfeld and J. Wattecamps. Automatically locating, ex- tracting and analyzing tabular data. In Proceedings of 21st Annual International ACM SIGIR Conference, pages 347- 349, Melbourne, Australia, August 1998.
- G. Nagy. Twenty years of document image analysis in PAMI. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(1):38-62, January 2000.
- Y. Pu and Z. Shi. A natural learning algorithm based on Hough transform for text lines extraction in handwritten documents. In Proceedings of the 6 th International Work- shop on Frontiers in Handwriting Recognition, pages 637- 646, Taejon, Korea, August 1998.
- E. H. Ratzlaff. Inter-line distance estimation and text line extraction for unconstrained online handwriting. In Pro- ceedings of the 7 th International Workshop on Frontiers in Handwriting Recognition, Nijmegen, Netherlands, Septem- ber 2000.
- J. Subrahmonia. Pen computing: Challenges and applica- tions. In Proceedings of the 15 th International Conference on Pattern Recognition, pages 60-66, Barcelona, Spain, September 2000.
- T. A. Tokuyasu and P. A. Chou. An iterative decoding ap- proach to document image analysis. In Proceedings of the Workshop on Document Layout Interpretation and its Appli- cations, Bangalore, India, September 1999.
- B. Yu and A. K. Jain. A robust and fast skew detec- tion algorithm for generic documents. Pattern Recognition, 29(10):1599-1630, October 1996.
- Y. Zhong, K. Karu, and A. K. Jain. Locating text in com- plex color images. Pattern Recognition, 28(10):1523-1535, 1995.