http://repository.iitr.ac.in/handle/123456789/15862
Title: | Multi-oriented english text line extraction using background and foreground information |
Authors: | Pratim Roy, Partha Pal U. Liadós J. Kimura F. |
Published in: | Proceedings of DAS 2008 of the 8th IAPR International Workshop on Document Analysis Systems |
Abstract: | In graphical documents (map, engineering drawing), artistic documents etc. there exist many printed materials where text lines are not parallel to each other and they are multi-oriented and curve in nature. For the OCR of such documents we need to extract individual text lines from the documents. Extraction of individual text lines from multi-oriented and/or curved text document is a difficult problem. In this paper, we propose a novel method to extract individual text lines from such document pages and the method is based on the foreground and background information of the characters of the text. To take care of background information, water reservoir concept is used here. In the proposed scheme at first, individual components are detected and grouped into 3-character clusters using their inter-component distance, size and positional information. Applying concept of graph, initial 3-character clusters are merged to have larger cluster group. Using inter-character background information, orientations of the extreme characters of a larger cluster are decided and based on these orientation, two candidate regions are formed from the cluster. Finally, with the help of these candidate regions, individual lines are extracted. From the experiment, we obtained encouraging result. © 2008 IEEE. |
Citation: | Proceedings of DAS 2008 of the 8th IAPR International Workshop on Document Analysis Systems, (2008), 315- 322. Nara |
URI: | https://doi.org/10.1109/DAS.2008.83 http://repository.iitr.ac.in/handle/123456789/15862 |
Issue Date: | 2008 |
Keywords: | Electronic warfare Image segmentation Intelligent vehicle highway systems Ionizing radiation Military electronic countermeasures Technical presentations Background informations Engineering drawings Foreground informations Individual components Novel methods Of graphs Positional informations Printed materials Text documents Text lines Water reservoirs Reservoirs (water) |
ISBN: | 9.78077E+12 |
Author Scopus IDs: | 56880478500 57200742116 23392941400 55247906100 |
Author Affiliations: | Roy, P.P., Computer Vision Center, Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain Pal, U., Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata -108, India Liadós, J., Computer Vision Center, Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain Kimura, F., Graduate School of Engineering, Mie University, 1577 Kurimamachiya, Mie 514-8504, Japan |
Corresponding Author: | Roy, P. P.; Computer Vision Center, Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain |
Appears in Collections: | Conference Publications [CS] |
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.