Skip navigation
Please use this identifier to cite or link to this item: http://repository.iitr.ac.in/handle/123456789/15862
Title: Multi-oriented english text line extraction using background and foreground information
Authors: Pratim Roy, Partha
Pal U.
Liadós J.
Kimura F.
Published in: Proceedings of DAS 2008 of the 8th IAPR International Workshop on Document Analysis Systems
Abstract: In graphical documents (map, engineering drawing), artistic documents etc. there exist many printed materials where text lines are not parallel to each other and they are multi-oriented and curve in nature. For the OCR of such documents we need to extract individual text lines from the documents. Extraction of individual text lines from multi-oriented and/or curved text document is a difficult problem. In this paper, we propose a novel method to extract individual text lines from such document pages and the method is based on the foreground and background information of the characters of the text. To take care of background information, water reservoir concept is used here. In the proposed scheme at first, individual components are detected and grouped into 3-character clusters using their inter-component distance, size and positional information. Applying concept of graph, initial 3-character clusters are merged to have larger cluster group. Using inter-character background information, orientations of the extreme characters of a larger cluster are decided and based on these orientation, two candidate regions are formed from the cluster. Finally, with the help of these candidate regions, individual lines are extracted. From the experiment, we obtained encouraging result. © 2008 IEEE.
Citation: Proceedings of DAS 2008 of the 8th IAPR International Workshop on Document Analysis Systems, (2008), 315- 322. Nara
URI: https://doi.org/10.1109/DAS.2008.83
http://repository.iitr.ac.in/handle/123456789/15862
Issue Date: 2008
Keywords: Electronic warfare
Image segmentation
Intelligent vehicle highway systems
Ionizing radiation
Military electronic countermeasures
Technical presentations
Background informations
Engineering drawings
Foreground informations
Individual components
Novel methods
Of graphs
Positional informations
Printed materials
Text documents
Text lines
Water reservoirs
Reservoirs (water)
ISBN: 9.78077E+12
Author Scopus IDs: 56880478500
57200742116
23392941400
55247906100
Author Affiliations: Roy, P.P., Computer Vision Center, Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain
Pal, U., Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata -108, India
Liadós, J., Computer Vision Center, Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain
Kimura, F., Graduate School of Engineering, Mie University, 1577 Kurimamachiya, Mie 514-8504, Japan
Corresponding Author: Roy, P. P.; Computer Vision Center, Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain
Appears in Collections:Conference Publications [CS]

Files in This Item:
There are no files associated with this item.
Show full item record


Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.