Skip navigation
Please use this identifier to cite or link to this item: http://repository.iitr.ac.in/handle/123456789/15713
Title: Bangla date field extraction in offline handwritten documents
Authors: Mandal R.
Pratim Roy, Partha
Pal U.
Published in: Proceedings of ACM International Conference Proceeding Series
Abstract: Date is a useful information for various application (e.g. date wise document indexing) and automatic extraction of date information involves difficult challenges due to writing styles of different individuals, touching characters and confusion among identification of numerals, punctuation and texts. In this paper, we present a framework for indexing/retrieval of Bangla date patterns from handwritten documents. The method first classifies word components of each text line into month and non-month class using word level feature. Next, non-month words are segmented into individual components and classified into one of text, digit or punctuation. Using this information of word and character level components, the date patterns are searched. First using voting approach and then using regular expression we detect the candidate lines for numeric and semi-numeric date. Dynamic Time Warping (DTW) matching of profile based features is used for classification of month/non-month words. Numerals and punctuations are classified using gradient based feature and SVM classifier. The experiment is performed on Bangla handwritten dataset and the results demonstrate the effectiveness of the proposed system. © 2012 ACM.
Citation: Proceedings of ACM International Conference Proceeding Series, (2012), 37- 41. Mumbai
URI: https://doi.org/10.1145/2432553.2432561
http://repository.iitr.ac.in/handle/123456789/15713
Issue Date: 2012
Keywords: Automatic extraction
Document indexing
Dynamic time warping
Gradient based feature
Handwritten dataset
Handwritten document
Individual components
Off-line handwritten
Regular expressions
SVM classifiers
Text lines
Touching character
Voting approach
Word and characters
Word components
Word level
Writing style
Extraction
Indexing (of information)
Pattern matching
Information use
ISBN: 9.78145E+12
Author Scopus IDs: 54410932900
56880478500
57200742116
Author Affiliations: Mandal, R., Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, India
Roy, P.P., Laboratoire d'Informatique, Université François Rabelais, Tours, France
Pal, U., Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, India
Corresponding Author: Mandal, R.; Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, India; email: ranjumandal@gmail.com
Appears in Collections:Conference Publications [CS]

Files in This Item:
There are no files associated with this item.
Show full item record


Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.