Skip navigation
Please use this identifier to cite or link to this item:
Title: HMM-based Indic handwritten word recognition using zone segmentation
Authors: Pratim Roy, Partha
Bhunia A.K.
Das A.
Dey P.
Pal U.
Published in: Pattern Recognition
Abstract: This paper presents a novel approach towards Indic handwritten word recognition using zone-wise information. Because of complex nature due to compound characters, modifiers, overlapping and touching, etc., character segmentation and recognition is a tedious job in Indic scripts (e.g. Devanagari, Bangla, Gurumukhi, and other similar scripts). To avoid character segmentation in such scripts, HMM-based sequence modeling has been used earlier in holistic way. This paper proposes an efficient word recognition framework by segmenting the handwritten word images horizontally into three zones (upper, middle and lower) and then recognize the corresponding zones. The main aim of this zone segmentation approach is to reduce the number of distinct component classes compared to the total number of classes in Indic scripts. As a result, use of this zone segmentation approach enhances the recognition performance of the system. The components in middle zone, where characters are mostly touching, are recognized using HMM. After the recognition of middle zone, HMM based Viterbi forced alignment is applied to mark the left and right boundaries of the characters in the middle zone. Next, the residue components, if any, in upper and lower zones are obtained in a character boundary then the components are combined with the character to achieve the final word level recognition. Water reservoir-based properties have been integrated in this framework to improve the zone segmentation and character boundary detection defects while segmentation. A novel sliding window-based feature, called Pyramid Histogram of Oriented Gradient (PHOG) is proposed for middle zone recognition. PHOG features have been compared with other existing features and found robust for Indic script recognition. An exhaustive experiment is performed on two Indic scripts namely, Bangla and Devanagari for the performance evaluation. From the experiment, it has been noted that proposed zone-wise recognition improves accuracy with respect to the traditional way of Indic word recognition. © 2016 Elsevier Ltd
Citation: Pattern Recognition (2016), 60(): 1057-1075
Issue Date: 2016
Publisher: Elsevier Ltd
Keywords: Handwritten word recognition
Hidden Markov Model
Indian script recognition
ISSN: 313203
Author Scopus IDs: 56880478500
Author Affiliations: Roy, P.P., Department of CSE, Indian Institute of Technology Roorkee, India
Bhunia, A.K., Department of ECE, Institute of Engineering & Management, Kolkata, India
Das, A., Department of ECE, Institute of Engineering & Management, Kolkata, India
Dey, P., Department of ECE, Institute of Engineering & Management, Kolkata, India
Pal, U., CVPR Unit, Indian Statistical Institute, Kolkata, India
Corresponding Author: Roy, P.P.; Department of CSE, Indian Institute of Technology RoorkeeIndia; email:
Appears in Collections:Journal Publications [CS]

Files in This Item:
There are no files associated with this item.
Show full item record

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.