Skip navigation
Please use this identifier to cite or link to this item: http://repository.iitr.ac.in/handle/123456789/5637
Title: Frame selection for OCR from video stream of book flipping
Authors: Chakraborty D.
Pratim Roy, Partha
Saini R.
Alvarez J.M.
Pal U.
Published in: Multimedia Tools and Applications
Abstract: Optical Character Recognition (OCR) in video stream of flipping pages is a challenging task because flipping at random speed causes difficulties in identifying the frames that contain the open page image (OPI). Also, low resolution, blurring effect, shadow, etc., add significant noise in selection of proper frames for OCR. In this paper, we focus on identifying a set of representative frames from the video stream of flipping pages without using any explicit hardware and then perform OCR on these frames for recognition. Thus, an end-to-end solution is proposed for video stream of flipping pages. To select an OPI, we present an efficient algorithm that exploits cues from edge information during flipping event. These cues, extracted from the region of interest (ROI) of the frame, determine the flipping or open state of a page. The open state classification is performed by an SVM classifier following training of the edge cue information. After selecting a set of frames for each OPI, a representative frame from OPI set is chosen for OCR. Experiments are performed on videos captured using standard resolution camera. We have obtained 88.81 % accuracy on representative frame selection from the proposed method whereas when compared with GIST (Oliva and Torralba, Int J Comput Vis 42(3):145–175 (2001)), the accuracy was only 51.28 %. To the best of our knowledge this is the first work in this area. After frame selection, we have achieved 83.31 % character recognition accuracy and 78.11 % word recognition accuracy with traditional OCR in our dataset of flipping book. © 2017, Springer Science+Business Media New York.
Citation: Multimedia Tools and Applications (2018), 77(1): 985-1008
URI: https://doi.org/10.1007/s11042-016-4292-3
http://repository.iitr.ac.in/handle/123456789/5637
Issue Date: 2018
Publisher: Springer New York LLC
Keywords: OCR of flipping book
Video document image
Video OCR
ISSN: 13807501
Author Scopus IDs: 56118751400
56880478500
57190288840
23391957400
57200742116
Author Affiliations: Chakraborty, D., Computer Vision and Pattern Recognition Unit, ISI Kolkata, Kolkata, India
Roy, P.P., Department of Computer Science and Engineering, IIT Roorkee, Roorkee, India
Saini, R., Department of Computer Science and Engineering, IIT Roorkee, Roorkee, India
Alvarez, J.M., Canberra Research Lab, ACTACT, Australia
Pal, U., Computer Vision and Pattern Recognition Unit, ISI Kolkata, Kolkata, India
Corresponding Author: Roy, P.P.; Department of Computer Science and Engineering, IIT RoorkeeIndia; email: proy.fcs@iitr.ac.in
Appears in Collections:Journal Publications [CS]

Files in This Item:
There are no files associated with this item.
Show full item record


Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.