http://repository.iitr.ac.in/handle/123456789/5637
Title: | Frame selection for OCR from video stream of book flipping |
Authors: | Chakraborty D. Pratim Roy, Partha Saini R. Alvarez J.M. Pal U. |
Published in: | Multimedia Tools and Applications |
Abstract: | Optical Character Recognition (OCR) in video stream of flipping pages is a challenging task because flipping at random speed causes difficulties in identifying the frames that contain the open page image (OPI). Also, low resolution, blurring effect, shadow, etc., add significant noise in selection of proper frames for OCR. In this paper, we focus on identifying a set of representative frames from the video stream of flipping pages without using any explicit hardware and then perform OCR on these frames for recognition. Thus, an end-to-end solution is proposed for video stream of flipping pages. To select an OPI, we present an efficient algorithm that exploits cues from edge information during flipping event. These cues, extracted from the region of interest (ROI) of the frame, determine the flipping or open state of a page. The open state classification is performed by an SVM classifier following training of the edge cue information. After selecting a set of frames for each OPI, a representative frame from OPI set is chosen for OCR. Experiments are performed on videos captured using standard resolution camera. We have obtained 88.81 % accuracy on representative frame selection from the proposed method whereas when compared with GIST (Oliva and Torralba, Int J Comput Vis 42(3):145–175 (2001)), the accuracy was only 51.28 %. To the best of our knowledge this is the first work in this area. After frame selection, we have achieved 83.31 % character recognition accuracy and 78.11 % word recognition accuracy with traditional OCR in our dataset of flipping book. © 2017, Springer Science+Business Media New York. |
Citation: | Multimedia Tools and Applications (2018), 77(1): 985-1008 |
URI: | https://doi.org/10.1007/s11042-016-4292-3 http://repository.iitr.ac.in/handle/123456789/5637 |
Issue Date: | 2018 |
Publisher: | Springer New York LLC |
Keywords: | OCR of flipping book Video document image Video OCR |
ISSN: | 13807501 |
Author Scopus IDs: | 56118751400 56880478500 57190288840 23391957400 57200742116 |
Author Affiliations: | Chakraborty, D., Computer Vision and Pattern Recognition Unit, ISI Kolkata, Kolkata, India Roy, P.P., Department of Computer Science and Engineering, IIT Roorkee, Roorkee, India Saini, R., Department of Computer Science and Engineering, IIT Roorkee, Roorkee, India Alvarez, J.M., Canberra Research Lab, ACTACT, Australia Pal, U., Computer Vision and Pattern Recognition Unit, ISI Kolkata, Kolkata, India |
Corresponding Author: | Roy, P.P.; Department of Computer Science and Engineering, IIT RoorkeeIndia; email: proy.fcs@iitr.ac.in |
Appears in Collections: | Journal Publications [CS] |
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.