Skip navigation
Please use this identifier to cite or link to this item: http://repository.iitr.ac.in/handle/123456789/15983
Full metadata record
DC FieldValueLanguage
dc.contributor.authorPratim Roy, Partha-
dc.contributor.authorRamel J.-Y.-
dc.contributor.authorRagot N.-
dc.date.accessioned2020-12-02T11:42:10Z-
dc.date.available2020-12-02T11:42:10Z-
dc.date.issued2011-
dc.identifier.citationProceedings of the International Conference on Document Analysis and Recognition, ICDAR, (2011), 678- 682. Beijing-
dc.identifier.isbn9.78077E+12-
dc.identifier.issn15205363-
dc.identifier.urihttps://doi.org/10.1109/ICDAR.2011.142-
dc.identifier.urihttp://repository.iitr.ac.in/handle/123456789/15983-
dc.description.abstractWord searching and indexing in historical document collections is a challenging problem because, characters in these documents are often touching or broken due to degradation/ ageing effects. For efficient searching in such historical documents, this paper presents a novel approach towards word spotting using string matching of character primitives. We describe the text string as a sequence of primitives which consists of a single character or a part of a character. Primitive segmentation is performed analyzing text background information that is obtained by water reservoir technique. Next, the primitives are clustered using template matching and a codebook of representative primitives is built. Using this primitive codebook, the text information in the document images are encoded and stored. For a query word, we segment it into primitives and encode the word by a string of representative primitives from codebook. Finally, an approximate string matching is applied to find similar words. The matching similarity is used to rank the retrieved words. The proposed method is tested on historical books of French alphabets and we have obtained encouraging results from the experiment. © 2011 IEEE.-
dc.description.sponsorshipTC10 (Graph. Recogn.) TC11 (Read. Syst.) (IAPR);Chinese Academy of Sciences;NSFC;FUJITSU;Hanvon Technology-
dc.language.isoen_US-
dc.relation.ispartofProceedings of the International Conference on Document Analysis and Recognition, ICDAR-
dc.subjectAgeing effects-
dc.subjectApproximate string matching-
dc.subjectBackground information-
dc.subjectCodebooks-
dc.subjectDocument images-
dc.subjectHistorical documents-
dc.subjectQuery words-
dc.subjectString matching-
dc.subjectText information-
dc.subjectText string-
dc.subjectWater reservoir-
dc.subjectWord retrieval-
dc.subjectWord Spotting-
dc.subjectHistory-
dc.subjectReservoirs (water)-
dc.subjectTemplate matching-
dc.subjectImage matching-
dc.titleWord retrieval in historical document using character-primitives-
dc.typeConference Paper-
dc.scopusid56880478500-
dc.scopusid8293131700-
dc.scopusid16053389600-
dc.affiliationRoy, P.P., Laboratoire d'Informatique, Université François Rabelais, Tours, France-
dc.affiliationRamel, J.-Y., Laboratoire d'Informatique, Université François Rabelais, Tours, France-
dc.affiliationRagot, N., Laboratoire d'Informatique, Université François Rabelais, Tours, France-
dc.description.correspondingauthorRoy, P.P.; Laboratoire d'Informatique, Université François Rabelais, Tours, France; email: partha.roy@univ-tours.fr-
dc.identifier.conferencedetails11th International Conference on Document Analysis and Recognition, ICDAR 2011, Beijing, 18-21 September 2011-
Appears in Collections:Conference Publications [CS]

Files in This Item:
There are no files associated with this item.
Show simple item record


Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.