Development of a Two-Stage Segmentation-Based Word Searching Method for Handwritten Document Images

11Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

Word searching or keyword spotting is an important research problem in the domain of document image processing. The solution to the said problem for handwritten documents is more challenging than for printed ones. In this work, a two-stage word searching schema is introduced. In the first stage, all the irrelevant words with respect to a search word are filtered out from the document page image. This is carried out using a zonal feature vector, called pre-selection feature vector, along with a rule-based binary classification method. In the next step, a holistic word recognition paradigm is used to confirm a pre-selected word as search word. To accomplish this, a modified histogram of oriented gradients-based feature descriptor is combined with a topological feature vector. This method is experimented on a QUWI English database, which is freely available through the International Conference on Document Analysis and Recognition 2015 competition entitled "Writer Identification and Gender Classification." This technique not only provides good retrieval performance in terms of recall, precision, and F-measure scores, but it also outperforms some state-of-the-art methods.

References Powered by Scopus

Histograms of oriented gradients for human detection

30478Citations
N/AReaders
Get full text

Lexicon-free handwritten word spotting using character HMMs

257Citations
N/AReaders
Get full text

PHOCNet: A deep convolutional neural network for word spotting in handwritten documents

208Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A GA based hierarchical feature selection approach for handwritten word recognition

163Citations
N/AReaders
Get full text

Handwritten English word recognition using a deep learning based object detection architecture

32Citations
N/AReaders
Get full text

A two-stage CNN-based hand-drawn electrical and electronic circuit component recognition system

24Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Malakar, S., Ghosh, M., Sarkar, R., & Nasipuri, M. (2020). Development of a Two-Stage Segmentation-Based Word Searching Method for Handwritten Document Images. Journal of Intelligent Systems, 29(1), 719–735. https://doi.org/10.1515/jisys-2017-0384

Readers' Seniority

Tooltip

Professor / Associate Prof. 2

33%

Researcher 2

33%

Lecturer / Post doc 1

17%

PhD / Post grad / Masters / Doc 1

17%

Readers' Discipline

Tooltip

Computer Science 6

75%

Engineering 1

13%

Materials Science 1

13%

Save time finding and organizing research with Mendeley

Sign up for free