The impact of the image processing in the indexation system

Youssef Elfakir, Ghizlane Khaissidi, Mostafa Mrabti, Driss Chenouni

Abstract


This paper presents an efficient word spotting system applied to handwritten Arabic documents, where images are represented with bag-of-visual-SIFT descriptors and a sliding window approach is used to locate the regions that are most similar to the query by following the query-by-example paragon. First, a pre-processing step is used to produce a better representation of the most informative features. Secondly, a region-based framework is deployed to represent each local region by a bag-of-visual-SIFT descriptors. Afterward, some experiments are in order to demonstrate the codebook size influence on the efficiency of the system, by analyzing the curse of dimensionality curve. In the end, to measure the similarity score, a floating distance based on the descriptor’s number for each query is adopted. The experimental results prove the efficiency of the proposed processing steps in the word spotting system.

Keywords


bag-of-visual word; floating similarity distance; handwritten arabic documents; scale-invariant-feature transform; word spotting;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v9i5.pp4311-4320

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578