Customized mask region based convolutional neural networks for un-uniformed shape text detection and text recognition

Ravikumar Hodikehosahally Channegowda; Palani Karthik; Raghavendra Srinivasaiah; Mahadev Shivaraj

doi:10.11591/ijece.v13i1.pp413-424

Customized mask region based convolutional neural networks for un-uniformed shape text detection and text recognition

Ravikumar Hodikehosahally Channegowda, Palani Karthik, Raghavendra Srinivasaiah, Mahadev Shivaraj

Abstract

In image scene, text contains high-level of important information that helps to analyze and consider the particular environment. In this paper, we adapt image mask and original identification of the mask region based convolutional neural networks (R-CNN) to allow recognition at 3 levels such as sequence, holistic and pixel-level semantics. Particularly, pixel and holistic level semantics can be utilized to recognize the texts and define the text shapes, respectively. Precisely, in mask and detection, we segment and recognize both character and word instances. Furthermore, we implement text detection through the outcome of instance segmentation on 2-D feature-space. Also, to tackle and identify the text issues of smaller and blurry texts, we consider text recognition by attention-based of optical character recognition (OCR) model with the mask R-CNN at sequential level. The OCR module is used to estimate character sequence through feature maps of the word instances in sequence to sequence. Finally, we proposed a fine-grained learning technique that trains a more accurate and robust model by learning models from the annotated datasets at the word level. Our proposed approach is evaluated on popular benchmark dataset ICDAR 2013 and ICDAR 2015.

Keywords

Deep neural networks; Optical character-recognition; Region based convolutional neural networks; Region of interest; Text detection; Text recognition

Full Text:

PDF

DOI: http://doi.org/10.11591/ijece.v13i1.pp413-424

Copyright (c) 2022 Ravikumar Hodikehosahally Channegowda, Palani Karthik, Raghavendra Srinivasaiah, Mahadev Shivaraj

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578

This journal is published by the Institute of Advanced Engineering and Science (IAES).

Username
Password
Remember me