Recognition of Devanagari Scene Text Using Autoencoder CNN

Shiravale, Sankirti S.; Jayadevan, R.; Sannakki, Sanjeev S.

doi:10.5565/rev/elcvia.1344

Bibliographic citation -- Permanent link: https://ddd.uab.cat/record/237126

Scopus: 4 citations, Google Scholar: citations

Recognition of Devanagari Scene Text Using Autoencoder CNN
Shiravale, Sankirti S. (Marathwada Mitra Mandal's College of Engineering (Índia). Department of Computer Engineering)
Jayadevan, R. (Army Institute of Technology (Pune, Índia). Department of Computer Engineering)
Sannakki, Sanjeev S. (Gogte Institute of Technology (Belagavi, Índia). Department of Computer Science and Engineering)

Date:	2021
Abstract:	Scene text recognition is a well-rooted research domain covering a diverse application area. Recognition of scene text is challenging due to the complex nature of scene images. Various structural characteristics of the script also influence the recognition process. Text and background segmentation is a mandatory step in the scene text recognition process. A text recognition system produces the most accurate results if the structural and contextual information is preserved by the segmentation technique. Therefore, an attempt is made here to develop a robust foreground/background segmentation(separation) technique that produces the highest recognition results. A ground-truth dataset containing Devanagari scene text images is prepared for the experimentation. An encoder-decoder convolutional neural network model is used for text/background segmentation. The model is trained with Devanagari scene text images for pixel-wise classification of text and background. The segmented text is then recognized using an existing OCR engine (Tesseract). The word and character level recognition rates are computed and compared with other existing segmentation techniques to establish the effectiveness of the proposed technique.
Rights:	Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades.
Language:	Anglès
Document:	Article ; recerca ; Versió publicada
Subject:	Character and text recognition ; Scene text recognition ; Devanagari script ; OCR ; Segmentation technique ; Encoder-decoder CNN ; Computer vision ; Pattern recognition ; Image analysis and processing
Published in:	ELCVIA : Electronic Letters on Computer Vision and Image Analysis, Vol. 20 Núm. 1 (2021) , p. 55-69 (Regular Issue) , ISSN 1577-5097

Adreça original: https://elcvia.cvc.uab.es/article/view/v20-n1-Sannakki
DOI: 10.5565/rev/elcvia.1344

15 p, 1.4 MB

The record appears in these collections:
Articles > Published articles > ELCVIA
Articles > Research articles

Record created 2021-02-25, last modified 2022-02-05

Similar records

Add to personal basket
Export as Citation, BibTeX, MARC, MARCXML, DC, EDM OpenAire4