A systematic framework for Sanskrit character recognition using deep learning

Kore, Vrinda; G, Dhruva; Rao, Sahana; M, Vijitha; Preethi, P.

doi:10.5565/rev/elcvia.1850

Cita bibliogràfica -- Enllaç permanent: https://ddd.uab.cat/record/311374

Google Scholar: cites

A systematic framework for Sanskrit character recognition using deep learning
Kore, Vrinda (PES University (Índia))
G, Dhruva (PES University (Índia))
Rao, Sahana (PES University (Índia))
M, Vijitha (PES University (Índia))
Preethi, P. (PES University (Índia))

Data:	2025
Resum:	Sanskrit is widely acknowledged to be among the world's oldest surviving classical languages, and yet its usage has continued to decline unabated in the present milieu. Such insidious erosion of popularity is directly attributable to the absence of native speakers of the language and the perceived inaccessibility of Sanskrit to contemporary audiences. Notwithstanding, the language remains historically and culturally inseparable from the subcontinent, with numerous religious manuscripts, epigraphical inscriptions, edicts and scientific literature written in the Sanskrit script. Attempts made to resuscitate the language have been largely unsuccessful as these attempts have relied extensively on laborious human transcription and translation. Such manual endeavors can be superseded by the use of efficient computational techniques to facilitate the efficient transcription of voluminous manuscripts written in the Sanskrit script. The emergence of deep learning frameworks has enabled researchers to overcome the draw backs of conventional machine learning algorithms in developing efficient and extensible character recognition systems. Notwithstanding, the advancement of character recognition frameworks varies across different Indic scripts. In this context, this paper introduces an extensible framework for the transcription of hand written Sanskrit manuscripts. In the absence of a benchmark dataset of handwritten Sanskrit characters, the authors introduce a comprehensive dataset to facilitate further downstream segmentation. The dataset, on augmentation, comprises over a hundred thousand samples and has been collected from over a hundred individuals. The paper explores an integrated approach to segmentation and accordingly delineates a systematic methodology for effectively segmenting Sanskrit words, incorporating techniques such as thresholding, zone-based classification, median bisection and projection profiles. The proposed technique accommodates a diverse array of characters and modifiers present in the Sanskrit script. Subsequently, a concurrent deep learning architecture parallelizes transcription using Neural Networks (CNN and Residual Networks). The deep learning models show accuracies exceeding 90%. This paper attempts to benchmark the significance of systematic approaches to machine transcription of low-resource languages.
Drets:	Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades.
Llengua:	Anglès
Document:	Article ; recerca ; Versió publicada
Matèria:	Sanskrit ; Devanagari ; Low-resource languages ; Transcription ; Optical character recognition ; Segmentation ; Deep learning ; Neural networks
Publicat a:	ELCVIA. Electronic letters on computer vision and image analysis, Vol. 24 Núm. 2 (2025) , p. 81-103 (Regular Issue) , ISSN 1577-5097

Adreça original: https://elcvia.cvc.uab.cat/article/view/1850
DOI: 10.5565/rev/elcvia.1850

23 p, 1.8 MB

El registre apareix a les col·leccions:
Articles > Articles publicats > ELCVIA
Articles > Articles de recerca

Registre creat el 2025-05-09, darrera modificació el 2025-11-23

Registres semblants

Afegeix-lo al cistell personal
Anomena i desa Citation, BibTeX, MARC, MARCXML, DC, EDM OpenAire4