Off-line identifying Script Writers by Swin Transformers and ResNeSt-50
Kacem Echi, Afef 
(University of Tunis (Tunísia))
Ben Aïcha, Takwa 
(University of Tunis (Tunísia))
| Fecha: |
2024 |
| Resumen: |
In this work, we present two advanced models for identifying script writers, leveraging the power of deep learning. The proposed systems utilize the new vision Swin Transformer and ResNeSt-50. Swin Transformer is known for its robustness to variations and ability to model long-range dependencies, which helps capture context and make robust predictions. Through extensive training on large datasets of handwritten text samples, the Swin Transformer operates on sequences of image patches and learns to establish a robust representation of each writer's unique style. On the other hand, ResNeSt-50 (Residual Neural Network with Squeeze-and-Excitation (SE) and Next Stage modules), with its multiple layers, helps in learning complex representations of a writer's unique style and distinguishing between different writing styles with high precision. The SE module within ResNeSt helps the model focus on distinctive handwriting characteristics and reduce noise. The experimental results demonstrate exceptional performance, achieving an accuracy of 98. 50% (at patch level) by the Swin Transformer on the CVL database, which consists of images with cursively handwritten German and English texts, and an accuracy of 96. 61% (at page level) by ResNeSt-50 on the same database. This research advances writer identification by showcasing the effectiveness of the Swin Transformer and ResNeSt-50. The achieved accuracy underscores the potential of these models to process and understand complex handwriting effectively. |
| Derechos: |
Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades.  |
| Lengua: |
Anglès |
| Documento: |
Article ; recerca ; Versió publicada |
| Materia: |
Writer identification ;
Deep learning ;
Swin transformer ;
Resnest-50 ;
Handwriting analysis |
| Publicado en: |
ELCVIA. Electronic letters on computer vision and image analysis, Vol. 23 Núm. 1 (2024) , p. 15-31 (Regular Issue) , ISSN 1577-5097 |
Adreça original: https://elcvia.cvc.uab.cat/article/view/1787
Adreça alternativa: https://raco.cat/index.php/ELCVIA/article/view/980000001024
DOI: 10.5565/rev/elcvia.1787
El registro aparece en las colecciones:
Artículos >
Artículos publicados >
ELCVIAArtículos >
Artículos de investigación
Registro creado el 2024-06-08, última modificación el 2025-11-14