| Home > Articles > Published articles > Synthetic dataset of ID and Travel Documents |
| Date: | 2024 |
| Abstract: | This paper presents a new synthetic dataset of ID and travel documents, called SIDTD. The SIDTD dataset is created to help training and evaluating forged ID documents detection systems. Such a dataset has become a necessity as ID documents contain personal information and a public dataset of real documents can not be released. Moreover, forged documents are scarce, compared to legit ones, and the way they are generated varies from one fraudster to another resulting in a class of high intra-variability. In this paper we introduce a dataset, synthetically generated, that simulates the most common, and easiest, forgeries to be made by common users of ID documents and travel documents. The creation of this dataset will help to document image analysis community to progress in the task of automatic ID document verification in online onboarding systems. |
| Grants: | Agència de Gestió d'Ajuts Universitaris i de Recerca 2021/SGR-01499 Agencia Estatal de Investigación PID2021-126808OB-I00 European Commission 101018342 |
| Rights: | Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades. |
| Language: | Anglès |
| Document: | Article ; recerca ; Versió publicada |
| Subject: | Databases ; Engineering |
| Published in: | Scientific data, Vol. 11 (December 2024) , art. 1356, ISSN 2052-4463 |
10 p, 2.5 MB |