Multiple imputation using the average code from autoencoders

Macias Toro, Edwar Hernando; Serrano García, Javier; López Vicario, José; Morell Pérez, Antoni

doi:10.1016/j.cmpbup.2022.100053

Cita bibliogràfica -- Enllaç permanent: https://ddd.uab.cat/record/273547

Scopus: 0 cites, Google Scholar: cites

Multiple imputation using the average code from autoencoders
Macias Toro, Edwar Hernando

(Universitat Autònoma de Barcelona. Departament de Telecomunicació i Enginyeria de Sistemes)
Serrano García, Javier 1964-

(Universitat Autònoma de Barcelona. Departament de Telecomunicació i Enginyeria de Sistemes)
López Vicario, José

(Universitat Autònoma de Barcelona. Departament de Telecomunicació i Enginyeria de Sistemes)
Morell Pérez, Antoni

(Universitat Autònoma de Barcelona. Departament de Telecomunicació i Enginyeria de Sistemes)

Data:	2022
Resum:	Background: Missing information is a constant issue in the clinical setting. The presence of missing values (MV) is triggered by the wrong acquisition of data or sudden events in the patient's health condition. Imputation arises to replace the non-existent information with the twofold purpose of benefiting from existing information and reducing bias in clinical settings. Mechanisms based on deep learning and multiple imputation (MI) are leading alternatives to impute MVs because of their capacity to extract complex relationships and the consideration of uncertainty that MI adds. Objective: This study aims to improve the reconstruction of missing information through a novel imputation alternative that integrates a MI paradigm into deep learning models. Methods: The proposed method integrates the MI paradigm into the latent representations of an autoencoder, the so-called codes. The average code is then computed, boosting a better latent representation of data. Finally, the average code is decoded to reconstruct MVs. Results: The proposed method is tested in 6 datasets with different patters of MVs. It is compared with solutions based on autoencoders and generative adversarial networks. For the random appearance of MVs, the proposed method outperforms 97% of the scenarios with a reconstruction gain that ranges 1. 04-1. 45. For the other MVs mechanisms, the proposed method improves the reconstruction in at least 69% of the experiments, with a gain of 1. 13-1. 91. Conclusion: The findings of the proposed approach showed that the reconstructive capacity of the average code outperforms in most of the scenarios its competitors and close to the best solution in the rest of the scenarios. The integration of the MI paradigm into latent representations of data and the computation of average codes allow a more robust representation of the data and enables the enhancement of current state-of-the-art methods for high MVs rates.
Ajuts:	Agencia Estatal de Investigación TEC2017-84321-C4-4-R Agència de Gestió d'Ajuts Universitaris i de Recerca 2017/SGR-1670
Drets:	Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades.
Llengua:	Anglès
Document:	Article ; recerca ; Versió publicada
Matèria:	Average code ; Deep learning ; Autoencoder ; Multiple imputation
Publicat a:	Computer Methods and Programs in Biomedicine Update, Vol. 2 (February 2022) , art. 100053, ISSN 2666-9900

DOI: 10.1016/j.cmpbup.2022.100053

10 p, 6.1 MB

El registre apareix a les col·leccions:
Articles > Articles de recerca
Articles > Articles publicats

Registre creat el 2023-04-12, darrera modificació el 2023-04-17

Registres semblants

Afegeix-lo al cistell personal
Anomena i desa Citation, BibTeX, MARC, MARCXML, DC, EDM OpenAire4