Web of Science: 68 citations, Scopus: 79 citations, Google Scholar: citations,
The reliability of a deep learning model in clinical out-of-distribution MRI data : A multicohort study
Mårtensson, Gustav (Karolinska Institutet (Estocolm, Suècia))
Ferreira, Daniel (Karolinska Institutet (Estocolm, Suècia))
Granberg, Tobias (Karolinska University Hospital and Karolinska Institutet (Suecia))
Cavallin, Lena (Karolinska University Hospital and Karolinska Institutet (Suecia))
Oppedal, Ketil (University of Stavanger)
Padovani, Alessandra (University of Brescia)
Rektorova, Irena (Masaryk University)
Bonanni, Laura (University G d'Annunzio of Chieti-Pescara)
Pardini, Matteo (University of Genoa and Neurology Clinics)
Kramberger, Milica G. (Univerza V Ljubljani)
Taylor, John-Paul (Newcastle University)
Hort, Jakub (Charles University)
Snædal, Jón (Landspitali University Hospital (Reykjavík, Islàndia))
Kulisevsky, Jaime (Institut d'Investigació Biomèdica Sant Pau)
Blanc, Frederic (University of Strasbourg and French National Centre for Scientific Research (CNRS))
Antonini, Angelo (University of Padua)
Mecocci, Patrizia (University of Perugia)
Vellas, Bruno (University of Toulouse)
Tsolaki, Magda (3rd Aristotle University of Thessaloniki)
Kłoszewska, Iwona (Medical University of Lodz)
Soininen, H. (Kuopio University Hospital ( Finlàndia))
Lovestone, S. (University of Oxford)
Simmons, A. (King's College London)
Aarsland, Dag (King's College London)
Westman, Eric (King's College London)
Universitat Autònoma de Barcelona

Date: 2020
Abstract: Deep learning (DL) methods have in recent years yielded impressive results in medical imaging, with the potential to function as clinical aid to radiologists. However, DL models in medical imaging are often trained on public research cohorts with images acquired with a single scanner or with strict protocol harmonization, which is not representative of a clinical setting. The aim of this study was to investigate how well a DL model performs in unseen clinical datasets-collected with different scanners, protocols and disease populations-and whether more heterogeneous training data improves generalization. In total, 3117 MRI scans of brains from multiple dementia research cohorts and memory clinics, that had been visually rated by a neuroradiologist according to Scheltens' scale of medial temporal atrophy (MTA), were included in this study. By training multiple versions of a convolutional neural network on different subsets of this data to predict MTA ratings, we assessed the impact of including images from a wider distribution during training had on performance in external memory clinic data. Our results showed that our model generalized well to datasets acquired with similar protocols as the training data, but substantially worse in clinical cohorts with visibly different tissue contrasts in the images. This implies that future DL studies investigating performance in out-of-distribution (OOD) MRI data need to assess multiple external cohorts for reliable results. Further, by including data from a wider range of scanners and protocols the performance improved in OOD data, which suggests that more heterogeneous training data makes the model generalize better. To conclude, this is the most comprehensive study to date investigating the domain shift in deep learning on MRI data, and we advocate rigorous evaluation of DL models on clinical data prior to being certified for deployment.
Rights: Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, la comunicació pública de l'obra i la creació d'obres derivades, fins i tot amb finalitats comercials, sempre i quan es reconegui l'autoria de l'obra original. Creative Commons
Language: Anglès
Document: Article ; recerca ; Versió publicada
Published in: Medical Image Analysis, Vol. 66 (december 2020) , p. 101714, ISSN 1361-8423

DOI: 10.1016/j.media.2020.101714
PMID: 33007638


15 p, 294.1 KB

The record appears in these collections:
Research literature > UAB research groups literature > Research Centres and Groups (research output) > Health sciences and biosciences > Institut de Recerca Sant Pau
Articles > Research articles
Articles > Published articles

 Record created 2023-10-25, last modified 2024-05-01



   Favorit i Compartir