Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.authorValsson, Steinar
dc.contributor.authorArandjelović, Ognjen
dc.date.accessioned2022-01-18T13:30:01Z
dc.date.available2022-01-18T13:30:01Z
dc.date.issued2022-01-16
dc.identifier277506427
dc.identifierc139f940-02c6-4fd1-97b5-3595bc0eb77d
dc.identifier85129555506
dc.identifier.citationValsson , S & Arandjelović , O 2022 , ' Nuances of interpreting X-ray analysis by deep learning and lessons for reporting experimental findings ' , Sci , vol. 4 , no. 1 , 3 . https://doi.org/10.3390/sci4010003en
dc.identifier.issn2413-4155
dc.identifier.otherJisc: 2be91bf11ce84d66b39499fee0c24fd5
dc.identifier.urihttps://hdl.handle.net/10023/24693
dc.description.abstractWith the increase in the availability of annotated X-ray image data, there has been an accompanying and consequent increase in research on machine-learning-based, and ion particular deep-learning-based, X-ray image analysis. A major problem with this body of work lies in how newly proposed algorithms are evaluated. Usually, comparative analysis is reduced to the presentation of a single metric, often the area under the receiver operating characteristic curve (AUROC), which does not provide much clinical value or insight and thus fails to communicate the applicability of proposed models. In the present paper, we address this limitation of previous work by presenting a thorough analysis of a state-of-the-art learning approach and hence illuminate various weaknesses of similar algorithms in the literature, which have not yet been fully acknowledged and appreciated. Our analysis was performed on the ChestX-ray14 dataset, which has 14 lung disease labels and metainfo such as patient age, gender, and the relative X-ray direction. We examined the diagnostic significance of different metrics used in the literature including those proposed by the International Medical Device Regulators Forum, and present the qualitative assessment of the spatial information learned by the model. We show that models that have very similar AUROCs can exhibit widely differing clinical applicability. As a result, our work demonstrates the importance of detailed reporting and analysis of the performance of machine-learning approaches in this field, which is crucial both for progress in the field and the adoption of such models in practice.
dc.format.extent13
dc.format.extent3083931
dc.language.isoeng
dc.relation.ispartofScien
dc.subjectRoentgenen
dc.subjectChesten
dc.subjectDiseaseen
dc.subjectThoraxen
dc.subjectErroren
dc.subjectLabelen
dc.subjectQA76 Computer softwareen
dc.subjectRC Internal medicineen
dc.subjectDASen
dc.subjectMCCen
dc.subject.lccQA76en
dc.subject.lccRCen
dc.titleNuances of interpreting X-ray analysis by deep learning and lessons for reporting experimental findingsen
dc.typeJournal articleen
dc.contributor.institutionUniversity of St Andrews. School of Computer Scienceen
dc.identifier.doi10.3390/sci4010003
dc.description.statusPeer revieweden


This item appears in the following Collection(s)

Show simple item record