Leveraging foundation models for enhanced detection of colorectal cancer biomarkers in small datasets

Myles, Craig; Um, In Hwa; Harrison, David James; Harris-Birtill, David Cameron Christopher

Show simple item record

Files in this item

Name:: Myles-2024-Leveraging-foundation-models-MIUA2024-AAM-CCBY.pdf
Size:: 13.76Mb
Format:: PDF

View/Open

Item metadata

dc.contributor.author	Myles, Craig
dc.contributor.author	Um, In Hwa
dc.contributor.author	Harrison, David James
dc.contributor.author	Harris-Birtill, David Cameron Christopher
dc.contributor.editor	Yap, Moi Hoon
dc.contributor.editor	Kendrick, Connah
dc.contributor.editor	Behera, Ardhendu
dc.contributor.editor	Cootes, Timothy
dc.contributor.editor	Zwiggelaar, Reyer
dc.date.accessioned	2024-07-24T16:30:13Z
dc.date.available	2024-07-24T16:30:13Z
dc.date.issued	2024-07-24
dc.identifier	305637543
dc.identifier	5dda1d39-6620-4bc5-b2d9-c70a5809fd5b
dc.identifier.citation	Myles , C , Um , I H , Harrison , D J & Harris-Birtill , D C C 2024 , Leveraging foundation models for enhanced detection of colorectal cancer biomarkers in small datasets . in M H Yap , C Kendrick , A Behera , T Cootes & R Zwiggelaar (eds) , Medical image understanding and analysis : 28th annual conference, MIUA 2024, Manchester, UK, July 24–26, 2024, proceedings, part I . Lecture notes in computer science , vol. 14859 , Springer , Cham , pp. 329-343 , Medical Image Understanding and Analysis: 28th Annual Event , Manchester , United Kingdom , 24/07/24 . https://doi.org/10.1007/978-3-031-66955-2_23	en
dc.identifier.citation	conference	en
dc.identifier.isbn	9783031669545
dc.identifier.isbn	9783031669552
dc.identifier.issn	0302-9743
dc.identifier.uri	https://hdl.handle.net/10023/30262
dc.description	Funding: This work is supported in part by the Industrial Centre for AI Research in Digital Diagnostics (iCAIRD) which is funded by Innovate UK on behalf of UK Research and Innovation (UKRI) (project number 104690)	en
dc.description.abstract	Colorectal cancer is the second leading cause of cancer death worldwide. Its high incidence and mortality rate highlight the critical role of advanced diagnostics and early detection methods. Advancements in computational pathology can significantly enhance diagnostic precision and treatment personalisation, ultimately improving patient outcomes. Hospitals and labs globally are transitioning toward routine whole slide image (WSI) digitisation. This digitisation process generates large volumes of data, offering an opportunity to enhance diagnostic capabilities through the use of machine learning techniques such as weakly supervised learning and self supervised learning (SSL). This study evaluates the performance of state-of-the-art self-supervised learning (SSL) feature extractor foundation models—CTransPath, Phikon, and UNI—against a pretrained ResNet-50, which serves as a benchmark. Our Transformer network analyses these feature vectors, focusing on their efficacy in predicting key colorectal cancer biomarkers within a small dataset containing 423 WSIs with only 8% of cases exhibiting mismatch repair (MMR) deficiency. The CTransPath model achieved the highest validation AUROC of 0.9466 for MMR classification but exhibited a test AUROC of 0.6880, demonstrating significant variability. In contrast, the UNI model demonstrated greater consistency and robustness, achieving a test AUROC of 0.7136, which additionally represents a 6.3% improvement over ResNet-50’s test AUROC of 0.6709. The results highlight the feasibility of using advanced machine learning models with smaller, sparsely annotated datasets, though the variability noted in some models underscores the challenges at the edge of data scarcity. Code and experimental framework available at https://github.com/CraigMyles/SurGen-CRC-Arena.
dc.format.extent	14431088
dc.language.iso	eng
dc.publisher	Springer
dc.relation.ispartof	Medical image understanding and analysis	en
dc.relation.ispartofseries	Lecture notes in computer science	en
dc.subject	Digital pathology	en
dc.subject	Machine learning	en
dc.subject	Transformer	en
dc.subject	Deep learning	en
dc.subject	Slide-level classification	en
dc.subject	Mismatch repair (MMR)	en
dc.subject	BRAF mutation	en
dc.subject	RAS mutation	en
dc.subject	Survival prediction	en
dc.subject	RB Pathology	en
dc.subject	E	en
dc.subject.lcc	RB	en
dc.title	Leveraging foundation models for enhanced detection of colorectal cancer biomarkers in small datasets	en
dc.type	Conference item	en
dc.contributor.sponsor	Innovate UK	en
dc.contributor.institution	University of St Andrews. School of Computer Science	en
dc.contributor.institution	University of St Andrews. School of Medicine	en
dc.identifier.doi	https://doi.org/10.1007/978-3-031-66955-2_23
dc.identifier.grantnumber	TS/S013121/1	en

This item appears in the following Collection(s)

University of St Andrews Research

Show simple item record