Statistical underpinning of mutational signature analyses of cancer sequencing data

Velasco Pardo, Víctor

Show simple item record

Files in this item

Name:: Thesis-Victor-Velasco-Pardo-complete-version.pdf
Size:: 17.96Mb
Format:: PDF
Description:: Complete version

View/Open

Name:: Thesis-Victor-Velasco-Pardo-complete-version.zip
Size:: 287.0Mb
Format:: application/zip
Description:: Complete version (Preservation copy)

View/Open

Item metadata

dc.contributor.advisor	Lynch, Andy G.
dc.contributor.advisor	Papathomas, Michail
dc.contributor.author	Velasco Pardo, Víctor
dc.coverage.spatial	162	en_US
dc.date.accessioned	2024-05-13T13:40:48Z
dc.date.available	2024-05-13T13:40:48Z
dc.date.issued	2024-06-11
dc.identifier.uri	https://hdl.handle.net/10023/29876
dc.description.abstract	Cancer is a disease driven and characterised by mutations in the DNA. Thanks to massively parallel sequencing technologies, it is now possible to obtain the sequence of a cancer genome. The advent of modern sequencing technologies has allowed researchers to study the mutations involved in tumour development. More recently, attention has been drawn to the `passenger' mutations that are not involved in tumour development but bear fingerprints of the mutational processes that have been operative over a patient's lifetime. Those fingerprints, termed mutational signatures, appear consistently across cancer genomes that have been exposed to the underlying mutational processes. Computational analyses have identified over a hundred such signatures, and it is now possible to estimate the relative prevalence of mutational signatures in a cancer genome. Both types of analyses are perhaps unique in the medical literature, in that no confidence intervals or other representations of uncertainty are demanded when reporting the results. In this thesis, we address the problem of quantifying uncertainty around the reported mutational signatures and their relative prevalence in individual tumours. First, in Chapter 2, we review the available computational methods for mutational signature analyses, assessing the potential of existing approaches to characterise uncertainty. Then, in Chapter 3, we annotate ten statistical challenges. The remainder of the thesis is built on the aim of addressing some of those challenges. To estimate the relative prevalence of mutational signatures in individual tumours, a method that quantifies the uncertainty around the estimated solution is lacking. Moreover, those analyses assume that the true values for the signatures are `known' as they are propagated from previous analyses. In Chapter 4, we suggest a setting where the signatures are `partially known'. We propose a novel approach for this problem, in a Bayesian setting, providing credible intervals around the estimated solution, propagating prior uncertainty regarding `partially known' signatures, and updating prior beliefs about them. Estimation of mutational signatures is often performed in a matrix factorisation setting that is not fully probabilistic. While an alternative fully probabilistic approach is available, a post-processing method is needed to characterise the uncertainty around the reported solution. In Chapter 5, we introduce a novel post-processing approach to quantify uncertainty around the mutational signatures estimated in a cohort of cancer patients, along with software that allows investigators to use the proposed method and visualise results.	en_US
dc.language.iso	en	en_US
dc.subject	Cancer genomics	en_US
dc.subject	Mutational signatures	en_US
dc.subject	Bioinformatics	en_US
dc.subject	Biostatistics	en_US
dc.subject	Bayesian statistics	en_US
dc.title	Statistical underpinning of mutational signature analyses of cancer sequencing data	en_US
dc.type	Thesis	en_US
dc.contributor.sponsor	Melville Trust	en_US
dc.type.qualificationlevel	Doctoral	en_US
dc.type.qualificationname	PhD Doctor of Philosophy	en_US
dc.publisher.institution	The University of St Andrews	en_US
dc.rights.embargodate	2025-05-08
dc.rights.embargoreason	Thesis restricted in accordance with University regulations. Restricted until 8 May 2025	en
dc.identifier.doi	https://doi.org/10.17630/sta/912

This item appears in the following Collection(s)

Show simple item record