Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.advisorKosiol, Carolin
dc.contributor.authorMikula, Lynette Caitlin
dc.coverage.spatial224en_US
dc.date.accessioned2023-06-29T10:12:12Z
dc.date.available2023-06-29T10:12:12Z
dc.date.issued2023-11-28
dc.identifier.urihttps://hdl.handle.net/10023/27841
dc.description.abstractSingle nucleotide polymorphisms in samples of DNA sequences from one or multiple populations can be summarised as site frequency spectra. Since polymorphic sites are known to be predominantly biallelic, models for the evolution of allele frequencies that assume low scaled mutation rates are justified. The biallelic boundary-mutation Moran model with reversible mutations (BMM) arises as an approximation to the classic Moran model under this consideration, and it underpins this PhD thesis. In the introduction, the BMM is presented as a mathematically tractable model that is e cient in its use of site frequency data for inferring mutation and selection parameters. Chapter 2 of this thesis extends the BMM to include balancing selection, in addition to biased mutations and a directional component (e.g., directional selection or biased gene conversion). In Chapter 3, discrete and stochastic demographic changes are incorporated into the spectral representation of the neutral BMM. A Hidden Markov Model inspired approach is used to simulate sample spectra under di↵erent scenarios, and propose a new inference method. A novel class of Hidden Markov Models with ordered hidden states and emission densities (oHMMed) is introduced in Chapter 4 alongside the source code of a corresponding R-package. In Chapter 5, oHMMed is used to annotate the genome of orangutans according to average levels of GC content and recombination rates. Site frequency spectra of similar regions are subjected to Markov Chain Monte Carlo analyses based on the BMM, and to demographic inference per Chapter 3. They are further characterised by structural genomic features. Overall, this provides a quantification of how biased gene conversion and recombination shape the background variation in hominid site frequency data. Utilised conjointly, the methods developed in this thesis could help inform an extended null model of evolution, and improve genome scans.en_US
dc.language.isoenen_US
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectProbabilistic models for genetic evolutionen_US
dc.subjectDiffusion model for population demography (orthogonal polynomial diffusion)en_US
dc.subjectMoran model with low-scaled mutation rates and selection (directional and balancing)en_US
dc.subjectPopulation genetic landscape of orangutans (GC content and recombination)en_US
dc.subjectGenome segmentation algorithm (ordered Hidden Markov Model with emission densities)en_US
dc.titleModelling and inferring neutral and non-neutral forces shaping genomic site frequenciesen_US
dc.typeThesisen_US
dc.contributor.sponsorUniversity of St Andrews. School of Biologyen_US
dc.contributor.sponsorVienna Science and Technology Fund (WWTF)en_US
dc.type.qualificationlevelDoctoralen_US
dc.type.qualificationnamePhD Doctor of Philosophyen_US
dc.publisher.institutionThe University of St Andrewsen_US
dc.rights.embargodate2025-06-22
dc.rights.embargoreasonThesis restricted in accordance with University regulations. Restricted until 22nd June 2025en
dc.identifier.doihttps://doi.org/10.17630/sta/522
dc.identifier.grantnumber10.47379/MA16061 (WWTF)en_US


The following licence files are associated with this item:

    This item appears in the following Collection(s)

    Show simple item record

    Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International
    Except where otherwise noted within the work, this item's licence for re-use is described as Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International