Show simple item record

Files in this item


Item metadata

dc.contributor.authorQin, Xinghu
dc.contributor.authorGaggiotti, Oscar E.
dc.identifier.citationQin , X & Gaggiotti , O E 2022 , ' Information-based summary statistics for spatial genetic structure inference ' , Molecular Ecology Resources , vol. 22 , no. 6 , 13606 , pp. 2183-2195 .
dc.identifier.otherPURE: 278225874
dc.identifier.otherPURE UUID: 0da928e7-4a67-4dc9-b06e-be13c593a3f3
dc.identifier.otherRIS: urn:22F134811AD2B9E3851C1A2285D9F704
dc.identifier.otherORCID: /0000-0003-1827-1493/work/110423273
dc.identifier.otherORCID: /0000-0003-2351-3610/work/110423323
dc.identifier.otherScopus: 85126948222
dc.identifier.otherWOS: 000772351300001
dc.descriptionXHQ was supported by China Scholarship Council.en
dc.description.abstractThe measurement of biodiversity at all levels of organization is an essential first step to understand the ecological and evolutionary processes that drive spatial patterns of biodiversity. Ecologists have explored the use of a large range of different summary statistics and have come to the view that information-based summary statistics, and in particular so-called Hill numbers, are a useful tool to measure biodiversity. Population geneticists, on the other hand, have focused largely on summary statistics based on heterozygosity and measures of allelic richness. However, recent studies proposed the adoption of information-based summary statistics in population genetics studies. Here, we performed a comprehensive assessment of the power of this family of summary statistics to inform regarding spatial patterns of genetic diversity and we compared it with that of traditional population genetics approaches, namely measures based on allelic richness and heterozygosity. To give an unbiased evaluation, we used three machine learning methods to test the performance of different sets of summary statistics to discriminate between spatial scenarios. We defined three distinct sets, (i) one based on allelic richness measures which included the Jaccard index, (ii) a set based on heterozygosity that included FST and (iii) a set based on Hill numbers derived from Shannon entropy, which included the recently proposed Shannon differentiation, ΔD. The results showed that the last of these performed as well or, under some specific spatial scenarios, even better than the traditional population genetics measures. Interestingly, we found that a rarely or never used genetic differentiation measure based on allelic richness, Jaccard dissimilarity (J), showed the highest discriminatory power to discriminate among spatial scenarios, followed by Shannon differentiation ΔD. We concluded, therefore, that information-based measures as well as Jaccard dissimilarity represent excellent additions to the population genetics toolkit.
dc.relation.ispartofMolecular Ecology Resourcesen
dc.rightsCopyright © 2022 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.en
dc.subjectInformation-based statisticsen
dc.subjectPopulation geneticsen
dc.subjectSpatial structureen
dc.subjectQA Mathematicsen
dc.subjectQH301 Biologyen
dc.subjectQH426 Geneticsen
dc.titleInformation-based summary statistics for spatial genetic structure inferenceen
dc.typeJournal articleen
dc.description.versionPublisher PDFen
dc.contributor.institutionUniversity of St Andrews. School of Biologyen
dc.contributor.institutionUniversity of St Andrews. St Andrews Bioinformatics Uniten
dc.contributor.institutionUniversity of St Andrews. Scottish Oceans Instituteen
dc.contributor.institutionUniversity of St Andrews. Marine Alliance for Science & Technology Scotlanden
dc.description.statusPeer revieweden

This item appears in the following Collection(s)

Show simple item record