Deciphering signatures of natural selection via deep learning
Abstract
Identifying genomic regions influenced by natural selection provides fundamental insights into the genetic basis of local adaptation. However, it remains challenging to detect loci under complex spatially varying selection. We propose a deep learning-based framework, DeepGenomeScan, which can detect signatures of spatially varying selection. We demonstrate that DeepGenomeScan outperformed principal component analysis- and redundancy analysis-based genome scans in identifying loci underlying quantitative traits subject to complex spatial patterns of selection. Noticeably, DeepGenomeScan increases statistical power by up to 47.25% under nonlinear environmental selection patterns. We applied DeepGenomeScan to a European human genetic dataset and identified some well-known genes under selection and a substantial number of clinically important genes that were not identified by SPA, iHS, Fst and Bayenv when applied to the same dataset.
Citation
Qin , X , Chiang , C & Gaggiotti , O E 2022 , ' Deciphering signatures of natural selection via deep learning ' , Briefings in Bioinformatics , vol. 23 , no. 5 , bbac354 . https://doi.org/10.1093/bib/bbac354
Publication
Briefings in Bioinformatics
Status
Peer reviewed
ISSN
1467-5463Type
Journal article
Rights
Copyright © The Author(s) 2022. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
Description
XQ was supported by a PhD scholarship from the China Scholarship Council and now is supported by International Postdoctoral Exchange Fellowship Program (Talent-Introduction Program) from China Postdoc Council. CWKC is supported in part by National Institute of General Medical Sciences (NIGMS) of the National Institute of Health (award number R35GM142783). Computation for this work is supported in part by USC’s Center for Advanced Research Computing (https://www.carc.usc.edu/).Collections
Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.