Fast and accurate deep learning scans for signatures of natural selection in genomes using FASTER-NN

Abstract Deep learning classification models based on Convolutional Neural Networks (CNNs) are increasingly used in population genetic inference for detecting signatures of natural selection. Prevailing detection methods treat the design of the classifier as a discrete phase, assuming that high clas...

Full description

Saved in:
Bibliographic Details
Main Authors: Sjoerd van den Belt, Nikolaos Alachiotis
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Communications Biology
Online Access:https://doi.org/10.1038/s42003-025-07480-7
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Deep learning classification models based on Convolutional Neural Networks (CNNs) are increasingly used in population genetic inference for detecting signatures of natural selection. Prevailing detection methods treat the design of the classifier as a discrete phase, assuming that high classification accuracy is the sole prerequisite for precise detection. This frequently steers method development toward classification-driven optimizations that can inadvertently impede detection. We present FASTER-NN, a CNN classifier designed specifically for the precise detection of natural selection. It has higher sensitivity than state-of-the-art CNN classifiers while only processing allele frequencies and genomic positions through dilated convolutions to maximize data reuse. As a result, execution time is invariant to the sample size and the chromosome length, creating a highly suitable solution for large-scale, whole-genome scans. Furthermore, FASTER-NN can accurately identify selective sweeps in recombination hotspots, which is a highly challenging detection problem with very limited theoretical treatment to date.
ISSN:2399-3642