Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.

Polygenic prediction of complex trait phenotypes has become important in human genetics, especially in the context of precision medicine. Recently, mr.mash, a flexible and computationally efficient method that models multiple phenotypes jointly and leverages sharing of effects across such phenotypes...

Full description

Saved in:
Bibliographic Details
Main Authors: Deborah Kunkel, Peter Sørensen, Vijay Shankar, Fabio Morgante
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS Genetics
Online Access:https://doi.org/10.1371/journal.pgen.1011519
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832540333495287808
author Deborah Kunkel
Peter Sørensen
Vijay Shankar
Fabio Morgante
author_facet Deborah Kunkel
Peter Sørensen
Vijay Shankar
Fabio Morgante
author_sort Deborah Kunkel
collection DOAJ
description Polygenic prediction of complex trait phenotypes has become important in human genetics, especially in the context of precision medicine. Recently, mr.mash, a flexible and computationally efficient method that models multiple phenotypes jointly and leverages sharing of effects across such phenotypes to improve prediction accuracy, was introduced. However, a drawback of mr.mash is that it requires individual-level data, which are often not publicly available. In this work, we introduce mr.mash-rss, an extension of the mr.mash model that requires only summary statistics from Genome-Wide Association Studies (GWAS) and linkage disequilibrium (LD) estimates from a reference panel. By using summary data, we achieve the twin goal of increasing the applicability of the mr.mash model to data sets that are not publicly available and making it scalable to biobank-size data. Through simulations, we show that mr.mash-rss is competitive with, and often outperforms, current state-of-the-art methods for single- and multi-phenotype polygenic prediction in a variety of scenarios that differ in the pattern of effect sharing across phenotypes, the number of phenotypes, the number of causal variants, and the genomic heritability. We also present a real data analysis of 16 blood cell phenotypes in the UK Biobank, showing that mr.mash-rss achieves higher prediction accuracy than competing methods for the majority of traits, especially when the data set has smaller sample size.
format Article
id doaj-art-8bd454d6499c44b7af1ef19f3beccf7a
institution Kabale University
issn 1553-7390
1553-7404
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj-art-8bd454d6499c44b7af1ef19f3beccf7a2025-02-05T05:31:01ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042025-01-01211e101151910.1371/journal.pgen.1011519Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.Deborah KunkelPeter SørensenVijay ShankarFabio MorgantePolygenic prediction of complex trait phenotypes has become important in human genetics, especially in the context of precision medicine. Recently, mr.mash, a flexible and computationally efficient method that models multiple phenotypes jointly and leverages sharing of effects across such phenotypes to improve prediction accuracy, was introduced. However, a drawback of mr.mash is that it requires individual-level data, which are often not publicly available. In this work, we introduce mr.mash-rss, an extension of the mr.mash model that requires only summary statistics from Genome-Wide Association Studies (GWAS) and linkage disequilibrium (LD) estimates from a reference panel. By using summary data, we achieve the twin goal of increasing the applicability of the mr.mash model to data sets that are not publicly available and making it scalable to biobank-size data. Through simulations, we show that mr.mash-rss is competitive with, and often outperforms, current state-of-the-art methods for single- and multi-phenotype polygenic prediction in a variety of scenarios that differ in the pattern of effect sharing across phenotypes, the number of phenotypes, the number of causal variants, and the genomic heritability. We also present a real data analysis of 16 blood cell phenotypes in the UK Biobank, showing that mr.mash-rss achieves higher prediction accuracy than competing methods for the majority of traits, especially when the data set has smaller sample size.https://doi.org/10.1371/journal.pgen.1011519
spellingShingle Deborah Kunkel
Peter Sørensen
Vijay Shankar
Fabio Morgante
Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
PLoS Genetics
title Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
title_full Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
title_fullStr Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
title_full_unstemmed Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
title_short Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
title_sort improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes
url https://doi.org/10.1371/journal.pgen.1011519
work_keys_str_mv AT deborahkunkel improvingpolygenicpredictionfromsummarydatabylearningpatternsofeffectsharingacrossmultiplephenotypes
AT petersørensen improvingpolygenicpredictionfromsummarydatabylearningpatternsofeffectsharingacrossmultiplephenotypes
AT vijayshankar improvingpolygenicpredictionfromsummarydatabylearningpatternsofeffectsharingacrossmultiplephenotypes
AT fabiomorgante improvingpolygenicpredictionfromsummarydatabylearningpatternsofeffectsharingacrossmultiplephenotypes