Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.

Polygenic prediction of complex trait phenotypes has become important in human genetics, especially in the context of precision medicine. Recently, mr.mash, a flexible and computationally efficient method that models multiple phenotypes jointly and leverages sharing of effects across such phenotypes...

Full description

Saved in:

Bibliographic Details
Main Authors:	Deborah Kunkel, Peter Sørensen, Vijay Shankar, Fabio Morgante
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2025-01-01
Series:	PLoS Genetics
Online Access:	https://doi.org/10.1371/journal.pgen.1011519
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832540333495287808
author	Deborah Kunkel Peter Sørensen Vijay Shankar Fabio Morgante
author_facet	Deborah Kunkel Peter Sørensen Vijay Shankar Fabio Morgante
author_sort	Deborah Kunkel
collection	DOAJ
description	Polygenic prediction of complex trait phenotypes has become important in human genetics, especially in the context of precision medicine. Recently, mr.mash, a flexible and computationally efficient method that models multiple phenotypes jointly and leverages sharing of effects across such phenotypes to improve prediction accuracy, was introduced. However, a drawback of mr.mash is that it requires individual-level data, which are often not publicly available. In this work, we introduce mr.mash-rss, an extension of the mr.mash model that requires only summary statistics from Genome-Wide Association Studies (GWAS) and linkage disequilibrium (LD) estimates from a reference panel. By using summary data, we achieve the twin goal of increasing the applicability of the mr.mash model to data sets that are not publicly available and making it scalable to biobank-size data. Through simulations, we show that mr.mash-rss is competitive with, and often outperforms, current state-of-the-art methods for single- and multi-phenotype polygenic prediction in a variety of scenarios that differ in the pattern of effect sharing across phenotypes, the number of phenotypes, the number of causal variants, and the genomic heritability. We also present a real data analysis of 16 blood cell phenotypes in the UK Biobank, showing that mr.mash-rss achieves higher prediction accuracy than competing methods for the majority of traits, especially when the data set has smaller sample size.
format	Article
id	doaj-art-8bd454d6499c44b7af1ef19f3beccf7a
institution	Kabale University
issn	1553-7390 1553-7404
language	English
publishDate	2025-01-01
publisher	Public Library of Science (PLoS)
record_format	Article
series	PLoS Genetics
spelling	doaj-art-8bd454d6499c44b7af1ef19f3beccf7a2025-02-05T05:31:01ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042025-01-01211e101151910.1371/journal.pgen.1011519Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.Deborah KunkelPeter SørensenVijay ShankarFabio MorgantePolygenic prediction of complex trait phenotypes has become important in human genetics, especially in the context of precision medicine. Recently, mr.mash, a flexible and computationally efficient method that models multiple phenotypes jointly and leverages sharing of effects across such phenotypes to improve prediction accuracy, was introduced. However, a drawback of mr.mash is that it requires individual-level data, which are often not publicly available. In this work, we introduce mr.mash-rss, an extension of the mr.mash model that requires only summary statistics from Genome-Wide Association Studies (GWAS) and linkage disequilibrium (LD) estimates from a reference panel. By using summary data, we achieve the twin goal of increasing the applicability of the mr.mash model to data sets that are not publicly available and making it scalable to biobank-size data. Through simulations, we show that mr.mash-rss is competitive with, and often outperforms, current state-of-the-art methods for single- and multi-phenotype polygenic prediction in a variety of scenarios that differ in the pattern of effect sharing across phenotypes, the number of phenotypes, the number of causal variants, and the genomic heritability. We also present a real data analysis of 16 blood cell phenotypes in the UK Biobank, showing that mr.mash-rss achieves higher prediction accuracy than competing methods for the majority of traits, especially when the data set has smaller sample size.https://doi.org/10.1371/journal.pgen.1011519
spellingShingle	Deborah Kunkel Peter Sørensen Vijay Shankar Fabio Morgante Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes. PLoS Genetics
title	Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
title_full	Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
title_fullStr	Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
title_full_unstemmed	Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
title_short	Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.
title_sort	improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes
url	https://doi.org/10.1371/journal.pgen.1011519
work_keys_str_mv	AT deborahkunkel improvingpolygenicpredictionfromsummarydatabylearningpatternsofeffectsharingacrossmultiplephenotypes AT petersørensen improvingpolygenicpredictionfromsummarydatabylearningpatternsofeffectsharingacrossmultiplephenotypes AT vijayshankar improvingpolygenicpredictionfromsummarydatabylearningpatternsofeffectsharingacrossmultiplephenotypes AT fabiomorgante improvingpolygenicpredictionfromsummarydatabylearningpatternsofeffectsharingacrossmultiplephenotypes

Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.

Similar Items