Using family-based imputation in genome-wide association studies with large complex pedigrees: the Framingham Heart Study.

Imputation has been widely used in genome-wide association studies (GWAS) to infer genotypes of un-genotyped variants based on the linkage disequilibrium in external reference panels such as the HapMap and 1000 Genomes. However, imputation has only rarely been performed based on family relationships...

Full description

Saved in:
Bibliographic Details
Main Authors: Ming-Huei Chen, Jie Huang, Wei-Min Chen, Martin G Larson, Caroline S Fox, Ramachandran S Vasan, Sudha Seshadri, Christopher J O'Donnell, Qiong Yang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0051589&type=printable
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849727627807424512
author Ming-Huei Chen
Jie Huang
Wei-Min Chen
Martin G Larson
Caroline S Fox
Ramachandran S Vasan
Sudha Seshadri
Christopher J O'Donnell
Qiong Yang
author_facet Ming-Huei Chen
Jie Huang
Wei-Min Chen
Martin G Larson
Caroline S Fox
Ramachandran S Vasan
Sudha Seshadri
Christopher J O'Donnell
Qiong Yang
author_sort Ming-Huei Chen
collection DOAJ
description Imputation has been widely used in genome-wide association studies (GWAS) to infer genotypes of un-genotyped variants based on the linkage disequilibrium in external reference panels such as the HapMap and 1000 Genomes. However, imputation has only rarely been performed based on family relationships to infer genotypes of un-genotyped individuals. Using 8998 Framingham Heart Study (FHS) participants genotyped with Affymetrix 550K SNPs, we imputed genotypes of same set of SNPs for additional 3121 participants, most of whom were never genotyped due to lack of DNA sample. Prior to imputation, 122 pedigrees were too large to be handled by the imputation software Merlin. Therefore, we developed a novel pedigree splitting algorithm that can maximize the number of genotyped relatives for imputing each un-genotyped individual, while keeping new sub-pedigrees under a pre-specified size. In GWAS of four phenotypes available in FHS (Alzheimer disease, circulating levels of fibrinogen, high-density lipoprotein cholesterol, and uric acid), we compared results using genotyped individuals only with results using both genotyped and imputed individuals. We studied the impact of applying different imputation quality filtering thresholds on the association results and did not found a universal threshold that always resulted in a more significant p-value for previously identified loci. However most of these loci had a lower p-value when we only included imputed genotypes with with ≥60% SNP- and ≥50% person-specific imputation certainty. In summary, we developed a novel algorithm for splitting large pedigrees for imputation and found a plausible imputation quality filtering threshold based on FHS. Further examination may be required to generalize this threshold to other studies.
format Article
id doaj-art-bda47f7fafd24ae3b52884aa85d91a54
institution DOAJ
issn 1932-6203
language English
publishDate 2012-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-bda47f7fafd24ae3b52884aa85d91a542025-08-20T03:09:47ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-01712e5158910.1371/journal.pone.0051589Using family-based imputation in genome-wide association studies with large complex pedigrees: the Framingham Heart Study.Ming-Huei ChenJie HuangWei-Min ChenMartin G LarsonCaroline S FoxRamachandran S VasanSudha SeshadriChristopher J O'DonnellQiong YangImputation has been widely used in genome-wide association studies (GWAS) to infer genotypes of un-genotyped variants based on the linkage disequilibrium in external reference panels such as the HapMap and 1000 Genomes. However, imputation has only rarely been performed based on family relationships to infer genotypes of un-genotyped individuals. Using 8998 Framingham Heart Study (FHS) participants genotyped with Affymetrix 550K SNPs, we imputed genotypes of same set of SNPs for additional 3121 participants, most of whom were never genotyped due to lack of DNA sample. Prior to imputation, 122 pedigrees were too large to be handled by the imputation software Merlin. Therefore, we developed a novel pedigree splitting algorithm that can maximize the number of genotyped relatives for imputing each un-genotyped individual, while keeping new sub-pedigrees under a pre-specified size. In GWAS of four phenotypes available in FHS (Alzheimer disease, circulating levels of fibrinogen, high-density lipoprotein cholesterol, and uric acid), we compared results using genotyped individuals only with results using both genotyped and imputed individuals. We studied the impact of applying different imputation quality filtering thresholds on the association results and did not found a universal threshold that always resulted in a more significant p-value for previously identified loci. However most of these loci had a lower p-value when we only included imputed genotypes with with ≥60% SNP- and ≥50% person-specific imputation certainty. In summary, we developed a novel algorithm for splitting large pedigrees for imputation and found a plausible imputation quality filtering threshold based on FHS. Further examination may be required to generalize this threshold to other studies.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0051589&type=printable
spellingShingle Ming-Huei Chen
Jie Huang
Wei-Min Chen
Martin G Larson
Caroline S Fox
Ramachandran S Vasan
Sudha Seshadri
Christopher J O'Donnell
Qiong Yang
Using family-based imputation in genome-wide association studies with large complex pedigrees: the Framingham Heart Study.
PLoS ONE
title Using family-based imputation in genome-wide association studies with large complex pedigrees: the Framingham Heart Study.
title_full Using family-based imputation in genome-wide association studies with large complex pedigrees: the Framingham Heart Study.
title_fullStr Using family-based imputation in genome-wide association studies with large complex pedigrees: the Framingham Heart Study.
title_full_unstemmed Using family-based imputation in genome-wide association studies with large complex pedigrees: the Framingham Heart Study.
title_short Using family-based imputation in genome-wide association studies with large complex pedigrees: the Framingham Heart Study.
title_sort using family based imputation in genome wide association studies with large complex pedigrees the framingham heart study
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0051589&type=printable
work_keys_str_mv AT minghueichen usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT jiehuang usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT weiminchen usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT martinglarson usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT carolinesfox usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT ramachandransvasan usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT sudhaseshadri usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT christopherjodonnell usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT qiongyang usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy