Computational analysis of congenital heart disease associated SNPs: unveiling their impact on the gene regulatory system

Abstract Congenital heart disease (CHD) is a prevalent condition characterized by defective heart development, causing premature death and stillbirths among infants. Genome-wide association studies (GWASs) have provided insights into the role of genetic variants in CHD pathogenesis through the ident...

Full description

Saved in:
Bibliographic Details
Main Authors: Shikha Vashisht, Costantino Parisi, Cecilia L. Winata
Format: Article
Language:English
Published: BMC 2025-01-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-025-11232-6
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832585978690142208
author Shikha Vashisht
Costantino Parisi
Cecilia L. Winata
author_facet Shikha Vashisht
Costantino Parisi
Cecilia L. Winata
author_sort Shikha Vashisht
collection DOAJ
description Abstract Congenital heart disease (CHD) is a prevalent condition characterized by defective heart development, causing premature death and stillbirths among infants. Genome-wide association studies (GWASs) have provided insights into the role of genetic variants in CHD pathogenesis through the identification of a comprehensive set of single-nucleotide polymorphisms (SNPs). Notably, 90–95% of these variants reside in the noncoding genome, complicating the understanding of their underlying mechanisms. Here, we developed a systematic computational pipeline for the identification and analysis of CHD-associated SNPs spanning both coding and noncoding regions of the genome. Initially, we curated a thorough dataset of SNPs from GWAS-catalog and ClinVar database and filtered them based on CHD-related traits. Subsequently, these CHD-SNPs were annotated and categorized into noncoding and coding regions based on their location. To study the functional implications of noncoding CHD-SNPs, we cross-validated them with enhancer-specific histone modification marks from developing human heart across 9 Carnegie stages and identified potential cardiac enhancers. This approach led to the identification of 2,056 CHD-associated putative enhancers (CHD-enhancers), 38.9% of them overlapping with known enhancers catalogued in human enhancer disease database. We identified heart-related transcription factor binding sites within these CHD-enhancers, offering insights into the impact of SNPs on TF binding. Conservation analysis further revealed that many of these CHD-enhancers were highly conserved across vertebrates, suggesting their evolutionary significance. Utilizing heart-specific expression quantitative trait loci data, we further identified a subset of 63 CHD-SNPs with regulatory potential distributed across various cardiac tissues. Concurrently, coding CHD-SNPs were represented as a protein interaction network and its subsequent binding energy analysis focused on a pair of proteins within this network, pinpointed a deleterious coding CHD-SNP, rs770030288, located in C2 domain of MYBPC3 protein. Overall, our findings demonstrate that SNPs have the potential to disrupt gene regulatory systems, either by affecting enhancer sequences or modulating protein-protein interactions, which can lead to abnormal developmental processes contributing to CHD pathogenesis.
format Article
id doaj-art-37166f3175274b8990d6c061c5252edf
institution Kabale University
issn 1471-2164
language English
publishDate 2025-01-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj-art-37166f3175274b8990d6c061c5252edf2025-01-26T12:16:34ZengBMCBMC Genomics1471-21642025-01-0126111710.1186/s12864-025-11232-6Computational analysis of congenital heart disease associated SNPs: unveiling their impact on the gene regulatory systemShikha Vashisht0Costantino Parisi1Cecilia L. Winata2International Institute of Molecular and Cell Biology in Warsaw, Laboratory of Zebrafish Developmental GenomicsInternational Institute of Molecular and Cell Biology in Warsaw, Laboratory of Zebrafish Developmental GenomicsInternational Institute of Molecular and Cell Biology in Warsaw, Laboratory of Zebrafish Developmental GenomicsAbstract Congenital heart disease (CHD) is a prevalent condition characterized by defective heart development, causing premature death and stillbirths among infants. Genome-wide association studies (GWASs) have provided insights into the role of genetic variants in CHD pathogenesis through the identification of a comprehensive set of single-nucleotide polymorphisms (SNPs). Notably, 90–95% of these variants reside in the noncoding genome, complicating the understanding of their underlying mechanisms. Here, we developed a systematic computational pipeline for the identification and analysis of CHD-associated SNPs spanning both coding and noncoding regions of the genome. Initially, we curated a thorough dataset of SNPs from GWAS-catalog and ClinVar database and filtered them based on CHD-related traits. Subsequently, these CHD-SNPs were annotated and categorized into noncoding and coding regions based on their location. To study the functional implications of noncoding CHD-SNPs, we cross-validated them with enhancer-specific histone modification marks from developing human heart across 9 Carnegie stages and identified potential cardiac enhancers. This approach led to the identification of 2,056 CHD-associated putative enhancers (CHD-enhancers), 38.9% of them overlapping with known enhancers catalogued in human enhancer disease database. We identified heart-related transcription factor binding sites within these CHD-enhancers, offering insights into the impact of SNPs on TF binding. Conservation analysis further revealed that many of these CHD-enhancers were highly conserved across vertebrates, suggesting their evolutionary significance. Utilizing heart-specific expression quantitative trait loci data, we further identified a subset of 63 CHD-SNPs with regulatory potential distributed across various cardiac tissues. Concurrently, coding CHD-SNPs were represented as a protein interaction network and its subsequent binding energy analysis focused on a pair of proteins within this network, pinpointed a deleterious coding CHD-SNP, rs770030288, located in C2 domain of MYBPC3 protein. Overall, our findings demonstrate that SNPs have the potential to disrupt gene regulatory systems, either by affecting enhancer sequences or modulating protein-protein interactions, which can lead to abnormal developmental processes contributing to CHD pathogenesis.https://doi.org/10.1186/s12864-025-11232-6Congenital heart diseaseSingle-nucleotide polymorphismEnhancerProtein-protein interaction networkGene regulatory system
spellingShingle Shikha Vashisht
Costantino Parisi
Cecilia L. Winata
Computational analysis of congenital heart disease associated SNPs: unveiling their impact on the gene regulatory system
BMC Genomics
Congenital heart disease
Single-nucleotide polymorphism
Enhancer
Protein-protein interaction network
Gene regulatory system
title Computational analysis of congenital heart disease associated SNPs: unveiling their impact on the gene regulatory system
title_full Computational analysis of congenital heart disease associated SNPs: unveiling their impact on the gene regulatory system
title_fullStr Computational analysis of congenital heart disease associated SNPs: unveiling their impact on the gene regulatory system
title_full_unstemmed Computational analysis of congenital heart disease associated SNPs: unveiling their impact on the gene regulatory system
title_short Computational analysis of congenital heart disease associated SNPs: unveiling their impact on the gene regulatory system
title_sort computational analysis of congenital heart disease associated snps unveiling their impact on the gene regulatory system
topic Congenital heart disease
Single-nucleotide polymorphism
Enhancer
Protein-protein interaction network
Gene regulatory system
url https://doi.org/10.1186/s12864-025-11232-6
work_keys_str_mv AT shikhavashisht computationalanalysisofcongenitalheartdiseaseassociatedsnpsunveilingtheirimpactonthegeneregulatorysystem
AT costantinoparisi computationalanalysisofcongenitalheartdiseaseassociatedsnpsunveilingtheirimpactonthegeneregulatorysystem
AT cecilialwinata computationalanalysisofcongenitalheartdiseaseassociatedsnpsunveilingtheirimpactonthegeneregulatorysystem