Identifying unstable CNG repeat loci in the human genome: a heuristic approach and implications for neurological disorders

Abstract Tandem nucleotide repeat (TNR) expansions, particularly the CNG nucleotide configuration, are associated with a variety of neurodegenerative disorders. In this study, we aimed to identify novel unstable CNG repeat loci associated with the neurogenetic disorder spinocerebellar ataxia (SCA)....

Full description

Saved in:
Bibliographic Details
Main Authors: Varun Suroliya, Bharathram Uppili, Manish Kumar, Vineet Jha, Achal K. Srivastava, Mohammed Faruq
Format: Article
Language:English
Published: Nature Publishing Group 2024-06-01
Series:Human Genome Variation
Online Access:https://doi.org/10.1038/s41439-024-00281-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832594913790787584
author Varun Suroliya
Bharathram Uppili
Manish Kumar
Vineet Jha
Achal K. Srivastava
Mohammed Faruq
author_facet Varun Suroliya
Bharathram Uppili
Manish Kumar
Vineet Jha
Achal K. Srivastava
Mohammed Faruq
author_sort Varun Suroliya
collection DOAJ
description Abstract Tandem nucleotide repeat (TNR) expansions, particularly the CNG nucleotide configuration, are associated with a variety of neurodegenerative disorders. In this study, we aimed to identify novel unstable CNG repeat loci associated with the neurogenetic disorder spinocerebellar ataxia (SCA). Using a computational approach, 15,069 CNG repeat loci in the coding and noncoding regions of the human genome were identified. Based on the feature selection criteria (repeat length >10 and functional location of repeats), we selected 52 repeats for further analysis and evaluated the repeat length variability in 100 control subjects. A subset of 19 CNG loci observed to be highly variable in control subjects was selected for subsequent analysis in 100 individuals with SCA. The genes with these highly variable repeats also exhibited higher gene expression levels in the brain according to the tissue expression dataset (GTEx). No pathogenic expansion events were identified in patient samples, which is a limitation given the size of the patient group examined; however, these loci contain potential risk alleles for expandability. Recent studies have implicated GLS, RAI1, GIPC1, MED15, EP400, MEF2A, and CNKSR2 in neurological diseases, with GLS, GIPC1, MED15, RAI1, and MEF2A sharing the same repeat loci reported in this study. This finding validates the approach of evaluating repeat loci in different populations and their possible implications for human pathologies.
format Article
id doaj-art-85352b10f56c4571aca6372d342666bb
institution Kabale University
issn 2054-345X
language English
publishDate 2024-06-01
publisher Nature Publishing Group
record_format Article
series Human Genome Variation
spelling doaj-art-85352b10f56c4571aca6372d342666bb2025-01-19T12:15:51ZengNature Publishing GroupHuman Genome Variation2054-345X2024-06-011111810.1038/s41439-024-00281-0Identifying unstable CNG repeat loci in the human genome: a heuristic approach and implications for neurological disordersVarun Suroliya0Bharathram Uppili1Manish Kumar2Vineet Jha3Achal K. Srivastava4Mohammed Faruq5Department of Neurology, All India Institute of Medical SciencesGenomics and Molecular Medicine, CSIR-Institute of Genomics and Integrative BiologyGenomics and Molecular Medicine, CSIR-Institute of Genomics and Integrative BiologyPersistent LABS, Persistent Systems Ltd.Department of Neurology, All India Institute of Medical SciencesGenomics and Molecular Medicine, CSIR-Institute of Genomics and Integrative BiologyAbstract Tandem nucleotide repeat (TNR) expansions, particularly the CNG nucleotide configuration, are associated with a variety of neurodegenerative disorders. In this study, we aimed to identify novel unstable CNG repeat loci associated with the neurogenetic disorder spinocerebellar ataxia (SCA). Using a computational approach, 15,069 CNG repeat loci in the coding and noncoding regions of the human genome were identified. Based on the feature selection criteria (repeat length >10 and functional location of repeats), we selected 52 repeats for further analysis and evaluated the repeat length variability in 100 control subjects. A subset of 19 CNG loci observed to be highly variable in control subjects was selected for subsequent analysis in 100 individuals with SCA. The genes with these highly variable repeats also exhibited higher gene expression levels in the brain according to the tissue expression dataset (GTEx). No pathogenic expansion events were identified in patient samples, which is a limitation given the size of the patient group examined; however, these loci contain potential risk alleles for expandability. Recent studies have implicated GLS, RAI1, GIPC1, MED15, EP400, MEF2A, and CNKSR2 in neurological diseases, with GLS, GIPC1, MED15, RAI1, and MEF2A sharing the same repeat loci reported in this study. This finding validates the approach of evaluating repeat loci in different populations and their possible implications for human pathologies.https://doi.org/10.1038/s41439-024-00281-0
spellingShingle Varun Suroliya
Bharathram Uppili
Manish Kumar
Vineet Jha
Achal K. Srivastava
Mohammed Faruq
Identifying unstable CNG repeat loci in the human genome: a heuristic approach and implications for neurological disorders
Human Genome Variation
title Identifying unstable CNG repeat loci in the human genome: a heuristic approach and implications for neurological disorders
title_full Identifying unstable CNG repeat loci in the human genome: a heuristic approach and implications for neurological disorders
title_fullStr Identifying unstable CNG repeat loci in the human genome: a heuristic approach and implications for neurological disorders
title_full_unstemmed Identifying unstable CNG repeat loci in the human genome: a heuristic approach and implications for neurological disorders
title_short Identifying unstable CNG repeat loci in the human genome: a heuristic approach and implications for neurological disorders
title_sort identifying unstable cng repeat loci in the human genome a heuristic approach and implications for neurological disorders
url https://doi.org/10.1038/s41439-024-00281-0
work_keys_str_mv AT varunsuroliya identifyingunstablecngrepeatlociinthehumangenomeaheuristicapproachandimplicationsforneurologicaldisorders
AT bharathramuppili identifyingunstablecngrepeatlociinthehumangenomeaheuristicapproachandimplicationsforneurologicaldisorders
AT manishkumar identifyingunstablecngrepeatlociinthehumangenomeaheuristicapproachandimplicationsforneurologicaldisorders
AT vineetjha identifyingunstablecngrepeatlociinthehumangenomeaheuristicapproachandimplicationsforneurologicaldisorders
AT achalksrivastava identifyingunstablecngrepeatlociinthehumangenomeaheuristicapproachandimplicationsforneurologicaldisorders
AT mohammedfaruq identifyingunstablecngrepeatlociinthehumangenomeaheuristicapproachandimplicationsforneurologicaldisorders