Potential Biomarkers for Predicting the Risk of Developing Into Long COVID After COVID‐19 Infection

ABSTRACT Background Long COVID, a heterogeneous condition characterized by a range of physical and neuropsychiatric presentations, can be presented with a proportion of COVID‐19‐infected individuals. Methods Transcriptomic data sets of those within gene expression profiles of COVID‐19, long COVID, a...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhiyong Hou, Yu Ming, Jun Liu, Zhong Wang
Format: Article
Language:English
Published: Wiley 2025-01-01
Series:Immunity, Inflammation and Disease
Subjects:
Online Access:https://doi.org/10.1002/iid3.70137
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832087006805491712
author Zhiyong Hou
Yu Ming
Jun Liu
Zhong Wang
author_facet Zhiyong Hou
Yu Ming
Jun Liu
Zhong Wang
author_sort Zhiyong Hou
collection DOAJ
description ABSTRACT Background Long COVID, a heterogeneous condition characterized by a range of physical and neuropsychiatric presentations, can be presented with a proportion of COVID‐19‐infected individuals. Methods Transcriptomic data sets of those within gene expression profiles of COVID‐19, long COVID, and healthy controls were retrieved from the GEO database. Differentially expressed genes (DEGs) falling under COVID‐19 and long COVID were identified with R packages, and contemporaneously conducted module detection was performed with the Modular Pharmacology Platform (http://112.86.129.72:48081/). The integration of both DEGs and differentially expressed module‐genes (DEMGs) regarding long COVID and COVID‐19 was intersected by following Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Gene Set Enrichment Analysis (GSEA). Results There were 11 and 62 differentially expressed modules, 1837 and 179 DEGs, as well as 103 and 508 DEMGs acquiring identified for both COVID‐19 and long COVID, notably enriched in the immune‐correlated signaling pathways. The immune infiltrating cells of long COVID and COVID‐19 were comparatively and respectively assessed via CIBERSORT, ssGSEA, and xCell algorithms. Subsequently, the screening of hub genes involved employing the SVM‐RFE, RF, XGBoost algorithms, and logistic regression analysis. Among the 67 candidate genes were processed with machine learning algorithms and logistic regression, a subgroup consisting of CEP55, CDCA2, MELK, and DEPDC1B, was at last identified as potential biomarkers for predicting the risk of the progression into long COVID after COVID‐19 infections. The predicting performance of the potential biomarkers was quantified with a ROC value of 0.8762542, which proved the combination of potential biomarkers provided the highest performance. Conclusions In summary, we identified a subgroup of potential biomarkers for predicting the risk of the progression into long COVID after COVID‐19 infection, which could be partly elucidation of the associated molecular mechanisms for long COVID.
format Article
id doaj-art-2c4d16785aa14d79ba66c50a3d73067b
institution Kabale University
issn 2050-4527
language English
publishDate 2025-01-01
publisher Wiley
record_format Article
series Immunity, Inflammation and Disease
spelling doaj-art-2c4d16785aa14d79ba66c50a3d73067b2025-02-06T07:50:38ZengWileyImmunity, Inflammation and Disease2050-45272025-01-01131n/an/a10.1002/iid3.70137Potential Biomarkers for Predicting the Risk of Developing Into Long COVID After COVID‐19 InfectionZhiyong Hou0Yu Ming1Jun Liu2Zhong Wang3Institute of Basic Research in Clinical Medicine China Academy of Chinese Medical Sciences Beijing ChinaInstitute of Basic Research in Clinical Medicine China Academy of Chinese Medical Sciences Beijing ChinaInstitute of Basic Research in Clinical Medicine China Academy of Chinese Medical Sciences Beijing ChinaInstitute of Basic Research in Clinical Medicine China Academy of Chinese Medical Sciences Beijing ChinaABSTRACT Background Long COVID, a heterogeneous condition characterized by a range of physical and neuropsychiatric presentations, can be presented with a proportion of COVID‐19‐infected individuals. Methods Transcriptomic data sets of those within gene expression profiles of COVID‐19, long COVID, and healthy controls were retrieved from the GEO database. Differentially expressed genes (DEGs) falling under COVID‐19 and long COVID were identified with R packages, and contemporaneously conducted module detection was performed with the Modular Pharmacology Platform (http://112.86.129.72:48081/). The integration of both DEGs and differentially expressed module‐genes (DEMGs) regarding long COVID and COVID‐19 was intersected by following Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Gene Set Enrichment Analysis (GSEA). Results There were 11 and 62 differentially expressed modules, 1837 and 179 DEGs, as well as 103 and 508 DEMGs acquiring identified for both COVID‐19 and long COVID, notably enriched in the immune‐correlated signaling pathways. The immune infiltrating cells of long COVID and COVID‐19 were comparatively and respectively assessed via CIBERSORT, ssGSEA, and xCell algorithms. Subsequently, the screening of hub genes involved employing the SVM‐RFE, RF, XGBoost algorithms, and logistic regression analysis. Among the 67 candidate genes were processed with machine learning algorithms and logistic regression, a subgroup consisting of CEP55, CDCA2, MELK, and DEPDC1B, was at last identified as potential biomarkers for predicting the risk of the progression into long COVID after COVID‐19 infections. The predicting performance of the potential biomarkers was quantified with a ROC value of 0.8762542, which proved the combination of potential biomarkers provided the highest performance. Conclusions In summary, we identified a subgroup of potential biomarkers for predicting the risk of the progression into long COVID after COVID‐19 infection, which could be partly elucidation of the associated molecular mechanisms for long COVID.https://doi.org/10.1002/iid3.70137COVID‐19immune cell infiltrationlong COVIDmachine learning algorithmsmodular pharmacology platform
spellingShingle Zhiyong Hou
Yu Ming
Jun Liu
Zhong Wang
Potential Biomarkers for Predicting the Risk of Developing Into Long COVID After COVID‐19 Infection
Immunity, Inflammation and Disease
COVID‐19
immune cell infiltration
long COVID
machine learning algorithms
modular pharmacology platform
title Potential Biomarkers for Predicting the Risk of Developing Into Long COVID After COVID‐19 Infection
title_full Potential Biomarkers for Predicting the Risk of Developing Into Long COVID After COVID‐19 Infection
title_fullStr Potential Biomarkers for Predicting the Risk of Developing Into Long COVID After COVID‐19 Infection
title_full_unstemmed Potential Biomarkers for Predicting the Risk of Developing Into Long COVID After COVID‐19 Infection
title_short Potential Biomarkers for Predicting the Risk of Developing Into Long COVID After COVID‐19 Infection
title_sort potential biomarkers for predicting the risk of developing into long covid after covid 19 infection
topic COVID‐19
immune cell infiltration
long COVID
machine learning algorithms
modular pharmacology platform
url https://doi.org/10.1002/iid3.70137
work_keys_str_mv AT zhiyonghou potentialbiomarkersforpredictingtheriskofdevelopingintolongcovidaftercovid19infection
AT yuming potentialbiomarkersforpredictingtheriskofdevelopingintolongcovidaftercovid19infection
AT junliu potentialbiomarkersforpredictingtheriskofdevelopingintolongcovidaftercovid19infection
AT zhongwang potentialbiomarkersforpredictingtheriskofdevelopingintolongcovidaftercovid19infection