Machine Learning-based Prediction of HBV-related Hepatocellular Carcinoma and Detection of Key Candidate Biomarkers

Objective: This study aimed to classify open-access gene expression data of patients with hepatitis B virus-related hepatocellular carcinoma (HBV + HCC) and chronic HBV without HCC (HBV alone) using the XGBoost method, one of the machine learning methods, and reveal important genes that may cause HC...

Full description

Saved in:
Bibliographic Details
Main Authors: Zeynep KUCUKAKCALI, Sami AKBULUT, Cemil COLAK
Format: Article
Language:English
Published: Galenos Publishing House 2022-09-01
Series:Medeniyet Medical Journal
Subjects:
Online Access:https://jag.journalagent.com/z4/download_fulltext.asp?pdir=medeniyet&un=MEDJ-39049
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832581941408301056
author Zeynep KUCUKAKCALI
Sami AKBULUT
Cemil COLAK
author_facet Zeynep KUCUKAKCALI
Sami AKBULUT
Cemil COLAK
author_sort Zeynep KUCUKAKCALI
collection DOAJ
description Objective: This study aimed to classify open-access gene expression data of patients with hepatitis B virus-related hepatocellular carcinoma (HBV + HCC) and chronic HBV without HCC (HBV alone) using the XGBoost method, one of the machine learning methods, and reveal important genes that may cause HCC. Methods: This case-control study used the open-access gene expression data of patients with HBV + HCC and HBV alone. Data from 17 patients with HBV + HCC and 36 patients with HBV were included. XGBoost was constructed for the classification via 10-fold cross-validation. Accuracy, balanced accuracy, sensitivity, selectivity, positive-predictive value, and negative-predictive value performance metrics were evaluated for model performance. Results: According to the feature-selection method, 18 genes were selected, and modeling was performed with these input variables. Accuracy, balanced accuracy, sensitivity, specificity, positive-predictive value, negative-predictive value, and F1 score obtained from XGBoost model were 98.1%, 98.6%, 100%, 97.2%, 94.4%, 100%, and 97.1%, respectively. Based on the predictor importance findings acquired from XGBoost, the RNF26, FLJ10233, ACBD6, RBM12, PFAS, H3C11, and GKP5 can be employed as potential biomarkers of HBV-related HCC. Conclusions: In this study, genes that may be possible biomarkers of HBV-related HCC were determined using a machine learning-based prediction approach. After the reliability of the obtained genes are clinically verified in subsequent research, therapeutic procedures can be established based on these genes, and their usefulness in clinical practice may be documented.
format Article
id doaj-art-2eaa45b46653474a87b92976b1bd86df
institution Kabale University
issn 2149-2042
2149-4606
language English
publishDate 2022-09-01
publisher Galenos Publishing House
record_format Article
series Medeniyet Medical Journal
spelling doaj-art-2eaa45b46653474a87b92976b1bd86df2025-01-30T07:10:56ZengGalenos Publishing HouseMedeniyet Medical Journal2149-20422149-46062022-09-0137325526310.4274/MMJ.galenos.2022.39049MEDJ-39049Machine Learning-based Prediction of HBV-related Hepatocellular Carcinoma and Detection of Key Candidate BiomarkersZeynep KUCUKAKCALI0Sami AKBULUT1Cemil COLAK2Inonu University Faculty of Medicine, Department of Biostatistics and Medical Informatics, Malatya, TurkeyInonu University Faculty of Medicine, Department of Biostatistics and Medical Informatics, Malatya, Turkey, Inonu University Faculty of Medicine, Department of General Surgery, Malatya, Turkey, Inonu University Faculty of Medicine, Department of Public Health, Malatya, TurkeyInonu University Faculty of Medicine, Department of Biostatistics and Medical Informatics, Malatya, TurkeyObjective: This study aimed to classify open-access gene expression data of patients with hepatitis B virus-related hepatocellular carcinoma (HBV + HCC) and chronic HBV without HCC (HBV alone) using the XGBoost method, one of the machine learning methods, and reveal important genes that may cause HCC. Methods: This case-control study used the open-access gene expression data of patients with HBV + HCC and HBV alone. Data from 17 patients with HBV + HCC and 36 patients with HBV were included. XGBoost was constructed for the classification via 10-fold cross-validation. Accuracy, balanced accuracy, sensitivity, selectivity, positive-predictive value, and negative-predictive value performance metrics were evaluated for model performance. Results: According to the feature-selection method, 18 genes were selected, and modeling was performed with these input variables. Accuracy, balanced accuracy, sensitivity, specificity, positive-predictive value, negative-predictive value, and F1 score obtained from XGBoost model were 98.1%, 98.6%, 100%, 97.2%, 94.4%, 100%, and 97.1%, respectively. Based on the predictor importance findings acquired from XGBoost, the RNF26, FLJ10233, ACBD6, RBM12, PFAS, H3C11, and GKP5 can be employed as potential biomarkers of HBV-related HCC. Conclusions: In this study, genes that may be possible biomarkers of HBV-related HCC were determined using a machine learning-based prediction approach. After the reliability of the obtained genes are clinically verified in subsequent research, therapeutic procedures can be established based on these genes, and their usefulness in clinical practice may be documented.https://jag.journalagent.com/z4/download_fulltext.asp?pdir=medeniyet&un=MEDJ-39049hepatocellular carcinomahepatitis b infectionchronic liver diseasegene expression
spellingShingle Zeynep KUCUKAKCALI
Sami AKBULUT
Cemil COLAK
Machine Learning-based Prediction of HBV-related Hepatocellular Carcinoma and Detection of Key Candidate Biomarkers
Medeniyet Medical Journal
hepatocellular carcinoma
hepatitis b infection
chronic liver disease
gene expression
title Machine Learning-based Prediction of HBV-related Hepatocellular Carcinoma and Detection of Key Candidate Biomarkers
title_full Machine Learning-based Prediction of HBV-related Hepatocellular Carcinoma and Detection of Key Candidate Biomarkers
title_fullStr Machine Learning-based Prediction of HBV-related Hepatocellular Carcinoma and Detection of Key Candidate Biomarkers
title_full_unstemmed Machine Learning-based Prediction of HBV-related Hepatocellular Carcinoma and Detection of Key Candidate Biomarkers
title_short Machine Learning-based Prediction of HBV-related Hepatocellular Carcinoma and Detection of Key Candidate Biomarkers
title_sort machine learning based prediction of hbv related hepatocellular carcinoma and detection of key candidate biomarkers
topic hepatocellular carcinoma
hepatitis b infection
chronic liver disease
gene expression
url https://jag.journalagent.com/z4/download_fulltext.asp?pdir=medeniyet&un=MEDJ-39049
work_keys_str_mv AT zeynepkucukakcali machinelearningbasedpredictionofhbvrelatedhepatocellularcarcinomaanddetectionofkeycandidatebiomarkers
AT samiakbulut machinelearningbasedpredictionofhbvrelatedhepatocellularcarcinomaanddetectionofkeycandidatebiomarkers
AT cemilcolak machinelearningbasedpredictionofhbvrelatedhepatocellularcarcinomaanddetectionofkeycandidatebiomarkers