Machine Learning for Predicting Distant Metastasis of Medullary Thyroid Carcinoma Using the SEER Database

Objectives. We aimed to establish an effective machine learning (ML) model for predicting the risk of distant metastasis (DM) in medullary thyroid carcinoma (MTC). Methods. Demographic data of MTC patients were extracted from the Surveillance, Epidemiology, and End Results (SEER) database of the Nat...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhen-Tian Guo, Kun Tian, Xi-Yuan Xie, Yu-Hang Zhang, De-Bao Fang
Format: Article
Language:English
Published: Wiley 2023-01-01
Series:International Journal of Endocrinology
Online Access:http://dx.doi.org/10.1155/2023/9965578
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832547245173506048
author Zhen-Tian Guo
Kun Tian
Xi-Yuan Xie
Yu-Hang Zhang
De-Bao Fang
author_facet Zhen-Tian Guo
Kun Tian
Xi-Yuan Xie
Yu-Hang Zhang
De-Bao Fang
author_sort Zhen-Tian Guo
collection DOAJ
description Objectives. We aimed to establish an effective machine learning (ML) model for predicting the risk of distant metastasis (DM) in medullary thyroid carcinoma (MTC). Methods. Demographic data of MTC patients were extracted from the Surveillance, Epidemiology, and End Results (SEER) database of the National Institutes of Health between 2004 and 2015 to develop six ML algorithm models. Models were evaluated based on accuracy, precision, recall rate, F1-score, and area under the receiver operating characteristic curve (AUC). The association between clinicopathological characteristics and target variables was interpreted. Analyses were performed using traditional logistic regression (LR). Results. In total, 2049 patients were included and 138 developed DM. Multivariable LR showed that age, sex, tumor size, extrathyroidal extension, and lymph node metastasis were predictive features for DM in MTC. Among the six ML models, the random forest (RF) had the best predictability in assessing the risk of DM in MTC, with an accuracy, precision, recall rate, F1-score, and AUC higher than those of the traditional binary LR model. Conclusion. RF was superior to traditional LR in predicting the risk of DM in MTC and can provide a valuable reference for clinicians in decision-making.
format Article
id doaj-art-a808a78939334b789a8ffa2cf60a7320
institution Kabale University
issn 1687-8345
language English
publishDate 2023-01-01
publisher Wiley
record_format Article
series International Journal of Endocrinology
spelling doaj-art-a808a78939334b789a8ffa2cf60a73202025-02-03T06:45:38ZengWileyInternational Journal of Endocrinology1687-83452023-01-01202310.1155/2023/9965578Machine Learning for Predicting Distant Metastasis of Medullary Thyroid Carcinoma Using the SEER DatabaseZhen-Tian Guo0Kun Tian1Xi-Yuan Xie2Yu-Hang Zhang3De-Bao Fang4Department of General SurgeryDepartment of General SurgeryFujian Provincial HospitalMudanjiang Medical UniversityHefei National Laboratory for Physical Sciences at MicroscaleObjectives. We aimed to establish an effective machine learning (ML) model for predicting the risk of distant metastasis (DM) in medullary thyroid carcinoma (MTC). Methods. Demographic data of MTC patients were extracted from the Surveillance, Epidemiology, and End Results (SEER) database of the National Institutes of Health between 2004 and 2015 to develop six ML algorithm models. Models were evaluated based on accuracy, precision, recall rate, F1-score, and area under the receiver operating characteristic curve (AUC). The association between clinicopathological characteristics and target variables was interpreted. Analyses were performed using traditional logistic regression (LR). Results. In total, 2049 patients were included and 138 developed DM. Multivariable LR showed that age, sex, tumor size, extrathyroidal extension, and lymph node metastasis were predictive features for DM in MTC. Among the six ML models, the random forest (RF) had the best predictability in assessing the risk of DM in MTC, with an accuracy, precision, recall rate, F1-score, and AUC higher than those of the traditional binary LR model. Conclusion. RF was superior to traditional LR in predicting the risk of DM in MTC and can provide a valuable reference for clinicians in decision-making.http://dx.doi.org/10.1155/2023/9965578
spellingShingle Zhen-Tian Guo
Kun Tian
Xi-Yuan Xie
Yu-Hang Zhang
De-Bao Fang
Machine Learning for Predicting Distant Metastasis of Medullary Thyroid Carcinoma Using the SEER Database
International Journal of Endocrinology
title Machine Learning for Predicting Distant Metastasis of Medullary Thyroid Carcinoma Using the SEER Database
title_full Machine Learning for Predicting Distant Metastasis of Medullary Thyroid Carcinoma Using the SEER Database
title_fullStr Machine Learning for Predicting Distant Metastasis of Medullary Thyroid Carcinoma Using the SEER Database
title_full_unstemmed Machine Learning for Predicting Distant Metastasis of Medullary Thyroid Carcinoma Using the SEER Database
title_short Machine Learning for Predicting Distant Metastasis of Medullary Thyroid Carcinoma Using the SEER Database
title_sort machine learning for predicting distant metastasis of medullary thyroid carcinoma using the seer database
url http://dx.doi.org/10.1155/2023/9965578
work_keys_str_mv AT zhentianguo machinelearningforpredictingdistantmetastasisofmedullarythyroidcarcinomausingtheseerdatabase
AT kuntian machinelearningforpredictingdistantmetastasisofmedullarythyroidcarcinomausingtheseerdatabase
AT xiyuanxie machinelearningforpredictingdistantmetastasisofmedullarythyroidcarcinomausingtheseerdatabase
AT yuhangzhang machinelearningforpredictingdistantmetastasisofmedullarythyroidcarcinomausingtheseerdatabase
AT debaofang machinelearningforpredictingdistantmetastasisofmedullarythyroidcarcinomausingtheseerdatabase