A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression—An Azure Machine Learning Perspective

The application of Machine Learning for predictive analysis in healthcare, particularly for diseases like diabetes, has proven highly beneficial. This study introduces an optimized Light Gradient-Boosting Machine (Light GBM) and K-Nearest Neighbour (KNN) based ensemble algorithm for predicting diabe...

Full description

Saved in:
Bibliographic Details
Main Authors: V. K. Daliya, T. K. Ramesh
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10836739/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590362943684608
author V. K. Daliya
T. K. Ramesh
author_facet V. K. Daliya
T. K. Ramesh
author_sort V. K. Daliya
collection DOAJ
description The application of Machine Learning for predictive analysis in healthcare, particularly for diseases like diabetes, has proven highly beneficial. This study introduces an optimized Light Gradient-Boosting Machine (Light GBM) and K-Nearest Neighbour (KNN) based ensemble algorithm for predicting diabetic progression of Type 2 Diabetes, classifying it as high or low risk, using patient health parameters and serum measurements. Our model uses LightGBM, a rapid and efficient gradient boosting framework, coupled with KNN, which uses proximity to classify data points. The proposed model uses various optimization techniques, such as 10 fold cross validation, grid search method etc. to get the best results out of the ensemble model. As the model combines optimized version of LightGBM and KNN through a voting classifier which uses soft voting technique to find the final class, it utilizes the predictive capabilities of both the methods in an effective manner. The experiment is performed and implemented in Microsoft’s Azure cloud, using Azure Machine Learning service, that leverages the advantages of cloud computing with respect to scalability, security and its potential integration possibilities into IoT-based smart healthcare systems.This aspect highlights its versatility and impact with respect to remote monitoring of patients as well. The ensemble achieves an 83.2% Area Under the Curve (AUC) of Receiver Operating Characteristics (ROC) score, indicating good classification efficiency. It produced 75% accuracy as well. The proposed model is compared with other classification and ensemble models, showcasing its superiority against other models.The ensemble is also tested with some meta heuristic optimization methods, which produced comparable scores. The method’s effectiveness is validated against another risk prediction dataset, proving its reliability. The model’s accurate predictions can aid individuals in understanding disease progression risks and guide medical professionals in intervention strategies.
format Article
id doaj-art-df87accb7a394748aa2dfde822937cf7
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-df87accb7a394748aa2dfde822937cf72025-01-24T00:01:35ZengIEEEIEEE Access2169-35362025-01-0113115601157510.1109/ACCESS.2025.352803310836739A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression—An Azure Machine Learning PerspectiveV. K. Daliya0https://orcid.org/0000-0002-4508-2922T. K. Ramesh1https://orcid.org/0000-0002-6259-5172Department of Electronics and Communication Engineering, Amrita School of Engineering, Amrita Vishwa Vidyapeetham, Bengaluru, IndiaDepartment of Electronics and Communication Engineering, Amrita School of Engineering, Amrita Vishwa Vidyapeetham, Bengaluru, IndiaThe application of Machine Learning for predictive analysis in healthcare, particularly for diseases like diabetes, has proven highly beneficial. This study introduces an optimized Light Gradient-Boosting Machine (Light GBM) and K-Nearest Neighbour (KNN) based ensemble algorithm for predicting diabetic progression of Type 2 Diabetes, classifying it as high or low risk, using patient health parameters and serum measurements. Our model uses LightGBM, a rapid and efficient gradient boosting framework, coupled with KNN, which uses proximity to classify data points. The proposed model uses various optimization techniques, such as 10 fold cross validation, grid search method etc. to get the best results out of the ensemble model. As the model combines optimized version of LightGBM and KNN through a voting classifier which uses soft voting technique to find the final class, it utilizes the predictive capabilities of both the methods in an effective manner. The experiment is performed and implemented in Microsoft’s Azure cloud, using Azure Machine Learning service, that leverages the advantages of cloud computing with respect to scalability, security and its potential integration possibilities into IoT-based smart healthcare systems.This aspect highlights its versatility and impact with respect to remote monitoring of patients as well. The ensemble achieves an 83.2% Area Under the Curve (AUC) of Receiver Operating Characteristics (ROC) score, indicating good classification efficiency. It produced 75% accuracy as well. The proposed model is compared with other classification and ensemble models, showcasing its superiority against other models.The ensemble is also tested with some meta heuristic optimization methods, which produced comparable scores. The method’s effectiveness is validated against another risk prediction dataset, proving its reliability. The model’s accurate predictions can aid individuals in understanding disease progression risks and guide medical professionals in intervention strategies.https://ieeexplore.ieee.org/document/10836739/Diabetic predictionensemble learningKNNLightGBMMachine learningvoting classifier
spellingShingle V. K. Daliya
T. K. Ramesh
A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression—An Azure Machine Learning Perspective
IEEE Access
Diabetic prediction
ensemble learning
KNN
LightGBM
Machine learning
voting classifier
title A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression—An Azure Machine Learning Perspective
title_full A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression—An Azure Machine Learning Perspective
title_fullStr A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression—An Azure Machine Learning Perspective
title_full_unstemmed A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression—An Azure Machine Learning Perspective
title_short A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression—An Azure Machine Learning Perspective
title_sort cloud based optimized ensemble model for risk prediction of diabetic progression x2014 an azure machine learning perspective
topic Diabetic prediction
ensemble learning
KNN
LightGBM
Machine learning
voting classifier
url https://ieeexplore.ieee.org/document/10836739/
work_keys_str_mv AT vkdaliya acloudbasedoptimizedensemblemodelforriskpredictionofdiabeticprogressionx2014anazuremachinelearningperspective
AT tkramesh acloudbasedoptimizedensemblemodelforriskpredictionofdiabeticprogressionx2014anazuremachinelearningperspective
AT vkdaliya cloudbasedoptimizedensemblemodelforriskpredictionofdiabeticprogressionx2014anazuremachinelearningperspective
AT tkramesh cloudbasedoptimizedensemblemodelforriskpredictionofdiabeticprogressionx2014anazuremachinelearningperspective