AI Machine Learning–Based Diabetes Prediction in Older Adults in South Korea: Cross-Sectional Analysis

Abstract BackgroundDiabetes is prevalent in older adults, and machine learning algorithms could help predict diabetes in this population. ObjectiveThis study determined diabetes risk factors among older adults aged ≥60 years using machine learning algorithms and se...

Full description

Saved in:
Bibliographic Details
Main Authors: Hocheol Lee, Myung-Bae Park, Young-Joo Won
Format: Article
Language:English
Published: JMIR Publications 2025-01-01
Series:JMIR Formative Research
Online Access:https://formative.jmir.org/2025/1/e57874
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract BackgroundDiabetes is prevalent in older adults, and machine learning algorithms could help predict diabetes in this population. ObjectiveThis study determined diabetes risk factors among older adults aged ≥60 years using machine learning algorithms and selected an optimized prediction model. MethodsThis cross-sectional study was conducted on 3084 older adults aged ≥60 years in Seoul from January to November 2023. Data were collected using a mobile app (Gosufit) that measured depression, stress, anxiety, basal metabolic rate, oxygen saturation, heart rate, and average daily step count. Health coordinators recorded data on diabetes, hypertension, hyperlipidemia, chronic obstructive pulmonary disease, percent body fat, and percent muscle. The presence of diabetes was the target variable, with various health indicators as predictors. Machine learning algorithms, including random forest, gradient boosting model, light gradient boosting model, extreme gradient boosting model, and k-nearest neighbors, were employed for analysis. The dataset was split into 70% training and 30% testing sets. Model performance was evaluated using accuracy, precision, recall, F1 score, and area under the curve (AUC). Shapley additive explanations (SHAPs) were used for model interpretability. ResultsSignificant predictors of diabetes included hypertension (χ1Pχ1Pt3082Pt3082P ConclusionsThis study focused on modifiable risk factors, providing crucial data for establishing a system for the automated collection of health information and lifelog data from older adults using digital devices at service facilities.
ISSN:2561-326X