A Multi-Modal Machine Learning Framework for Type 2 Diabetes Risk Stratification and Progression Time Prediction
DOI:
https://doi.org/10.62051/8yj4ng97Keywords:
Warwick Evans; Publishing; These keywords will also be used by the publisher to produce a keyword index.Abstract
The growing global burden of Type 2 Diabetes (T2D) urgently requires advanced predictive tools that go beyond traditional binary risk classification. This study presents an innovative machine learning framework that integrates multimodal data, including clinical parameters, lifestyle factors, Nuclear Magnetic Resonance (NMR) metabolomics data, and 12-lead Electrocardiogram (ECG) signals, to simultaneously predict T2D risk and disease progression time. This study employs a two-stage prediction framework: an AdaBoost classifier optimized by Optuna is used for risk stratification (AUROC=0.9468, F1=0.8857), combined with an Elastic Net regression model for progression time estimation (R²=0.7649, MSE=15.1535). This framework achieves superior predictive performance compared to single-source models by synergistically integrating complementary data modalities, guides clinical intervention timing through quantitative estimation of individualized progression time, and enhances model interpretability through rigorous feature importance analysis, demonstrating key advantages. This method, which combines high accuracy with clinical utility, fills a critical gap in current T2D prediction methodologies and provides a practical tool for personalized prevention strategies and optimized healthcare resource allocation.
Downloads
References
[1] Ahmad E, Lim S, Lamptey R, et al. Type 2 diabetes [J]. The Lancet, 2022, 400 (10365): 1803 - 1820.
[2] Deberneh H M, Kim I. Prediction of type 2 diabetes based on machine learning algorithm [J]. International journal of environmental research and public health, 2021, 18 (6): 3317.
[3] Joshi R D, Dhakal C K. Predicting type 2 diabetes using logistic regression and machine learning approaches [J]. International journal of environmental research and public health, 2021, 18 (14): 7346.
[4] Fazakis N, Kocsis O, Dritsas E, et al. Machine learning tools for long-term type 2 diabetes risk prediction [J]. ieee Access, 2021, 9: 103737 - 103757.
[5] Kautzky-Willer A, Leutner M, Harreiter J. Sex differences in type 2 diabetes [J]. Diabetology, 2023, 66 (6): 986 - 1002.
[6] Fregoso-Aparicio L, Noguez J, Montesinos L, et al. Machine learning and deep learning predictive models for type 2 diabetes: a systematic review [J]. Diabetology & metabolic syndrome, 2021, 13 (1): 148.
[7] Kopitar L, Kocbek P, Cilar L, et al. Early detection of type 2 diabetes mellitus using machine learning-based prediction models [J]. Scientific reports, 2020, 10 (1): 11981.
[8] Hasan M K, Alam M A, Das D, et al. Diabetes prediction using ensembling of different machine learning classifiers [J]. IEEE Access, 2020, 8: 76516 - 76531.
[9] Rastogi R, Bansal M. Diabetes prediction model using data mining techniques [J]. Measurement: Sensors, 2023, 25: 100605.
[10] Tasin I, Nabil T U, Islam S, et al. Diabetes prediction using machine learning and explainable AI techniques [J]. Healthcare technology letters, 2023, 10 (1-2): 1 - 10.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Transactions on Environment, Energy and Earth Sciences

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.









