Enhancing diabetes prediction through probability-based correction: a methodological approach

Aitouhanni Imane, Berqia Amine

Abstract


Predictive healthcare analytics demands accurate predictions from interpretable models for early diagnosis and intervention on diabetes prognosis, which remains a well-established challenge. This study presents a new probability-based correction method to enhance the performance of a model in diabetes prediction. Initial model comparisons are performed using the PyCaret framework to identify the baseline model. Logistic regression was selected due to its simplicity, interpretability, and its higher accuracy, which outperformed other models. To further facilitate future research in this field, this study was conducted using a noisy dataset without any changes or preprocessing steps other than those available in the dataset from the producer. This intentional decision meant that the new probability-based method could be evaluated in isolation without any additional modifications being applied. The proposed correction method adjusts predictions into borderline probability intervals to obtain more accurate classifications. This approach increased the model accuracy by 6% from 75% to 81%, thus proving successful in resolving the misclassification problem with higher risk. This approach outperforms state-of-the-art methods and demonstrates its generalizability in enhancing the certainty of downstream clinical decisions.

Keywords


Diabetes prediction; Enhancement; Healthcare; Machine learning; Probability correction

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v15i5.pp4933-4941

Copyright (c) 2025 Aitouhanni Imane, Berqia Amine

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578

This journal is published by theĀ Institute of Advanced Engineering and Science (IAES).