Comparing hyperparameter optimized support vector machine, multi-layer perceptron and bagging classifiers for diabetes mellitus prediction

Nuzhat Ahmad Yatoo, Ishok Sathik Ali, Imran Mirza

Abstract


Diabetes Mellitus (DM) is a chronic metabolic disorder that affects the way body processes blood glucose levels. Within the medical field, Machine Learning (ML) has significant potential for accurately forecasting and diagnosing a range of chronic conditions. If an accurate prognosis is achieved early, the risk to health and intensity of DM can be significantly mitigated. In this study, a robust methodology for DM prognosis was proposed, which included anomaly replacement, data normalization, feature extraction, and K-fold cross-validation. Three machine learning methods, Support Vector Machine, Multilayer Perceptron and Bagging, were employed to predict Diabetes Mellitus using the National Health and Nutritional Examination Survey (NHANES) 2011-2012 dataset. Accuracy, AUC and Recall were chosen as the evaluation metrics and subsequently optimized during hyperparameter tweaking. From all the comprehensive tests, Bagging outperformed the other two models with an Accuracy of 96.67, AUC score of 99.2 and Recall of 97.0. The proposed methodology surpasses other approaches for forecasting DM.

Keywords


Bagging; Classification; Diabetes mellitus; Machine learning; Multi-layer perceptron; Support vector machine; XGBoost

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v14i5.pp5834-5847

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578

This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).