TY - JOUR
T1 - A novel intelligent system for detection of type 2 diabetes with modified loss function and regularization
AU - Mallika, G. C.
AU - Alsadoon, Abeer
AU - Pham, D. T. H.
AU - Mai, H. T.
AU - Abdullah, S.
AU - Prasad, P. W. C.
AU - Nguyen, T. Q. V.
PY - 2021
Y1 - 2021
N2 - Abstract: Type 2 Diabetes (T2DM) makes up about 90% of diabetes cases, as well as tough restriction on continuous monitoring and detecting become one of key aspects in T2DM. This research aims to develop an ensemble of several machine learning and deep learning models for early detection of T2DM with high accuracy. With high diversity of models, the ensemble provides more excessive performance than single models. Methodology. The proposed system is modified enhanced ensemble of machine learning models for T2DM prediction. It is composed of Logistic Regression, Random Forest, SVM, and Deep Neural Network models to generate a modified ensemble model. Results. The output of each model in the modified ensemble is used to figure out the final output of the system. The datasets being used for these models include Practice Fusion HER, Pima Indians diabetic’s data, UCI AIM94 Dataset and CA Diabetes Prevalence 2014. In comparison to the previous solutions, the proposed ensemble model solution exposes the effectiveness of accuracy, sensitivity, and specificity. It provides an accuracy of 87.5 from 83.51% in average, sensitivity of 35.8 from 29.59% as well as specificity of 98.9 from 96.27%. The processing time of the proposed model solution with 96.6 ms is faster than the state-of-the-art with 97.5 ms. Conclusion. The proposed modified enhanced system in this work improves the overall prediction capability of T2DM using an ensemble of several machine learning and deep learning models. A majority voting scheme utilizes the output from several models to make the final accurate prediction. Regularization function in this work is modified in order to include the regularization of all the models in ensemble, that helps prevent the overfitting and encourages the generalization capacity of the proposed system.
AB - Abstract: Type 2 Diabetes (T2DM) makes up about 90% of diabetes cases, as well as tough restriction on continuous monitoring and detecting become one of key aspects in T2DM. This research aims to develop an ensemble of several machine learning and deep learning models for early detection of T2DM with high accuracy. With high diversity of models, the ensemble provides more excessive performance than single models. Methodology. The proposed system is modified enhanced ensemble of machine learning models for T2DM prediction. It is composed of Logistic Regression, Random Forest, SVM, and Deep Neural Network models to generate a modified ensemble model. Results. The output of each model in the modified ensemble is used to figure out the final output of the system. The datasets being used for these models include Practice Fusion HER, Pima Indians diabetic’s data, UCI AIM94 Dataset and CA Diabetes Prevalence 2014. In comparison to the previous solutions, the proposed ensemble model solution exposes the effectiveness of accuracy, sensitivity, and specificity. It provides an accuracy of 87.5 from 83.51% in average, sensitivity of 35.8 from 29.59% as well as specificity of 98.9 from 96.27%. The processing time of the proposed model solution with 96.6 ms is faster than the state-of-the-art with 97.5 ms. Conclusion. The proposed modified enhanced system in this work improves the overall prediction capability of T2DM using an ensemble of several machine learning and deep learning models. A majority voting scheme utilizes the output from several models to make the final accurate prediction. Regularization function in this work is modified in order to include the regularization of all the models in ensemble, that helps prevent the overfitting and encourages the generalization capacity of the proposed system.
UR - https://hdl.handle.net/1959.7/uws:77679
U2 - 10.1134/S0361768821050054
DO - 10.1134/S0361768821050054
M3 - Article
SN - 0361-7688
VL - 47
SP - 402
EP - 414
JO - Programming and Computer Software
JF - Programming and Computer Software
IS - 5
ER -