TY - JOUR
T1 - Supervised machine learning for early predicting the sepsis patient : modified mean imputation and modified chi-square feature selection
AU - Shrestha, Ujjwol
AU - Alsadoon, Abeer
AU - Prasad, P. W. C.
AU - Al Aloussi, Sarmad
AU - Alsadoon, Omar Hisham
PY - 2021
Y1 - 2021
N2 - Sepsis is a typical and significant emergency in medical clinics comprehensively. A creative and possible instrument for identifying sepsis stays elusive. Supervised models can identify potential clinical factors and give a more accurate prediction than the existing benchmark rule-based tools. This research aims to increase the sensitivity to accurately predict the sepsis patient. The proposed system consists of the mean imputation and chi-square technique to replace the missing features and feature selection, respectively. All datasets are fed into the chi-square technique for feature selection by measuring how expectations compare to actual observed data. The essential missing data are then replaced using the mean-imputation method by calculating the mean value of the available data. Finally, the selected features are used as an input to the supervised machine learning model for the classification of sepsis patient. The results of accuracy and processing time are obtained by using different datasets. The results show that the proposed solution achieves better classification performance in different data scenarios and different review types. The proposed solution provides a classification accuracy of 97.67% against the current accuracy of 91.12% on average. It also provides a processing time of 29.1 milliseconds against the current processing time of 32.8 milliseconds on average. The proposed system is focused on the feature selection process that is involved in the machine learning model. Finally, this study solves the issue of model overfitting with supervised machine learning.
AB - Sepsis is a typical and significant emergency in medical clinics comprehensively. A creative and possible instrument for identifying sepsis stays elusive. Supervised models can identify potential clinical factors and give a more accurate prediction than the existing benchmark rule-based tools. This research aims to increase the sensitivity to accurately predict the sepsis patient. The proposed system consists of the mean imputation and chi-square technique to replace the missing features and feature selection, respectively. All datasets are fed into the chi-square technique for feature selection by measuring how expectations compare to actual observed data. The essential missing data are then replaced using the mean-imputation method by calculating the mean value of the available data. Finally, the selected features are used as an input to the supervised machine learning model for the classification of sepsis patient. The results of accuracy and processing time are obtained by using different datasets. The results show that the proposed solution achieves better classification performance in different data scenarios and different review types. The proposed solution provides a classification accuracy of 97.67% against the current accuracy of 91.12% on average. It also provides a processing time of 29.1 milliseconds against the current processing time of 32.8 milliseconds on average. The proposed system is focused on the feature selection process that is involved in the machine learning model. Finally, this study solves the issue of model overfitting with supervised machine learning.
UR - https://hdl.handle.net/1959.7/uws:63127
U2 - 10.1007/s11042-021-10725-2
DO - 10.1007/s11042-021-10725-2
M3 - Article
SN - 1380-7501
VL - 80
SP - 20477
EP - 20500
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 13
ER -