Feasibility study of constructing a screening tool for adolescent diabetes detection applying machine learning methods

Hansel Hu, Tin Lai, Farnaz Farid

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

Prediabetes and diabetes are becoming alarmingly prevalent among adolescents over the past decade. However, an effective screening tool that can assess diabetes risks smoothly is still in its infancy. In order to contribute to such significant gaps, this research proposes a machine learning-based predictive model to detect adolescent diabetes. The model applies supervised machine learning and a novel feature selection method to the National Health and Nutritional Examination Survey datasets after an exhaustive search to select reliable and accurate data. The best model achieved an area under the curve (AUC) score of 71%. This research proves that a screening tool based on supervised machine learning models can assist in the automated detection of youth diabetes. It also identifies some critical predictors to such detection using Lasso Regression, Random Forest Importance and Gradient Boosted Tree Importance feature selection methods. The most contributing features to Youth diabetes detection are physical characteristics (e.g., waist, leg length, gender), dietary information (e.g., water, protein, sodium) and demographics. These predictors can be further utilised in other areas of medical research, such as electronic medical history.
Original languageEnglish
Article number6155
Number of pages13
JournalSensors
Volume22
Issue number16
DOIs
Publication statusPublished - 2022

Open Access - Access Right Statement

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/).

Fingerprint

Dive into the research topics of 'Feasibility study of constructing a screening tool for adolescent diabetes detection applying machine learning methods'. Together they form a unique fingerprint.

Cite this