Early-Stage Diabetes mellitus Risk Prediction And Symptom Association: A Comparative Analysis Using Feature Importance

Darsheel Sanghavi; Daxay Sanghavi; Nilesh Patil

doi:10.53555/kuey.v30i1.6939

pdf

Published: Jan 9, 2024

DOI: https://doi.org/10.53555/kuey.v30i1.6939

Keywords:

Diabetes prediction, machine learning, feature importance, data mining, association rule mining, Random Forest, Logistic Regression, Naive Bayes, KNN, Decision Tree

Darsheel Sanghavi

Daxay Sanghavi

Nilesh Patil

Abstract

Early-stage diabetes risk prediction is a critical component of preventive healthcare, with the goal of identifying patients who are at risk of developing diabetes before they have symptoms. This research evaluates multiple machine learning (ML) methods for predicting diabetes risk, including logistic regression, Naive Bayes, random forest, K-Nearest Neighbours (KNN), and decision trees. To train and evaluate these models, we used an upgraded version of the Sylhet Diabetes Hospital Dataset, which had 521 occurrences and 18 attributes. Our analysis includes a variety of parameters, such as each algorithm's predicted accuracy, feature importance ranking across models, association rule mining to identify connections between essential diabetes markers, detailed mathematical foundations, and pseudocode. The results reveal that the Random Forest algorithm outperforms all other approaches, with an accuracy of 97.1153%. Polyuria, polydipsia, and gender are significant predictors across multiple algorithms, according to our findings. Association rule mining reveals strong correlations between these symptoms, particularly in female patients. This multidimensional approach not only provides a robust foundation for early diabetes detection, but it also sheds light on the interplay of risk factors. The findings have the potential to enhance preventative care practices and lead to more targeted screening regimens.

Downloads

Download data is not yet available.

How to Cite

Darsheel Sanghavi, Daxay Sanghavi, & Nilesh Patil. (2024). Early-Stage Diabetes mellitus Risk Prediction And Symptom Association: A Comparative Analysis Using Feature Importance. Educational Administration: Theory and Practice, 30(1), 2997–3006. https://doi.org/10.53555/kuey.v30i1.6939

Issue

Vol. 30 No. 1 (2024)

Section

Articles

Author Biographies

Darsheel Sanghavi

Dwarkadas J. Sanghvi College Of Engineering, Vile Parle (W), Mumbai – 400 056, India. [0009-0005-7484-6660],

Daxay Sanghavi

Dwarkadas J. Sanghvi College Of Engineering, Vile Parle (W), Mumbai – 400 056, India, [0009-0000-4509-4192]

Nilesh Patil

Dwarkadas J. Sanghvi College Of Engineering, Vile Parle (W), Mumbai – 400 056, India, [0000-0001-8335-4426]

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Darsheel Sanghavi

Daxay Sanghavi

Nilesh Patil

Most read articles by the same author(s)