Chronic Disease Prediction Model Using Integration of DBSCAN, SMOTE-ENN, and Random Forest

Norma Latif Fitriyani, Muhammad Syafrudin, Ganjar Alfian, Chuan Kai Yang, Jongtae Rhee, Siti Maghfirotul Ulyah

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

Heart disease (HD) is number one chronic disease and becomes a major cause of worldwide disability and death. Aside of HD, type 2 diabetes (T2D) is also as the most deathful diseases that causes serious issues if untreated and undetected. HD and T2D predictions are the most effective measures to control the HD and T2D. Thus, early HD and T2D predictions are important to help individuals in preventing the occurrence of the worst cases. This study proposes a chronic disease prediction model for HD and T2D prediction. The proposed study utilized random forest combined with DBSCAN as outlier detection method and SMOTE-ENN as data balancing method. Two HD datasets (Statlog and Cleveland) and one T2D dataset (NHIS Korea) were used for building the model and comparing the results with other existing machine learning (ML) algorithms, including GNB, LR, MLP, DT, and SVM. To measure the performance of the model, k-fold (10) cross-validation and several performance metrics including accuracy, precision, f-measure, and recall are applied in this study. The results show the model that we proposed outperforms other classification models, as well as previous studies, with accuracy rates 97.63%, 97.69%, and 94.85% for Statlog HD dataset, Cleveland HD dataset and NHIS T2D dataset, respectively. By utilizing the proposed model, it could increase the expectation in preventing the occurrence of the worst case and helping individuals in taking fast and precise actions when status of HD and T2D are detected.

Original languageEnglish
Title of host publication2022 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems, ICETSIS 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages289-294
Number of pages6
ISBN (Electronic)9781665469197
DOIs
Publication statusPublished - 2022
Event2022 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems, ICETSIS 2022 - Virtual, Online, Bahrain
Duration: 22 Jun 202223 Jun 2022

Publication series

Name2022 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems, ICETSIS 2022

Conference

Conference2022 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems, ICETSIS 2022
Country/TerritoryBahrain
CityVirtual, Online
Period22/06/2223/06/22

Keywords

  • heart disease
  • machine learning
  • outlier
  • type 2 diabetes
  • unbalanced data

Fingerprint

Dive into the research topics of 'Chronic Disease Prediction Model Using Integration of DBSCAN, SMOTE-ENN, and Random Forest'. Together they form a unique fingerprint.

Cite this