Modeling naïve bayes imputation classification for missing data

B. K. Khotimah, Miswanto, H. Suprajitno

Research output: Contribution to journalConference articlepeer-review

5 Citations (Scopus)

Abstract

Naïve Bayes Imputation (NBI) is used to fill in missing values by replacing the attribute information according to the probability estimate. The NBI process divides the whole data into two sub-sets is the complete data and data containing missing data. Complete data is used for the imputation process at the lost value. The process is repeated for each missing attribute to generate complete data for classification. This research applies NBI for imputation and preprocessing as preparation of classification process. The trial of this study used NBI for imputation compared to using the mean and mode to predict the missing data. The data used for imputation is full train of complete data as a whole to predict the missing value so as to represent the entire data. The results of this study prove that imputation with NBI produces the right imputation with higher accuracy than other imputations. NBI with single imputation and multiple imputation results in better performance because of the right features. This study aims to calculate the effect of missing values on Naïve Bayes Imputation Algorithm is based on a probalistic model using mixed data. Empirically shows that the interaction between several methods of imputation and supervised classification results in differences in the performance of classification for the same imputation method.

Original languageEnglish
Article number012111
JournalIOP Conference Series: Earth and Environmental Science
Volume243
Issue number1
DOIs
Publication statusPublished - 9 Apr 2019
Event1st International Conference on Environmental Geography and Geography Education, ICEGE 2018 - Jember, East Java, Indonesia
Duration: 17 Nov 201818 Nov 2018

Fingerprint

Dive into the research topics of 'Modeling naïve bayes imputation classification for missing data'. Together they form a unique fingerprint.

Cite this