TY - JOUR
T1 - Investigations on Impact of Feature Normalization Techniques for Prediction of Hydro-Climatology Data Using Neural Network Backpropagation with Three Layer Hidden
AU - Syaharuddin,
AU - Fatmawati,
AU - Suprajitno, Herry
N1 - Publisher Copyright:
© 2022 WITPress. All rights reserved.
PY - 2022/11
Y1 - 2022/11
N2 - Data normalization techniques are a very important initial stage to be carried out in order to obtain a good predictive data approach. Many researchers get different prediction and error results in each use of these data normalization techniques. Thereby, in this article discusses the accuracy rate of the seven normalization techniques at the preprocessing stage in the Neural Network Backpropagation (NNBP) architecture including decimal scaling, Z-score, min-max (there are 6 types), sigmoid, tanh estimators, mean-MAD, and median-MAD. We used two data patterns: seasonal data (rainfall) and stationary data (air humidity) that taken over the past 10 years (at 10-day intervals). We use accuracy rate parameters including number of epochs, MAE, and MSE when conducting training, testing, and predictions. The results showed that the Z-score technique was very good for the normalization of rainfall data with epochs of 10, MAE of 0.051, and MSE of 0.004. In the case of air humidity data, mean-MAD and Z-score techniques can be recommended with the number of mean-MAD technique epochs of 8, MAE of 0.013, MSE of 0.0004, while the number of epochs of Z-score techniques of 7, MAE of 0.018, and MSE of 0.0006. Thus, we conclude that when other researchers predict seasonal data or stationary data can use the Z-score technique for data normalization.
AB - Data normalization techniques are a very important initial stage to be carried out in order to obtain a good predictive data approach. Many researchers get different prediction and error results in each use of these data normalization techniques. Thereby, in this article discusses the accuracy rate of the seven normalization techniques at the preprocessing stage in the Neural Network Backpropagation (NNBP) architecture including decimal scaling, Z-score, min-max (there are 6 types), sigmoid, tanh estimators, mean-MAD, and median-MAD. We used two data patterns: seasonal data (rainfall) and stationary data (air humidity) that taken over the past 10 years (at 10-day intervals). We use accuracy rate parameters including number of epochs, MAE, and MSE when conducting training, testing, and predictions. The results showed that the Z-score technique was very good for the normalization of rainfall data with epochs of 10, MAE of 0.051, and MSE of 0.004. In the case of air humidity data, mean-MAD and Z-score techniques can be recommended with the number of mean-MAD technique epochs of 8, MAE of 0.013, MSE of 0.0004, while the number of epochs of Z-score techniques of 7, MAE of 0.018, and MSE of 0.0006. Thus, we conclude that when other researchers predict seasonal data or stationary data can use the Z-score technique for data normalization.
KW - backpropagation algorithms
KW - data preprocessing
KW - normalization technique
KW - rainfall data
KW - temperature data
UR - http://www.scopus.com/inward/record.url?scp=85143797544&partnerID=8YFLogxK
U2 - 10.18280/ijsdp.170707
DO - 10.18280/ijsdp.170707
M3 - Article
AN - SCOPUS:85143797544
SN - 1743-7601
VL - 17
SP - 2069
EP - 2074
JO - International Journal of Sustainable Development and Planning
JF - International Journal of Sustainable Development and Planning
IS - 7
ER -