TY - JOUR
T1 - Using structure-based data transformation method to improve prediction accuracies for small data sets
AU - Li, Der Chiang
AU - Chang, Chih Chieh
AU - Liu, Chiao Wen
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2012/2
Y1 - 2012/2
N2 - Small data set problems have been widely considered in many fields, where increasing the prediction ability is the most important goal. This study considers the data structure to identify new data points in a more precise manner, and is thus able to achieve improved prediction capability. The proposed method, named structure-based data transformation, consists of two steps. The first step is using the density-based spatial clustering of applications with noise (DBSCAN) algorithm to separate data sets into clusters, which generates the number of clusters dynamically. The second step is to build up the data transformation function, in which the new attributes are computed using fuzzy membership functions obtained by the corresponding membership grades in each cluster. Three real cases are selected to compare the proposed forecasting model with the linear regression (LR), backpropagation neural network (BPNN), and support vector machine for regression (SVR) methods. The result show that the structure-based data transformation method has better performance than when using the raw data with regard to the error improving rate, mean square error (MSE), and standard deviation (STD).
AB - Small data set problems have been widely considered in many fields, where increasing the prediction ability is the most important goal. This study considers the data structure to identify new data points in a more precise manner, and is thus able to achieve improved prediction capability. The proposed method, named structure-based data transformation, consists of two steps. The first step is using the density-based spatial clustering of applications with noise (DBSCAN) algorithm to separate data sets into clusters, which generates the number of clusters dynamically. The second step is to build up the data transformation function, in which the new attributes are computed using fuzzy membership functions obtained by the corresponding membership grades in each cluster. Three real cases are selected to compare the proposed forecasting model with the linear regression (LR), backpropagation neural network (BPNN), and support vector machine for regression (SVR) methods. The result show that the structure-based data transformation method has better performance than when using the raw data with regard to the error improving rate, mean square error (MSE), and standard deviation (STD).
UR - http://www.scopus.com/inward/record.url?scp=84856016410&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84856016410&partnerID=8YFLogxK
U2 - 10.1016/j.dss.2011.11.021
DO - 10.1016/j.dss.2011.11.021
M3 - Article
AN - SCOPUS:84856016410
SN - 0167-9236
VL - 52
SP - 748
EP - 756
JO - Decision Support Systems
JF - Decision Support Systems
IS - 3
ER -