TY - GEN
T1 - Generating multi-modality virtual samples with soft DBSCAN for small data set learning
AU - Lin, Liang Sian
AU - Li, Der Chiang
AU - Yu, Wei Hao
AU - Hsueh, Yu Mei
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/11/23
Y1 - 2015/11/23
N2 - Owing to the factors of cost and time limit, the number of samples is usually small in the early stages of manufacturing systems. When the number of available data is small, traditional statistic techniques have difficulty to obtain robust analyses. Therefore, based on a uni-modality distribution assumption, many researchers have proposed virtual sample generation methods to expand the training sample size to enhance the performance of small data set learning. In practice, small data may be following a multi-modality distribution. Therefore, in order to solve multi-modal small data sets, this study proposes a new approach to create multi-modality Weibull virtual samples, where we use the maximal p-value to estimate parameters of the Weibull distribution. In addition, the soft DBSCAN method is used to identify a suitable number of modalities. One data set is employed to check the performance of the proposed method, and comparisons are made by the prediction on root mean square error. The results using a paired t-test show that the proposed method has a superior prediction performance than that of the mega-trend-diffusion method using a uni-modality triangular membership function.
AB - Owing to the factors of cost and time limit, the number of samples is usually small in the early stages of manufacturing systems. When the number of available data is small, traditional statistic techniques have difficulty to obtain robust analyses. Therefore, based on a uni-modality distribution assumption, many researchers have proposed virtual sample generation methods to expand the training sample size to enhance the performance of small data set learning. In practice, small data may be following a multi-modality distribution. Therefore, in order to solve multi-modal small data sets, this study proposes a new approach to create multi-modality Weibull virtual samples, where we use the maximal p-value to estimate parameters of the Weibull distribution. In addition, the soft DBSCAN method is used to identify a suitable number of modalities. One data set is employed to check the performance of the proposed method, and comparisons are made by the prediction on root mean square error. The results using a paired t-test show that the proposed method has a superior prediction performance than that of the mega-trend-diffusion method using a uni-modality triangular membership function.
UR - http://www.scopus.com/inward/record.url?scp=84962688534&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84962688534&partnerID=8YFLogxK
U2 - 10.1109/ACIT-CSI.2015.69
DO - 10.1109/ACIT-CSI.2015.69
M3 - Conference contribution
AN - SCOPUS:84962688534
T3 - Proceedings - 3rd International Conference on Applied Computing and Information Technology and 2nd International Conference on Computational Science and Intelligence, ACIT-CSI 2015
SP - 363
EP - 368
BT - Proceedings - 3rd International Conference on Applied Computing and Information Technology and 2nd International Conference on Computational Science and Intelligence, ACIT-CSI 2015
A2 - Tsuchida, Kensei
A2 - Ishii, Naohiro
A2 - Goto, Takaaki
A2 - Takahashi, Satoshi
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd International Conference on Applied Computing and Information Technology and 2nd International Conference on Computational Science and Intelligence, ACIT-CSI 2015
Y2 - 12 July 2015 through 16 July 2015
ER -