TY - GEN
T1 - Improving virtual sample generation for small sample learning with dependent attributes
AU - Lin, Liang Sian
AU - Li, Der Chiang
AU - Pan, Chih Wei
PY - 2016/8/31
Y1 - 2016/8/31
N2 - Since the product life cycles are getting shorter and shorter, the issue of small data set learning has drawn more and more attentions in both academics and enterprises. Many methods have been proposed to improve the learning performance of small data set. In these methods, the virtual sample generation approach is the most popular technique for improving small data learning. In the process of virtual sample generation, the attribute independence in small data is the key part to determine the learning performance, because it is the necessary assumption before generating virtual samples. However, in the real world, attributes in the data set usually are not mutual independent. Therefore, this paper proposes a new process to generate independent virtual samples based on the box-and-whisker plot domain estimation. In order to validate the effectiveness of the proposed method, one data set is used to calculate the classification accuracy average and standard deviation based on the support vector machine. The results of the experiment show that the presented method has a superior classification performance than other methods.
AB - Since the product life cycles are getting shorter and shorter, the issue of small data set learning has drawn more and more attentions in both academics and enterprises. Many methods have been proposed to improve the learning performance of small data set. In these methods, the virtual sample generation approach is the most popular technique for improving small data learning. In the process of virtual sample generation, the attribute independence in small data is the key part to determine the learning performance, because it is the necessary assumption before generating virtual samples. However, in the real world, attributes in the data set usually are not mutual independent. Therefore, this paper proposes a new process to generate independent virtual samples based on the box-and-whisker plot domain estimation. In order to validate the effectiveness of the proposed method, one data set is used to calculate the classification accuracy average and standard deviation based on the support vector machine. The results of the experiment show that the presented method has a superior classification performance than other methods.
UR - http://www.scopus.com/inward/record.url?scp=84988905045&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84988905045&partnerID=8YFLogxK
U2 - 10.1109/IIAI-AAI.2016.18
DO - 10.1109/IIAI-AAI.2016.18
M3 - Conference contribution
AN - SCOPUS:84988905045
T3 - Proceedings - 2016 5th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2016
SP - 715
EP - 718
BT - Proceedings - 2016 5th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2016
A2 - Hiramatsu, Ayako
A2 - Matsuo, Tokuro
A2 - Kanzaki, Akimitsu
A2 - Komoda, Norihisa
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2016
Y2 - 10 July 2016 through 14 July 2016
ER -