TY - JOUR
T1 - Incorporating monotonic domain knowledge in support vector learning for data mining regression problems
AU - Chuang, Hui Chi
AU - Chen, Chih Chuan
AU - Li, Sheng Tun
N1 - Funding Information:
This study was supported in part by the Ministry of Science and Technology, Taiwan, under contract NSC 102-2410-H-006-080-MY3, MOST 105-2410-H-006-038-MY3 and MOST 107-2410-H-143-005. The authors also thank Mr. Chi Chou and Mr. Yu-Di, Chen for their help with the experimentation.
Publisher Copyright:
© 2019, Springer-Verlag London Ltd., part of Springer Nature.
PY - 2020/8/1
Y1 - 2020/8/1
N2 - A common problem of data-driven data mining methods is that they might lack considering domain knowledge, despite possibly having high accuracy with respect to the data. As such, prior knowledge plays an important role in many data mining applications. Incorporating prior knowledge into data mining techniques is not trivial and remains a partially open issue drawing much attention. In this paper, we propose a new support vector regression (SVR) model that takes into account the prior knowledge of domain experts in the form of inequalities, which reflect the monotonic relationship between the output and some of the attributes of the input. A dual quadratic programming problem corresponding to the SVR model is derived, along with algorithms for solving it and creating constraints, respectively. The experiment results, which were conducted on two artificial and two practical datasets, show that the proposed model, which considers the monotonicity defined by domain experts, performs better than the original SVR. Moreover, the proposed method is also suitable for prior domain knowledge of piecewise monotonicity.
AB - A common problem of data-driven data mining methods is that they might lack considering domain knowledge, despite possibly having high accuracy with respect to the data. As such, prior knowledge plays an important role in many data mining applications. Incorporating prior knowledge into data mining techniques is not trivial and remains a partially open issue drawing much attention. In this paper, we propose a new support vector regression (SVR) model that takes into account the prior knowledge of domain experts in the form of inequalities, which reflect the monotonic relationship between the output and some of the attributes of the input. A dual quadratic programming problem corresponding to the SVR model is derived, along with algorithms for solving it and creating constraints, respectively. The experiment results, which were conducted on two artificial and two practical datasets, show that the proposed model, which considers the monotonicity defined by domain experts, performs better than the original SVR. Moreover, the proposed method is also suitable for prior domain knowledge of piecewise monotonicity.
UR - http://www.scopus.com/inward/record.url?scp=85077068977&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85077068977&partnerID=8YFLogxK
U2 - 10.1007/s00521-019-04661-4
DO - 10.1007/s00521-019-04661-4
M3 - Article
AN - SCOPUS:85077068977
SN - 0941-0643
VL - 32
SP - 11791
EP - 11805
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 15
ER -