TY - JOUR
T1 - A fuzzy conceptualization model for text mining with application in opinion polarity classification
AU - Li, Sheng Tun
AU - Tsai, Fu Ching
N1 - Funding Information:
This study was supported in part by National Science Council, Taiwan under Contract NSC99-2410-H-006-054-MY3 and NSC101-2410-H-006-012.
PY - 2013/2
Y1 - 2013/2
N2 - Automatic text classification in text mining is a critical technique to manage huge collections of documents. However, most existing document classification algorithms are easily affected by ambiguous terms. The ability to disambiguate for a classifier is thus as important as the ability to classify accurately. In this paper, we propose a novel classification framework based on fuzzy formal concept analysis to conceptualize documents into a more abstract form of concepts, and use these as the training examples to alleviate the arbitrary outcomes caused by ambiguous terms. The proposed model is evaluated on a benchmark testbed and two opinion polarity datasets. The experimental results indicate superior performance in all datasets. Applying concept analysis to opinion polarity classification is a leading endeavor in the disambiguation of Web 2.0 contents, and the approach presented in this paper offers significant improvements on current methods. The results of the proposed model reveal its ability to decrease the sensitivity to noise, as well as its adaptability in cross domain applications.
AB - Automatic text classification in text mining is a critical technique to manage huge collections of documents. However, most existing document classification algorithms are easily affected by ambiguous terms. The ability to disambiguate for a classifier is thus as important as the ability to classify accurately. In this paper, we propose a novel classification framework based on fuzzy formal concept analysis to conceptualize documents into a more abstract form of concepts, and use these as the training examples to alleviate the arbitrary outcomes caused by ambiguous terms. The proposed model is evaluated on a benchmark testbed and two opinion polarity datasets. The experimental results indicate superior performance in all datasets. Applying concept analysis to opinion polarity classification is a leading endeavor in the disambiguation of Web 2.0 contents, and the approach presented in this paper offers significant improvements on current methods. The results of the proposed model reveal its ability to decrease the sensitivity to noise, as well as its adaptability in cross domain applications.
UR - http://www.scopus.com/inward/record.url?scp=84871924091&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84871924091&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2012.10.005
DO - 10.1016/j.knosys.2012.10.005
M3 - Article
AN - SCOPUS:84871924091
SN - 0950-7051
VL - 39
SP - 23
EP - 33
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
ER -