TY - JOUR
T1 - An opinion feature extraction approach based on a multidimensional sentence analysis model
AU - Guo, Jiunn Liang
AU - Peng, Jhih En
AU - Wang, Hei Chia
N1 - Funding Information:
The research is based on work supported (in part) by the NSC 101-2410-H-006-011-MY2 project from National Science Council, Taiwan, HUA100-3-9-315 project from National Cheng Kung University, Taiwan, and 101-EC-17-A-03-S1-214 project from the Ministry of Economic Affairs (MOEA) of Taiwan.
PY - 2013/7/4
Y1 - 2013/7/4
N2 - With Web 2.0 applications being widely used, social networking services, including web blogs, forums, and other online communities, have become informative tools that help individuals to easily gauge the pulse of the electronic consuming market. As a substitute for traditional public media, the related site provides unique mechanisms to instantly reveal the degree of public product acceptance by either statistically aggregating the rating results or archiving opinions shared by experienced customers. However, the growth of user-generated information and its scattered unstructured contents is overwhelming to users, thereby triggering the demand for a more efficient system that can offer concise information. Most existing efforts dedicated to these issues may neglect vital aspects of the sentence-level context. This article aims to explore the critical features hidden in the sentential structure of opinion articles and expects that the detected patterns may contribute to the enhancement of related applications. Accordingly, a multidimensional sentence modeling algorithm (MSMA) is designed to evaluate various sentential characteristics and adopt a genetic algorithm to optimize the weighting scheme while determining feature importance. The study also makes use of the public knowledge resource Wikipedia as a global reference to fine-tune the feature set's effectiveness and enhance the overall performance of this framework. The results of experiments on an electronic product data set demonstrate that the proposed method is promising and provides significant improvement over previous studies.
AB - With Web 2.0 applications being widely used, social networking services, including web blogs, forums, and other online communities, have become informative tools that help individuals to easily gauge the pulse of the electronic consuming market. As a substitute for traditional public media, the related site provides unique mechanisms to instantly reveal the degree of public product acceptance by either statistically aggregating the rating results or archiving opinions shared by experienced customers. However, the growth of user-generated information and its scattered unstructured contents is overwhelming to users, thereby triggering the demand for a more efficient system that can offer concise information. Most existing efforts dedicated to these issues may neglect vital aspects of the sentence-level context. This article aims to explore the critical features hidden in the sentential structure of opinion articles and expects that the detected patterns may contribute to the enhancement of related applications. Accordingly, a multidimensional sentence modeling algorithm (MSMA) is designed to evaluate various sentential characteristics and adopt a genetic algorithm to optimize the weighting scheme while determining feature importance. The study also makes use of the public knowledge resource Wikipedia as a global reference to fine-tune the feature set's effectiveness and enhance the overall performance of this framework. The results of experiments on an electronic product data set demonstrate that the proposed method is promising and provides significant improvement over previous studies.
UR - http://www.scopus.com/inward/record.url?scp=84880422474&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84880422474&partnerID=8YFLogxK
U2 - 10.1080/01969722.2013.789649
DO - 10.1080/01969722.2013.789649
M3 - Article
AN - SCOPUS:84880422474
SN - 0196-9722
VL - 44
SP - 379
EP - 401
JO - Cybernetics and Systems
JF - Cybernetics and Systems
IS - 5
ER -