Automatic domain-specific sentiment lexicon generation with label propagation

Yen Jen Tai, Hung Yu Kao

研究成果: Conference contribution

18 引文 斯高帕斯(Scopus)

摘要

Nowadays, the advance of social media has led to the explosive growth of opinion data. Therefore, sentiment analysis has attracted a lot of attentions. Currently, sentiment analysis applications are divided into two main approaches, the lexicon-based approach and the machine-learning approach. However, both of them face the challenge of obtaining a large amount of human-labeled training data and corpus. For the lexicon-based approach, it requires a sentiment lexicon to determine the opinion polarity. There are many existing benchmark sentiment lexicons, but they cannot cover all the domain-specific words meanings. Thus, automatic generation of a domain-specific sentiment lexicon becomes an important task. We propose a framework to automatically generate sentiment lexicon. First, we determine the semantic similarity between two words in the entire unlabeled corpus. We treat the words as nodes and similarities as weighted edges to construct word graphs. A graph-based semi-supervised label propagation method finally assigns the polarity to unlabeled words through the proposed propagation process. Experiments conducted on the microblog data, Twitter, show that our approach leads to a better performance than baseline approaches and general-purpose sentiment dictionaries.

原文English
主出版物標題Proceedings - 15th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2013
頁面53-62
頁數10
DOIs
出版狀態Published - 2013 十二月 1
事件15th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2013 - Vienna, Austria
持續時間: 2013 十二月 22013 十二月 4

出版系列

名字ACM International Conference Proceeding Series

Other

Other15th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2013
國家/地區Austria
城市Vienna
期間13-12-0213-12-04

All Science Journal Classification (ASJC) codes

  • 軟體
  • 人機介面
  • 電腦視覺和模式識別
  • 電腦網路與通信

指紋

深入研究「Automatic domain-specific sentiment lexicon generation with label propagation」主題。共同形成了獨特的指紋。

引用此