A similarity measure for text processing

Jung Yi Jiang, Wen Hao Cheng, Yu Shu Chiou, Shie Jue Lee

研究成果: Conference contribution

10 引文 斯高帕斯(Scopus)

摘要

In this paper, we propose a novel similarity measure for document data processing. For two document vectors, the proposed measure takes three cases into account: a) The feature considered appears in both documents, b) the feature considered appears in only one document, and c) the feature considered appears in none of the documents. For the first case, we give a lower bound and decrease the similarity according to the difference between the feature values of the two documents. For the second case, we give a fixed value disregarding the magnitude of the feature value. For the last case, we treat it as an identity, Experimental results show that our proposed method can work more effectively than others.

原文English
主出版物標題Proceedings of 2011 International Conference on Machine Learning and Cybernetics, ICMLC 2011
頁面1460-1465
頁數6
DOIs
出版狀態Published - 2011 十一月 7
事件2011 International Conference on Machine Learning and Cybernetics, ICMLC 2011 - Guilin, Guangxi, China
持續時間: 2011 七月 102011 七月 13

出版系列

名字Proceedings - International Conference on Machine Learning and Cybernetics
4
ISSN(列印)2160-133X
ISSN(電子)2160-1348

Other

Other2011 International Conference on Machine Learning and Cybernetics, ICMLC 2011
國家China
城市Guilin, Guangxi
期間11-07-1011-07-13

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computational Theory and Mathematics
  • Computer Networks and Communications
  • Human-Computer Interaction

指紋 深入研究「A similarity measure for text processing」主題。共同形成了獨特的指紋。

引用此