Multilabel text categorization based on fuzzy relevance clustering

Shie Jue Lee, Jung Yi Jiang

研究成果: Article同行評審

36 引文 斯高帕斯(Scopus)

摘要

We propose a fuzzy based method for multilabel text classification in which a document can belong to one or more than one category. In text categorization, the number of the involved features is usually huge, causing the curse of the dimensionality problem. Besides, a category can be a nonconvex region, which is a union of several overlapping or disjoint subregions. An automatic classification system, thus, may suffer from large memory requirements or poor performance. By incorporating fuzzy techniques, our proposed method can overcome these issues. A fuzzy relevance measure is adopted to transform high-dimensional documents to low-dimensional fuzzy relevance vectors to avoid the curse of dimensionality problem. A clustering technique is used to divide the relevance space into a collection of subregions which are then combined to make up individual categories. This allows complex and nonconvex regions to be created. A number of experiments are presented to show the effectiveness of the proposed method in both performance and speed.

原文English
文章編號6679223
頁(從 - 到)1457-1471
頁數15
期刊IEEE Transactions on Fuzzy Systems
22
發行號6
DOIs
出版狀態Published - 2014 12月 1

All Science Journal Classification (ASJC) codes

  • 控制與系統工程
  • 計算機理論與數學
  • 人工智慧
  • 應用數學

指紋

深入研究「Multilabel text categorization based on fuzzy relevance clustering」主題。共同形成了獨特的指紋。

引用此