Represented indicator measurement and corpus distillation on focus species detection

Chih Hsuan Wei, Hung Yu Kao

研究成果: Conference contribution

2 引文 斯高帕斯(Scopus)

摘要

In extraction of information from the biomedical literature, name disambiguation of domain-specific entities, such as proteins, is one of the most important issues. The entity ambiguity with the highest dimension is the species to which an entity is associated with. Furthermore, one of the bottlenecks in inter-species gene name normalization is species disambiguation. To enhance the performance of species disambiguation, the detection of focus species detection remains a substantial challenge. This study presents a method addressing this issue. The results present evaluations of all articles from the BioCreaTive I&II GN task. Our method is robust for all types of articles, particularly those without explicit species entity information. Since our method requires a training corpus to be the indicator vector, we developed an iterative corpus distillation method to extend the corpus. In the conducted experiments, the proposed method achieved a high accuracy of 85.64% and 84.32% without species entity information.

原文English
主出版物標題Proceedings - 2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010
頁面657-662
頁數6
DOIs
出版狀態Published - 2010 十二月 1
事件2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010 - Hong Kong, China
持續時間: 2010 十二月 182010 十二月 21

出版系列

名字Proceedings - 2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010

Other

Other2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010
國家China
城市Hong Kong
期間10-12-1810-12-21

All Science Journal Classification (ASJC) codes

  • Biomedical Engineering
  • Health Informatics

指紋 深入研究「Represented indicator measurement and corpus distillation on focus species detection」主題。共同形成了獨特的指紋。

引用此