Curatable Named-Entity Recognition Using Semantic Relations

Yi Yu Hsu, Hung Yu Kao

研究成果: Article同行評審

11 引文 斯高帕斯(Scopus)

摘要

Named-entity recognition (NER) plays an important role in the development of biomedical databases. However, the existing NER tools produce multifarious named-entities which may result in both curatable and non-curatable markers. To facilitate biocuration with a straightforward approach, classifying curatable named-entities is helpful with regard to accelerating the biocuration workflow. Co-occurrence Interaction Nexus with Named-entity Recognition (CoINNER) is a web-based tool that allows users to identify genes, chemicals, diseases, and action term mentions in the Comparative Toxicogenomic Database (CTD). To further discover interactions, CoINNER uses multiple advanced algorithms to recognize the mentions in the BioCreative IV CTD Track. CoINNER is developed based on a prototype system that annotated gene, chemical, and disease mentions in PubMed abstracts at BioCreative 2012 Track I (literature triage). We extended our previous system in developing CoINNER. The pre-tagging results of CoINNER were developed based on the state-of-the-art named entity recognition tools in BioCreative III. Next, a method based on conditional random fields (CRFs) is proposed to predict chemical and disease mentions in the articles. Finally, action term mentions were collected by latent Dirichlet allocation (LDA). At the BioCreative IV CTD Track, the best F-measures reached for gene/protein, chemical/drug and disease NER were 54 percent while CoINNER achieved a 61.5 percent F-measure. System URL: http://ikmbio.csie.ncku.edu.tw/coinner/introduction.htm.

原文English
文章編號6945344
頁(從 - 到)785-792
頁數8
期刊IEEE/ACM Transactions on Computational Biology and Bioinformatics
12
發行號4
DOIs
出版狀態Published - 2015 7月 1

All Science Journal Classification (ASJC) codes

  • 生物技術
  • 遺傳學
  • 應用數學

指紋

深入研究「Curatable Named-Entity Recognition Using Semantic Relations」主題。共同形成了獨特的指紋。

引用此