Literature extraction of protein functions using sentence pattern mining

Jung-Hsien Chiang, Hsu Chun Yu

Research output: Contribution to journalArticle

15 Citations (Scopus)


With the rapid growth of articles of genomics research, it has become a challenge for biomedical researchers to access this ever-increasing quantity of information to understand the newest discovery of functions of proteins they are studying. To facilitate functional annotation of proteins by utilizing the huge amounts of biomedical literature and transforming the knowledge into easily accessible database formats, the text mining technique thus becomes essential. In this paper, we propose the method of sentence pattern mining to extract protein functions from biomedical literature. To recognize variants of function terms correctly, we identify morphological, syntactic, and semantic variation forms. The proposed methods can be used to aid database curators in annotating protein functions and to assist biologists and medical researchers in searching protein functions from biomedical literature.

Original languageEnglish
Pages (from-to)1088-1098
Number of pages11
JournalIEEE Transactions on Knowledge and Data Engineering
Issue number8
Publication statusPublished - 2005 Aug 1

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

Fingerprint Dive into the research topics of 'Literature extraction of protein functions using sentence pattern mining'. Together they form a unique fingerprint.

Cite this