4.6 Article

BioPPISVMExtractor: A protein-protein interaction extractor for biomedical literature using SVM and rich feature sets

期刊

JOURNAL OF BIOMEDICAL INFORMATICS
卷 43, 期 1, 页码 88-96

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2009.08.013

关键词

Text mining; Information extraction; Protein-protein interaction; Conditional random fields; Support vector machines

资金

  1. Natural Science Foundation of China [60373095, 60673039]
  2. National High Tech Research and Development Plan of China [2006AA01Z151]

向作者/读者索取更多资源

Protein-protein interactions play a key role in various aspects of the structural and functional organization of the cell. Knowledge about them unveils the molecular mechanisms of biological processes. However, the amount of biomedical literature regarding protein interactions is increasing rapidly and it is difficult for interaction database curators to detect and curate protein interaction information manually. This paper presents a SVM-based system, named BioPPISVMExtractor, to identify protein-protein interactions in biomedical literature. This system uses rich feature sets including word features, keyword feature, protein names distance feature and Link path feature for SVM classification. In addition, the Link Grammar extraction result feature is introduced to improve the precision rate. Experimental evaluations with other state-of-the-art PPI extraction systems tested on the DIP corpus indicate that BioPPISVMExtractor can substantially improve recall at the cost of a moderate decline in precision. (C) 2009 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据