4.7 Article

High-performance gene name normalization with GENO

期刊

BIOINFORMATICS
卷 25, 期 6, 页码 815-821

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btp071

关键词

-

资金

  1. German Ministry of Science and Education (BMBF) [01DS001B]
  2. European Commission [28099]

向作者/读者索取更多资源

Motivation: The recognition and normalization of textual mentions of gene and protein names is both particularly important and challenging. Its importance lies in the fact that they constitute the crucial conceptual entities in biomedicine. Their recognition and normalization remains a challenging task because of widespread gene name ambiguities within species, across species, with common English words and with medical sublanguage terms. Results: We present GENO, a highly competitive system for gene name normalization, which obtains an F-measure performance of 86.4% (precision: 87.8%, recall: 85.0%) on the BIOCREATIVE-II test set, thus being on a par with the best system on that task. Our system tackles the complex gene normalization problem by employing a carefully crafted suite of symbolic and statistical methods, and by fully relying on publicly available software and data resources, including extensive background knowledge based on semantic pro. ling. A major goal of our work is to present GENO's architecture in a lucid and perspicuous way to pave the way to full reproducibility of our results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据