☆ 3.8 Article

Extracting and characterizing gene-drug relationships from the literature

PHARMACOGENETICS (2004)

期刊

PHARMACOGENETICS

卷 14, 期 9, 页码 577-586

出版社

LIPPINCOTT WILLIAMS & WILKINS

DOI: 10.1097/00008571-200409000-00002

关键词

algorithms; databases; machine learning; natural language processing; pharmacogenetics

类别

Biotechnology & Applied Microbiology Genetics & Heredity Pharmacology & Pharmacy

资金

NHGRI NIH HHS [5 T32 HG00044] Funding Source: Medline
NIGMS NIH HHS [GM61374] Funding Source: Medline
NLM NIH HHS [LM06244] Funding Source: Medline

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

A fundamental task of pharmacogenetics is to collect and classify relationships between genes and drugs. Currently, this useful information has not been comprehensively aggregated in any database and remains scattered throughout the published literature. Although there are efforts to collect this information manually, they are limited by the size of the published literature on gene-drug relationships. Therefore, we investigated computational methods to extract and characterize pharmacogenetic relationships between genes and drugs from the literature. We first evaluated the effectiveness of the co-occurrence method in identifying related genes and drugs. We then used supervised machine learning algorithms to classify the relationships between genes and drugs from the Pharmacogenetics and Pharmacogenomics Knowledge Base (PharmGKB) into five categories that have been defined by active pharmacogenetic researchers as relevant to their work. The final co-occurrence algorithm was able to extract 78% of the related genes and drugs that were published in a review article from the literature. Our algorithm subsequently classified the relationships between genes and drugs from the PharmGKB into five categories with 74% accuracy. We have made the data available on a supplementary website at http://bionlp.stanford.edu/genedrug/ Gene-drug relationships can be accurately extracted from text and classified into categories. Although the relationships that we have identified do not capture the details and fine distinctions often made in the literature, these methods will help scientists to track the ever-growing literature and create information resources to support future discoveries. (C) 2004 Lippincott Williams Wilkins.

Extracting and characterizing gene-drug relationships from the literature

期刊

PHARMACOGENETICS

出版社

LIPPINCOTT WILLIAMS & WILKINS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Extracting and characterizing gene-drug relationships from the literature

期刊

PHARMACOGENETICS

出版社

LIPPINCOTT WILLIAMS & WILKINS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文