4.6 Article

Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports

期刊

JOURNAL OF BIOMEDICAL INFORMATICS
卷 45, 期 5, 页码 885-892

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jbi.2012.04.008

关键词

Adverse drug effect; Benchmark corpus; Annotation; Harmonization; Sentence classification

资金

  1. B-IT Research School scholarship Grant from the state of NorthRhineWestfalia

向作者/读者索取更多资源

A significant amount of information about drug-related safety issues such as adverse effects are published in medical case reports that can only be explored by human readers due to their unstructured nature. The work presented here aims at generating a systematically annotated corpus that can support the development and validation of methods for the automatic extraction of drug-related adverse effects from medical case reports. The documents are systematically double annotated in various rounds to ensure consistent annotations. The annotated documents are finally harmonized to generate representative consensus annotations. In order to demonstrate an example use case scenario, the corpus was employed to train and validate models for the classification of informative against the non-informative sentences. A Maximum Entropy classifier trained with simple features and evaluated by 10-fold cross-validation resulted in the F-1 score of 0.70 indicating a potential useful application of the corpus. (C) 2012 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据