☆ 4.5 Article

Using binary classification to prioritize and curate articles for the Comparative Toxicogenomics Database

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2012)

Journal

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION

Volume -, Issue -, Pages -

Publisher

OXFORD UNIV PRESS

DOI: 10.1093/database/bas050

Keywords

Funding

DebugIT project
European Community [FP7-217139]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

We report on the original integration of an automatic text categorization pipeline, soq-called ToxiCat (Toxicogenomic Categorizer), that we developed to perform biomedical documents classification and prioritization in order to speed up the curation of the Comparative Toxicogenomics Database (CTD). The task can be basically described as a binary classification task, where a scoring function is used to rank a selected set of articles. Then components of a questionq-answering system are used to extract CTDq-specific annotations from the ranked list of articles. The ranking function is generated using a Support Vector Machine, which combines three main modules: an information retrieval engine for MEDLINE (EAGLi), a gene normalization service (NormaGene) developed for a previous BioCreative campaign and finally, a set of answering components and entity recognizer for diseases and chemicals. The main components of the pipeline are publicly available both as web application and web services. The specific integration performed for the BioCreative competition is available via a web user interface at http://pingu.unige.ch:8080/Toxicat.

Using binary classification to prioritize and curate articles for the Comparative Toxicogenomics Database

Journal

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Using binary classification to prioritize and curate articles for the Comparative Toxicogenomics Database

Journal

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper