☆ 4.6 Article

Automatically annotating documents with normalized gene lists

BMC BIOINFORMATICS (2005)

Journal

BMC BIOINFORMATICS

Volume 6, Issue -, Pages -

Publisher

BMC

DOI: 10.1186/1471-2105-6-S1-S13

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Background: Document gene normalization is the problem of creating a list of unique identifiers for genes that are mentioned within a document. Automating this process has many potential applications in both information extraction and database curation systems. Here we present two separate solutions to this problem. The first is primarily based on standard pattern matching and information extraction techniques. The second and more novel solution uses a statistical classifier to recognize valid gene matches from a list of known gene synonyms. Results: We compare the results of the two systems, analyze their merits and argue that the classification based system is preferable for many reasons including performance, simplicity and robustness. Our best systems attain a balanced precision and recall in the range of 74%-92%, depending on the organism.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6

Not enough ratings

Automatically annotating documents with normalized gene lists

Journal

BMC BIOINFORMATICS

Publisher

BMC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Automatically annotating documents with normalized gene lists

Journal

BMC BIOINFORMATICS

Publisher

BMC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper