☆ 4.7 Article

GENIA corpus-a semantically annotated corpus for bio-textmining

BIOINFORMATICS (2003)

Journal

BIOINFORMATICS

Volume 19, Issue -, Pages i180-i182

Publisher

OXFORD UNIV PRESS

DOI: 10.1093/bioinformatics/btg1023

Keywords

Text Mining; Information Extraction; Corpus; Natural Language Processing; Computational Molecular Biology

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Motivation: Natural language processing (NLP) methods are regarded as being useful to raise the potential of text mining from biological literature. The lack of an extensively annotated corpus of this literature, however, causes a major bottleneck for applying NLP techniques. GENIA corpus is being developed to provide reference materials to let NLP techniques work for bio-textmining. Results: GENIA corpus version 3.0 consisting of 2000 MEDLINE abstracts has been released with more than 400 000 words and almost 100 000 annotations for biological terms.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7

Not enough ratings

GENIA corpus-a semantically annotated corpus for bio-textmining

Journal

BIOINFORMATICS

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

GENIA corpus-a semantically annotated corpus for bio-textmining

Journal

BIOINFORMATICS

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper