期刊
PLOS ONE
卷 8, 期 2, 页码 -出版社
PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pone.0055674
关键词
-
资金
- NSF [BDI-0446224, EF-0337220, DEB-0328922, DBI-0850223]
- [TAMOP-4.2/B-09/1/KONV-2010-0005]
- [KTIA-OTKA CNK 80140]
- [HURO/0901/205/2.2.2]
- Div Of Biological Infrastructure
- Direct For Biological Sciences [1321620] Funding Source: National Science Foundation
- Emerging Frontiers
- Direct For Biological Sciences [1039530] Funding Source: National Science Foundation
Hymenoptera, the insect order that includes sawflies, bees, wasps, and ants, exhibits an incredible diversity of phenotypes, with over 145,000 species described in a corpus of textual knowledge since Carolus Linnaeus. In the absence of specialized training, often spanning decades, however, these articles can be challenging to decipher. Much of the vocabulary is domain-specific (e.g., Hymenoptera biology), historically without a comprehensive glossary, and contains much homonymous and synonymous terminology. The Hymenoptera Anatomy Ontology was developed to surmount this challenge and to aid future communication related to hymenopteran anatomy, as well as provide support for domain experts so they may actively benefit from the anatomy ontology development. As part of HAO development, an active learning, dictionary-based, natural language recognition tool was implemented to facilitate Hymenoptera anatomy term discovery in literature. We present this tool, referred to as the 'Proofer', as part of an iterative approach to growing phenotype-relevant ontologies, regardless of domain. The process of ontology development results in a critical mass of terms that is applied as a filter to the source collection of articles in order to reveal term occurrence and biases in natural language species descriptions. Our results indicate that taxonomists use domain-specific terminology that follows taxonomic specialization, particularly at superfamily and family level groupings and that the developed Proofer tool is effective for term discovery, facilitating ontology construction.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据