4.7 Article

A New Method for Counting Reproductive Structures in Digitized Herbarium Specimens Using Mask R-CNN

期刊

FRONTIERS IN PLANT SCIENCE
卷 11, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fpls.2020.01129

关键词

automated regional segmentation; deep learning; digitized herbarium specimen; plant phenology; regional convolutional neural network; reproductive structures; visual data classification

资金

  1. New England Vascular Plant Project [DBI: EF1208835]
  2. NSF-DEB [1754584]
  3. Climate Change Solutions Fund
  4. Harvard University
  5. Harvard Forest
  6. NSF-DBI [EF1208835]
  7. NSF Postdoctoral Research Fellowship in Biology [NSF-DBI-1711936]
  8. French Agence Nationale de la Recherche (ANR) [ANR-17-ROSE-0003]
  9. Agence Nationale de la Recherche (ANR) [ANR-17-ROSE-0003] Funding Source: Agence Nationale de la Recherche (ANR)
  10. Direct For Biological Sciences
  11. Division Of Environmental Biology [1754584] Funding Source: National Science Foundation

向作者/读者索取更多资源

Phenology-the timing of life-history events-is a key trait for understanding responses of organisms to climate. The digitization and online mobilization of herbarium specimens is rapidly advancing our understanding of plant phenological response to climate and climatic change. The current practice of manually harvesting data from individual specimens, however, greatly restricts our ability to scale-up data collection. Recent investigations have demonstrated that machine-learning approaches can facilitate this effort. However, present attempts have focused largely on simplistic binary coding of reproductive phenology (e.g., presence/absence of flowers). Here, we use crowd-sourced phenological data of buds, flowers, and fruits from >3,000 specimens of six common wildflower species of the eastern United States (Anemone canadensisL.,A. hepaticaL.,A. quinquefoliaL.,Trillium erectumL.,T. grandiflorum(Michx.) Salisb., andT. undulatumWild.) to train models using Mask R-CNN to segment and count phenological features. A single global model was able to automate the binary coding of each of the three reproductive stages with >87% accuracy. We also successfully estimated the relative abundance of each reproductive structure on a specimen with >= 90% accuracy. Precise counting of features was also successful, but accuracy varied with phenological stage and taxon. Specifically, counting flowers was significantly less accurate than buds or fruits likely due to their morphological variability on pressed specimens. Moreover, our Mask R-CNN model provided more reliable data than non-expert crowd-sourcers but not botanical experts, highlighting the importance of high-quality human training data. Finally, we also demonstrated the transferability of our model to automated phenophase detection and counting of the threeTrilliumspecies, which have large and conspicuously-shaped reproductive organs. These results highlight the promise of our two-phase crowd-sourcing and machine-learning pipeline to segment and count reproductive features of herbarium specimens, thus providing high-quality data with which to investigate plant responses to ongoing climatic change.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据