☆ 4.7 Article

Passive acoustic monitoring of animal populations with transfer learning

ECOLOGICAL INFORMATICS (2022)

期刊

ECOLOGICAL INFORMATICS

卷 70, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.ecoinf.2022.101688

关键词

Transfer learning; Convolutional neural networks; Deep learning; Vocalisation classification; Bioacoustics

类别

Ecology

资金

African Institute for Mathematical Sciences South Africa
International Development Research Centre, Ottawa, Canada
Government of Canada through Global Affairs Canada (GAC)
Microsoft's AI for Earth program
Centre ValBio research station & Madagascar National Parks

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Progress in deep learning, specifically in using convolutional neural networks for classification models, has been significant. This study investigates the use of transfer learning in passive acoustic monitoring, showing that it can improve F1 score up to 82% while simplifying implementation and design decisions.

Progress in deep learning, more specifically in using convolutional neural networks (CNNs) for the creation of classification models, has been tremendous in recent years. Within bioacoustics research, there has been a large number of recent studies that use CNNs. Designing CNN architectures from scratch is non-trivial and requires knowledge of machine learning. Furthermore, hyper-parameter tuning associated with CNNs is extremely time consuming and requires expensive hardware. In this paper we assess whether it is possible to build good bioacoustic classifiers by adapting and re-using existing CNNs pre-trained on the ImageNet dataset - instead of designing them from scratch, a strategy known as transfer learning that has proved highly successful in other domains. This study is a first attempt to conduct a large-scale investigation on how transfer learning can be used for passive acoustic monitoring (PAM), to simplify the implementation of CNNs and the design decisions when creating them, and to remove time consuming hyper-parameter tuning phases. We compare 12 modern CNN architectures across 4 passive acoustic datasets that target calls of the Hainan gibbon Nomascus hainanus, the critically endangered black-and-white ruffed lemur Varecia variegata, the vulnerable Thyolo alethe Chamaetylas choloensis, and the Pin-tailed whydah Vidua macroura. We focus our work on data scarcity issues by training PAM binary classification models very small datasets, with as few as 25 verified examples. Our findings reveal that transfer learning can result in up to 82% F1 score while keeping CNN implementation details to a minimum, thus rendering this approach accessible, easier to design, and speeding up further vocalisation annotations to create PAM robust models.

Passive acoustic monitoring of animal populations with transfer learning

期刊

ECOLOGICAL INFORMATICS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Passive acoustic monitoring of animal populations with transfer learning

期刊

ECOLOGICAL INFORMATICS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文