4.7 Article

Automatic bat call classification using transformer networks

期刊

ECOLOGICAL INFORMATICS
卷 78, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.ecoinf.2023.102288

关键词

Computational bioacoustics; Attention; Transformer; Echolocation; Species identification; Acoustic monitoring; Bat calls

类别

向作者/读者索取更多资源

Automatically identifying bat species from their echolocation calls is a challenging task. Existing models trained on single call datasets often perform poorly on real-life data and are too slow for real-time classification. This study proposes a Transformer architecture trained on synthetic data and achieves high accuracy and F1-score on the test set, outperforming other tools on an independent dataset.
Automatically identifying bat species from their echolocation calls is a difficult but important task for monitoring bats and the ecosystem they live in. Major challenges in automatic bat call identification are high call variability, similarities between species, interfering calls and lack of annotated data. Many currently available models suffer from relatively poor performance on real-life data due to being trained on single call datasets and, moreover, are often too slow for real-time classification. Here, we propose a Transformer architecture for multi-label classification with potential applications in real-time classification scenarios. We train our model on synthetically generated multispecies recordings by merging multiple bats calls into a single recording with multiple simultaneous calls. Our approach achieves a single species accuracy of 88.92% (F1-score of 84.23%) and a multi species macro F1-score of 74.40% on our test set. In comparison to three other tools on the independent and publicly available dataset ChiroVox, our model achieves at least 25.82% better accuracy for single species classification and at least 6.9% better macro F1-score for multi species classification.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据