4.6 Article

A practical guide for generating unsupervised, spectrogram-based latent space representations of animal vocalizations

期刊

JOURNAL OF ANIMAL ECOLOGY
卷 91, 期 8, 页码 1567-1581

出版社

WILEY
DOI: 10.1111/1365-2656.13754

关键词

animal sounds; animal vocalizations; bioacoustics; call classification; dimensionality reduction; spectrogram; UMAP; unsupervised learning

资金

  1. Alexander von Humboldt-Stiftung
  2. Deutsche Forschungsgemeinschaft [EXC 2117 -422037984]
  3. Gips-Schule-Stiftung
  4. Human Frontiers Science Program [RGP0051/2019]
  5. Minerva Foundation
  6. University Konstanz Zukunftskolleg

向作者/读者索取更多资源

This study explores the feature classification and similarity calculation of animal vocalization data using neighborhood-based dimensionality reduction of spectrograms. The results demonstrate that this method can generate meaningful latent space representations and can be applied to classify ambiguous calls and detect mislabeled calls.
Background: The manual detection, analysis and classification of animal vocalizations in acoustic recordings is laborious and requires expert knowledge. Hence, there is a need for objective, generalizable methods that detect underlying patterns in these data, categorize sounds into distinct groups and quantify similarities between them. Among all computational methods that have been proposed to accomplish this, neighbourhood-based dimensionality reduction of spectrograms to produce a latent space representation of calls stands out for its conceptual simplicity and effectiveness. Goal of the study/what was done: Using a dataset of manually annotated meerkat Suricata suricatta vocalizations, we demonstrate how this method can be used to obtain meaningful latent space representations that reflect the established taxonomy of call types. We analyse strengths and weaknesses of the proposed approach, give recommendations for its usage and show application examples, such as the classification of ambiguous calls and the detection of mislabelled calls. What this means: All analyses are accompanied by example code to help researchers realize the potential of this method for the study of animal vocalizations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据