☆ 4.5 Article

Tensorial dynamic time warping with articulation index representation for efficient audio-template learning

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2018)

期刊

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA

卷 143, 期 3, 页码 1548-1558

出版社

ACOUSTICAL SOC AMER AMER INST PHYSICS

DOI: 10.1121/1.5027245

关键词

类别

Acoustics Audiology & Speech-Language Pathology

资金

TerraSwarm Research Center
STARnet phase of the Focus Center Research Program (FCRP), a Semiconductor Research Corporation program - MARCO
STARnet phase of the Focus Center Research Program (FCRP), a Semiconductor Research Corporation program - DARPA

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Audio classification techniques often depend on the availability of a large labeled training dataset for successful performance. However, in many application domains of audio classification (e.g., wildlife monitoring), obtaining labeled data is still a costly and laborious process. Motivated by this observation, a technique is proposed to efficiently learn a clean template from a few labeled, but likely corrupted (by noise and interferences), data samples. This learning can be done efficiently via tensorial dynamic time warping on the articulation index-based time-frequency representations of audio data. The learned template can then be used in audio classification following the standard template-based approach. Experimental results show that the proposed approach outperforms both (1) the recurrent neural network approach and (2) the state-of-the-art in the template-based approach on a wildlife detection application with few training samples. (C) 2018 Acoustical Society of America.

Tensorial dynamic time warping with articulation index representation for efficient audio-template learning

期刊

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA

出版社

ACOUSTICAL SOC AMER AMER INST PHYSICS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Tensorial dynamic time warping with articulation index representation for efficient audio-template learning

期刊

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA

出版社

ACOUSTICAL SOC AMER AMER INST PHYSICS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文