☆ 3.8 Proceedings Paper

A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection

INTERSPEECH 2021 (2021)

期刊

INTERSPEECH 2021

卷 -, 期 -, 页码 4259-4263

出版社

ISCA-INT SPEECH COMMUNICATION ASSOC

DOI: 10.21437/Interspeech.2021-702

关键词

anti-spoofing; countermeasure; ASVspoof 2019; logical access; deep learning; significance test

类别

Audiology & Speech-Language Pathology Computer Science, Artificial Intelligence Computer Science, Software Engineering

资金

JST CREST Grants, Japan [JPMJCR18A6, JPMJCR20D3]
MEXT KAKENHI Grants, Japan [16H06302, 18H04120, 18H04112, 18KT0051]
Google AI for Japan program

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Recent research has focused on back-end neural networks and training criteria for speech spoofing countermeasures. This study offers a comparative perspective on various models and recognizes the potential impact of random initial seed on model performance. Promising techniques, including average pooling and a new hyper-parameter-free loss function, led to the best single model with significantly different statistical performance compared to others.

A great deal of recent research effort on speech spoofing countermeasures has been invested into back-end neural networks and training criteria. We contribute to this effort with a comparative perspective in this study. Our comparison of countermeasure models on the ASVspoof 2019 logical access scenario takes into account common strategies to deal with input trials of varied length, recently proposed margin-based training criteria, and widely used front ends. We also measured intra-model differences through multiple training-evaluation rounds with random initialization. Our statistical analysis demonstrates that the performance of the same model may be statistically significantly different when just changing the random initial seed. We thus recommend similar statistical analysis or reporting results of multiple runs for further research on the database. Despite the intra-model differences, we observed a few promising techniques, including average pooling, to efficiently process varied-length inputs and a new hyper-parameter-free loss function. The two techniques led to the best single model in our experiment, which achieved an equal error rate of 1.92% and was significantly different in statistical sense from most of the other experimental models.

A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection

期刊

INTERSPEECH 2021

出版社

ISCA-INT SPEECH COMMUNICATION ASSOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection

期刊

INTERSPEECH 2021

出版社

ISCA-INT SPEECH COMMUNICATION ASSOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文