4.6 Article

End-to-End Dual-Branch Network Towards Synthetic Speech Detection

期刊

IEEE SIGNAL PROCESSING LETTERS
卷 30, 期 -, 页码 359-363

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LSP.2023.3262419

关键词

Forgery; Feature extraction; Finite element analysis; Training; Speech synthesis; Task analysis; Multitasking; ASVspoof 2019 LA; attention mechanism; generalization ability; multi-task learning; synthetic speech detection

向作者/读者索取更多资源

Synthetic speech attacks pose a great threat to ASV systems. A Dual-Branch Network is proposed, using LFCC and CQT as inputs, to enhance the generalization ability for attacks generated by unknown synthesis algorithms. The system outperforms existing state-of-the-art systems and shows good generalization for unknown forgery types.
Synthetic speech attacks bring more threats to Automatic Speaker Verification (ASV) systems, thus many synthetic speech detection (SSD) systems have been proposed to help the ASV system resist synthetic speech attacks. However, existing SSD systems still lack the generalization ability for the attacks generated by unknown synthesis algorithms. This letter proposes an end-to-end ensemble system, namely Dual-Branch Network, in which linear frequency cepstral coefficients (LFCC) and constant Q transform (CQT) are used as the input of two branches respectively. In addition, four fusion strategies are compared for the fusion of two branches to obtain an optimal one; multi-task learning and convolutional block attention module (CBAM) are introduced into the Dual-Branch Network to help the network learn the common forgery features from different forgery types of speech and enhance the representation power of learned features. Experimental results on the ASVspoof 2019 logical access (LA) dataset demonstrate that the proposed system outperforms existing state-of-the-art systems on both t-DCF and EER scores and has good generalization for unknown forgery types of synthetic speech.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据