4.6 Article

Anti-forensics of fake stereo audio using generative adversarial network

期刊

MULTIMEDIA TOOLS AND APPLICATIONS
卷 81, 期 12, 页码 17155-17167

出版社

SPRINGER
DOI: 10.1007/s11042-022-12448-4

关键词

Generative adversarial network; Anti-forensics; Stereo faking

资金

  1. National Natural Science Foundation of China [61300055]
  2. Zhejiang Natural Science Foundation [LY20F020010, LY17F020010]
  3. Ningbo Natural Science Foundation [202003N4089]
  4. Ningbo Science and Technology Innovation 2025 Major Project [2018B10010, 2019B10075]
  5. K.C. Wong Magna Fund in Ningbo University

向作者/读者索取更多资源

Fake-quality audio detection, specifically in the context of stereo-faked audio, is an important field in digital audio forensics. This study proposes an anti-forensic framework based on generative adversarial network to expose the weaknesses of stereo-faking detectors. By generating fake stereo audio using a mono audio, the researchers demonstrate that detection accuracy can significantly decrease while the false acceptance rate increases.
Fake-quality audio detection is an important branch in the field of digital audio forensics. Resampling and recompression are the two typical operations to achieve fake audio quality, in which an audio with low sampling/bit rate can be converted to one with higher sampling/bit rate pretending to be in high quality. Stereo-faking is another fake-quality operation, with which a mono audio can be converted into a stereo one. To detect the stereo-faking, a few forensic methods have been proposed. Little consideration, however, has been given to the security of these methods themselves. To expose the weakness of these stereo-faking detectors, an anti-forensic framework based on generative adversarial network is proposed. The fake stereo audio is created by generating a new channel audio based on a mono audio. Skip connection is adopted to ensure the quality of the generated audio. Considering that stereo application scenarios are mostly music and film recording, a large number of music and film recordings are downloaded from the Internet as our datasets. Use these datasets to train our model. The anti-forensic samples generated by the model are used to attack the most effective fake stereo audio detectors. Experimental results show that the generated fake stereo audio of music can significantly reduce its detection accuracy from about 99-30%, and the false acceptance rate can increase from 0.08% to about 69%. The fake stereo audio generated from the film recording can significantly reduce its detection accuracy from about 99-1.7%, and the false acceptance rate can increase from 0.02% to about 98%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据