4.7 Article

Real-time steganalysis for streaming media based on multi-channel convolutional sliding windows

期刊

KNOWLEDGE-BASED SYSTEMS
卷 237, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2021.107561

关键词

Voice over IP (VoIP); Real-time steganalysis; Convolutional sliding window; Multi-channel feature extraction

资金

  1. National Key Research and Development Program of China [2018YFC1604000/2018YFC1604002]

向作者/读者索取更多资源

This paper proposes a real-time VoIP steganalysis model to improve the accuracy and efficiency of detecting covert communications. The model utilizes multi-channel convolutional sliding windows and statistical feature analysis to achieve high detection performance, especially for low embedding rates.
In recent years, covert communication technologies based on Voice over Internet Protocol (VoIP) have received more and more attention, which meanwhile poses a significant threat to the security of cyberspace. In this paper, we are chiefly concerned with improving the accuracy and efficiency of detection of covert communications, and we propose a real-time VoIP steganalysis model to tackle these issues. Multi-channel convolutional sliding windows (CSW) are developed to analyze the correlations between a given frame and its neighboring frames in a VoIP signal. Within each sliding window, we employ two feature extraction channels to extract correlation features from the input signal. Each channel is constructed of multiple convolutional layers having a large number of convolution kernels. The extracted features are then fed to a forward fully connected network for feature fusion. By analyzing the statistical distribution of these features, the discriminator will determine whether the input speech signal contains covert information or not. We designed several experiments to test the proposed model's detection performance under various conditions, including different embedding rates, different speech lengths, etc. Experimental results show that the proposed model can efficiently and accurately detect steganographic voice streams, especially in the case of low embedding rates. In addition, further experiments demonstrate that the proposed model can attain nearly real-time detection of VoIP speech signals and achieve state-of-the-art performance. (c) 2021 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据