☆ 4.7 Article

Separable Convolution Network With Dual-Stream Pyramid Enhanced Strategy for Speech Steganalysis

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY (2023)

Journal

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY

Volume 18, Issue -, Pages 2737-2750

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIFS.2023.3269640

Keywords

Feature extraction; Calibration; Convolution; Steganography; Speech coding; Coherence; Neural networks; Steganalysis; separable convolution; pulse position; dual-stream network; calibration

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper proposes a novel steganalysis method based on separable convolution network (SepSteNet) with dual-stream pyramid enhanced strategy (DPES) to improve the detection performance of speech steganography. Experimental results show that the presented method significantly outperforms the existing ones, and DPES can effectively enhance the performance of the existing deep neural network for speech steganalysis.

Steganography based on fixed codebook has become one of the most important branches of speech steganography due to its high imperceptibility and having the largest available carrier space. As its countermeasure technique, this paper presents a novel steganalysis method based on separable convolution network (SepSteNet) with dual-stream pyramid enhanced strategy (DPES). Specifically, to better acquire discriminative representations, we design the pulse-aware separable block to capture the pulse correspondence along independent levels of pulse positions, where the pulse-aware excitation module is plugged to avoid noisy clue accumulation by adaptively emphasizing the salient part. Moreover, the global attending block is introduced to enhance correspondence features through calculating global responses at distinct subframes. In addition, to eliminate the negative impact of sample content, DPES is leveraged to incorporate cross-domain coherence features by the inverted connected dual-stream branches. With the original and calibration speech samples, two branches enable the correspondence of two detection feature domains to interact with each other to generate coherence features independent of sample content, thereby improving the detection performance. The performance of the presented method is comprehensively evaluated and compared with the state of the arts. The experimental results demonstrate that the presented method significantly outperforms the existing ones. Furthermore, DPES is shown to be a general enhancement strategy that can effectively improve the performance of the existing deep neural network for speech steganalysis. The source code for this work is publicly available on https://github.com/BarryxxZ/SepSteNetwithDPES.

Separable Convolution Network With Dual-Stream Pyramid Enhanced Strategy for Speech Steganalysis

Journal

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Separable Convolution Network With Dual-Stream Pyramid Enhanced Strategy for Speech Steganalysis

Journal

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper