4.6 Article

A Word-Embedding-Based Steganalysis Method for Linguistic Steganography via Synonym Substitution

期刊

IEEE ACCESS
卷 6, 期 -, 页码 64131-64141

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2018.2878273

关键词

Steganalysis; steganography; word embedding; Skip-gram language model; TF-IDF

资金

  1. National Natural Science Foundation of China [61202439, 61302159, 61602059, 61872448]
  2. Scientific Research Foundation of Hunan Provincial Education Department of China [16A008]
  3. Hunan Key Laboratory of Smart Roadway and Cooperative Vehicle-Infrastructure Systems [2017TP1016]

向作者/读者索取更多资源

The development of steganography technology threatens the security of privacy information in smart campus. To prevent privacy disclosure, a linguistic steganalysis method based on word embedding is proposed to detect the privacy information hidden in synonyms in the texts. With the continuous Skip-gram language model, each synonym and words in its context are represented as word embeddings, which aims to encode semantic meanings of words into low-dimensional dense vectors. The context fitness, which characterizes the suitability of a synonym by its semantic correlations with context words, is effectively estimated by their corresponding word embeddings and weighted by TF-IDF values of context words. By analyzing the differences of context fitness values of synonyms in the same synonym set and the differences of those in the cover and stego text, three features are extracted and fed into a support vector machine classifier for steganalysis task. The experimental results show that the proposed steganalysis improves the average F-value at least 4.8% over two baselines. In addition, the detection performance can be further improved by learning better word embeddings.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据