A Generation-based Text Steganography by Maintaining Consistency of Probability Distribution

5Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Text steganography combined with natural language generation has become increasingly popular. The existing methods usually embed secret information in the generated word by controlling the sampling in the process of text generation. A candidate pool will be constructed by greedy strategy, and only the words with high probability will be encoded, which damages the statistical law of the texts and seriously affects the security of steganography. In order to reduce the influence of the candidate pool on the statistical imperceptibility of steganography, we propose a steganography method based on a new sampling strategy. Instead of just consisting of words with high probability, we select words with relatively small difference from the actual sample of the language model to build a candidate pool, thus keeping consistency with the probability distribution of the language model. What's more, we encode the candidate words according to their probability similarity with the target word, which can further maintain the probability distribution. Experimental results show that the proposed method can outperform the state-of-the-art steganographic methods in terms of security performance.

Cite

CITATION STYLE

APA

Yang, B., Peng, W., Xue, Y., & Zhong, P. (2021). A Generation-based Text Steganography by Maintaining Consistency of Probability Distribution. KSII Transactions on Internet and Information Systems, 15(11), 4184–4202. https://doi.org/10.3837/TIIS.2021.11.017

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free