Abstract
Text steganography combined with natural language generation has become increasingly popular. The existing methods usually embed secret information in the generated word by controlling the sampling in the process of text generation. A candidate pool will be constructed by greedy strategy, and only the words with high probability will be encoded, which damages the statistical law of the texts and seriously affects the security of steganography. In order to reduce the influence of the candidate pool on the statistical imperceptibility of steganography, we propose a steganography method based on a new sampling strategy. Instead of just consisting of words with high probability, we select words with relatively small difference from the actual sample of the language model to build a candidate pool, thus keeping consistency with the probability distribution of the language model. What's more, we encode the candidate words according to their probability similarity with the target word, which can further maintain the probability distribution. Experimental results show that the proposed method can outperform the state-of-the-art steganographic methods in terms of security performance.
Author supplied keywords
Cite
CITATION STYLE
Yang, B., Peng, W., Xue, Y., & Zhong, P. (2021). A Generation-based Text Steganography by Maintaining Consistency of Probability Distribution. KSII Transactions on Internet and Information Systems, 15(11), 4184–4202. https://doi.org/10.3837/TIIS.2021.11.017
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.