Improving Password Guessing Using Byte Pair Encoding

Xingxing Wang; Dakui Wang; Xiaojun Chen; Rui Xu; Jinqiao Shi; Li Guo

Conference Proceedings

Improving Password Guessing Using Byte Pair Encoding

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10599 LNCS 254-268

DOI: 10.1007/978-3-319-69659-1_14

0Citations

12Readers

Get full text

Abstract

Recent many password guessing algorithms based on the Probabilistic Context-Free Grammars (PCFGs) model brought significant improvements in password cracking. These algorithms analyzed common semantic patterns (letter semantic patterns, date patterns, keyboard patterns etc.) from passwords and modeled the construction process of passwords by using PCFGs. However, there still left a large fraction of integral segments in passwords which seem no semantics. Can those segments be deeply analyzed and help to make further improvements on password cracking? Motivated by this challenge, this paper employs Byte Pair Encoding (BPE) algorithm for password segmentation, extracting those non-semantical patterns which are frequently used in passwords subconsciously by people. Based on the segmentation, we propose a BPE-PCFGs model to generate password guesses. Furthermore, we also utilize the existing common semantic patterns and BPE patterns to construct a new Rich-BPE-PCFGs password generator. Experimental results on large-scale password sets show that our Rich-BPE-PCFGs model can obtain a 2.36%–37.56% improvement over the original PCFGs model, which is a good complement to existing password guessing algorithms.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, X., Wang, D., Chen, X., Xu, R., Shi, J., & Guo, L. (2017). Improving Password Guessing Using Byte Pair Encoding. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10599 LNCS, pp. 254–268). Springer Verlag. https://doi.org/10.1007/978-3-319-69659-1_14

Improving Password Guessing Using Byte Pair Encoding

Abstract

Author supplied keywords

Cite

Register to see more suggestions