Statistical debugging using latent topic models

David Andrzejewski; Anne Mulhern; Ben Liblit; Xiaojin Zhu

Conference ProceedingsOPEN ACCESS

Statistical debugging using latent topic models

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4701 LNAI 6-17

DOI: 10.1007/978-3-540-74958-5_5

31Citations

76Readers

Abstract

Statistical debugging uses machine learning to model program failures and help identify root causes of bugs. We approach this task using a novel Delta-Latent-Dirichlet-Allocation model. We model execution traces attributed to failed runs of a program as being generated by two types of latent topics: normal usage topics and bug topics. Execution traces attributed to successful runs of the same program, however, are modeled by usage topics only. Joint modeling of both kinds of traces allows us to identify weak bug topics that would otherwise remain undetected. We perform model inference with collapsed Gibbs sampling. In quantitative evaluations on four real programs, our model produces bug topics highly correlated to the true bugs, as measured by the Rand index. Qualitative evaluation by domain experts suggests that our model outperforms existing statistical methods for bug cause identification, and may help support other software tasks not addressed by earlier models. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Andrzejewski, D., Mulhern, A., Liblit, B., & Zhu, X. (2007). Statistical debugging using latent topic models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4701 LNAI, pp. 6–17). Springer Verlag. https://doi.org/10.1007/978-3-540-74958-5_5

Statistical debugging using latent topic models

Abstract

Cite

Register to see more suggestions