Analysis of Naive Bayes' assumptions on software fault data: An empirical study

  • Turhan B
  • Bener A
  • 81


    Mendeley users who have this article in their library.
  • 53


    Citations of this article.


Software defect prediction is important for reducing test times by allocating testing resources effectively. In terms of predicting the defects in software, Naive Bayes outperforms a wide range of other methods. However, Naive Bayes assumes the 'independence' and 'equal importance' of attributes. In this work, we analyze these assumptions of Naive Bayes using public software defect data from NASA. Our analysis shows that independence assumption is not harmful for software defect data with PCA pre-processing. Our results also indicate that assigning weights to static code attributes may increase the prediction performance significantly, while removing the need for feature subset selection. © 2008 Elsevier B.V. All rights reserved.

Author-supplied keywords

  • Empirical study
  • Naive Bayes
  • Software defect prediction

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document


Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free