Learning tree augmented naive Bayes for ranking

34Citations
Citations of this article
38Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Naive Bayes has been widely used in data mining as a simple and effective classification algorithm. Since its conditional independence assumption is rarely true, numerous algorithms have been proposed to improve naive Bayes, among which tree augmented naive Bayes (TAN) achieves a significant improvement in term of classification accuracy, while maintaining efficiency and model simplicity. In many real-world data mining applications, however, an accurate ranking is more desirable than a classification. Thus it is interesting whether TAN also achieves significant improvement in term of ranking, measured by AUC(the area under the Receiver Operating Characteristics curve). Unfortunately, our experiments show that TAN performs even worse than naive Bayes in ranking. Responding to this fact, we present a novel learning algorithm, called forest augmented naive Bayes (FAN), by modifying the traditional TAN learning algorithm. We experimentally test our algorithm on all the 36 data sets recommended by Weka, and compare it to naive Bayes, SBC, TAN, and C4.4, in terms of AUC. The experimental results show that our algorithm outperforms all the other algorithms significantly in yielding accurate rankings. Our work provides an effective and efficient data mining algorithm for applications in which an accurate ranking is required. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Jiang, L., Zhang, H., Cai, Z., & Su, J. (2005). Learning tree augmented naive Bayes for ranking. In Lecture Notes in Computer Science (Vol. 3453, pp. 688–698). Springer Verlag. https://doi.org/10.1007/11408079_63

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free