Consistent model selection of discrete Bayesian networks from incomplete data

Nikolay Balov

Journal ArticleOPEN ACCESS

Consistent model selection of discrete Bayesian networks from incomplete data

Balov N

Electronic Journal of Statistics (2013) 7(1) 1047-1077

DOI: 10.1214/13-EJS802

9Citations

16Readers

Abstract

A maximum likelihood based model selection of discrete Bayesian networks is considered. The structure learning is performed by employing a scoring function S, which, for a given network G and n-sample Dn, is defined as the maximum marginal log-likelihood l minus a penalization term λnh proportional to network complexity h(G), S(G{pipe}Dn)=l(G{pipe}Dnn)-λnh(G). An available case analysis is developed with the standard log-likelihood replaced by the sum of sample average node log-likelihoods. The approach utilizes partially missing data records and allows for comparison of models fitted to different samples. In missing completely at random settings the estimation is shown to be consistent if and only if the sequence λn converges to zero at a slower than n-1/2 rate. In particular, the BIC model selection (λn=0.5 log(n)/n) applied to the node-average log-likelihood is shown to be inconsistent in general. This is in contrast to the complete data case when BIC is known to be consistent. The conclusions are confirmed by numerical experiments.

Author supplied keywords

Cite

CITATION STYLE

APA

Balov, N. (2013). Consistent model selection of discrete Bayesian networks from incomplete data. Electronic Journal of Statistics, 7(1), 1047–1077. https://doi.org/10.1214/13-EJS802

Consistent model selection of discrete Bayesian networks from incomplete data

Abstract

Author supplied keywords

Cite

Register to see more suggestions