Identifiability and estimation of probabilities from multiple databases with incomplete data and sampling selection

Jinzhu Jia; Zhi Geng; Mingfeng Wang

Conference ProceedingsOPEN ACCESS

Identifiability and estimation of probabilities from multiple databases with incomplete data and sampling selection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4109 LNCS 792-798

DOI: 10.1007/11815921_87

1Citations

3Readers

Abstract

For an application problem, there may be multiple databases, and each database may not contain complete variables or attributes, that is, some variables are observed but some others are missing. Further, data of a database may be collected conditionally on some designed variables. In this paper, we discuss problems related to data mining from such multiple databases. We propose an approach for detecting identifiability of a joint distribution from multiple databases. For an identifiable joint distribution, we further present the expectation-maximization (EM) algorithm for calculating the maximum likelihood estimates (MLEs) of the joint distribution. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Jia, J., Geng, Z., & Wang, M. (2006). Identifiability and estimation of probabilities from multiple databases with incomplete data and sampling selection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4109 LNCS, pp. 792–798). Springer Verlag. https://doi.org/10.1007/11815921_87

Identifiability and estimation of probabilities from multiple databases with incomplete data and sampling selection

Abstract

Cite

Register to see more suggestions