Data generated from microarray experiments often suffer from missing values. As most downstream analyses need full matrices as input, these missing values have to be estimated. Bayesian principal component analysis (BPCA) is a well-known microarray missing value estimation method, but its performance is not satisfactory on datasets with strong local similarity structure. A bicluster-based BPCA (bi-BPCA) method is proposed in this paper to fully exploit local structure of the matrix. In a bicluster, the most correlated genes and experimental conditions with the missing entry are identified, and BPCA is conducted on these biclusters to estimate the missing values. An automatic parameter learning scheme is also developed to obtain optimal parameters. Experimental results on four real microarray matrices indicate that bi-BPCA obtains the lowest normalized root-mean-square error on 82.14% of all missing rates. © 2013 IEEE.
CITATION STYLE
Meng, F., Cai, C., & Yan, H. (2014). A bicluster-based bayesian principal component analysis method for microarray missing value estimation. IEEE Journal of Biomedical and Health Informatics, 18(3), 863–871. https://doi.org/10.1109/JBHI.2013.2284795
Mendeley helps you to discover research relevant for your work.