Several publications have focused on fitting a specific distribution to overall microarray data. Due to a number of biological features the distribution of overall spot intensities can take various shapes. It appears to be impossible to find a specific distribution fitting all experiments even if they are carried out perfectly. Therefore, a probabilistic representation that models a mixture of various effects would be suitable. We use a Gaussian mixture model to represent signal intensity profiles. The advantage of this approach is the derivation of a probabilistic criterion for expressed and non-expressed genes. Furthermore our approach does not involve any prior decision on the number of model parameters. We properly fit microarray data of various shapes by a mixture of Gaussians using the EM algorithm and determine the complexity of the mixture model by the Bayesian Information Criterion (BIG). Finally, we apply our method to simulated data and to biological data. © Springer-Verlag Berlin Heidelberg 2003.
CITATION STYLE
Steinhoff, C., Müller, T., Nuber, U. A., & Vingron, M. (2003). Gaussian mixture density estimation applied to microarray data. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2810, 418–429. https://doi.org/10.1007/978-3-540-45231-7_39
Mendeley helps you to discover research relevant for your work.