Many statistical methods have been proposed to estimate causal models in classical situations with fewer variables than observations. However, modern datasets including gene expression data increase the needs of high-dimensional causal modeling in challenging situations with orders of magnitude more variables than observations. In this paper, we propose a method to find exogenous variables in a linear non-Gaussian causal model, which requires much smaller sample sizes than conventional methods and works even when orders of magnitude more variables than observations. Exogenous variables work as triggers that activate causal chains in the model, and their identification leads to more efficient experimental designs and better understanding of the causal mechanism. We present experiments with artificial data and real-world gene expression data to evaluate the method. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Sogawa, Y., Shimizu, S., Hyvärinen, A., Washio, T., Shimamura, T., & Imoto, S. (2010). Discovery of exogenous variables in data with more variables than observations. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6352 LNCS, pp. 67–76). https://doi.org/10.1007/978-3-642-15819-3_10
Mendeley helps you to discover research relevant for your work.