A Bayesian analysis strategy for cross-study translation of gene expression biomarkers

  • Lucas J
  • Carvalho C
  • West M
  • 15


    Mendeley users who have this article in their library.
  • 20


    Citations of this article.


We describe a strategy for the analysis of experimentally derived gene expression signatures and their translation to human observational data. Sparse multivariate regression models are used to identify expression signature gene sets representing downstream biological pathway events following interventions in designed experiments. When translated into in vivo human observational data, analysis using sparse latent factor models can yield multiple quantitative factors characterizing expression patterns that are often more complex than in the controlled, in vitro setting. The estimation of common patterns in expression that reflect all aspects of covariation evident in vivo offers an enhanced, modular view of the complexity of biological associations of signature genes. This can identify substructure in the biological process under experimental investigation and improved biomarkers of clinical outcomes. We illustrate the approach in a detailed study from an oncogene intervention experiment where in vivo factor profiling of an in vitro signature generates biological insights related to underlying pathway activities and chromosomal structure, and leads to refinements of cancer recurrence risk stratification across several cancer studies.

Author-supplied keywords

  • Breast cancer
  • Factor
  • Gene expression
  • Latent factor models
  • Module
  • Pathway
  • Signature
  • Sparse regression

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document


  • Joseph Lucas

  • Carlos Carvalho

  • Mike West

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free