Searching in microbial genomes for encoded small proteins

  • Boekhorst J
  • Wilson G
  • Siezen R
  • 25


    Mendeley users who have this article in their library.
  • 9


    Citations of this article.


The Gaussian process latent variable model (GP-LVM) is a generative approach to non-linear low dimensional embedding, that pro-vides a smooth probabilistic mapping from latent to data space. It is also a non-linear generalization of probabilistic PCA (PPCA) (Tipping & Bishop, 1999). While most ap-proaches to non-linear dimensionality meth-ods focus on preserving local distances in data space, the GP-LVM focusses on exactly the opposite. Being a smooth mapping from latent to data space, it focusses on keep-ing things apart in latent space that are far apart in data space. In this paper we first provide an overview of dimensionality reduc-tion techniques, placing the emphasis on the kind of distance relation preserved. We then show how the GP-LVM can be generalized, through back constraints, to additionally pre-serve local distances. We give illustrative ex-periments on common data sets.

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document


  • Jos Boekhorst

  • Greer Wilson

  • Roland J. Siezen

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free