Powerful generative models, particularly in natural language modelling, are commonly trained by maximizing a variational lower bound on the data log likelihood. These models often suffer from poor use of their latent variable, with ad-hoc annealing factors used to encourage retention of information in the latent variable. We discuss an alternative and general approach to latent variable modelling, based on an objective that encourages a perfect reconstruction by tying a stochastic autoencoder with a variational autoencoder (VAE). This ensures by design that the latent variable captures information about the observations, whilst retaining the ability to generate well. Interestingly, although our model is fundamentally different to a VAE, the lower bound attained is identical to the standard VAE bound but with the addition of a simple pre-factor; thus, providing a formal interpretation of the commonly used, ad-hoc pre-factors in training VAEs.
CITATION STYLE
Mansbridge, A., Fierimonte, R., Feige, I., & Barber, D. (2019). Improving latent variable descriptiveness by modelling rather than ad-hoc factors. Machine Learning, 108(8–9), 1601–1611. https://doi.org/10.1007/s10994-019-05830-1
Mendeley helps you to discover research relevant for your work.