Big data Bayesian linear regression and variable selection by normal-inverse-gamma summation

6Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We introduce the normal-inverse-gamma summation operator, which combines Bayesian regression results from different data sources and leads to a simple split-and-merge algorithm for big data regressions. The summation operator is also useful for computing the marginal likelihood and facilitates Bayesian model selection methods, including Bayesian LASSO, stochastic search variable selection, Markov chain Monte Carlo model composition, etc. Observations are scanned in one pass and then the sampler iteratively combines normal-inversegamma distributions without reloading the data. Simulation studies demonstrate that our algorithms can efficiently handle highly correlated big data. A real-world data set on employment and wage is also analyzed.

Cite

CITATION STYLE

APA

Qian, H. (2018). Big data Bayesian linear regression and variable selection by normal-inverse-gamma summation. Bayesian Analysis, 13(4), 1007–1031. https://doi.org/10.1214/17-BA1083

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free