Big data Bayesian linear regression and variable selection by normal-inverse-gamma summation

Hang Qian

Journal ArticleOPEN ACCESS

Big data Bayesian linear regression and variable selection by normal-inverse-gamma summation

Qian H

Bayesian Analysis (2018) 13(4) 1007-1031

DOI: 10.1214/17-BA1083

6Citations

14Readers

Get full text

Abstract

We introduce the normal-inverse-gamma summation operator, which combines Bayesian regression results from different data sources and leads to a simple split-and-merge algorithm for big data regressions. The summation operator is also useful for computing the marginal likelihood and facilitates Bayesian model selection methods, including Bayesian LASSO, stochastic search variable selection, Markov chain Monte Carlo model composition, etc. Observations are scanned in one pass and then the sampler iteratively combines normal-inversegamma distributions without reloading the data. Simulation studies demonstrate that our algorithms can efficiently handle highly correlated big data. A real-world data set on employment and wage is also analyzed.

Author supplied keywords

Cite

CITATION STYLE

APA

Qian, H. (2018). Big data Bayesian linear regression and variable selection by normal-inverse-gamma summation. Bayesian Analysis, 13(4), 1007–1031. https://doi.org/10.1214/17-BA1083

Big data Bayesian linear regression and variable selection by normal-inverse-gamma summation

Abstract

Author supplied keywords

Cite

Register to see more suggestions