🌙

一种用于稀疏贝叶斯推断的快速异步马尔可夫链蒙特卡罗采样器

A fast asynchronous Markov chain Monte Carlo sampler for sparse Bayesian inference

Journal of the Royal Statistical Society. Series B: Statistical Methodology · 2023
被引 0
ABS 4

中文导读

提出一种快速近似MCMC采样框架,适用于稀疏贝叶斯推断问题,每次迭代计算成本为O(n(s+J)),在高维线性回归中能高概率恢复主信号,混合时间最多与回归变量数成线性关系。

Abstract

Abstract We propose a very fast approximate Markov chain Monte Carlo sampling framework that is applicable to a large class of sparse Bayesian inference problems. The computational cost per iteration in several regression models is of order O(n(s+J)), where n is the sample size, s is the underlying sparsity of the model, and J is the size of a randomly selected subset of regressors. This cost can be further reduced by data sub-sampling when stochastic gradient Langevin dynamics are employed. The algorithm is an extension of the asynchronous Gibbs sampler of Johnson et al. [(2013). Analyzing Hogwild parallel Gaussian Gibbs sampling. In Proceedings of the 26th International Conference on Neural Information Processing Systems (NIPS’13) (Vol. 2, pp. 2715–2723)], but can be viewed from a statistical perspective as a form of Bayesian iterated sure independent screening [Fan, J., Samworth, R., & Wu, Y. (2009). Ultrahigh dimensional feature selection: Beyond the linear model. Journal of Machine Learning Research, 10, 2013–2038]. We show that in high-dimensional linear regression problems, the Markov chain generated by the proposed algorithm admits an invariant distribution that recovers correctly the main signal with high probability under some statistical assumptions. Furthermore, we show that its mixing time is at most linear in the number of regressors. We illustrate the algorithm with several models.

贝叶斯推断稀疏模型马尔可夫链蒙特卡罗高维回归