大规模数据集下常见微观计量模型的贝叶斯推断:双边缘化子抽样方法

Bayesian Inference in Common Microeconometric Models With Massive Datasets by Double Marginalized Subsampling

Journal of Business & Economic Statistics · 2021
被引 0
人大 AABS 4

中文导读

针对大规模数据下贝叶斯推断计算量大的问题,提出双边缘化子抽样方法,适用于Tobit、Probit、异方差、随机波动等多种微观计量模型,能精确逼近后验分布并提升计算效率。

Abstract

Bayesian inference with a large dataset is computationally intensive, as Markov chain Monte Carlo simulation requires a complete scan of the dataset for each proposed parameter update. To reduce the number of data points evaluated at each iteration of posterior simulation, we develop a double marginalized subsampling method, which is applicable to a wide array of microeconometric models including Tobit, Probit, regressions with non-Gaussian errors, heteroscedasticity and stochastic volatility, hierarchical longitudinal models, time-varying-parameter regressions, Gaussian mixtures, etc. We also provide an extension to double pseudo-marginalized subsampling, which has more applications beyond conditionally conjugate models. With rank-one update of the cumulative statistics, both methods target the exact posterior distribution, from which a parameter draw can be obtained with every single observation. Simulation studies demonstrate the statistical and computational efficiency of the marginalized sampler. The methods are also applied to a real-world massive dataset on the incidentally truncated mortgage rates.

贝叶斯推断微观计量模型大规模数据集双重边缘化子抽样