最大熵先验不确定性与统计经济数据的相关性

Maximum-Entropy Prior Uncertainty and Correlation of Statistical Economic Data

Journal of Business & Economic Statistics · 2015
被引 17
人大 AABS 4

中文导读

利用贝叶斯推断和最大熵原理,在缺失不确定性信息时估计经济源数据的先验分布、不确定性和相关性,并以美国县商业模式数据库为例展示相对不确定性范围1%-80%及20%的数据对相关性绝对值超过0.9。

Abstract

Empirical estimates of source statistical economic data such as trade flows, greenhouse gas emissions, or employment figures are always subject to uncertainty (stemming from measurement errors or confidentiality) but information concerning that uncertainty is often missing. This article uses concepts from Bayesian inference and the maximum entropy principle to estimate the prior probability distribution, uncertainty, and correlations of source data when such information is not explicitly provided. In the absence of additional information, an isolated datum is described by a truncated Gaussian distribution, and if an uncertainty estimate is missing, its prior equals the best guess. When the sum of a set of disaggregate data is constrained to match an aggregate datum, it is possible to determine the prior correlations among disaggregate data. If aggregate uncertainty is missing, all prior correlations are positive. If aggregate uncertainty is available, prior correlations can be either all positive, all negative, or a mix of both. An empirical example is presented, which reports relative uncertainties and correlation priors for the County Business Patterns database. In this example, relative uncertainties range from 1% to 80% and 20% of data pairs exhibit correlations below −0.9 or above 0.9. Supplementary materials for this article are available online.

最大熵先验数据不确定性相关性估计贝叶斯推断