面向高维协变量空间观测的双重正则化广义线性模型

Doubly regularized generalized linear models for spatial observations with high-dimensional covariates

Journal of the Royal Statistical Society. Series C: Applied Statistics · 2025
被引 0
ABS 3

中文导读

针对空间格点数据,提出一种双重正则化回归框架,同时利用空间网络和特征相似性网络,提升高维协变量下的预测精度和变量选择能力,并给出渐近有效的置信区间和假设检验。

Abstract

Abstract A discrete spatial lattice can be cast as a network structure over which spatially correlated outcomes are observed. A second network structure may also capture similarities among measured features, when such information is available. Incorporating the network structures when analysing such doubly structured data can improve predictive power, and lead to better identification of important features in the data-generating process. Motivated by applications in spatial disease mapping, we develop a new doubly regularized regression framework to incorporate these network structures for analysing high-dimensional datasets. Our estimators can be easily implemented with standard convex optimization algorithms. In addition, we describe a procedure to obtain asymptotically valid confidence intervals and hypothesis tests for our model parameters. We show empirically that our framework provides improved predictive accuracy and inferential power compared with existing high-dimensional spatial methods. These advantages hold given fully accurate network information, and also with networks which are partially misspecified or uninformative. The application of the proposed method to modelling COVID-19 mortality data suggests that it can improve the prediction of deaths beyond standard spatial models, and that it selects relevant covariates more often.

空间计量经济学高维统计广义线性模型网络结构疾病制图