存在实时高频数据时通过随机子抽样进行复发事件分析

Recurrent Event Analysis in the Presence of Real-Time High Frequency Data via Random Subsampling

Journal of Computational and Graphical Statistics · 2023
被引 1
ABS 3

中文导读

针对数字监测研究中实时高频数据导致复发事件分析计算量过大的问题,提出随机子抽样框架,实现高效近似似然估计,并允许使用标准软件拟合模型。

Abstract

Digital monitoring studies collect real-time high frequency data via mobile sensors in the subjects' natural environment. This data can be used to model the impact of changes in physiology on recurrent event outcomes such as smoking, drug use, alcohol use, or self-identified moments of suicide ideation. Likelihood calculations for the recurrent event analysis, however, become computationally prohibitive in this setting. Motivated by this, a random subsampling framework is proposed for computationally efficient, approximate likelihood-based estimation. A subsampling-unbiased estimator for the derivative of the cumulative hazard enters into an approximation of log-likelihood. The estimator has two sources of variation: the first due to the recurrent event model and the second due to subsampling. The latter can be reduced by increasing the sampling rate; however, this leads to increased computational costs. The approximate score equations are equivalent to logistic regression score equations, allowing for standard, "off-the-shelf" software to be used in fitting these models. Simulations demonstrate the method and efficiency-computation trade-off. We end by illustrating our approach using data from a digital monitoring study of suicidal ideation.

数字监测复发事件分析高频数据随机子抽样