跳跃扩散过程的强化学习及其金融应用

Reinforcement Learning for Jump‐Diffusions, With Financial Applications

Mathematical Finance · 2026

被引 0

人大 BABS 3

Xuefeng Gao · 香港中文大学
Lingfei Li · 香港中文大学
Xun Yu Zhou · 哥伦比亚大学

中文导读

研究了跳跃扩散过程下的连续时间强化学习，发现无需事先判断数据来源是纯扩散还是跳跃扩散，可直接应用现有算法，并验证了其在均值方差投资组合和期权对冲中的有效性。

Abstract

ABSTRACT We study continuous‐time reinforcement learning (RL) for stochastic control in which system dynamics are governed by jump‐diffusion processes. We formulate an entropy‐regularized exploratory control problem with stochastic policies to capture the exploration–exploitation balance essential for RL. Unlike the pure diffusion case initially studied by Wang et al., the derivation of the exploratory dynamics under jump‐diffusions calls for a careful formulation of the jump part. Through a theoretical analysis, we find that one can simply use the same policy evaluation and q‐learning algorithms in Jia and Zhou, originally developed for controlled diffusions, without needing to check a priori whether the underlying data come from a pure diffusion or a jump‐diffusion. We investigate as an application the mean–variance portfolio selection problem with stock price modelled as a jump‐diffusion, and show that both RL algorithms and parameterizations are invariant with respect to jumps. Finally, we present a detailed study on applying the general theory to option hedging.

强化学习随机控制金融工程投资组合选择期权对冲

阅读原文 ↗