基于深度强化学习的远程状态估计最优混合传输策略

Optimal Hybrid Transmission Strategy for Remote State Estimation With Deep Reinforcement Learning

IEEE Transactions on Systems, Man, and Cybernetics: Systems · 2025

被引 0

ABS 3

Hongxia Rao
Zitian Li
Yong Xu
Tingwen Huang
Lixin Yang
Leszek Rutkowski

中文导读

针对远程状态估计中高频链路带宽高但可靠性低、低频链路可靠性高但能耗低的特点，提出一种混合传输策略，并用深度强化学习算法求解最优调度方案，实现高可靠低能耗传输。

Abstract

This work delves into an optimal hybrid transmission strategy for remote state estimation (RSE), where some smart sensors observe various systems and transmit their local state estimates to a remote estimator. Inspired by the high bandwidth of a high-frequency (HF) link alongside the robust reliability and low energy consumption of a low-frequency (LF) link, a novel hybrid transmission strategy is proposed. This strategy allows smart sensors to utilize either the HF link or the LF link dynamically, each with distinct channel characteristics and energy consumption. To achieve highly reliable and energy-efficient transmission, it is imperative to devise an optimal transmission scheduling strategy that dictates the selection of the link and channel allocation. Formulating this challenge as a Markov decision process (MDP), a sufficient condition ensuring the existence of an optimal deterministic and stationary (ODS) policy is established. The structural characteristics of the optimal policy are derived. Furthermore, a deep reinforcement learning (DRL) approach, specifically the dueling double deep <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Q</i>-network (D3QN) algorithm, is employed to approximate the optimal policy. Finally, the structural results and the effectiveness of the DRL algorithm are verified by simulation examples.

远程状态估计深度强化学习传输调度马尔可夫决策过程

阅读原文 ↗