🌙

面向长时程灵巧机器人细胞微操作的技能信息表征模仿学习

Skill Information Representation Imitation Learning for Long-Horizon Dexterous Robot Micromanipulation of Deformable Cell

IEEE Transactions on Cybernetics · 2026
被引 0
ABS 3

中文导读

提出SIRIL算法,通过提取专家视频帧的离散潜在编码并建模其分布,量化技能信息以生成安全动作约束,在斑马鱼胚胎细胞膜剥离等任务中平均准确率86.7%,最终成功率64.7%。

Abstract

Robots performing collaborative long-horizon dexterity cell micromanipulation tasks are challenging and practically significant, such as peeling cell membranes, which is considered one of the most technically demanding procedures. The imitation learning (IL) approach is expected to address the challenges of multitask coupling and object modeling difficulties in long-horizon tasks. Existing IL algorithms suffer from compounding error as they perform only a simple mapping of the task environment space to the action space. In this article, we propose a skill information representation IL (SIRIL) algorithm for long-horizon dexterous robot micromanipulation tasks. First, SIRIL extracts the discrete latent codes of the expert video frames by the VQ-GAN encoder, and the distribution of the latent codes is modeled by an autoregressive transformer. SIRIL quantifies the representation of the expert's skill information by computing the log-likelihood of the latent discrete codes, which allows for the extraction of safe action constraints. Finally, SIRIL predicts actions by integrating actions from previous time steps, and actions that satisfy the safety constraints are executed, which effectively suppresses compounding error. Real experiments show that the SIRIL algorithm can complete the deformable zebrafish embryonic cells dexterous membrane stripping surgery. Ablation studies further confirmed SIRIL's high efficiency in various subtasks, including PushCell, GraspCell, and PeelCell, which achieves an average accuracy of 86.7% and a high final success rate of 64.7%, significantly outperforming existing algorithms. Code is available at https://github.com/zycrobot/SIRIL.

机器人模仿学习细胞微操作技能表征