Stealthy data poisoning attack method on offline reinforcement learning in unmanned systems

Aiming at the limitations in effectiveness and stealth of existing offline reinforcement learning(RL) data poisoning attacks, a critical time-step dynamic poisoning attack was proposed, perturbing important samples to achieve efficient and covert attacks. Temporal difference errors, identified throu...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHOU Xue, MAN Dapeng, XU Chen, LYU Jiguang, ZENG Fanyi, GAO Chaoyang, YANG Wu
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2024-12-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2024264/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Aiming at the limitations in effectiveness and stealth of existing offline reinforcement learning(RL) data poisoning attacks, a critical time-step dynamic poisoning attack was proposed, perturbing important samples to achieve efficient and covert attacks. Temporal difference errors, identified through theoretical analysis as crucial for model learning, were used to guide poisoning target selection. A bi-objective optimization approach was introduced to minimize perturbation magnitude while maximizing the negative impact on performance. Experimental results show that with only a 1% poisoning rate, the method reduces agent performance by 84%, revealing the sensitivity and vulnerability of offline RL models in unmanned systems.
ISSN:1000-436X