Intelligent interference decision algorithm with prior knowledge embedded LSTM-PPO model
Focusing on the issues of low efficiency and effectiveness in decision-making as well as the instability of traditional reinforcement learning model-based multi-function radar (MFR) jamming decision algorithms, a prior knowledge embedded long short-term memory (LSTM) network-proximal policy optimiza...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2024-12-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2024270/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832595460706009088 |
---|---|
author | ZHANG Jingke YANG Kai LI Chao WANG Hongyan |
author_facet | ZHANG Jingke YANG Kai LI Chao WANG Hongyan |
author_sort | ZHANG Jingke |
collection | DOAJ |
description | Focusing on the issues of low efficiency and effectiveness in decision-making as well as the instability of traditional reinforcement learning model-based multi-function radar (MFR) jamming decision algorithms, a prior knowledge embedded long short-term memory (LSTM) network-proximal policy optimization (PPO) model based intelligent interference decision algorithm was developed. Firstly, the MFR interference decision problem was regarded as a Markov decision process (MDP). Furthermore, by incorporating prior knowledge associated with the interference domain into the reward function of the PPO model using revenue shaping theory, a reshaped reward function was obtained to guide agent converge quickly so as to improve decision-making efficiency. Besides, leveraging LSTM’s excellent temporal feature extraction ability enables capturing dynamic characteristics of echo data effectively to describe radar working states. Finally, these extracted dynamic features were inputted into the PPO model. With guidance from embedded prior knowledge, an effective interference decision can be achieved rapidly. Simulation results demonstrate that compared to traditional reinforcement learning model based interference decision algorithms, higher efficiency and effectiveness in decision-making can be attained via the proposed algorithms and the MFR interference decision can be efficiently and robustly achieved. |
format | Article |
id | doaj-art-3cb6a1783fdf496ab5c57931ec66f84a |
institution | Kabale University |
issn | 1000-436X |
language | zho |
publishDate | 2024-12-01 |
publisher | Editorial Department of Journal on Communications |
record_format | Article |
series | Tongxin xuebao |
spelling | doaj-art-3cb6a1783fdf496ab5c57931ec66f84a2025-01-18T19:00:08ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2024-12-014522723980268840Intelligent interference decision algorithm with prior knowledge embedded LSTM-PPO modelZHANG JingkeYANG KaiLI ChaoWANG HongyanFocusing on the issues of low efficiency and effectiveness in decision-making as well as the instability of traditional reinforcement learning model-based multi-function radar (MFR) jamming decision algorithms, a prior knowledge embedded long short-term memory (LSTM) network-proximal policy optimization (PPO) model based intelligent interference decision algorithm was developed. Firstly, the MFR interference decision problem was regarded as a Markov decision process (MDP). Furthermore, by incorporating prior knowledge associated with the interference domain into the reward function of the PPO model using revenue shaping theory, a reshaped reward function was obtained to guide agent converge quickly so as to improve decision-making efficiency. Besides, leveraging LSTM’s excellent temporal feature extraction ability enables capturing dynamic characteristics of echo data effectively to describe radar working states. Finally, these extracted dynamic features were inputted into the PPO model. With guidance from embedded prior knowledge, an effective interference decision can be achieved rapidly. Simulation results demonstrate that compared to traditional reinforcement learning model based interference decision algorithms, higher efficiency and effectiveness in decision-making can be attained via the proposed algorithms and the MFR interference decision can be efficiently and robustly achieved.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2024270/interference decisionMFRPPOLSTM networknetwork prior knowledge |
spellingShingle | ZHANG Jingke YANG Kai LI Chao WANG Hongyan Intelligent interference decision algorithm with prior knowledge embedded LSTM-PPO model Tongxin xuebao interference decision MFR PPO LSTM network network prior knowledge |
title | Intelligent interference decision algorithm with prior knowledge embedded LSTM-PPO model |
title_full | Intelligent interference decision algorithm with prior knowledge embedded LSTM-PPO model |
title_fullStr | Intelligent interference decision algorithm with prior knowledge embedded LSTM-PPO model |
title_full_unstemmed | Intelligent interference decision algorithm with prior knowledge embedded LSTM-PPO model |
title_short | Intelligent interference decision algorithm with prior knowledge embedded LSTM-PPO model |
title_sort | intelligent interference decision algorithm with prior knowledge embedded lstm ppo model |
topic | interference decision MFR PPO LSTM network network prior knowledge |
url | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2024270/ |
work_keys_str_mv | AT zhangjingke intelligentinterferencedecisionalgorithmwithpriorknowledgeembeddedlstmppomodel AT yangkai intelligentinterferencedecisionalgorithmwithpriorknowledgeembeddedlstmppomodel AT lichao intelligentinterferencedecisionalgorithmwithpriorknowledgeembeddedlstmppomodel AT wanghongyan intelligentinterferencedecisionalgorithmwithpriorknowledgeembeddedlstmppomodel |