IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles
In a multi-beam communication scenario where Infrastructure-to-Vehicle (I2V) and Vehicle-to-Vehicle (V2V) communications coexist, the limited spectrum of resources force V2V users to reuse the orthogonal frequency bands allocated to I2V, inevitably introducing cross-layer interference between I2V an...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-05-01
|
| Series: | Applied Sciences |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2076-3417/15/11/6069 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849721924259676160 |
|---|---|
| author | Jinxiang Lai Deqing Wang Yifeng Zhao |
| author_facet | Jinxiang Lai Deqing Wang Yifeng Zhao |
| author_sort | Jinxiang Lai |
| collection | DOAJ |
| description | In a multi-beam communication scenario where Infrastructure-to-Vehicle (I2V) and Vehicle-to-Vehicle (V2V) communications coexist, the limited spectrum of resources force V2V users to reuse the orthogonal frequency bands allocated to I2V, inevitably introducing cross-layer interference between I2V and V2V. Furthermore, the adoption of a multi-beam communication architecture exacerbates beam interference, significantly degrading the overall network’s communication and sensing performance. To address these challenges, this paper proposes an integrated sensing and communication (ISAC) beam allocation algorithm, termed IMSBA, which jointly optimizes beam direction, transmission power, and spectrum resource allocation to effectively mitigate the interference between I2V and V2V while maximizing the overall network performance. Specifically, IMSBA employs a joint optimization framework combining Multi-Agent Proximal Policy Optimization (MAPPO) with a Stackelberg game. Within this framework, MAPPO leverages vehicle perception data to dynamically optimize V2V beam steering and frequency selection, while the Stackelberg game reduces computational complexity through hierarchical decision-making and optimizes the joint power allocation among V2V users. Additionally, the proposed scheme incorporates a V2V cooperative sensing domain-sharing mechanism to enhance system robustness under adverse conditions. The experimental results demonstrated that, compared with existing baseline schemes, IMSBA achieved a 92.5% improvement in V2V energy efficiency while significantly enhancing both communication and sensing performance. This study provides an efficient and practical solution for spectrum-constrained scenarios in millimeter-wave Internet-of-Things (IoT), offering substantial theoretical insights and practical value for the efficient operation of intelligent transportation system (ITSs). |
| format | Article |
| id | doaj-art-f23b4af84a5741039e065ce4142f4684 |
| institution | DOAJ |
| issn | 2076-3417 |
| language | English |
| publishDate | 2025-05-01 |
| publisher | MDPI AG |
| record_format | Article |
| series | Applied Sciences |
| spelling | doaj-art-f23b4af84a5741039e065ce4142f46842025-08-20T03:11:30ZengMDPI AGApplied Sciences2076-34172025-05-011511606910.3390/app15116069IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of VehiclesJinxiang Lai0Deqing Wang1Yifeng Zhao2College of General Education, Fujian Polytechnic of Water Conservancy and Electric Power, Yongan 366000, ChinaDepartment of Information and Communication Engineering, Xiamen University, Xiamen 361005, ChinaDepartment of Information and Communication Engineering, Xiamen University, Xiamen 361005, ChinaIn a multi-beam communication scenario where Infrastructure-to-Vehicle (I2V) and Vehicle-to-Vehicle (V2V) communications coexist, the limited spectrum of resources force V2V users to reuse the orthogonal frequency bands allocated to I2V, inevitably introducing cross-layer interference between I2V and V2V. Furthermore, the adoption of a multi-beam communication architecture exacerbates beam interference, significantly degrading the overall network’s communication and sensing performance. To address these challenges, this paper proposes an integrated sensing and communication (ISAC) beam allocation algorithm, termed IMSBA, which jointly optimizes beam direction, transmission power, and spectrum resource allocation to effectively mitigate the interference between I2V and V2V while maximizing the overall network performance. Specifically, IMSBA employs a joint optimization framework combining Multi-Agent Proximal Policy Optimization (MAPPO) with a Stackelberg game. Within this framework, MAPPO leverages vehicle perception data to dynamically optimize V2V beam steering and frequency selection, while the Stackelberg game reduces computational complexity through hierarchical decision-making and optimizes the joint power allocation among V2V users. Additionally, the proposed scheme incorporates a V2V cooperative sensing domain-sharing mechanism to enhance system robustness under adverse conditions. The experimental results demonstrated that, compared with existing baseline schemes, IMSBA achieved a 92.5% improvement in V2V energy efficiency while significantly enhancing both communication and sensing performance. This study provides an efficient and practical solution for spectrum-constrained scenarios in millimeter-wave Internet-of-Things (IoT), offering substantial theoretical insights and practical value for the efficient operation of intelligent transportation system (ITSs).https://www.mdpi.com/2076-3417/15/11/6069internet of vehicleintegrated sensing and communicationbeam allocationmulti-agent reinforcement learninggame theory |
| spellingShingle | Jinxiang Lai Deqing Wang Yifeng Zhao IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles Applied Sciences internet of vehicle integrated sensing and communication beam allocation multi-agent reinforcement learning game theory |
| title | IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles |
| title_full | IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles |
| title_fullStr | IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles |
| title_full_unstemmed | IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles |
| title_short | IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles |
| title_sort | imsba a novel integrated sensing and communication beam allocation based on multi agent reinforcement learning for mmwave internet of vehicles |
| topic | internet of vehicle integrated sensing and communication beam allocation multi-agent reinforcement learning game theory |
| url | https://www.mdpi.com/2076-3417/15/11/6069 |
| work_keys_str_mv | AT jinxianglai imsbaanovelintegratedsensingandcommunicationbeamallocationbasedonmultiagentreinforcementlearningformmwaveinternetofvehicles AT deqingwang imsbaanovelintegratedsensingandcommunicationbeamallocationbasedonmultiagentreinforcementlearningformmwaveinternetofvehicles AT yifengzhao imsbaanovelintegratedsensingandcommunicationbeamallocationbasedonmultiagentreinforcementlearningformmwaveinternetofvehicles |