IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles

In a multi-beam communication scenario where Infrastructure-to-Vehicle (I2V) and Vehicle-to-Vehicle (V2V) communications coexist, the limited spectrum of resources force V2V users to reuse the orthogonal frequency bands allocated to I2V, inevitably introducing cross-layer interference between I2V an...

Full description

Saved in:
Bibliographic Details
Main Authors: Jinxiang Lai, Deqing Wang, Yifeng Zhao
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/11/6069
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849721924259676160
author Jinxiang Lai
Deqing Wang
Yifeng Zhao
author_facet Jinxiang Lai
Deqing Wang
Yifeng Zhao
author_sort Jinxiang Lai
collection DOAJ
description In a multi-beam communication scenario where Infrastructure-to-Vehicle (I2V) and Vehicle-to-Vehicle (V2V) communications coexist, the limited spectrum of resources force V2V users to reuse the orthogonal frequency bands allocated to I2V, inevitably introducing cross-layer interference between I2V and V2V. Furthermore, the adoption of a multi-beam communication architecture exacerbates beam interference, significantly degrading the overall network’s communication and sensing performance. To address these challenges, this paper proposes an integrated sensing and communication (ISAC) beam allocation algorithm, termed IMSBA, which jointly optimizes beam direction, transmission power, and spectrum resource allocation to effectively mitigate the interference between I2V and V2V while maximizing the overall network performance. Specifically, IMSBA employs a joint optimization framework combining Multi-Agent Proximal Policy Optimization (MAPPO) with a Stackelberg game. Within this framework, MAPPO leverages vehicle perception data to dynamically optimize V2V beam steering and frequency selection, while the Stackelberg game reduces computational complexity through hierarchical decision-making and optimizes the joint power allocation among V2V users. Additionally, the proposed scheme incorporates a V2V cooperative sensing domain-sharing mechanism to enhance system robustness under adverse conditions. The experimental results demonstrated that, compared with existing baseline schemes, IMSBA achieved a 92.5% improvement in V2V energy efficiency while significantly enhancing both communication and sensing performance. This study provides an efficient and practical solution for spectrum-constrained scenarios in millimeter-wave Internet-of-Things (IoT), offering substantial theoretical insights and practical value for the efficient operation of intelligent transportation system (ITSs).
format Article
id doaj-art-f23b4af84a5741039e065ce4142f4684
institution DOAJ
issn 2076-3417
language English
publishDate 2025-05-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-f23b4af84a5741039e065ce4142f46842025-08-20T03:11:30ZengMDPI AGApplied Sciences2076-34172025-05-011511606910.3390/app15116069IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of VehiclesJinxiang Lai0Deqing Wang1Yifeng Zhao2College of General Education, Fujian Polytechnic of Water Conservancy and Electric Power, Yongan 366000, ChinaDepartment of Information and Communication Engineering, Xiamen University, Xiamen 361005, ChinaDepartment of Information and Communication Engineering, Xiamen University, Xiamen 361005, ChinaIn a multi-beam communication scenario where Infrastructure-to-Vehicle (I2V) and Vehicle-to-Vehicle (V2V) communications coexist, the limited spectrum of resources force V2V users to reuse the orthogonal frequency bands allocated to I2V, inevitably introducing cross-layer interference between I2V and V2V. Furthermore, the adoption of a multi-beam communication architecture exacerbates beam interference, significantly degrading the overall network’s communication and sensing performance. To address these challenges, this paper proposes an integrated sensing and communication (ISAC) beam allocation algorithm, termed IMSBA, which jointly optimizes beam direction, transmission power, and spectrum resource allocation to effectively mitigate the interference between I2V and V2V while maximizing the overall network performance. Specifically, IMSBA employs a joint optimization framework combining Multi-Agent Proximal Policy Optimization (MAPPO) with a Stackelberg game. Within this framework, MAPPO leverages vehicle perception data to dynamically optimize V2V beam steering and frequency selection, while the Stackelberg game reduces computational complexity through hierarchical decision-making and optimizes the joint power allocation among V2V users. Additionally, the proposed scheme incorporates a V2V cooperative sensing domain-sharing mechanism to enhance system robustness under adverse conditions. The experimental results demonstrated that, compared with existing baseline schemes, IMSBA achieved a 92.5% improvement in V2V energy efficiency while significantly enhancing both communication and sensing performance. This study provides an efficient and practical solution for spectrum-constrained scenarios in millimeter-wave Internet-of-Things (IoT), offering substantial theoretical insights and practical value for the efficient operation of intelligent transportation system (ITSs).https://www.mdpi.com/2076-3417/15/11/6069internet of vehicleintegrated sensing and communicationbeam allocationmulti-agent reinforcement learninggame theory
spellingShingle Jinxiang Lai
Deqing Wang
Yifeng Zhao
IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles
Applied Sciences
internet of vehicle
integrated sensing and communication
beam allocation
multi-agent reinforcement learning
game theory
title IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles
title_full IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles
title_fullStr IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles
title_full_unstemmed IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles
title_short IMSBA: A Novel Integrated Sensing and Communication Beam Allocation Based on Multi-Agent Reinforcement Learning for mmWave Internet of Vehicles
title_sort imsba a novel integrated sensing and communication beam allocation based on multi agent reinforcement learning for mmwave internet of vehicles
topic internet of vehicle
integrated sensing and communication
beam allocation
multi-agent reinforcement learning
game theory
url https://www.mdpi.com/2076-3417/15/11/6069
work_keys_str_mv AT jinxianglai imsbaanovelintegratedsensingandcommunicationbeamallocationbasedonmultiagentreinforcementlearningformmwaveinternetofvehicles
AT deqingwang imsbaanovelintegratedsensingandcommunicationbeamallocationbasedonmultiagentreinforcementlearningformmwaveinternetofvehicles
AT yifengzhao imsbaanovelintegratedsensingandcommunicationbeamallocationbasedonmultiagentreinforcementlearningformmwaveinternetofvehicles