Communication resource allocation method in vehicular networks based on federated multi-agent deep reinforcement learning

Abstract In highly dynamic vehicular networking scenarios, when Vehicle-to-Infrastructure links and Vehicle-to-Vehicle links share spectrum resources, the traditional distributed resource allocation method lacks global optimization and fails to respond to environmental changes in a timely manner, wh...

Full description

Saved in:
Bibliographic Details
Main Authors: Qingli Liu, Yongjie Ma
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-15982-x
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract In highly dynamic vehicular networking scenarios, when Vehicle-to-Infrastructure links and Vehicle-to-Vehicle links share spectrum resources, the traditional distributed resource allocation method lacks global optimization and fails to respond to environmental changes in a timely manner, which leads to low spectral efficiency of the system. A resource allocation method based on federated multi-agent deep reinforcement learning is proposed for Vehicular Networking communication, by fusing Asynchronous Federated Learning (AFL) and Multi-Agent Deep Deterministic Policy Gradient (MADDPG). Synergistic optimization of resource allocation. First, vehicles as agent dynamically optimize spectrum access, power control, and bandwidth allocation based on local channel states through the collaborative policy of MADDPG to reduce cross-link interference. Second, the asynchronous federation architecture is designed, where vehicles independently upload local model parameters to the global server, dynamically adjust the aggregation weights according to the real-time channel quality, and optimize the update of global model parameters. Finally, the global model parameters are fed back to the vehicles to further optimize the local resource allocation strategy, thus improving the system spectrum efficiency. The simulation results show that the system spectrum efficiency is improved by 19.1% on average compared with the centralized DDPG, MADDPG, MAPPO and FL-DuelingDQN algorithms in the Vehicle Networking scenario, while the transmission success rate of the V2V link is improved by 9.3% on average, and the total capacity of the V2I link is increased by 16.1% on average.
ISSN:2045-2322