MACRPO: Multi-agent cooperative recurrent policy optimization

This work considers the problem of learning cooperative policies in multi-agent settings with partially observable and non-stationary environments without a communication channel. We focus on improving information sharing between agents and propose a new multi-agent actor-critic method called Multi-...

Full description

Saved in:
Bibliographic Details
Main Authors: Eshagh Kargar, Ville Kyrki
Format: Article
Language:English
Published: Frontiers Media S.A. 2024-12-01
Series:Frontiers in Robotics and AI
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frobt.2024.1394209/full
Tags: Add Tag
No Tags, Be the first to tag this record!