A cooperative multi-agent reinforcement learning algorithm based on dynamic self-selection parameters sharing

In multi-agent reinforcement learning, parameter sharing can effectively alleviate the inefficiency of learning caused by non-stationarity.However, maintaining the same policy forall agents during learning may have detrimental effects.To solve this problem, a new approach was introduced to give agen...

Full description

Saved in:
Bibliographic Details
Main Authors: Han WANG, Yang YU, Yuan JIANG
Format: Article
Language:zho
Published: POSTS&TELECOM PRESS Co., LTD 2022-03-01
Series:智能科学与技术学报
Subjects:
Online Access:http://www.cjist.com.cn/thesisDetails#10.11959/j.issn.2096-6652.202214
Tags: Add Tag
No Tags, Be the first to tag this record!