Heuristic Sarsa algorithm based on value function transfer

With the problem of slow convergence for traditional Sarsa algorithm,an improved heuristic Sarsa algorithm based on value function transfer was proposed.The algorithm combined traditional Sarsa algorithm and value function transfer method,and the algorithm introduced bisimulation metric and used it...

Full description

Saved in:
Bibliographic Details
Main Authors: Jianping CHEN, Zhengxia YANG, Quan LIU, Hongjie WU, Yang XU, Qiming FU
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2018-08-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2018133/
Tags: Add Tag
No Tags, Be the first to tag this record!