Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism

In the reinforcement leaning tasks with continuous state spaces,the algorithms are usually facing the problems of ill initial performance and low convergence speed.In order to solve these problems,the potential function shaping reward mechanism was proposed to improve the reinforcement learning algo...

Full description

Saved in:
Bibliographic Details
Main Authors: Fei XIAO, Quan LIU, Qi-ming FU, Hong-kun SUN, Long GAO
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2013-01-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/1000-436X(2013)01-0077-12/
Tags: Add Tag
No Tags, Be the first to tag this record!