Adaptive temporal-difference learning via deep neural network function approximation: a non-asymptotic analysis
Abstract Although deep reinforcement learning has achieved notable practical achievements, its theoretical foundations have been scarcely explored until recent times. Nonetheless, the rate of convergence for current neural temporal-difference (TD) learning algorithms is constrained, largely due to t...
Saved in:
Main Authors: | Guoyong Wang, Tiange Fu, Ruijuan Zheng, Xuhui Zhao, Junlong Zhu, Mingchuan Zhang |
---|---|
Format: | Article |
Language: | English |
Published: |
Springer
2025-01-01
|
Series: | Complex & Intelligent Systems |
Subjects: | |
Online Access: | https://doi.org/10.1007/s40747-024-01757-w |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Boundedness and asymptotic behavior of solutions of a forced difference equation
by: John R. Graef, et al.
Published: (1994-01-01) -
Enhancing navigation performance in unknown environments using spiking neural networks and reinforcement learning with asymptotic gradient method
by: Xiaode Liu, et al.
Published: (2025-01-01) -
A structured model for the spread of Mycobacterium marinum: Foundations for a numerical approximation scheme
by: Azmy S. Ackleh, et al.
Published: (2014-02-01) -
Finite difference approximations for measure-valued solutions of a hierarchicallysize-structured population model
by: Azmy S. Ackleh, et al.
Published: (2014-11-01) -
An embedding of Schwartz distributions in the algebra of asymptotic functions
by: Michael Oberguggenberger, et al.
Published: (1998-01-01)