Adaptive temporal-difference learning via deep neural network function approximation: a non-asymptotic analysis

Adaptive temporal-difference learning via deep neural network function approximation: a non-asymptotic analysis

Abstract Although deep reinforcement learning has achieved notable practical achievements, its theoretical foundations have been scarcely explored until recent times. Nonetheless, the rate of convergence for current neural temporal-difference (TD) learning algorithms is constrained, largely due to t...

Full description

Saved in:

Bibliographic Details
Main Authors:	Guoyong Wang, Tiange Fu, Ruijuan Zheng, Xuhui Zhao, Junlong Zhu, Mingchuan Zhang
Format:	Article
Language:	English
Published:	Springer 2025-01-01
Series:	Complex & Intelligent Systems
Subjects:	Adaptive methods Non-asymptotic convergence Nonlinear function approximation Reinforcement learning Temporal-difference learning
Online Access:	https://doi.org/10.1007/s40747-024-01757-w
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Boundedness and asymptotic behavior of solutions of a forced difference equation
by: John R. Graef, et al.
Published: (1994-01-01)

Enhancing navigation performance in unknown environments using spiking neural networks and reinforcement learning with asymptotic gradient method
by: Xiaode Liu, et al.
Published: (2025-01-01)

A structured model for the spread of Mycobacterium marinum: Foundations for a numerical approximation scheme
by: Azmy S. Ackleh, et al.
Published: (2014-02-01)

Finite difference approximations for measure-valued solutions of a hierarchicallysize-structured population model
by: Azmy S. Ackleh, et al.
Published: (2014-11-01)

An embedding of Schwartz distributions in the algebra of asymptotic functions
by: Michael Oberguggenberger, et al.
Published: (1998-01-01)

On rational approximation in a ball in ℂN
by: P. W. Darko, et al.
Published: (2000-01-01)

Application of the method of stationary phase to weakly nonlinear hyperbolic systems asymptotic solving
by: Aleksandras Krylovas
Published: (2004-12-01)

Asymptotic equivalence of sequences and summability
by: Jinlu Li
Published: (1997-01-01)

On convergence of (μ,ν)-sequences of unisolvent rational approximants to meromorphic functions in Cn
by: C. H. Lutterodt
Published: (1985-01-01)

On the system of two nonlinear difference equations xn+1=A+xn−1/yn, yn+1=A+yn−1/xn
by: G. Papaschinopoulos, et al.
Published: (2000-01-01)

On a problem of Nathanson related to minimal asymptotic bases of order $h$
by: Chen, Shi-Qiang, et al.
Published: (2024-02-01)

An efficient modified HS conjugate gradient algorithm in machine learning
by: Gonglin Yuan, et al.
Published: (2024-11-01)

Asymptotic equivalence and summability
by: Mousa S. Marouf
Published: (1993-01-01)

Remarks on the existence and decay of the nonlinear beam equation
by: Jaime E. Mũnoz Rivera
Published: (1994-01-01)

Fine-scale forest classification with multi-temporal sentinel-1/2 imagery using a temporal convolutional neural network
by: Rongfei Duan, et al.
Published: (2025-12-01)

Intentionally-underestimated value function at terminal state for temporal-difference learning with mis-designed reward
by: Taisuke Kobayashi
Published: (2025-03-01)

A toxin-mediated size-structured population model: Finite difference approximation and well-posedness
by: Qihua Huang, et al.
Published: (2016-04-01)

An asymptotic expansion for a ratio of products of gamma functions
by: Wolfgang Bühring
Published: (2000-01-01)

Asymptotical expansions in the Kubilius theorem of large deviations
by: Rimantas Skrabutėnas
Published: (2005-12-01)

On the Convergence Rate for the Longest at Most <i>T</i>-Contaminated Runs of Heads
by: István Fazekas, et al.
Published: (2025-01-01)

Recognition of biosignals with nonlinear properties by approximate entropy parameters
by: L.A. Manilo, et al.
Published: (2023-10-01)

On a modified Bernstein operators approximation method for computational solution of Volterra integral equation
by: Khursheed J. Ansari, et al.
Published: (2025-01-01)

Parameter Estimation in a Coupled System of Nonlinear Size-Structured Populations
by: Azmy S. Ackleh, et al.
Published: (2005-02-01)

Federated Digital Twins: A Scheduling Approach Based on Temporal Graph Neural Network and Deep Reinforcement Learning
by: Young-Jin Kim, et al.
Published: (2025-01-01)

Bi-Fuzzy S-Approximation Spaces
by: Ronghai Wang, et al.
Published: (2025-01-01)

Habitat and Spatio-Temporal Interaction Between Green Peafowl with Cattle and Megaherbivores in Baluran National Park
by: Satyawan Pudyatmoko
Published: (2019-05-01)

Model-Based Graph Reinforcement Learning for Inductive Traffic Signal Control
by: Francois-Xavier Devailly, et al.
Published: (2024-01-01)

A Large-Scale Spatio-Temporal Multimodal Fusion Framework for Traffic Prediction
by: Bodong Zhou, et al.
Published: (2024-09-01)

S-asymptotic expansion of distributions
by: Bogoljub Stankovic
Published: (1988-01-01)

Review of the matched asymptotic approach of the coupled criterion
by: Jiménez-Alfaro, Sara, et al.
Published: (2025-02-01)

Optimal feedback control of dynamical systems via value-function approximation
by: Kunisch, Karl, et al.
Published: (2023-07-01)

A proper subclass of Maclane's class 𝒜
by: May Hamdan
Published: (1999-01-01)

A temporal knowledge graph reasoning model based on recurrent encoding and contrastive learning
by: Weitong Liu, et al.
Published: (2025-01-01)

On some constants in simultaneous approximation
by: K. Balázs, et al.
Published: (1995-01-01)

A weak invariance principle and asymptotic stability for evolution equations with bounded generators
by: E. N. Chukwu, et al.
Published: (1995-01-01)

The Solution and Dynamic Behaviour of Difference Equations of Twenty-First Order
by: Ibrahim Tarek Fawzi Abdelhamid, et al.
Published: (2023-07-01)

Location of approximations of a Markoff theorem
by: K. C. Prasad, et al.
Published: (1990-01-01)

Spatial-Temporal Fusion Graph Neural Networks With Mixed Adjacency for Weather Forecasting
by: Ang Guo, et al.
Published: (2025-01-01)

On approximation of the solutions of delay differential equations by using piecewise constant arguments
by: Istevan Györi
Published: (1991-01-01)

On the normal approximation for weakly dependent random variables
by: Jonas Sunklodas
Published: (2005-12-01)