A study of value iteration and policy iteration for Markov decision processes in Deterministic systems

A study of value iteration and policy iteration for Markov decision processes in Deterministic systems

In the context of deterministic discrete-time control systems, we examined the implementation of value iteration (VI) and policy (PI) algorithms in Markov decision processes (MDPs) situated within Borel spaces. The deterministic nature of the system's transfer function plays a pivotal role, as...

Full description

Saved in:

Bibliographic Details
Main Authors:	Haifeng Zheng, Dan Wang
Format:	Article
Language:	English
Published:	AIMS Press 2024-11-01
Series:	AIMS Mathematics
Subjects:	markov decision processes deterministic system value iteration policy iteration average cost criterion
Online Access:	https://www.aimspress.com/article/doi/10.3934/math.20241613
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Some fixed point iteration procedures
by: B. E. Rhoades
Published: (1991-01-01)

On the Mann and Ishikawa iteration processes
by: Jia Yuting, et al.
Published: (1996-01-01)

The modification of the generalized gauss-seidel iteration techniques for absolute value equations
by: Rashid Ali, et al.
Published: (2022-12-01)

On Feller's criterion for the law of the iterated logarithm
by: Deli Li, et al.
Published: (1994-01-01)

The law of the iterated logarithm for exchangeable random variables
by: Hu-Ming Zhang, et al.
Published: (1995-01-01)

Bandlimited Frequency-Constrained Iterative Methods
by: Harrison Garrett, et al.
Published: (2025-01-01)

Spatio-Temporal Joint Trajectory Planning for Autonomous Vehicles Based on Improved Constrained Iterative LQR
by: Qin Li, et al.
Published: (2025-01-01)

Access and sustainment of ELMy H-mode operation for ITER pre-fusion power operation plasmas using JINTRAC
by: E. Tholerus, et al.
Published: (2025-01-01)

A generalization of some fixed point theorems of K. M. Ghosh
by: B. E. Rhoades
Published: (1982-01-01)

A generalization of contraction principle
by: K. M. Ghosh
Published: (1981-01-01)

ACCELERATED ITERATIVE RECONSTRUCTION OF PHANTOM «ROZI» BY OS-SART METHOD USING ORDERED SUBSET PROJECTIONS
by: S. A. Zolotarev, et al.
Published: (2017-08-01)

Chebyshev iteration for the problem with nonlocal boundary condition
by: Mifodijus Sapagovas, et al.
Published: (2004-12-01)

A Survey on High-Order Internal Model Based Iterative Learning Control
by: Miao Yu, et al.
Published: (2019-01-01)

Four-Step <i>T</i>-Stable Generalized Iterative Technique with Improved Convergence and Various Applications
by: Quanita Kiran, et al.
Published: (2025-01-01)

Improving the Quality of Single-Phase Grid-Connected Solar Systems Using Iterative Control Method
by: Mazharul Islam, et al.
Published: (2024-12-01)

Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation
by: Chang-Teng Shi, et al.
Published: (2024-12-01)

An iterative method based on the average quadrature formula
by: Tusar Singh, et al.
Published: (2025-03-01)

ITER NBI operational window and power availability constraints due to shine-through losses
by: P. Vincenzi, et al.
Published: (2025-01-01)

Combining Laplace transform and Variational iteration method for solving singular IVPs and BVPs of Lane–Emden type equation
by: Mohamed H. Jassim, et al.
Published: (2024-06-01)

Damped Iterative Explicit Guidance for Multistage Rockets with Thrust Drop Faults
by: Zongzhan Ma, et al.
Published: (2025-01-01)

Label iteration-based clustering ensemble algorithm
by: HE Yulin, et al.
Published: (2024-12-01)

Iterated Stieltjes transform of generalized functions
by: L. S. Dube
Published: (1985-01-01)

TUBING SYSTEM PERFORMANCE PROFILING OF DRY GAS WELLS USING NEWTON RAPHSON ITERATION METHOD
by: CHINEDU WILFRED OKOLOGUME, et al.
Published: (2021-10-01)

A computationally efficient non‐iterative four‐parameter sine fitting method
by: Balázs Renczes, et al.
Published: (2021-10-01)

On efficient iterative schemes for finding all solutions of non-linear engineering problems
by: Mudassir Shams, et al.
Published: (2024-09-01)

Investigating the influence of divertor baffles on nitrogen-seeded detachment in TCV with SOLPS-ITER simulations and TCV experiments
by: G. Sun, et al.
Published: (2025-01-01)

Nonlinear iterative approximation of steady incompressible chemically reacting flows
by: Gazca-Orozco, Pablo Alexei, et al.
Published: (2022-09-01)

A cutting-plane method with internal iteration points for the general convex programming problem
by: I. Ya. Zabotin, et al.
Published: (2024-01-01)

Nonlinear stochastic Markov processes and modeling uncertainty in populations
by: H.Thomas Banks, et al.
Published: (2011-11-01)

A novel class of fourth-order derivative-free iterative methods to obtain multiple zeros and their basins of attraction
by: Munish Kansal, et al.
Published: (2024-12-01)

A Robust Hermitian and Skew-Hermitian Based Multiplicative Splitting Iterative Method for the Continuous Sylvester Equation
by: Mohammad Khorsand Zak, et al.
Published: (2025-01-01)

An experimental comparison of two preconditioned iterative methods to solve the elliptic partial differential equations
by: Seyyed Ahmad Edalatpanah
Published: (2022-03-01)

Automatic Design of Robot Swarms under Concurrent Design Criteria: A Study Based on Iterated F‐Race
by: David Garzón Ramos, et al.
Published: (2025-01-01)

Feeding System’s Sensitivity and Reliability Analysis through Markov Decision Process
by: Sujata Jadhav, et al.
Published: (2025-04-01)

Two accelerated gradient-based iteration methods for solving the Sylvester matrix equation AX + XB = C
by: Huiling Wang, et al.
Published: (2024-12-01)

Application of the Canon 320 Row Variable Helical Pitch CTA System Combined with Iterative Reconstruction Technology for Lower Limb Vascular Imaging
by: Wenchao JI, et al.
Published: (2025-01-01)

Making virtual learning environment more intelligent: application of Markov decision process
by: Dalia Baziukaitė
Published: (2004-12-01)

Indifferentiable hash functions in the standard model
by: Juha Partala
Published: (2021-07-01)

A semi-supervised deep neuro-fuzzy iterative learning system for automatic segmentation of hippocampus brain MRI
by: M Nisha, et al.
Published: (2024-12-01)

Integrating fast iterative filtering and ensemble neural network structure with attention mechanism for carbon price forecasting
by: Wang Zhong, et al.
Published: (2024-11-01)