A study of value iteration and policy iteration for Markov decision processes in Deterministic systems
In the context of deterministic discrete-time control systems, we examined the implementation of value iteration (VI) and policy (PI) algorithms in Markov decision processes (MDPs) situated within Borel spaces. The deterministic nature of the system's transfer function plays a pivotal role, as...
Saved in:
Main Authors: | Haifeng Zheng, Dan Wang |
---|---|
Format: | Article |
Language: | English |
Published: |
AIMS Press
2024-11-01
|
Series: | AIMS Mathematics |
Subjects: | |
Online Access: | https://www.aimspress.com/article/doi/10.3934/math.20241613 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Some fixed point iteration procedures
by: B. E. Rhoades
Published: (1991-01-01) -
On the Mann and Ishikawa iteration processes
by: Jia Yuting, et al.
Published: (1996-01-01) -
The modification of the generalized gauss-seidel iteration techniques for absolute value equations
by: Rashid Ali, et al.
Published: (2022-12-01) -
On Feller's criterion for the law of the iterated logarithm
by: Deli Li, et al.
Published: (1994-01-01) -
The law of the iterated logarithm for exchangeable random variables
by: Hu-Ming Zhang, et al.
Published: (1995-01-01)