A study of value iteration and policy iteration for Markov decision processes in Deterministic systems

In the context of deterministic discrete-time control systems, we examined the implementation of value iteration (VI) and policy (PI) algorithms in Markov decision processes (MDPs) situated within Borel spaces. The deterministic nature of the system's transfer function plays a pivotal role, as...

Full description

Saved in:

Bibliographic Details
Main Authors:	Haifeng Zheng, Dan Wang
Format:	Article
Language:	English
Published:	AIMS Press 2024-11-01
Series:	AIMS Mathematics
Subjects:	markov decision processes deterministic system value iteration policy iteration average cost criterion
Online Access:	https://www.aimspress.com/article/doi/10.3934/math.20241613
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.aimspress.com/article/doi/10.3934/math.20241613

A study of value iteration and policy iteration for Markov decision processes in Deterministic systems

Internet

Similar Items