A study of value iteration and policy iteration for Markov decision processes in Deterministic systems

A study of value iteration and policy iteration for Markov decision processes in Deterministic systems

In the context of deterministic discrete-time control systems, we examined the implementation of value iteration (VI) and policy (PI) algorithms in Markov decision processes (MDPs) situated within Borel spaces. The deterministic nature of the system's transfer function plays a pivotal role, as...

Full description

Saved in:

Bibliographic Details
Main Authors:	Haifeng Zheng, Dan Wang
Format:	Article
Language:	English
Published:	AIMS Press 2024-11-01
Series:	AIMS Mathematics
Subjects:	markov decision processes deterministic system value iteration policy iteration average cost criterion
Online Access:	https://www.aimspress.com/article/doi/10.3934/math.20241613
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Newton iteration and SOR iteration based WWSE precoding algorithm in massive MIMO systems
by: Wensheng SUN, et al.
Published: (2019-11-01)

Joint source-channel iterative decoding based on hidden markov model
by: YIN Wei-wei, et al.
Published: (2006-01-01)

Optimized algorithm for value iteration based on topological sequence backups
by: Wei HUANG, et al.
Published: (2014-08-01)

Some fixed point iteration procedures
by: B. E. Rhoades
Published: (1991-01-01)

On the Mann and Ishikawa iteration processes
by: Jia Yuting, et al.
Published: (1996-01-01)

The modification of the generalized gauss-seidel iteration techniques for absolute value equations
by: Rashid Ali, et al.
Published: (2022-12-01)

Low-complexity FTN receivers based on frequency domain iterative decision feedback equalization
by: Juan ZENG, et al.
Published: (2017-04-01)

On Feller's criterion for the law of the iterated logarithm
by: Deli Li, et al.
Published: (1994-01-01)

Iterative multistage adaptive Rake receiver for CDMA wireless system
by: YE Li-bing, et al.
Published: (2005-01-01)

The law of the iterated logarithm for exchangeable random variables
by: Hu-Ming Zhang, et al.
Published: (1995-01-01)

A low complexity precoding based on weighted two diagonal iteration in massive MIMO system
by: Haiyan CAO, et al.
Published: (2018-09-01)

Bandlimited Frequency-Constrained Iterative Methods
by: Harrison Garrett, et al.
Published: (2025-01-01)

Spatio-Temporal Joint Trajectory Planning for Autonomous Vehicles Based on Improved Constrained Iterative LQR
by: Qin Li, et al.
Published: (2025-01-01)

Impulsive noise mitigation based on iteration adaptive approach in underwater acoustic communication
by: Guili ZHOU, et al.
Published: (2017-11-01)

Iterative channel estimation based on the Turbo principle for UWB system
by: TENG Peng-wei1, et al.
Published: (2008-01-01)

SL-n iterative localization algorithm in wireless sensor networks
by: LUO Xu, et al.
Published: (2011-01-01)

Access and sustainment of ELMy H-mode operation for ITER pre-fusion power operation plasmas using JINTRAC
by: E. Tholerus, et al.
Published: (2025-01-01)

Zero-correlation zone complementary sets based on iteration
by: LI Yu-bo, et al.
Published: (2011-01-01)

Chebyshev iteration for the problem with nonlocal boundary condition
by: Mifodijus Sapagovas, et al.
Published: (2004-12-01)

Study on the multistage iterative rank-reduced method
by: GUO Yi1, et al.
Published: (2007-01-01)

ACCELERATED ITERATIVE RECONSTRUCTION OF PHANTOM «ROZI» BY OS-SART METHOD USING ORDERED SUBSET PROJECTIONS
by: S. A. Zolotarev, et al.
Published: (2017-08-01)

Joint iterative decision feedback channel estimation and detection for Turbo coded V-BLAST MIMO-OFDM systems
by: QIAO Xiao-qiang1, et al.
Published: (2006-01-01)

A Survey on High-Order Internal Model Based Iterative Learning Control
by: Miao Yu, et al.
Published: (2019-01-01)

Regularized threshold iteration method for impulsive noise suppression in underwater acoustic communication
by: Nana LI, et al.
Published: (2019-03-01)

A generalization of contraction principle
by: K. M. Ghosh
Published: (1981-01-01)

A generalization of some fixed point theorems of K. M. Ghosh
by: B. E. Rhoades
Published: (1982-01-01)

Four-Step <i>T</i>-Stable Generalized Iterative Technique with Improved Convergence and Various Applications
by: Quanita Kiran, et al.
Published: (2025-01-01)

An iterative method based on the average quadrature formula
by: Tusar Singh, et al.
Published: (2025-03-01)

Improving the Quality of Single-Phase Grid-Connected Solar Systems Using Iterative Control Method
by: Mazharul Islam, et al.
Published: (2024-12-01)

Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation
by: Chang-Teng Shi, et al.
Published: (2024-12-01)

SSCANL decoder based joint iterative detection and decoding algorithm
by: Chongyang LIU, et al.
Published: (2022-10-01)

Adaptive iterative bit allocation algorithm for MIMO-OFDM system
by: FAN Ling-yan1, et al.
Published: (2007-01-01)

Intercarrier interference cancellation based on iterative detection algorithm for OFDM systems over time-varying channels
by: WANG Xia, et al.
Published: (2006-01-01)

Label iteration-based clustering ensemble algorithm
by: HE Yulin, et al.
Published: (2024-12-01)

Iterated Stieltjes transform of generalized functions
by: L. S. Dube
Published: (1985-01-01)

Damped Iterative Explicit Guidance for Multistage Rockets with Thrust Drop Faults
by: Zongzhan Ma, et al.
Published: (2025-01-01)

ITER NBI operational window and power availability constraints due to shine-through losses
by: P. Vincenzi, et al.
Published: (2025-01-01)

Combining Laplace transform and Variational iteration method for solving singular IVPs and BVPs of Lane–Emden type equation
by: Mohamed H. Jassim, et al.
Published: (2024-06-01)

TUBING SYSTEM PERFORMANCE PROFILING OF DRY GAS WELLS USING NEWTON RAPHSON ITERATION METHOD
by: CHINEDU WILFRED OKOLOGUME, et al.
Published: (2021-10-01)

Iterative hard thresholding based impulsive noise suppression method in power line communication system
by: Mingchen YU, et al.
Published: (2016-11-01)