Optimized algorithm for value iteration based on topological sequence backups
In order to improve the convergence performance, an optimized value iteration based on topological sequence backups, VI-TS, is proposed. The key idea of VI-TS is to circumvent the problem of unnecessary backups by dividing an MDP into strongly-connected components and solving these components in top...
Saved in:
Main Authors: | Wei HUANG, Quan LIU, Hong-kun SUN, Qi-ming FU, HOUXiao-ke Z |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2014-08-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2014.08.008/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
A study of value iteration and policy iteration for Markov decision processes in Deterministic systems
by: Haifeng Zheng, et al.
Published: (2024-11-01) -
The law of the iterated logarithm for exchangeable random variables
by: Hu-Ming Zhang, et al.
Published: (1995-01-01) -
The modification of the generalized gauss-seidel iteration techniques for absolute value equations
by: Rashid Ali, et al.
Published: (2022-12-01) -
On Cluster Points of Sequences via Fuzzy Topology
by: Jyothis K Mohan, et al.
Published: (2024-10-01) -
Newton iteration and SOR iteration based WWSE precoding algorithm in massive MIMO systems
by: Wensheng SUN, et al.
Published: (2019-11-01)