Optimized algorithm for value iteration based on topological sequence backups

In order to improve the convergence performance, an optimized value iteration based on topological sequence backups, VI-TS, is proposed. The key idea of VI-TS is to circumvent the problem of unnecessary backups by dividing an MDP into strongly-connected components and solving these components in top...

Full description

Saved in:
Bibliographic Details
Main Authors: Wei HUANG, Quan LIU, Hong-kun SUN, Qi-ming FU, HOUXiao-ke Z
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2014-08-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2014.08.008/
Tags: Add Tag
No Tags, Be the first to tag this record!