Bayesian Q learning method with Dyna architecture and prioritized sweeping
In order to balance this trade-off, a probability distribution was used in Bayesian Q learning method to de-scribe the uncertainty of the Q value and choose actions with this distribution. But the slow convergence is a big problem for Bayesian Q-Learning. In allusion to the above problems, a novel B...
Saved in:
Main Authors: | Jun YU, Quan LIU, Qi-ming FU, Hong-kun SUN, Gui-xing CHEN |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2013-11-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.11.015/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Making virtual learning environment more intelligent: application of Markov decision process
by: Dalia Baziukaitė
Published: (2004-12-01) -
Numerical study of sweep angle effects of the control fin in supercavitating flow
by: Van-Duyen Pham, et al.
Published: (2025-12-01) -
Transmission scheduling scheme based on deep Q learning in wireless network
by: Jiang ZHU, et al.
Published: (2018-04-01) -
Design and Application Research of an Adaptive Obstacle Climbing Mechanism for Floor Sweeping Robot
by: Feng Shupeng, et al.
Published: (2018-01-01) -
Q-learning global path planning for UAV navigation with pondered priorities
by: Kevin B. de Carvalho, et al.
Published: (2025-03-01)