Bayesian Q learning method with Dyna architecture and prioritized sweeping

Bayesian Q learning method with Dyna architecture and prioritized sweeping

In order to balance this trade-off, a probability distribution was used in Bayesian Q learning method to de-scribe the uncertainty of the Q value and choose actions with this distribution. But the slow convergence is a big problem for Bayesian Q-Learning. In allusion to the above problems, a novel B...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jun YU, Quan LIU, Qi-ming FU, Hong-kun SUN, Gui-xing CHEN
Format:	Article
Language:	zho
Published:	Editorial Department of Journal on Communications 2013-11-01
Series:	Tongxin xuebao
Subjects:	reinforcement learning Markov decision process prioritized sweeping Dyna architecture Bayesian Q learning
Online Access:	http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.11.015/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Making virtual learning environment more intelligent: application of Markov decision process
by: Dalia Baziukaitė
Published: (2004-12-01)

Numerical study of sweep angle effects of the control fin in supercavitating flow
by: Van-Duyen Pham, et al.
Published: (2025-12-01)

Transmission scheduling scheme based on deep Q learning in wireless network
by: Jiang ZHU, et al.
Published: (2018-04-01)

Design and Application Research of an Adaptive Obstacle Climbing Mechanism for Floor Sweeping Robot
by: Feng Shupeng, et al.
Published: (2018-01-01)

Q-learning global path planning for UAV navigation with pondered priorities
by: Kevin B. de Carvalho, et al.
Published: (2025-03-01)

Research of the Fast Modeling for Spiral Bevel Gear with Spherical Involute based on the SWEEP
by: Li Tongzhong, et al.
Published: (2015-01-01)

Research of the Characteristic of the Working Mode of Hydraulic Machinery Compound Transmission Sweeping Vehicle
by: Fuyi Cao, et al.
Published: (2019-02-01)

A survey of neural architecture search
by: Mingjie HE, et al.
Published: (2019-05-01)

Analysis of anomalous behaviour in network systems using deep reinforcement learning with convolutional neural network architecture
by: Mohammad Hossein Modirrousta, et al.
Published: (2024-12-01)

Preliminary study on the connotation of flexibility in dynamically reconfigurable networks
by: Dong-nian CHENG, et al.
Published: (2012-08-01)

A Bayesian Markov Framework for Modeling Breast Cancer Progression
by: Tong Wu
Published: (2024-12-01)

5G millimeter wave cell search algorithm with beam sweeping
by: Min SHEN, et al.
Published: (2020-06-01)

Stakeholder Prioritization in Brazilian Companies Disclosing GRI Reports
by: Keysa Manuela Cunha de Mascena, et al.
Published: (2018-01-01)

Matrix Ranking: A Tool for Decision-Making and Prioritization
by: Keegan Gay, et al.
Published: (2016-10-01)

Matrix Ranking: A Tool for Decision-Making and Prioritization
by: Keegan Gay, et al.
Published: (2016-10-01)

Autonomous security analysis and penetration testing model based on attack graph and deep Q-learning network
by: Cheng FAN, et al.
Published: (2023-12-01)

Nonhomogeneous Markov chains for degeneration behaviour of RC members’ durability and its Bayesian updating
by: Ruiqi Guo, et al.
Published: (2025-07-01)

Identifying and Prioritizing Suitable Financing Methods in Mosques of the Islamic Republic of Iran
by: Mahdi Karimi, et al.
Published: (2024-11-01)

Defense decision-making method based on incomplete information stochastic game and Q-learning
by: Hongqi ZHANG, et al.
Published: (2018-08-01)

Reinforcement Learning-Based Autonomous Soccer Agents: A Study in Multi-Agent Coordination and Strategy Development
by: Biplov Paneru, et al.
Published: (2025-01-01)

Speed of urban public transport as criterion of the justification of spatial and time-based prioritization
by: Yurii Yevchuk
Published: (2024-12-01)

New Report: Prioritization in Museum Collections – A Part of War and Disaster Preparedness
by: Susanna Carlsten
Published: (2025-02-01)

A health security-based framework for prioritizing regions for digital learning in complex health emergencies
by: Shawn M. D’Andrea, et al.
Published: (2025-01-01)

BAYESIAN FINITE ELEMENT MODEL UPDATING BASED ON MARKOV CHAIN POPULATION COMPETITION
by: YE Ling, et al.
Published: (2024-01-01)

Prioritizing Bridges for Seismic Resilience Enhancement: A Case Study of Algeria
by: mohammed abdellaoui, et al.
Published: (2024-05-01)

An MCMC Approach to Bayesian Image Analysis in Fourier Space
by: Konstantinos Bakas, et al.
Published: (2025-12-01)

Soil erosion assessment and watershed prioritization using GIS-based RUSLE in lower Shivaliks of Northwest India
by: Abrar Yousuf, et al.
Published: (2025-01-01)

Prioritizing DevOps Implementation Guidelines for Sustainable Software Projects
by: Muhammad Zohaib, et al.
Published: (2024-01-01)

Memory consolidation from a reinforcement learning perspective
by: Jong Won Lee, et al.
Published: (2025-01-01)

Identifying and Prioritizing Tourism Attractions in Ravansar Based on Tourist Attracting Potentials
by: Yousef Ghanbari, et al.
Published: (2012-07-01)

Prioritize the factors affecting on development of ecotourism villages of Barzok district- kashan by Combining DEMATEL and ANP Methods
by: Mohsen Shaterian, et al.
Published: (2017-03-01)

Adaptive pilot design for OFDM based on deep reinforcement learning
by: Qiaoshou LIU, et al.
Published: (2023-09-01)

A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents
by: Zhen Zhang, et al.
Published: (2018-01-01)

Comparative analysis of Q-learning, SARSA, and deep Q-network for microgrid energy management
by: Sreyas Ramesh, et al.
Published: (2025-01-01)

Research on resource allocation algorithm of centralized and distributed Q-learning in machine communication
by: Yunhe YU, et al.
Published: (2021-11-01)

Reinforcement learning-based vehicle travel path reconstruction from sparse automatic licence plate recognition data
by: Qiuping Li, et al.
Published: (2025-01-01)

Internet intelligent routing architecture and algorithm
by: Fei GUI, et al.
Published: (2020-10-01)

Seismic Prediction of Porosity in the Norne Field: Utilizing Support Vector Regression and Empirical Models Driven by Bayesian Linearized Inversion
by: Jorge A. Teruya Monroe, et al.
Published: (2025-01-01)

Lightweight and Efficient CSI-Based Human Activity Recognition via Bayesian Optimization-Guided Architecture Search and Structured Pruning
by: Sungkwan Youm, et al.
Published: (2025-01-01)

Efficiency-based prioritization of stakeholders using data envelopment analysis
by: Soheil Azizi, et al.
Published: (2023-12-01)