Online hierarchical reinforcement learning based on interrupting Option
Aiming at dealing with volume of big data,an on-line updating algorithm,named by Macro-Q with in-place updating (MQIU),which was based on Macro-Q algorithm and takes advantage of in-place updating approach,was proposed.The MQIU algorithm updates both the value function of abstract action and the val...
Saved in:
Main Authors: | Fei ZHU, Zhi-peng XU, Quan LIU, Yu-chen FU, Hui WANG |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2016-06-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2016117/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Multi-Agent Hierarchical Graph Attention Actor–Critic Reinforcement Learning
by: Tongyue Li, et al.
Published: (2024-12-01) -
Graph Pointer Network Based Hierarchical Curriculum Reinforcement Learning Method Solving Shuttle Tankers Scheduling Problem
by: Xiaoyong Gao, et al.
Published: (2024-12-01) -
Scheduling framework based on reinforcement learning in online-offline colocated cloud environment
by: Ling MA, et al.
Published: (2023-06-01) -
GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
by: Biao JIN, et al.
Published: (2023-06-01) -
Reinforcement learning-based vehicle travel path reconstruction from sparse automatic licence plate recognition data
by: Qiuping Li, et al.
Published: (2025-01-01)