Actor-critic algorithm with incremental dual natural policy gradient
The existed algorithms for continuous action space failed to consider the way of selecting optimal action and utilizing the knowledge of the action space,so an efficient actor-critic algorithm was proposed by improving the natural gradient.The objective of the proposed algorithm was to maximize the...
Saved in:
Main Authors: | Peng ZHANG, Quan LIU, Shan ZHONG, Jian-wei ZHAI, Wei-sheng QIAN |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2017-04-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2017089/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
On Quantum Natural Policy Gradients
by: Andre Sequeira, et al.
Published: (2024-01-01) -
Perspectives on Soft Actor–Critic (SAC)-Aided Operational Control Strategies for Modern Power Systems with Growing Stochastics and Dynamics
by: Jinbo Liu, et al.
Published: (2025-01-01) -
Mixture of Experts Framework Based on Soft Actor-Critic Algorithm for Highway Decision-Making of Connected and Automated Vehicles
by: Fuxing Yao, et al.
Published: (2025-01-01) -
Function approximation method based on weights gradient descent in reinforcement learning
by: Xiaoyan QIN, et al.
Published: (2023-08-01) -
Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism
by: Fei XIAO, et al.
Published: (2013-01-01)