TD algorithm based on double-layer fuzzy partitioning

TD algorithm based on double-layer fuzzy partitioning

When dealing with the continuous space problems,the traditional Q-iteration algorithms based on lookup-table or function approximation converge slowly and are diff lt to get a continuous policy.To overcome the above weak-nesses,an on-policy TD algorithm named DFP-OPTD was proposed based on double-la...

Full description

Saved in:

Bibliographic Details
Main Authors:	Xiang MU, Quan LIU, Qi-ming FU, Hong-kun SUN, Xin ZHOU
Format:	Article
Language:	zho
Published:	Editorial Department of Journal on Communications 2013-10-01
Series:	Tongxin xuebao
Subjects:	reinforcement learning on-policy gradient descent double layer fuzzy partitioning continuous action policy
Online Access:	http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.10.011/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Function approximation method based on weights gradient descent in reinforcement learning
by: Xiaoyan QIN, et al.
Published: (2023-08-01)

Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism
by: Fei XIAO, et al.
Published: (2013-01-01)

Comparison of the efficiency of zero and first order minimization methods in neural networks
by: E. A. Gubareva, et al.
Published: (2022-12-01)

Research of the Adaptive Fuzzy PID Control System of New Type Double Conical Continuously Variable Transmission
by: Li Jingkui, et al.
Published: (2016-01-01)

Deep Reinforcement Learning-Based Task Partitioning Ratio Decision Mechanism in High-Speed Rail Environments with Mobile Edge Computing Server
by: Seolwon Koo, et al.
Published: (2025-01-01)

On Quantum Natural Policy Gradients
by: Andre Sequeira, et al.
Published: (2024-01-01)

Recursive formulae for the multiplicative partition function
by: Jun Kyo Kim, et al.
Published: (1999-01-01)

Variable-Parameter Impedance Control of Manipulator Based on RBFNN and Gradient Descent
by: Linshen Li, et al.
Published: (2024-12-01)

Congruences modulo $4$ for the number of $3$-regular partitions
by: Ballantine, Cristina, et al.
Published: (2023-11-01)

Tight analyses for subgradient descent I: Lower bounds
by: Harvey, Nicholas J. A., et al.
Published: (2024-07-01)

Actor-critic algorithm with incremental dual natural policy gradient
by: Peng ZHANG, et al.
Published: (2017-04-01)

SGD-TripleQNet: An Integrated Deep Reinforcement Learning Model for Vehicle Lane-Change Decision
by: Yang Liu, et al.
Published: (2025-01-01)

Congruences involving generalized Frobenius partitions
by: James Sellers
Published: (1993-01-01)

Congruences involving F-partition functions
by: James Sellers
Published: (1994-01-01)

Forest fire risk assessment model optimized by stochastic average gradient descent
by: Zexin Fu, et al.
Published: (2025-01-01)

Solving Spatial Optimization Problems via Lagrangian Relaxation and Automatic Gradient Computation
by: Zhen Lei, et al.
Published: (2025-01-01)

Design and implement of integrative access layer in campus network
by: Zhong-nan FU, et al.
Published: (2014-10-01)

Traceability permissioned chain consensus mechanism based on double-layer architecture
by: Qingyang DING, et al.
Published: (2019-04-01)

A proximal policy optimization based deep reinforcement learning framework for tracking control of a flexible robotic manipulator
by: Joshi Kumar V, et al.
Published: (2025-03-01)

Smoothing gradient descent algorithm for the composite sparse optimization
by: Wei Yang, et al.
Published: (2024-11-01)

A Low-Resistance Hybrid Nanomaterial Composed of Double-Charged Ionic Liquid and a Layered Double Hydroxide (Type Al/Mg-Dodecyl Sulfate) for Electrochemical Determination of Methyldopa
by: Mir Mahdi Abolghasemi, et al.
Published: (2024-01-01)

On partitions with difference conditions
by: José Plínio, O. Santos, et al.
Published: (2000-01-01)

Digital Twin-Empowered Robotic Arm Control: An Integrated PPO and Fuzzy PID Approach
by: Yuhao Cen, et al.
Published: (2025-01-01)

Review on the contribution of ultrasounds in layered double hydroxides synthesis and in their performances
by: Kalawoun, Hamed, et al.
Published: (2023-11-01)

Double-Network-Layer, a New Solution for IPng
by: Shi Li
Published: (2014-08-01)

Efficient authentication scheme for double-layer satellite network
by: Hui ZHU, et al.
Published: (2019-03-01)

A core-shell magnetic layered double hydroxide composite material for the effective decolorization of phenol red
by: Şakir Yılmaz
Published: (2022-06-01)

Hybrid spin coating and evaporation techniques for fabricating double-layer stacked dual-color perovskite LEDs
by: Ching-Ho Tien, et al.
Published: (2025-01-01)

Convergence Rates of Gradient Methods for Convex Optimization in the Space of Measures
by: Chizat, Lénaïc
Published: (2023-01-01)

Bandlimited Frequency-Constrained Iterative Methods
by: Harrison Garrett, et al.
Published: (2025-01-01)

Design of hybridly-connected hybrid precoding in millimeter-wave massive MIMO system
by: Hongyu ZHAO, et al.
Published: (2020-03-01)

Magnesium layered double hydroxide nanofiller in PMMA exposed to gamma irradiation
by: Dandara Luz Souza de Oliveira, et al.
Published: (2025-02-01)

Fusion of satellite-ground and inter-satellite AKA protocols for double-layer satellite networks
by: Jin CAO, et al.
Published: (2023-02-01)

Cooperative inference analysis based on DNN convolutional kernel partitioning
by: Jialin ZHI, et al.
Published: (2022-12-01)

Optimal memory time Cache partitioning in chip-multiprocessors
by: Hao LI, et al.
Published: (2012-04-01)

Understanding the sorbent properties of layered double hydroxide for the removal of pharmaceuticals from aqueous Solutions: A comprehensive review
by: Zahraa Ali Hammood, et al.
Published: (2025-01-01)

Zero-sum partition theorems for graphs
by: Y. Caro, et al.
Published: (1994-01-01)

Reinforced Cost-Sensitive Graph Network for Detecting Fraud Leaders in Telecom Fraud
by: Peiwen Gao, et al.
Published: (2024-01-01)

Exploring the Photocatalytic Efficiency of Gold Nanoparticles Deposited on Ni-Al-Zr-Layered Double Hydroxides for Selective Glucose Oxidation
by: Nihel Dib, et al.
Published: (2024-12-01)

High-order fuzzy time series self-adaption prediction method based on spectral clustering
by: Chun-nan ZHOU, et al.
Published: (2016-02-01)