Z-Score Experience Replay in Off-Policy Deep Reinforcement Learning

Reinforcement learning, as a machine learning method that does not require pre-training data, seeks the optimal policy through the continuous interaction between an agent and its environment. It is an important approach to solving sequential decision-making problems. By combining it with deep learni...

Full description

Saved in:
Bibliographic Details
Main Authors: Yana Yang, Meng Xi, Huiao Dai, Jiabao Wen, Jiachen Yang
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/24/23/7746
Tags: Add Tag
No Tags, Be the first to tag this record!