Z-Score Experience Replay in Off-Policy Deep Reinforcement Learning

Reinforcement learning, as a machine learning method that does not require pre-training data, seeks the optimal policy through the continuous interaction between an agent and its environment. It is an important approach to solving sequential decision-making problems. By combining it with deep learni...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yana Yang, Meng Xi, Huiao Dai, Jiabao Wen, Jiachen Yang
Format:	Article
Language:	English
Published:	MDPI AG 2024-12-01
Series:	Sensors
Subjects:	deep reinforcement learning off policy priority experience replay z-score
Online Access:	https://www.mdpi.com/1424-8220/24/23/7746
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.mdpi.com/1424-8220/24/23/7746

Z-Score Experience Replay in Off-Policy Deep Reinforcement Learning

Internet

Similar Items