Selective imitation for efficient online reinforcement learning with pre-collected data

Deep reinforcement learning (RL) has emerged as a promising solution for autonomous devices requiring sequential decision-making. In the online RL framework, the agent must interact with the environment to collect data, making sample efficiency the most challenging aspect. While the off-policy metho...

Full description

Saved in:
Bibliographic Details
Main Authors: Chanin Eom, Dongsu Lee, Minhae Kwon
Format: Article
Language:English
Published: Elsevier 2024-12-01
Series:ICT Express
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405959524001048
Tags: Add Tag
No Tags, Be the first to tag this record!