Selective imitation for efficient online reinforcement learning with pre-collected data
Deep reinforcement learning (RL) has emerged as a promising solution for autonomous devices requiring sequential decision-making. In the online RL framework, the agent must interact with the environment to collect data, making sample efficiency the most challenging aspect. While the off-policy metho...
Saved in:
Main Authors: | Chanin Eom, Dongsu Lee, Minhae Kwon |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2024-12-01
|
Series: | ICT Express |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2405959524001048 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Optimization of Jamming Type Selection for Countering Multifunction Radar Based on Generative Adversarial Imitation Learning
by: Tianjian Yang, et al.
Published: (2025-01-01) -
Reducing risk for assistive reinforcement learning policies with diffusion models
by: Андрій Титаренко
Published: (2024-09-01) -
Improving Reinforcement Learning with Expert Demonstrations and Vision Transformers for Autonomous Vehicle Control
by: Badr Ben Elallid, et al.
Published: (2024-12-01) -
Joint Learning of Volume Scheduling and Order Placement Policies for Optimal Order Execution
by: Siyuan Li, et al.
Published: (2024-11-01) -
Online hierarchical reinforcement learning based on interrupting Option
by: Fei ZHU, et al.
Published: (2016-06-01)