Offline prompt reinforcement learning method based on feature extraction
Recent studies have shown that combining Transformer and conditional strategies to deal with offline reinforcement learning can bring better results. However, in a conventional reinforcement learning scenario, the agent can receive a single frame of observations one by one according to its natural c...
Saved in:
Main Authors: | Tianlei Yao, Xiliang Chen, Yi Yao, Weiye Huang, Zhaoyang Chen |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2025-01-01
|
Series: | PeerJ Computer Science |
Subjects: | |
Online Access: | https://peerj.com/articles/cs-2490.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Moor: Model-based offline policy optimization with a risk dynamics model
by: Xiaolong Su, et al.
Published: (2024-11-01) -
Research on named entity recognition method in cybersecurity based on soft prompt tuning and reinforcement learning
by: TIAN Zeshu, et al.
Published: (2024-10-01) -
Stealthy data poisoning attack method on offline reinforcement learning in unmanned systems
by: ZHOU Xue, et al.
Published: (2024-12-01) -
A guide to prompt design: foundations and applications for healthcare simulationists
by: Sara Maaz, et al.
Published: (2025-01-01) -
ANALYSIS OF THE FACTORS INFLUENCING THE OFFLINE LEARNING READINESS DURING THE COVID-19 PANDEMIC
by: Arlina Dhian Sulistyowati, et al.
Published: (2023-05-01)