Offline prompt reinforcement learning method based on feature extraction

Offline prompt reinforcement learning method based on feature extraction

Recent studies have shown that combining Transformer and conditional strategies to deal with offline reinforcement learning can bring better results. However, in a conventional reinforcement learning scenario, the agent can receive a single frame of observations one by one according to its natural c...

Full description

Saved in:

Bibliographic Details
Main Authors:	Tianlei Yao, Xiliang Chen, Yi Yao, Weiye Huang, Zhaoyang Chen
Format:	Article
Language:	English
Published:	PeerJ Inc. 2025-01-01
Series:	PeerJ Computer Science
Subjects:	Offline reinforcement learning Large language model Sequence modeling Prompt learning Representation learning
Online Access:	https://peerj.com/articles/cs-2490.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Moor: Model-based offline policy optimization with a risk dynamics model
by: Xiaolong Su, et al.
Published: (2024-11-01)

Research on named entity recognition method in cybersecurity based on soft prompt tuning and reinforcement learning
by: TIAN Zeshu, et al.
Published: (2024-10-01)

Stealthy data poisoning attack method on offline reinforcement learning in unmanned systems
by: ZHOU Xue, et al.
Published: (2024-12-01)

A guide to prompt design: foundations and applications for healthcare simulationists
by: Sara Maaz, et al.
Published: (2025-01-01)

ANALYSIS OF THE FACTORS INFLUENCING THE OFFLINE LEARNING READINESS DURING THE COVID-19 PANDEMIC
by: Arlina Dhian Sulistyowati, et al.
Published: (2023-05-01)

Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
by: Yunqiao Fei, et al.
Published: (2025-01-01)

On the legal implications of Large Language Model answers: A prompt engineering approach and a view beyond by exploiting Knowledge Graphs
by: George Hannah, et al.
Published: (2025-01-01)

Evaluation of LLMs accuracy and consistency in the registered dietitian exam through prompt engineering and knowledge retrieval
by: Iman Azimi, et al.
Published: (2025-01-01)

A Survey of Offline Handwriting Signature Verification
by: Jihad Majeed Nori, et al.
Published: (2025-01-01)

Research and application of defense mechanism for prompt injection attack of large language model in financial industry
by: MOU Daen, et al.
Published: (2024-10-01)

LingoTrip: Spatiotemporal context prompt driven large language model for individual trip prediction
by: Zhenlin Qin, et al.
Published: (2025-01-01)

KeyEE: Enhancing Low-Resource Generative Event Extraction with Auxiliary Keyword Sub-Prompt
by: Junwen Duan, et al.
Published: (2024-06-01)

Benefits of Online and Offline Resources for Teaching English and Norwegian as Foreign Languages in a Post-Pandemic Context
by: Diana Lățug, et al.
Published: (2024-10-01)

CuTCP: Custom Text Generation-based Class-aware Prompt Tuning for visual-language models
by: Min Huang, et al.
Published: (2025-01-01)

Survey on large language models alignment research
by: LIU Kunlin, et al.
Published: (2024-06-01)

End-to-End Architecture for English Reading and Writing Content Assessment Based on Prompt Learning
by: Su-Qin Wu, et al.
Published: (2025-01-01)

Ontology-based prompt tuning for news article summarization
by: A. R. S. Silva, et al.
Published: (2025-02-01)

PURE: a Prompt-based framework with dynamic Update mechanism for educational Relation Extraction
by: Xiaohui Cui, et al.
Published: (2024-12-01)

Implementation of home visit method in offline learning during the Covid-19 pandemic
by: Benjamin Metekohy, et al.
Published: (2023-03-01)

Temporal-Spatial Traffic Flow Prediction Model Based on Prompt Learning
by: Siteng Cai, et al.
Published: (2024-12-01)

MDAPT: Multi-Modal Depth Adversarial Prompt Tuning to Enhance the Adversarial Robustness of Visual Language Models
by: Chao Li, et al.
Published: (2025-01-01)

Memory consolidation from a reinforcement learning perspective
by: Jong Won Lee, et al.
Published: (2025-01-01)

Cypress Copilot: Development of an AI Assistant for Boosting Productivity and Transforming Web Application Testing
by: Suresh Babu Nettur, et al.
Published: (2025-01-01)

Measurance of Pitch Plane and Offline Detection of Accuracy on Hirth Coupling
by: Guangjun Shao, et al.
Published: (2019-03-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
by: Biao JIN, et al.
Published: (2023-06-01)

The Time Machine: Future Scenario Generation Through Generative AI Tools
by: Jan Ferrer i Picó, et al.
Published: (2025-01-01)

Automated speech therapy through personalized pronunciation correction using reinforcement learning and large language models
by: Ritika Lakshminarayanan, et al.
Published: (2025-03-01)

Scheduling framework based on reinforcement learning in online-offline colocated cloud environment
by: Ling MA, et al.
Published: (2023-06-01)

Evaluating the Performance of Large Language Models in Predicting Diagnostics for Spanish Clinical Cases in Cardiology
by: Julien Delaunay, et al.
Published: (2024-12-01)

Efficient anomaly detection in tabular cybersecurity data using large language models
by: Xiaoyong Zhao, et al.
Published: (2025-01-01)

Advancements in CNN Architectures for Offline Handwritten Arabic Character Recognition
by: El Ibrahimi Aissam, et al.
Published: (2025-01-01)

Node selection method in federated learning based on deep reinforcement learning
by: Wenchen HE, et al.
Published: (2021-06-01)

An empirical study of LLaMA3 quantization: from LLMs to MLLMs
by: Wei Huang, et al.
Published: (2024-12-01)

Online hierarchical reinforcement learning based on interrupting Option
by: Fei ZHU, et al.
Published: (2016-06-01)

Scilab-RL: A software framework for efficient reinforcement learning and cognitive modeling research
by: Jan Benad, et al.
Published: (2025-02-01)

A Novel Change Detection Method Based on Visual Language From High-Resolution Remote Sensing Images
by: Junlong Qiu, et al.
Published: (2025-01-01)

Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification
by: Jiachen Li, et al.
Published: (2025-01-01)

Hate Speech Detection Using Large Language Models: A Comprehensive Review
by: Aish Albladi, et al.
Published: (2025-01-01)

An instruction dataset for extracting quantum cascade laser properties from scientific textDataverse
by: Deperias Kerre, et al.
Published: (2025-02-01)

Model-Based Graph Reinforcement Learning for Inductive Traffic Signal Control
by: Francois-Xavier Devailly, et al.
Published: (2024-01-01)