Trajectory Based Prioritized Double Experience Buffer for Sample-Efficient Policy Optimization

Trajectory Based Prioritized Double Experience Buffer for Sample-Efficient Policy Optimization

Reinforcement learning has recently made great progress in various challenging domains such as board game of Go and MOBA game of StarCraft II. Policy gradient based reinforcement learning method has become the mainstream due to its effectiveness and simplicity both in discrete and continuous scenari...

Full description

Saved in:

Bibliographic Details
Main Authors:	Shengxiang Li, Ou Li, Guangyi Liu, Siyuan Ding, Yijie Bai
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Reinforcement learning policy gradient replay buffer distributed RL
Online Access:	https://ieeexplore.ieee.org/document/9486881/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Domain Adaptation Using the Replay Buffer: Adaptive Sampling Using Domain-Specific Classifier
by: Seokmin Kim, et al.
Published: (2024-01-01)

Pri-DDQN: learning adaptive traffic signal control strategy through a hybrid agent
by: Yanliu Zheng, et al.
Published: (2024-11-01)

BUFFERING FUNCTION: A GENERAL APPROACH FOR BUFFER BEHAVIOR
by: André Fernando de Oliveira
Published: (2020-09-01)

Push based buffer setting strategy for high density linecard of small buffer size
by: LI Yu-feng1, et al.
Published: (2008-01-01)

Concurrent Learning of Control Policy and Unknown Safety Specifications in Reinforcement Learning
by: Lunet Yifru, et al.
Published: (2024-01-01)

Alternative buffer systems in biopharmaceutical formulations and their effect on protein stability
by: Lebar Blaž, et al.
Published: (2024-09-01)

Assessing Buffer Gradient Synergies: Comparing Objective and Subjective Evaluations of Urban Park Ecosystem Services in Century Park, Shanghai
by: Weixuan Wei, et al.
Published: (2024-11-01)

A Reinforcement Learning Approach for Graph Rule Learning
by: Zhenzhen Mai, et al.
Published: (2025-02-01)

Robot Dynamic Path Planning Based on Prioritized Experience Replay and LSTM Network
by: Hongqi Li, et al.
Published: (2025-01-01)

BUFFER PROTECTION CHARACTERISTICS STUDY OF TRAJECTORY CORRECTION FUZE ROTATION ISOLATION PLATFORM
by: LI XinPeng, et al.
Published: (2019-01-01)

Reconfigurable buffer mechanism in switch fabric design
by: Hui-qun HUANG, et al.
Published: (2012-10-01)

An Adaptive Buffer Management Algoritbm Based on Queue Prediction for Wireless Video Communication
by: Chao Qian, et al.
Published: (2014-07-01)

Ebbinghaus forgetting curve and attention mechanism based recommendation algorithm
by: Nan JIN, et al.
Published: (2022-10-01)

Application of Protocol Buffer in Enterprise Cloud Contacts Abstract This paper introduces the method to use Protocol Buffer to update data in enterprise cloud contacts, and compares the time and size of XML（extensible markup language）, JSON（JavaScript object notation）and Protocol Buffer, through the test data.
by: Hao Yin, et al.
Published: (2012-09-01)

Economic analysis of buffer preparation strategy for single-use bioprocessing of monoclonal antibodies
by: Takao ITO, et al.
Published: (2024-11-01)

Autonomous Maneuvering Decision-Making Algorithm for Unmanned Aerial Vehicles Based on Node Clustering and Deep Deterministic Policy Gradient
by: Xianyong Jing, et al.
Published: (2024-12-01)

Avian response to conservation buffers in agricultural landscapes during winter
by: Kristine O. Evans, et al.
Published: (2014-06-01)

An Extension to the stress-buffering model: Timing of support across the lifecourse
by: Phoebe H. Lam
Published: (2024-12-01)

Clustering-based Failed goal Aware Hindsight Experience Replay
by: Taeyoung Kim, et al.
Published: (2024-12-01)

Determination of Adjacent Visual Buffer Zones for the Temple Town of Chiang Mai City
by: Janjira Sukwai, et al.
Published: (2024-10-01)

On Kronecker Product Of Two RL-graphs And Some Related Results
by: Elham Raisi Sarbizhan, et al.
Published: (2022-11-01)

Reinforcement Learning-Based Approach to Reduce Velocity Error in Car-Following for Autonomous Connected Vehicles
by: Abu Tayab, et al.
Published: (2024-11-01)

Finding an Optimal Geometric Configuration for TDOA Location Systems With Reinforcement Learning
by: Shengxiang Li, et al.
Published: (2021-01-01)

Reinforcement Learning-Based Adaptive Control of a Piezo-Driven Nanopositioning System
by: Liheng Chen, et al.
Published: (2024-01-01)

The Cyclical Behavior of Capital Buffer of Indonesian Islamic and Conventional Banks
by: Nur Atikah, et al.
Published: (2023-01-01)

Buffered 4% Articaine Reduces Pain and Enhances Anesthesia in Maxillary Third Molar Extractions: A Randomized, Double-Blind Split-Mouth Study
by: Henning Staedt, et al.
Published: (2024-11-01)

Developing an Investment Method for Securities With Reinforcement Learning
by: Weiwei Song, et al.
Published: (2024-01-01)

Effect of custom-designed transfection buffer on delivery of genome modification components into primary cells of buffalo, cattle, goats, and sheep
by: Shreya Malhotra, et al.
Published: (2023-11-01)

APER-DDQN: UAV Precise Airdrop Method Based on Deep Reinforcement Learning
by: Yan Ouyang, et al.
Published: (2022-01-01)

BUFFER ZONE IN INDONESIAN TERRITORY TO ANTICIPATE IMPACTS OF NUCLEAR WAR ON KOREAN PENINSULA
by: Riko FENDIAWAN, et al.
Published: (2024-10-01)

Buffer-aided cooperative NOMA with power transfer
by: Long YANG, et al.
Published: (2023-06-01)

An Analysis of Buffer Solution Topics in Chemistry Textbooks: Application of the 4 Step Teaching Material Development Criteria
by: Lisya Asmiati, et al.
Published: (2024-12-01)

Effect of bicarbonate buffer on artificial membrane permeation of drugs
by: Shiori Ishida, et al.
Published: (2025-01-01)

A Recursive Optimization Approach for Buffer Allocation in Large Production Lines
by: Lei Li, et al.
Published: (2024-10-01)

Measurement of video initial buffer size for mobile network
by: Guang CHENG, et al.
Published: (2019-10-01)

Accelerated traffic replay method based on time compression
by: Shuo WANG, et al.
Published: (2021-10-01)

A Security Design for the Detecting of Buffer Overflow Attacks in IoT Device
by: Bin Xu, et al.
Published: (2018-01-01)

On Quantum Natural Policy Gradients
by: Andre Sequeira, et al.
Published: (2024-01-01)

Thermal buffering-controlled temperature variation between Mg–Al-rich rocks and migmatites
by: Samantha March, et al.
Published: (2025-01-01)

People Attitude Toward Promotion of Agroforestry Practices in Buffer Zone Area of Mt. Elgon National Park, Uganda
by: Buyinza Mukadasi, et al.
Published: (2007-01-01)