UAV Swarm Rounding Strategy Based on Deep Reinforcement Learning Goal Consistency with Multi-Head Soft Attention Algorithm

Aiming at the problem of target rounding by UAV swarms in complex environments, this paper proposes a goal consistency reinforcement learning approach based on multi-head soft attention (GCMSA). Firstly, in order to make the model closer to reality, the reward function when the target is at differen...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhaotian Wei, Ruixuan Wei
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Drones
Subjects:
Online Access:https://www.mdpi.com/2504-446X/8/12/731
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Aiming at the problem of target rounding by UAV swarms in complex environments, this paper proposes a goal consistency reinforcement learning approach based on multi-head soft attention (GCMSA). Firstly, in order to make the model closer to reality, the reward function when the target is at different positions and the target escape strategy are set, respectively. Then, the Multi-head soft attention module is used to promote the information cognition of the target among the UAVs, so that the UAVs can complete the target roundup more smoothly. Finally, in the training phase, this paper introduces cognitive dissonance loss to improve the sample utilization. Simulation experiments show that GCMSA is able to obtain a higher task success rate and is significantly better than MADDPG in terms of algorithm performance.
ISSN:2504-446X