Q-Learning-Based Power Allocation for Secure Wireless Communication in UAV-Aided Relay Network

Unmanned aerial vehicle (UAV)-aided wireless relay networks are at risk of eavesdropping activities due to their open nature. In this paper, we study the security of a UAV-aided selective relaying wireless network in which <inline-formula> <tex-math notation="LaTeX">$N$ </te...

Full description

Saved in:
Bibliographic Details
Main Authors: Sidqy I. Alnagar, Anas M. Salhab, Salam A. Zummo
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9360739/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Unmanned aerial vehicle (UAV)-aided wireless relay networks are at risk of eavesdropping activities due to their open nature. In this paper, we study the security of a UAV-aided selective relaying wireless network in which <inline-formula> <tex-math notation="LaTeX">$N$ </tex-math></inline-formula> UAVs are employed as decode-and-forward (DF) relays linking a ground base station (BS) with <inline-formula> <tex-math notation="LaTeX">$L$ </tex-math></inline-formula> legitimate users on the ground in the presence of a passive eavesdropper (<inline-formula> <tex-math notation="LaTeX">$Eave$ </tex-math></inline-formula>). Direct links between the ground BS and both the ground users and the eavesdropper are assumed to be blocked. The ground-to-air and air-to-ground channels are assumed to follow Rician fading model with opportunistic scheduling scheme for UAVs and users selection. In order to secure data transmissions against such an interception action, the UAV of the worst UAV-selected user link transmits a jamming artificial noise (AN) signal to degrade <inline-formula> <tex-math notation="LaTeX">$Eave$ </tex-math></inline-formula> ability in decoding the confidential information successfully. The transmission outage probability, intercept probability, and hybrid outage probability are derived and analyzed. Due to the heavy computation burden raised by increasing the number of UAVs and users as well as the difficulty in estimating the instantaneous channel state information (CSI), existing traditional optimization methods are not highly efficient in solving the considered power allocation problem. Therefore, we propose a dynamic power control scheme based on Q-learning algorithm combined with statistical CSI where the hybrid outage probability is minimized. Simulation results show that the proposed algorithm efficiently reduces the hybrid outage probability with a noticeable reduction in the computational time.
ISSN:2169-3536