DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems

For the intelligent reflecting surface (IRS)-assisted multiple input single output (MISO) simultaneous wireless information and power transfer (SWIPT) system, the beam forming vector at the base station and the reflected beam forming vector of the IRS were jointly optimized, by considering the maxim...

Full description

Saved in:
Bibliographic Details
Main Authors: LUO Liping, PAN Weimin
Format: Article
Language:zho
Published: China InfoCom Media Group 2024-06-01
Series:物联网学报
Subjects:
Online Access:http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2024.00389/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841533798976585728
author LUO Liping
PAN Weimin
author_facet LUO Liping
PAN Weimin
author_sort LUO Liping
collection DOAJ
description For the intelligent reflecting surface (IRS)-assisted multiple input single output (MISO) simultaneous wireless information and power transfer (SWIPT) system, the beam forming vector at the base station and the reflected beam forming vector of the IRS were jointly optimized, by considering the maximum transmit power of the base station, the unit modulus constraint of the IRS reflection phase shift matrix, and the minimum energy constraint of the energy receiver. The object was to maximize the spectrum efficiency. To solve the non-convex optimization problem, a deep deterministic policy gradient (DDPG) algorithm based on deep reinforcement learning was proposed. Simulation results show that the average reward of the DDPG algorithm is related to the learning rate. Under the condition of selecting the appropriate learning rate, the DDPG algorithm can obtain an average mutual information similar to that of the traditional optimization algorithm, but the running time is significantly lower than that of the traditional non-convex optimization algorithm. Even if the number of antennas and the number of reflective units are increased, the DDPG algorithm can still converge in a short period of time. This indicates that the DDPG algorithm can effectively improve the computational efficiency and is suitable for communication services with high real-time requirements.
format Article
id doaj-art-38a1a07f347340f893b95dac3e68d9a7
institution Kabale University
issn 2096-3750
language zho
publishDate 2024-06-01
publisher China InfoCom Media Group
record_format Article
series 物联网学报
spelling doaj-art-38a1a07f347340f893b95dac3e68d9a72025-01-15T02:54:08ZzhoChina InfoCom Media Group物联网学报2096-37502024-06-018465567576801DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systemsLUO LipingPAN WeiminFor the intelligent reflecting surface (IRS)-assisted multiple input single output (MISO) simultaneous wireless information and power transfer (SWIPT) system, the beam forming vector at the base station and the reflected beam forming vector of the IRS were jointly optimized, by considering the maximum transmit power of the base station, the unit modulus constraint of the IRS reflection phase shift matrix, and the minimum energy constraint of the energy receiver. The object was to maximize the spectrum efficiency. To solve the non-convex optimization problem, a deep deterministic policy gradient (DDPG) algorithm based on deep reinforcement learning was proposed. Simulation results show that the average reward of the DDPG algorithm is related to the learning rate. Under the condition of selecting the appropriate learning rate, the DDPG algorithm can obtain an average mutual information similar to that of the traditional optimization algorithm, but the running time is significantly lower than that of the traditional non-convex optimization algorithm. Even if the number of antennas and the number of reflective units are increased, the DDPG algorithm can still converge in a short period of time. This indicates that the DDPG algorithm can effectively improve the computational efficiency and is suitable for communication services with high real-time requirements.http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2024.00389/multiple input single outputsimultaneous wireless information and power transferintelligent reflecting surfacebeam formingdeep deterministic policy gradient
spellingShingle LUO Liping
PAN Weimin
DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
物联网学报
multiple input single output
simultaneous wireless information and power transfer
intelligent reflecting surface
beam forming
deep deterministic policy gradient
title DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
title_full DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
title_fullStr DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
title_full_unstemmed DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
title_short DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
title_sort ddpg based performance optimization algorithm for irs assisted simultaneous wireless information and power transfer systems
topic multiple input single output
simultaneous wireless information and power transfer
intelligent reflecting surface
beam forming
deep deterministic policy gradient
url http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2024.00389/
work_keys_str_mv AT luoliping ddpgbasedperformanceoptimizationalgorithmforirsassistedsimultaneouswirelessinformationandpowertransfersystems
AT panweimin ddpgbasedperformanceoptimizationalgorithmforirsassistedsimultaneouswirelessinformationandpowertransfersystems