DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems

For the intelligent reflecting surface (IRS)-assisted multiple input single output (MISO) simultaneous wireless information and power transfer (SWIPT) system, the beam forming vector at the base station and the reflected beam forming vector of the IRS were jointly optimized, by considering the maxim...

Full description

Saved in:

Bibliographic Details
Main Authors:	LUO Liping, PAN Weimin
Format:	Article
Language:	zho
Published:	China InfoCom Media Group 2024-06-01
Series:	物联网学报
Subjects:	multiple input single output simultaneous wireless information and power transfer intelligent reflecting surface beam forming deep deterministic policy gradient
Online Access:	http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2024.00389/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841533798976585728
author	LUO Liping PAN Weimin
author_facet	LUO Liping PAN Weimin
author_sort	LUO Liping
collection	DOAJ
description	For the intelligent reflecting surface (IRS)-assisted multiple input single output (MISO) simultaneous wireless information and power transfer (SWIPT) system, the beam forming vector at the base station and the reflected beam forming vector of the IRS were jointly optimized, by considering the maximum transmit power of the base station, the unit modulus constraint of the IRS reflection phase shift matrix, and the minimum energy constraint of the energy receiver. The object was to maximize the spectrum efficiency. To solve the non-convex optimization problem, a deep deterministic policy gradient (DDPG) algorithm based on deep reinforcement learning was proposed. Simulation results show that the average reward of the DDPG algorithm is related to the learning rate. Under the condition of selecting the appropriate learning rate, the DDPG algorithm can obtain an average mutual information similar to that of the traditional optimization algorithm, but the running time is significantly lower than that of the traditional non-convex optimization algorithm. Even if the number of antennas and the number of reflective units are increased, the DDPG algorithm can still converge in a short period of time. This indicates that the DDPG algorithm can effectively improve the computational efficiency and is suitable for communication services with high real-time requirements.
format	Article
id	doaj-art-38a1a07f347340f893b95dac3e68d9a7
institution	Kabale University
issn	2096-3750
language	zho
publishDate	2024-06-01
publisher	China InfoCom Media Group
record_format	Article
series	物联网学报
spelling	doaj-art-38a1a07f347340f893b95dac3e68d9a72025-01-15T02:54:08ZzhoChina InfoCom Media Group物联网学报2096-37502024-06-018465567576801DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systemsLUO LipingPAN WeiminFor the intelligent reflecting surface (IRS)-assisted multiple input single output (MISO) simultaneous wireless information and power transfer (SWIPT) system, the beam forming vector at the base station and the reflected beam forming vector of the IRS were jointly optimized, by considering the maximum transmit power of the base station, the unit modulus constraint of the IRS reflection phase shift matrix, and the minimum energy constraint of the energy receiver. The object was to maximize the spectrum efficiency. To solve the non-convex optimization problem, a deep deterministic policy gradient (DDPG) algorithm based on deep reinforcement learning was proposed. Simulation results show that the average reward of the DDPG algorithm is related to the learning rate. Under the condition of selecting the appropriate learning rate, the DDPG algorithm can obtain an average mutual information similar to that of the traditional optimization algorithm, but the running time is significantly lower than that of the traditional non-convex optimization algorithm. Even if the number of antennas and the number of reflective units are increased, the DDPG algorithm can still converge in a short period of time. This indicates that the DDPG algorithm can effectively improve the computational efficiency and is suitable for communication services with high real-time requirements.http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2024.00389/multiple input single outputsimultaneous wireless information and power transferintelligent reflecting surfacebeam formingdeep deterministic policy gradient
spellingShingle	LUO Liping PAN Weimin DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems 物联网学报 multiple input single output simultaneous wireless information and power transfer intelligent reflecting surface beam forming deep deterministic policy gradient
title	DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
title_full	DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
title_fullStr	DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
title_full_unstemmed	DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
title_short	DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems
title_sort	ddpg based performance optimization algorithm for irs assisted simultaneous wireless information and power transfer systems
topic	multiple input single output simultaneous wireless information and power transfer intelligent reflecting surface beam forming deep deterministic policy gradient
url	http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2024.00389/
work_keys_str_mv	AT luoliping ddpgbasedperformanceoptimizationalgorithmforirsassistedsimultaneouswirelessinformationandpowertransfersystems AT panweimin ddpgbasedperformanceoptimizationalgorithmforirsassistedsimultaneouswirelessinformationandpowertransfersystems

DDPG-based performance optimization algorithm for IRS-assisted simultaneous wireless information and power transfer systems

Similar Items