Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO

In order to further improve its anti-interference ability in complex electromagnetic environment, a PPO algorithm based on weighted importance sampling and eligibility traces (ET-PPO) was proposed.On the basis of the traditional frequency hopping pattern, time-varying parameters were introduced, and...

Full description

Saved in:
Bibliographic Details
Main Authors: Yibo CHEN, Zhijin ZHAO
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2022-11-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022264/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841530784557563904
author Yibo CHEN
Zhijin ZHAO
author_facet Yibo CHEN
Zhijin ZHAO
author_sort Yibo CHEN
collection DOAJ
description In order to further improve its anti-interference ability in complex electromagnetic environment, a PPO algorithm based on weighted importance sampling and eligibility traces (ET-PPO) was proposed.On the basis of the traditional frequency hopping pattern, time-varying parameters were introduced, and the bivariate frequency hopping pattern decision problem was modeled as a Markov decision problem through the construction of the state-action-reward triple.Aiming at the high variance problem of the sample update method of an actor network of the PPO algorithm, weighted importance sampling was introduced to reduce the variance, and the action selection strategy of Beta distribution was used to enhance the stability of the learning stage.Aiming at the problem of slow convergence speed of the evaluator network, the eligibility trace method was introduced, which better balanced the convergence speed and the global optimal solution.The algorithm comparison simulation results in different electromagnetic interference environments show that ET-PPO has better adaptability and stability, and has better performance against obstruction interference and sweep frequency interference.
format Article
id doaj-art-cda06251d9f74112a12a7df9119c9991
institution Kabale University
issn 1000-0801
language zho
publishDate 2022-11-01
publisher Beijing Xintong Media Co., Ltd
record_format Article
series Dianxin kexue
spelling doaj-art-cda06251d9f74112a12a7df9119c99912025-01-15T02:59:53ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012022-11-0138869559575319Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPOYibo CHENZhijin ZHAOIn order to further improve its anti-interference ability in complex electromagnetic environment, a PPO algorithm based on weighted importance sampling and eligibility traces (ET-PPO) was proposed.On the basis of the traditional frequency hopping pattern, time-varying parameters were introduced, and the bivariate frequency hopping pattern decision problem was modeled as a Markov decision problem through the construction of the state-action-reward triple.Aiming at the high variance problem of the sample update method of an actor network of the PPO algorithm, weighted importance sampling was introduced to reduce the variance, and the action selection strategy of Beta distribution was used to enhance the stability of the learning stage.Aiming at the problem of slow convergence speed of the evaluator network, the eligibility trace method was introduced, which better balanced the convergence speed and the global optimal solution.The algorithm comparison simulation results in different electromagnetic interference environments show that ET-PPO has better adaptability and stability, and has better performance against obstruction interference and sweep frequency interference.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022264/complex electromagnetic environmentbivariate frequency hopping patternproximal policy optimizationeligibility trace
spellingShingle Yibo CHEN
Zhijin ZHAO
Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO
Dianxin kexue
complex electromagnetic environment
bivariate frequency hopping pattern
proximal policy optimization
eligibility trace
title Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO
title_full Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO
title_fullStr Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO
title_full_unstemmed Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO
title_short Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO
title_sort intelligent anti jamming decision algorithm of bivariate frequency hopping pattern based on et ppo
topic complex electromagnetic environment
bivariate frequency hopping pattern
proximal policy optimization
eligibility trace
url http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022264/
work_keys_str_mv AT yibochen intelligentantijammingdecisionalgorithmofbivariatefrequencyhoppingpatternbasedonetppo
AT zhijinzhao intelligentantijammingdecisionalgorithmofbivariatefrequencyhoppingpatternbasedonetppo