Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO
In order to further improve its anti-interference ability in complex electromagnetic environment, a PPO algorithm based on weighted importance sampling and eligibility traces (ET-PPO) was proposed.On the basis of the traditional frequency hopping pattern, time-varying parameters were introduced, and...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Beijing Xintong Media Co., Ltd
2022-11-01
|
Series: | Dianxin kexue |
Subjects: | |
Online Access: | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022264/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841530784557563904 |
---|---|
author | Yibo CHEN Zhijin ZHAO |
author_facet | Yibo CHEN Zhijin ZHAO |
author_sort | Yibo CHEN |
collection | DOAJ |
description | In order to further improve its anti-interference ability in complex electromagnetic environment, a PPO algorithm based on weighted importance sampling and eligibility traces (ET-PPO) was proposed.On the basis of the traditional frequency hopping pattern, time-varying parameters were introduced, and the bivariate frequency hopping pattern decision problem was modeled as a Markov decision problem through the construction of the state-action-reward triple.Aiming at the high variance problem of the sample update method of an actor network of the PPO algorithm, weighted importance sampling was introduced to reduce the variance, and the action selection strategy of Beta distribution was used to enhance the stability of the learning stage.Aiming at the problem of slow convergence speed of the evaluator network, the eligibility trace method was introduced, which better balanced the convergence speed and the global optimal solution.The algorithm comparison simulation results in different electromagnetic interference environments show that ET-PPO has better adaptability and stability, and has better performance against obstruction interference and sweep frequency interference. |
format | Article |
id | doaj-art-cda06251d9f74112a12a7df9119c9991 |
institution | Kabale University |
issn | 1000-0801 |
language | zho |
publishDate | 2022-11-01 |
publisher | Beijing Xintong Media Co., Ltd |
record_format | Article |
series | Dianxin kexue |
spelling | doaj-art-cda06251d9f74112a12a7df9119c99912025-01-15T02:59:53ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012022-11-0138869559575319Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPOYibo CHENZhijin ZHAOIn order to further improve its anti-interference ability in complex electromagnetic environment, a PPO algorithm based on weighted importance sampling and eligibility traces (ET-PPO) was proposed.On the basis of the traditional frequency hopping pattern, time-varying parameters were introduced, and the bivariate frequency hopping pattern decision problem was modeled as a Markov decision problem through the construction of the state-action-reward triple.Aiming at the high variance problem of the sample update method of an actor network of the PPO algorithm, weighted importance sampling was introduced to reduce the variance, and the action selection strategy of Beta distribution was used to enhance the stability of the learning stage.Aiming at the problem of slow convergence speed of the evaluator network, the eligibility trace method was introduced, which better balanced the convergence speed and the global optimal solution.The algorithm comparison simulation results in different electromagnetic interference environments show that ET-PPO has better adaptability and stability, and has better performance against obstruction interference and sweep frequency interference.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022264/complex electromagnetic environmentbivariate frequency hopping patternproximal policy optimizationeligibility trace |
spellingShingle | Yibo CHEN Zhijin ZHAO Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO Dianxin kexue complex electromagnetic environment bivariate frequency hopping pattern proximal policy optimization eligibility trace |
title | Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO |
title_full | Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO |
title_fullStr | Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO |
title_full_unstemmed | Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO |
title_short | Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO |
title_sort | intelligent anti jamming decision algorithm of bivariate frequency hopping pattern based on et ppo |
topic | complex electromagnetic environment bivariate frequency hopping pattern proximal policy optimization eligibility trace |
url | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022264/ |
work_keys_str_mv | AT yibochen intelligentantijammingdecisionalgorithmofbivariatefrequencyhoppingpatternbasedonetppo AT zhijinzhao intelligentantijammingdecisionalgorithmofbivariatefrequencyhoppingpatternbasedonetppo |