Value-difference learning based mMTC devices access algorithm in multi-cell network
In the massive machine type communication scenario of 5G, the access congestion problem of massive machine type communication devices (mMTCD) in multi-cell network is very important.A double deep Q network with value-difference based exploration (VDBE-DDQN) algorithm was proposed.The algorithm focus...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Beijing Xintong Media Co., Ltd
2022-06-01
|
Series: | Dianxin kexue |
Subjects: | |
Online Access: | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022152/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841528999723925504 |
---|---|
author | Xin LI Jun SUN |
author_facet | Xin LI Jun SUN |
author_sort | Xin LI |
collection | DOAJ |
description | In the massive machine type communication scenario of 5G, the access congestion problem of massive machine type communication devices (mMTCD) in multi-cell network is very important.A double deep Q network with value-difference based exploration (VDBE-DDQN) algorithm was proposed.The algorithm focused on the solution that could reduce the collision when a number of mMTCDs accessed to eNB in multi-cell network.The state transition process of the deep reinforcement learning algorithm was modeled as Markov decision process.Furthermore, the algorithm used a double deep Q network to fit the target state-action value function, and it employed an exploration strategy based on value-difference to adapt the change of the environment, which could take advantage of both current conditions and expected future needs.Moreover, each mMTCD updated the probability of exploration according to the difference between the current value function and the next value function estimated by the network, rather than using the same standard to select the best base eNB for the mMTCD.Simulation results show that the proposed algorithm can effectively improve the access success rate of the system. |
format | Article |
id | doaj-art-2cc7ab62d83d4cc7a3e76ef876a23474 |
institution | Kabale University |
issn | 1000-0801 |
language | zho |
publishDate | 2022-06-01 |
publisher | Beijing Xintong Media Co., Ltd |
record_format | Article |
series | Dianxin kexue |
spelling | doaj-art-2cc7ab62d83d4cc7a3e76ef876a234742025-01-15T03:27:15ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012022-06-0138829059811270Value-difference learning based mMTC devices access algorithm in multi-cell networkXin LIJun SUNIn the massive machine type communication scenario of 5G, the access congestion problem of massive machine type communication devices (mMTCD) in multi-cell network is very important.A double deep Q network with value-difference based exploration (VDBE-DDQN) algorithm was proposed.The algorithm focused on the solution that could reduce the collision when a number of mMTCDs accessed to eNB in multi-cell network.The state transition process of the deep reinforcement learning algorithm was modeled as Markov decision process.Furthermore, the algorithm used a double deep Q network to fit the target state-action value function, and it employed an exploration strategy based on value-difference to adapt the change of the environment, which could take advantage of both current conditions and expected future needs.Moreover, each mMTCD updated the probability of exploration according to the difference between the current value function and the next value function estimated by the network, rather than using the same standard to select the best base eNB for the mMTCD.Simulation results show that the proposed algorithm can effectively improve the access success rate of the system.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022152/mMTCRAreinforcement learningeNB selection |
spellingShingle | Xin LI Jun SUN Value-difference learning based mMTC devices access algorithm in multi-cell network Dianxin kexue mMTC RA reinforcement learning eNB selection |
title | Value-difference learning based mMTC devices access algorithm in multi-cell network |
title_full | Value-difference learning based mMTC devices access algorithm in multi-cell network |
title_fullStr | Value-difference learning based mMTC devices access algorithm in multi-cell network |
title_full_unstemmed | Value-difference learning based mMTC devices access algorithm in multi-cell network |
title_short | Value-difference learning based mMTC devices access algorithm in multi-cell network |
title_sort | value difference learning based mmtc devices access algorithm in multi cell network |
topic | mMTC RA reinforcement learning eNB selection |
url | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022152/ |
work_keys_str_mv | AT xinli valuedifferencelearningbasedmmtcdevicesaccessalgorithminmulticellnetwork AT junsun valuedifferencelearningbasedmmtcdevicesaccessalgorithminmulticellnetwork |