Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning
The Internet of things (IoT) era needs to realize the wide coverage and connections for the IoT nodes.However,the IoT communication technology cannot collect data timely in the remote area.UAV has been widely used in the IoT wireless sensor network for the data collection due to its flexibility and...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
China InfoCom Media Group
2020-09-01
|
Series: | 物联网学报 |
Subjects: | |
Online Access: | http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2020.00177/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841531202355331072 |
---|---|
author | Zhiyu MOU Yu ZHANG Dian FAN Jun LIU Feifei GAO |
author_facet | Zhiyu MOU Yu ZHANG Dian FAN Jun LIU Feifei GAO |
author_sort | Zhiyu MOU |
collection | DOAJ |
description | The Internet of things (IoT) era needs to realize the wide coverage and connections for the IoT nodes.However,the IoT communication technology cannot collect data timely in the remote area.UAV has been widely used in the IoT wireless sensor network for the data collection due to its flexibility and mobility.The trajectory design of the UAV assisted sensor network data acquisition was discussed in the proposed scheme,as well as the UAV charging demand in the data collection process was met.Specifically,based on the hierarchical reinforcement learning with the temporal abstraction,a novel option-DQN (option-deep Q-learning) algorithm targeted for the discrete action was proposed to improve the performance of the data collection and trajectory design,and control the UAV to recharge in time to ensure its normal flight.The simulation results show that the training rewards and speed of the proposed method are much better than the conventional DQN (deep Q-learning) algorithm.Besides,the proposed algorithm can guarantee the sufficient power supply of UAV by controlling it to recharge timely. |
format | Article |
id | doaj-art-cdc99fa8e253466ab77196474e904064 |
institution | Kabale University |
issn | 2096-3750 |
language | zho |
publishDate | 2020-09-01 |
publisher | China InfoCom Media Group |
record_format | Article |
series | 物联网学报 |
spelling | doaj-art-cdc99fa8e253466ab77196474e9040642025-01-15T02:52:54ZzhoChina InfoCom Media Group物联网学报2096-37502020-09-014425159645901Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learningZhiyu MOUYu ZHANGDian FANJun LIUFeifei GAOThe Internet of things (IoT) era needs to realize the wide coverage and connections for the IoT nodes.However,the IoT communication technology cannot collect data timely in the remote area.UAV has been widely used in the IoT wireless sensor network for the data collection due to its flexibility and mobility.The trajectory design of the UAV assisted sensor network data acquisition was discussed in the proposed scheme,as well as the UAV charging demand in the data collection process was met.Specifically,based on the hierarchical reinforcement learning with the temporal abstraction,a novel option-DQN (option-deep Q-learning) algorithm targeted for the discrete action was proposed to improve the performance of the data collection and trajectory design,and control the UAV to recharge in time to ensure its normal flight.The simulation results show that the training rewards and speed of the proposed method are much better than the conventional DQN (deep Q-learning) algorithm.Besides,the proposed algorithm can guarantee the sufficient power supply of UAV by controlling it to recharge timely.http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2020.00177/UAVtrajectory designdata collectioncharging |
spellingShingle | Zhiyu MOU Yu ZHANG Dian FAN Jun LIU Feifei GAO Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning 物联网学报 UAV trajectory design data collection charging |
title | Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning |
title_full | Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning |
title_fullStr | Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning |
title_full_unstemmed | Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning |
title_short | Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning |
title_sort | research on the uav aided data collection and trajectory design based on the deep reinforcement learning |
topic | UAV trajectory design data collection charging |
url | http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2020.00177/ |
work_keys_str_mv | AT zhiyumou researchontheuavaideddatacollectionandtrajectorydesignbasedonthedeepreinforcementlearning AT yuzhang researchontheuavaideddatacollectionandtrajectorydesignbasedonthedeepreinforcementlearning AT dianfan researchontheuavaideddatacollectionandtrajectorydesignbasedonthedeepreinforcementlearning AT junliu researchontheuavaideddatacollectionandtrajectorydesignbasedonthedeepreinforcementlearning AT feifeigao researchontheuavaideddatacollectionandtrajectorydesignbasedonthedeepreinforcementlearning |