Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning

The Internet of things (IoT) era needs to realize the wide coverage and connections for the IoT nodes.However,the IoT communication technology cannot collect data timely in the remote area.UAV has been widely used in the IoT wireless sensor network for the data collection due to its flexibility and...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhiyu MOU, Yu ZHANG, Dian FAN, Jun LIU, Feifei GAO
Format: Article
Language:zho
Published: China InfoCom Media Group 2020-09-01
Series:物联网学报
Subjects:
Online Access:http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2020.00177/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841531202355331072
author Zhiyu MOU
Yu ZHANG
Dian FAN
Jun LIU
Feifei GAO
author_facet Zhiyu MOU
Yu ZHANG
Dian FAN
Jun LIU
Feifei GAO
author_sort Zhiyu MOU
collection DOAJ
description The Internet of things (IoT) era needs to realize the wide coverage and connections for the IoT nodes.However,the IoT communication technology cannot collect data timely in the remote area.UAV has been widely used in the IoT wireless sensor network for the data collection due to its flexibility and mobility.The trajectory design of the UAV assisted sensor network data acquisition was discussed in the proposed scheme,as well as the UAV charging demand in the data collection process was met.Specifically,based on the hierarchical reinforcement learning with the temporal abstraction,a novel option-DQN (option-deep Q-learning) algorithm targeted for the discrete action was proposed to improve the performance of the data collection and trajectory design,and control the UAV to recharge in time to ensure its normal flight.The simulation results show that the training rewards and speed of the proposed method are much better than the conventional DQN (deep Q-learning) algorithm.Besides,the proposed algorithm can guarantee the sufficient power supply of UAV by controlling it to recharge timely.
format Article
id doaj-art-cdc99fa8e253466ab77196474e904064
institution Kabale University
issn 2096-3750
language zho
publishDate 2020-09-01
publisher China InfoCom Media Group
record_format Article
series 物联网学报
spelling doaj-art-cdc99fa8e253466ab77196474e9040642025-01-15T02:52:54ZzhoChina InfoCom Media Group物联网学报2096-37502020-09-014425159645901Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learningZhiyu MOUYu ZHANGDian FANJun LIUFeifei GAOThe Internet of things (IoT) era needs to realize the wide coverage and connections for the IoT nodes.However,the IoT communication technology cannot collect data timely in the remote area.UAV has been widely used in the IoT wireless sensor network for the data collection due to its flexibility and mobility.The trajectory design of the UAV assisted sensor network data acquisition was discussed in the proposed scheme,as well as the UAV charging demand in the data collection process was met.Specifically,based on the hierarchical reinforcement learning with the temporal abstraction,a novel option-DQN (option-deep Q-learning) algorithm targeted for the discrete action was proposed to improve the performance of the data collection and trajectory design,and control the UAV to recharge in time to ensure its normal flight.The simulation results show that the training rewards and speed of the proposed method are much better than the conventional DQN (deep Q-learning) algorithm.Besides,the proposed algorithm can guarantee the sufficient power supply of UAV by controlling it to recharge timely.http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2020.00177/UAVtrajectory designdata collectioncharging
spellingShingle Zhiyu MOU
Yu ZHANG
Dian FAN
Jun LIU
Feifei GAO
Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning
物联网学报
UAV
trajectory design
data collection
charging
title Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning
title_full Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning
title_fullStr Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning
title_full_unstemmed Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning
title_short Research on the UAV-aided data collection and trajectory design based on the deep reinforcement learning
title_sort research on the uav aided data collection and trajectory design based on the deep reinforcement learning
topic UAV
trajectory design
data collection
charging
url http://www.wlwxb.com.cn/zh/article/doi/10.11959/j.issn.2096-3750.2020.00177/
work_keys_str_mv AT zhiyumou researchontheuavaideddatacollectionandtrajectorydesignbasedonthedeepreinforcementlearning
AT yuzhang researchontheuavaideddatacollectionandtrajectorydesignbasedonthedeepreinforcementlearning
AT dianfan researchontheuavaideddatacollectionandtrajectorydesignbasedonthedeepreinforcementlearning
AT junliu researchontheuavaideddatacollectionandtrajectorydesignbasedonthedeepreinforcementlearning
AT feifeigao researchontheuavaideddatacollectionandtrajectorydesignbasedonthedeepreinforcementlearning