1 / 12
文档名称:

基于强化学习的多无人机协同航迹规划方法 尹依伊.pdf

格式:pdf   大小:983KB   页数:12页
下载后只包含 1 个 PDF 格式的文档,没有任何的图纸或源代码,查看文件列表

如果您已付费下载过本站文档,您可以点这里二次下载

分享

预览

基于强化学习的多无人机协同航迹规划方法 尹依伊.pdf

上传人:李十儿 2022/7/10 文件大小:983 KB

下载得到文件列表

基于强化学习的多无人机协同航迹规划方法 尹依伊.pdf

相关文档

文档介绍

文档介绍:: .
兵工学报
到时间协同与碰撞避免的协同航迹,
并能对环境建模时所未探明的障碍物进行躲避;与 A*算法相比,针对在线应用问题,新算法具有更高的
求解效率。
关键词:航迹规划;Q 学****时间协同;碰撞避免
中图分类号: 文献标志码:A
DOI:.0606
Reinforcement Learning-based Multi-UAVs Path Planning Method
YIN Yiyi1,2,WANG Xiaofang1,ZHOU Jian3
( of Aerospace Engineering,Beijing Institute of Technology,Beijing 100081,China;
Institute of Electronic System Engineering,Beijing 100854,China;
’an Modern Control Technology Research Institute,Xi’an 710065,Shaanxi,China )
Abstract: To solve the path planning problem of multi-UAVs with time cooperation,the battlefield model as
well as the Markov model of a single-UAV path planning is established,and the optimal path is calculated on the
basis of the Q learning algorithm. The Q-table obtained based on the Q learning algorithm is used to calculate the
shortest path of each UAV and the cooperative range,and the time cooperative paths is obtained by adjusting the
action selection strategy of the orbiting UAVs. Considering the collision avoidance problem of multi-UAVs,the
partial area is determined by designing backward parameters,and based on the deep reinforcement learning theory,
neural network is used to replace Q-table to re-plan partial path for UAVs which can avoid the problem of
dimensional explosion. As for the previously unexplored obstacles,the obstacle matrix is designed based on the idea
of the artificial potential field theory,and it is superimposed on the original Q-table to realize the collision avoidance
for the unexplored obstacle. Simulation results verify that the propo