一个深度强化学习航路规划（路径规划）github项目

iπ弟弟

已于 2024-03-25 15:06:24 修改

阅读量1.9w

点赞数 23

分类专栏：强化学习轨迹规划文章标签：深度学习

于 2021-06-14 19:38:53 首次发布

本文链接：https://blog.csdn.net/weixin_43145941/article/details/117911809

版权

强化学习同时被 2 个专栏收录

19 篇文章 ¥19.90 ¥99.00

订阅专栏

轨迹规划

12 篇文章 ¥9.90 ¥99.00

订阅专栏

超级会员免费看

该项目结合深度强化学习与传统算法实现无人机静态和动态环境下的避障路径规划。静态环境下，MADDPG和完全集中式TD3表现优秀；动态环境下，PPO、TD3和DDPG效果相当，PPO收敛更快。此外，还提供了A*搜索、RRT、蚁群等传统算法的实现。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Github地址：https://github.com/ZYunfeii/UAV_Obstacle_Avoiding_DRL
对应毕业设计论文：https://download.csdn.net/download/weixin_43145941/89025980

README

This is a project about deep reinforcement learning autonomous obstacle avoidance algorithm for UAV. The whole project includes obstacle avoidance in static environment and obstacle avoidance in dynamic environment. In the static environment, Multi-Agent Reinforcement Learning and artificial potential field algorithm are combined. In the dynamic environment, the project adopts the combination of disturbed flow field algorithm and single agent reinforcement learning algorithm.

Static environment

There are four methods to solve:

MADDPG
Fully Centralized DDPG
Fully Decentralized DDPG
Fully Centralized TD3

The third and the fourth methods perform better than others.

Dynamic environment

There are four methods to solve:

PPO+GAE(with multi-processing )
TD3
DDPG
SAC

The first three methods perform just the same. PPO convergence needs less episodes. TD3 and DDPG converge fast. Though Soft Actor-Critic is an outstanding algorithm in DRL, it has no obvious effect in my environment.

Traditional methods for UAV path planning

Three traditional methods are written with MATLAB:

A * search algorithm*
RRT algorithm
Ant colony algorithm

C++:

D star algorithm

The experiments show that A* search algorithm is much better than others but it is less effective than reinforcement learning path planning.

Artificial potential field algorithm

This project provides the MATLAB and Python realization of artificial potential field algorithm.

Python realization: ./APF/APFPy2.py ./APF/APFPy3.py ./APF/ApfAlgorithm.py (two-dimensional and three-dimensional)

Matlab realization: ./APF/APF_matlab (two-dimensional)

IFDS and IIFDS algorithm

This is an obstacle avoidance planning algorithm based on flow field. I realize it with matlab. The code is in folder IIFDS_and_IFDS.

How to begin trainning

For example, you want to train the agent in dynamic environment with TD3, what you need to do is just running the main.py, then test.py, finally open matlab and run the test.m to draw.

If you want to test the model in the environment with 4 obstacles, you just need to run Multi_obstacle_environment_test.py.