目录
Lunar Lander 月球着陆器
This environment is part of the Box2D environments. Please read that page first for general information.
此环境是 Box2D 环境的一部分。请先阅读该页面以获取一般信息。
Action Space 动作空间 |
Discrete(4) |
Observation Shape 状态形状 |
(8,) |
Observation High 状态上限 |
[1.5 1.5 5. 5. 3.14 5. 1. 1. ] |
Observation Low 状态下限 |
[-1.5 -1.5 -5. -5. -3.14 -5. -0. -0. ] |
Import |
|
Description 描述
This environment is a classic rocket trajectory optimization problem. According to Pontryagin’s maximum principle, it is optimal to fire the engine at full throttle or turn it off. This is the reason why this environment has discrete actions: engine on or off.
这个环境是一个经典的火箭弹道优化问题。根据 Pontryagin 的最大值原理,最好在全油门时点燃发动机或将其关闭。这就是此