一、Deterministic Policy Gradient 理论 1.On-Policy Deterministic Actor-Critic 2.Off-Policy Deterministic Actor-Critic 二、Deep Deterministic Policy Gradient DDPG实现框架,如下图所示: DDPG算法流程如下: