DDPG（6）_ddpg

最新推荐文章于 2022-07-07 22:56:02 发布

度过冰河时期的远古族人

最新推荐文章于 2022-07-07 22:56:02 发布

阅读量1.5k

点赞数

分类专栏： PYTHON

本文链接：https://blog.csdn.net/qq_30626231/article/details/80730854

版权

1、引用Python库

import gym
import tensorflow as tf
import numpy as np
from ou_noise import OUNoise
from critic_network import CriticNetwork 
from actor_network_bn import ActorNetwork
from replay_buffer import ReplayBuffer

2、定义参数

# Hyper Parameters:

REPLAY_BUFFER_SIZE = 1000000
REPLAY_START_SIZE = 10000
BATCH_SIZE = 64
GAMMA = 0.99

3、定义类

class DDPG:
    """docstring for DDPG"""
    def __init__(self, env):
        self.name = 'DDPG' # name for uploading results
        self.environment = env
        # Randomly initialize actor network and critic network
        # with both their target networks
        self.state_dim = env.observation_space.shape[0]

（以下函数均在类DDPG中定义）

3.1 初始化函数

    def __init__(self,

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

度过冰河时期的远古族人

关注关注

0
点赞
踩
4

收藏

觉得还不错? 一键收藏
2
评论
DDPG（6）_ddpg

1、引用Python库import gymimport tensorflow as tfimport numpy as npfrom ou_noise import OUNoisefrom critic_network import CriticNetwork from actor_network_bn import ActorNetworkfrom replay_buffer imp...
复制链接

扫一扫