RLlib

最新推荐文章于 2023-12-27 09:47:20 发布

hanjialeOK

最新推荐文章于 2023-12-27 09:47:20 发布

阅读量1.5k

点赞数

分类专栏：强化学习文章标签： rllib

本文链接：https://blog.csdn.net/weixin_43742643/article/details/121255458

版权

强化学习专栏收录该内容

7 篇文章 1 订阅

订阅专栏

framework 默认使用 tf1

# tf: TensorFlow (static-graph)
# tf2: TensorFlow 2.x (eager)
# tfe: TensorFlow eager
# torch: PyTorch
--config='{"framework": "tf"}'
--config='{"framework": "tf2"}'
# Enable tracing in eager mode. This greatly improves performance, but
# makes it slightly harder to debug since Python code won't be evaluated
# after the initial eager pass. Only possible if framework=tfe.
--config='{"framework": "tf2", "eager_tracing": true}'
--config='{"framework": "tfe"/"tf2"}'
--config='{"framework": "torch"}'
--torch

tf 测试命令

rllib train --run PPO --env PongDeterministic-v4 --checkpoint-freq 100 \
    --config '{"num_workers": 8, "num_gpus": 1}'

pytorch 测试命令

rllib train --run PPO --env PongDeterministic-v4 --checkpoint-freq 100 \
    --config '{"framework": "torch", "num_workers": 8, "num_gpus": 1}'

tensorboard 可视化

tensorboard --logdir=~/ray_results --port 80

模型评估

rllib rollout \
    ~/ray_results/default/PPO_PongDeterministic-v4_0upjmdgr0/checkpoint_1/checkpoint-1 \
    --run PPO --env PongDeterministic-v4 --steps 10000

hanjialeOK

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
RLlib

framework 默认使用 tf1# tf: TensorFlow (static-graph)# tf2: TensorFlow 2.x (eager)# tfe: TensorFlow eager# torch: PyTorch--config='{"framework": "tf"}'--config='{"framework": "tf2"}'# Enable tracing in eager mode. This greatly improves performance, but
复制链接

扫一扫

专栏目录