Deep Reinforcement Learning

今天受同学启发,决定写日志记录一下我的毕设之旅。

题目是CS294-112 Deep Reinforcement Learning HW2:Policy Gradients

Win10+Anaconda3+Python3.6

零基础 目前用一个礼拜看了吴恩达的机器学习课程,一礼拜看了吴恩达深度学习的第一课+第二课中tensorflow那一讲

David Sliver强化学习的第一讲、李宏毅的ML Lecture 28、莫烦Python强化学习的内容

基本上还是只有一些理论概念,无法系统编程,庆幸的是作业中已经给好了架构,降低了难度。


到现在为止遇到最大的问题就是gym的环境配置,在安装好gym后用最简单的代码测试

import gym
env=gym.make('CartPole-v0')
env.reset()

报错: NotImplementedError:abstract

踩了无数坑最后在github的issue里找到了办法,很简单,在Anaconda Prompt中输入

pip install pyglet==1.2.4

因该是pyglet版本太高而导致的一些问题,回退到pyglet1.2.4就能够解决这个问题。



第二个坑是Mujoco以及Mujoco_py的安装

到现在也没有解决问题,甚至懒得总结,不知道有没有人在win64上成功安装了,如果有,希望可以告诉我。。我已经领取了mujoco的一个月免费,希望在一个月里我可以解决这个问题。(申请了学生的一年免费,但是一直没有收到邮件)





  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
The topic of this book is Reinforcement Learning—which is a subfield of Machine Learning—focusing on the general and challenging problem of learning optimal behavior in complex environment. The learning process is driven only by reward value and observations obtained from the environment. This model is very general and can be applied to many practical situations from playing games to optimizing complex manufacture processes. Due to flexibility and generality, the field of Reinforcement Learning is developing very quickly and attracts lots of attention both from researchers trying to improve existing or create new methods, as well as from practitioners interested in solving their problems in the most efficient way. This book was written as an attempt to fill the obvious lack of practical and structured information about Reinforcement Learning methods and approaches. On one hand, there are lots of research activity all around the world, new research papers are being published almost every day, and a large portion of Deep Learning conferences such as NIPS or ICLR is dedicated to RL methods. There are several large research groups focusing on RL methods application in Robotics, Medicine, multi-agent systems, and others. The information about the recent research is widely available, but is too specialized and abstract to be understandable without serious efforts. Even worse is the situation with the practical aspect of RL application, as it is not always obvious how to make a step from the abstract method described in the mathematical-heavy form in a research paper to a working implementation solving actual problem. This makes it hard for somebody interested in the field to get an intuitive understanding of methods and ideas behind papers and conference talks. There are some very good blog posts about various RL aspects illustrated with working examples,
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值