深入浅出强化学习:原理入门_强化学习:表面解释

深入浅出强化学习:原理入门

Artificial Intelligence (AI) has become a huge buzz word in the past 5 years or more, and more and more people are being clued up about Artificial Neural Networks that can be trained in two different ways, namely supervised learning and unsupervised learning. However, there is one more that doesn’t really fall under either of the two mentioned categories and this is called reinforcement learning.

在过去的5年或更长的时间里,人工智能(AI)已成为一个热门话题,越来越多的人开始关注可以通过两种不同方式进行训练的人工神经网络,即监督学习和非监督学习。 但是,还有另外一种并没有真正属于上述两种类别,这称为强化学习。

Reinforcement learning is generally used on already established neural network models to encourage specific behaviors to achieve more of a favored outcome. Reinforcement learning currently has been used as a buzz word, and in these cases, it just placed into a black box.

强化学习通常用于已经建立的神经网络模型上,以鼓励特定的行为获得更多的满意结果。 强化学习目前已被用作流行语,在这些情况下,它只是放在一个黑盒子中。

In this article, I want to give a surface level explanation of what reinforcement learning is, by opening this “black box” that has been thrown around and expected to do amazing things.

在本文中,我想通过打开这个“黑匣子”来对强化学习是一个表面层面的解释,该“黑匣子”被扔来扔去,并有望做奇妙的事情。

Image for post

这个怎么运作?

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值