Rl-Camp-Recap

最新推荐文章于 2024-09-07 16:10:07 发布

qq_43408107

最新推荐文章于 2024-09-07 16:10:07 发布

阅读量179

点赞数

文章标签：机器学习

本文链接：https://blog.csdn.net/qq_43408107/article/details/106954003

版权

Reinforcement Learning Camp

Author: Yijia Shaw

Camp held by: Baidu Inc.

Brief intro

Reinforcement learning is a branch of AI that is developing quickly. Since it doesn’t need label data, the training and performance will not be limited by the amount of labeled data, which is a great advantage, compared with supervised, unsupervised or semi-supervised learning.

Course review

Through this course, lead by Instructor Ke and Xiao, I have learned some fundamental algorithms like $q - l e a r n i n g$ , $d q n$ , and $p g$ , et cetera. In the process, what impressed me the most is the part of q-learning taught by Ke. I have learned this basic algorithm before, but I didn’t actually understand the ideas underlying the codes. But through Instructor Ke’s explanation and live videos, I think I have got an a better understanding of the q-table. And the relationship between the state transition probability and the probability in epsilon-greedy.

Reflection

Besides, I have to reflect on my performance in the camp. I do need to improve my self-regulation ability. I was doing another project and so I didn’t put too much effort into the learning and assignments, but to be honest, there did exsit enough time for learning and assignments. So I think I have to improve my time management ability. Plan to recap these later.

Some finding

Another interesting thing is that, the reinforcement learning’s reward signal/training curve is different from the supervised learning. The latter is smoother compared with the former one. I think it’s because the agent need to trade-off between exploration and exploitation. When it explores a new environment, the reward will drop suddenly, but when he has fully exploited it. The performance will be better. So the grade will raise.

Future

The reinforcement learning is an interesting as well as promising field in machine learning. I’m interested in it, and will devote more time to it 😃

qq_43408107

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
Rl-Camp-Recap

Reinforcement Learning CampAuthor: Yijia ShawCamp held by: Baidu Inc.Brief introReinforcement learning is a branch of AI that is developing quickly. Since it doesn’t need label data, the training and performance will not be limited by the amount of
复制链接

扫一扫