论文阅读7-----基于强化学习的推荐系统 DRN: A Deep Reinforcement Learning Framework for News Recommendation
ABSTRACT
In this paper, we propose a novel Deep Reinforcement Learning framework for news recommendation.
我们提出来RL方法用于新闻推荐。
Online personalized news recommendation is a highly challenging problem due to the dynamic nature of news features and user preferences. Although some online recommendation models have been proposed to address the dynamic nature of news recommendation, these methods have three major issues.
新闻推荐挑战很大,因为新闻特征和用户偏好动态变化大。现存的推荐系统方法有如下缺点。
First, they only try to model current reward(e.g., Click Through Rate).
1.仅仅尝试当前的奖励,下文引出RL方法,因为RL方法适用于长期的奖励。
Second, very few studies consider to use user feedback other than click / no click labels (e.g., how frequent user returns) to help improve recommendation.
2.没考虑用户反馈,即使考虑了也不过click/no click labels.(反馈不够丰富,下文提出回归时间凑数)