Learning representation for personalization

最新推荐文章于 2023-05-05 16:33:04 发布

Ayang777

最新推荐文章于 2023-05-05 16:33:04 发布

阅读量222

点赞数

分类专栏：话题模型

话题模型专栏收录该内容

26 篇文章 2 订阅

订阅专栏

IR-web search 领域的用户表示，考虑用户检索的主题和行为分析（后者很有参考意义）

潜在变量模型，表示用户信息，同时构建（build）用户数据（user profile）有助于基于服务的个性化。

即结合了用户的话题兴趣和检索任务行为（coupling user topical interests with their search task behavior）——基于任务的用户分析可能更好的获取其行为、兴趣和偏好（a user representation based on the search tasks users' perform would better capture user actions, interests and preferences）

Prior work has primarily focused on mining general search behaviors but has considerably ignored the importance of identifying individual user's search preferences as well as user variability.（先前的工作关注于挖掘普遍的行为而忽略了识别个体偏好和动态性的重要性）

——基于ODP的手动分类

——基于关键词term：they often limit the scope of personalization as different users inherently follow different distributions over words and queries belonging to the same topic/interest might not contain any over-lapping terms. (限制了个性化的范围，因为用户的单词分布不同，相同话题/兴趣也可能包括不同的单词组）

——最后群组分析：which models users as a mixture over latent user groups wherein each group shares a common distribution over queries and a common click preference pattern.

本文中的方案：

1，话题建模用户的偏好

2，根据访问log文件，识别用户的检索（query）

3，融合话题和任务：利用LDA学习到潜在话题，然后推断用户的话题分布，得到topic profile；学习任务，task profile，用以区别不同的用户，当他们具有相同话题时-比如software engineer和pHD查询相同主题时，期待的结果是不一样的。构建《user，topic，task》张量，基于任务的用户话题偏好。

4，张量分解，得到20维度的向量，作为用户向量