推荐算法模型——DUPN

最新推荐文章于 2024-09-12 08:38:42 发布

yuexin2020

最新推荐文章于 2024-09-12 08:38:42 发布

阅读量857

点赞数

文章标签：算法机器学习人工智能深度学习 python

原文链接：https://blog.csdn.net/XindiOntheWay/article/details/106869786

版权

论文地址: https://arxiv.org/pdf/1805.10727.pdf

Perceive Your Users in Depth: Learning Universal User Representations from Multiple E-commerce Tasks
DUPN (Deep User Perception Network) 通过多任务模型来学习一个通用的用户表征。

1. 模型架构

模型的核心是embedding+LSTM+attention+多任务，论文中用户的表示向量是用128维的用户特征与128维的用户行为拼接得到256维的用户向量表示。
DUPN 的输入是按时间顺序排列的用户的行为序列 x = { x 1 , x 2 , ⋯ , x N }

2.模型特点

2.1 Property Gated LSTM & Attention Net

对LSTM进行了优化，根据item embedding -ei 和behavior property embedding-pi的不同特点，将他们进行分别处理：

pi反映了行为的重要性，因此在 LSTM 中会作为一个强烈的信号，也就是说，pi会极大地影响 what to extract, what to remember and what to forward，会输入到 input gate, forget gate 以及 output gate中(上式的(2), (3), (5))
ei 描述商品的特征，反映了用户的兴趣，是 c t 的唯一输入

2.2 Attention Net

Property Gated LSTM 的输入是另外一个序列h
h会被输入到一个注意力网络中，注意力网络会根据当前的query给予各个 hi

2.3 多任务学习

文中定义了五个任务：
CTR 预估：user representation 和 current item resprentation 作为输入，预测 CTR
L2R: learning to Rank
PPP: Price Preference Prediction
FIFP: Fashion Icon Following Prediction
SPP: Shop Preference Prediction

3.代码实现