【推荐系统】DUPN:Deep User Perception Network

最新推荐文章于 2024-01-31 01:28:17 发布

布纸所云

最新推荐文章于 2024-01-31 01:28:17 发布

阅读量1.4k

点赞数 2

分类专栏：推荐系统

本文链接：https://blog.csdn.net/XindiOntheWay/article/details/106869786

版权

10 篇文章 3 订阅

订阅专栏

DUPN (Deep User Perception Network) 通过多任务模型来学习一个通用的用户表征。

召回系统的架构如下图所示：
在这里插入图片描述

在这里插入图片描述

DUPN 的输入是按时间顺序排列的用户的行为序列 $x=\{x_1, x_2, \cdots,x_N\}$

$x_i$ 为第 $i$ 个behavior， $x_i=<item_i, property_i>$
$item_i$ 表征商品的特征，包括：
- common features: shop ID, brand, category, item tags (对于长尾商品起主导作用)
- personalized features: item id (对于popular items起主导作用)
$property_i$ 描述行为，包括行为的类型(behavior type，比如点击/收藏/加购物车/购买）、场景(scenario，比如搜索、推荐、广告等)、行为的时间(当前search和行为发生时间间隔、工作日还是周末，早晨还是晚上等)

在这里插入图片描述

处理方式就是将 multi-hot 的 $x_i=[item_i, property_i]$ 通过 linear mapping transform到一个低维空间 $res_i=[e_i, p_i]$ ( $e_i$ 和 $p_i$ 分别表示 item 和 property 的 embedding)：

经过 embedding 层得到的用户行为序列 $\{res_1, res_2,\cdots,res_N\}$ 被喂进 LSTM， LSTM 根据当前的输入 $res_t$ 和之前的隐向量 $h_{t-1}$ 来更新当前的隐向量 $h_t\in R^{d_h}$
考虑到 item embedding $e_i$ 和 behavior property embedding $p_I$ 的不同特点，文章提出了 Property Gated LSTM 对 $e_i$ 和 $p_i$ 区别处理：
- $p_I$ 反映了行为的重要性，因此在 LSTM 中会作为一个强烈的信号，也就是说， $p_I$ 会极大地影响 what to extract, what to remember and what to forward，会输入到 input gate, forget gate 以及 output gate中(上式的(2), (3), (5))
- $e_i$ 描述商品的特征，反映了用户的兴趣，是 $c_t$ 的唯一输入

Property Gated LSTM 的输入是另外一个序列 $\{h_1,h_2,\cdots, h_N\}$ ， $h_i$ 可以视作第 $i$ 个item 的向量表征
$h$ 会被输入到一个注意力网络中，注意力网络会根据当前的query给予各个 $h_i$ 不同的权重，最后用加权和来表征序列 $h$
$rep_s$ 是整个序列 $h$ 的向量表征
$a_t$ 是 $h_t$ 的权重
$attention(:\omega)$ 是两层全连接构成的注意力网络，输入包括当前的query $q$ ，user profile embedding $u$ , behavior$ property $p_t$ 以及当前的隐向量 $p_t$
$u s e r$ $embedding = [rep_s, u]$

上述得到 user embedding 后，我们定义了一些任务以便同时能够学习，对于每个任务而言，其他的任务都视作regularization，多任务学习的共享表达，可以使得 user representation 泛化性更强，更可靠
文中定义了五个任务：
- CTR 预估：user representation $rep_i$ 和 current item resprentation $e_i$ 作为输入，预测 $rep_i$ 点击 $e_i$ 的概率
- L2R: learning to Rank
- PPP: Price Preference Prediction
- FIFP: Fashion Icon Following Prediction
- SPP: Shop Preference Prediction