SDM，长短期兴趣融合召回

最新推荐文章于 2024-03-04 21:19:01 发布

yichudu

最新推荐文章于 2024-03-04 21:19:01 发布

阅读量943

点赞数

分类专栏：推荐系统

天天开心

本文链接：https://blog.csdn.net/chuchus/article/details/107863807

版权

推荐系统专栏收录该内容

26 篇文章 8 订阅

订阅专栏

SDM简述

CIKM ’19，阿里巴巴的一项推荐工作。

贡献

在已有的 sequence-based 工作基础上，解决两个问题：

session 中存在 multiple interest tendencies
long-term behaviors are various and complex. 为此设计 long-short term gated 作长短期兴趣融合

网络结构

在这里插入图片描述

user profile preference

使用 $e_u=concat(\{e_u^p|p\in P\})$ 表达用户向量。where $P=\{age,gender,life\_stage\}$ .

short-term preference

使用 $e_i=concat(\{e_i^f|f\in F\})$ 表达商品向量。where $F=\{id,cate\_first\_level,cate\_leaf\_level,shop,brand\}$ . due to the sparsity caused by the large-scale items, encoding items only by item_id is far from satisfaction.
送往LSTM，得到 $[h^u_1,... ,h^u_t]$
送往self_att，得到 $[\hat h^u_1,... ,\hat h^u_t]$
user_attention，权重由 $softmax(<e_u,h^u_i>)$ 得到，不再像self_att那样先线性投影 Q,K 空间再点积相乘。该步得到短期偏好 $s^u_t$

long-term preference

$L^u=\{L_f^u|f\in F\},l^u_k\in L^u_f， l^u_k\in R^d$ , F 为field的集合，同上。 $L^u_f$ 为某个field的偏好list，同一field共享embedding 矩阵。
$z^u_f =user\_attention(e_u,L^u_f)\in R^d$ , 起到 pooling 作用。
$z^u=concat(\{z^u_f|f\in F\})$ ，得到长期偏好 $p^u=tanh(Wz^u+b)$ .

long-short term fusion gate

“we elaborately design a gated neural network”, $G^u=\sigma(W_1e_u+W_2s^u_t+W_3p^u+b)，G^u\in R^d$ ,该gate用来控制短期兴趣的占比。
$\odot$ 为element-wise multiplication,进一步得到 $o^u_t=G^u\odot s^u_t+(1-G^u)\odot p^u$ 用于召回。

candidate matching

$score(item_i)=<o^u_t,v_i>,score(item_i)\in R,v_i\in V$ , $V$ 是另一个item emb矩阵。

数据集

用人家JD数据集，有意思吧。
在这里插入图片描述

图：user/item 交互数据

在这里插入图片描述

图：item的side_info

在这里插入图片描述

图：user 的side_info

模型对比及ablation study

在这里插入图片描述

SDMMA. Sequential Deep Matching with Multi-head Attention is our multi-head self-attention enhanced model.
PSDMMA. Personalized SDMMA adds user attention module to mine fine-grained personalized information.
PSDMMAL. PSDMMA combines representations of shortterm sessions and Long-term behaviors.
PSDMMAL-N. Based on PSDMMAL, during training, we take the following N items as target classes as Tang and Wang [24] does at the current time step. N = 5 in this experiment.
PSDMMAL-NoS. PSDMMAL does Not contain the embeddings of Side information in short-term sessions and long term behaviors except for the ID feature of item and user.

小结

以下指标指相对提升。

添加 user_profile， + $1.7\%$
添加 side_info， + $8\%$
添加长期兴趣融合， + $1.8\%$ .
following N as target。在淘宝数据集上取得sota，上线也是用的这一版，pCTR+7%;pGMV+4%;discovery+24%.
gate 设计。 SHAN 方法也用了user_profile与长期兴趣，但做法是att(query=user_profile,keys=[long_interest,short_interest]),这样的设计不如 gate 有表现力，更好的捕捉correlation，相互关系。