Paper Notes: Gemini

最新推荐文章于 2024-07-10 22:17:05 发布

HotChoc

最新推荐文章于 2024-07-10 22:17:05 发布

阅读量402

点赞数

分类专栏： GNN RECOMMENDER SYSTEM 文章标签：深度学习神经网络数据挖掘机器学习

本文链接：https://blog.csdn.net/HotChoc/article/details/109499275

版权

GNN 同时被 2 个专栏收录

9 篇文章 0 订阅

订阅专栏

RECOMMENDER SYSTEM

7 篇文章 0 订阅

订阅专栏

Gemini: A Novel and Universal Heterogeneous Graph Information Fusing Framework for Online Recommendations

LINK: https://doi.org/10.1145/3394486.3403388
CLASSIFICATION: RECOMMENDER-SYSTEM, HETEROGENEOUS NETWORK, GCN
YEAR: Submitted on Aug 2020
FROM: KDD 2020
WHAT PROBLEM TO SOLVE: Researchers have made efforts to utilize additional auxiliary information (e.g., social relations of users) to improve performance. However, such auxiliary information lacks compatibility for all recommendation scenarios, thus it is difficult to apply in some industrial scenarios where generality is required. Moreover, the heterogeneous nature between users and items aggravates the difficulty in network information fusion. In addition, the sparsity of user-item interactions is an urgent problem need to be solved.
SOLUTION: To solve the above problems, we propose a universal and effective framework named Gemini, which only relies on the common interaction logs, avoiding the dependence on auxiliary information and ensuring a better generality.
CORE POINT:
- The Main Contributions:
  1. We propose a new heterogeneous graph fusing framework, Gemini, which does not rely on any auxiliary information, and handles heterogeneous graph more effectively through a novel and effective network transformation. Thus, Gemini can be applied to all kinds of recommended scenario and achieve satisfactory results. To our best knowledge, this is the first work to transform heterogeneous graph to two semi homogeneous graphs that does not miss any key topology information.
  2. We propose a GCN based algorithm which effectively processes graph edge consisting of heterogeneous nodes by capturing the global importance and local importance of these nodes. Simultaneously, through an attention function, the algorithm focuses on more important homogeneous neighbors in aggregation stage. In addition, adding edge information while aggregating information from neighbor nodes can exchange heterogeneous topology information between Gemini-U and Gemini-I. Thus, the information fusion processes on the two graphs are interdependent. To our best knowledge, this is also the first work to take into account the above optimizations.
  3. To some extent, Gemini solves the sparsity problem of user-item interactions. Because, in addition to the first-order neighbor relations of user-item, the second-order neighbor relations of user-user and item-item are introduced to Gemini-U and Gemini-I.
  4. We design a training algorithm, Gemini-Collaboration, that enables the Gemini framework to run on a large-scale dataset.
  5. We conduct extensive offline experiments and deploy an online A/B tests at DiDiChuxing. Experimental results show the superiority of our Gemini over state-of-the-art algorithms.
- Network Transformation
  
  We transforms user-item heterogeneous graph into two semi homogeneous graphs, Gemini-U and Gemini-I, from the perspective of users and items respectively.
- Edge Embedding
  
  First of all, the edge attributes (i.e., nodes in Att-U and Att-I) describe the original first-order neighbor relationship and can be used to measure the strength of neighbor relationships of nodes in Gemini-U or Gemini-I. Second, through sharing node embeddings, the edge attributes can provide information of heterogeneous nodes in another graph.
  This brings two advantages: one is that the topology information of Gemini-U/Gemini-I, especially the high order neighbor relationship, is exchanged to each other; the other is that the separate graphs, Gemini-U and Gemini-I, are closely related to each other and their network information fusion affects each other.
  - Sum Pooling (quantity)
    
    The downside of this approach is that it ignores the importance of the different attribute node.
  - Local & Global Information (quality)
    
    The number of times a node appears on an edge describes the importance of the node to the edge, which we call Local Information because that this is from the perspective of a single edge. Conversely, the more edges a node appears on, the less important it is. We call the IDF of node Global Information because that this is from the perspective of all edges.
  - TF-IDF Pooling
- Information Convolution
  - Attention based Aggregating
    
    Our aggregator function is an attention-layer that combines edge embeddings and node embeddings, which can be formulated as follows:
    
    Edge Vectors:
    
    The attention aggregator can be calculated as follow:
  - Edge CONV
    
    We pass the neighbor information to self node by the following convolution function:
- Gemini Framework
  
  Line 2-12 is the sampling stage of Gemini. Each set $U^k$ contains the nodes that are needed to compute the representations of nodes $u$ ∈ $U^{k+1}$ , the same for each set $V^k$ . Lines 15-20 and 21-27 correspond to the aggregation stage of the user nodes and the item nodes, respectively.
- Gemini-Collaboration Framework
  
  The core idea of Gemini-Collaboration is that in one iteration, when the $z$ -th layer $z$ ∈ {1, · · · , $K$ } embedding is calculated, the embedding of its attribute nodes is the calculated value of this or last iteration and the attribute embedding is not updated in this lteration.
- Experiments
EXISTING PROBLEMS: 404
IMPROVEMENT IDEAS: Edge embedding with TF-IDF Pooling should be normalized or not?

HotChoc

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
2
评论
Paper Notes: Gemini

Gemini: A Novel and Universal Heterogeneous Graph Information Fusing Framework for Online RecommendationsLINK: https://doi.org/10.1145/3394486.3403388CLASSIFICATION: RECOMMENDER-SYSTEM, HETEROGENEOUS NETWORK, GCNYEAR: Submitted on Aug 2020FROM:
复制链接

扫一扫