CS224W - Colab 3 MessagePassing实现GraphSAGE_pyg massagepassing实现graphsage-CSDN博客

本文链接：https://blog.csdn.net/qq_31699791/article/details/127910613

Implement the GraphSAGE layer directly

1.GraphSage

对于一个具有编码 $h_v^{l-1}$ 的中心节点 $v$ ，进行下一步状态更新的规则为：
$h_v^{(l)} = W_l\cdot h_v^{(l-1)} + W_r \cdot AGG(\{h_u^{(l-1)}, \forall u \in N(v) \})$
$W_l$ 和 $W_r$ 为可学习的权重， $N (v)$ 代表 $v$ 的邻接节点。 $A G G (\cdot)$ 为消息聚合函数，当采用 mean aggregation时，有
$AGG(\{h_u^{(l-1)}, \forall u \in N(v) \}) = \frac{1}{|N(v)|} \sum_{u\in N(v)} h_u^{(l-1)}$

2.Implement

（1）实现方法

实现分三步，分别为

1）每一个邻居 $u$ 节点传递当前状态 $u^{l-1}$ ；

2）中心节点 $v$ 使用聚合函数聚合收到的消息，在GraphSage中为简单求平均；

3）中心节点使用聚合消息更新自己的状态，在GraphSage中为残差。

（2）实现步骤

pytorch提供了MessagePassing父类，我们借此可以简洁实现消息传递。

class GraphSage(MessagePassing):
    
    def __init__(self, in_channels, out_channels, normalize = True,
                 bias = False, **kwargs):  
        super(GraphSage, self).__init__(**kwargs)

        self.in_channels = in_channels
        self.out_channels = out_channels
        self.normalize = normalize

        self.lin_l=nn.Linear(in_features=in_channels, out_features=out_channels)
        self.lin_r=nn.Linear(in_features=in_channels, out_features=out_channels)
    
    def message(self, x_j):
        out = None
        out = self.lin_r(x_j)
        return out
   
    def aggregate(self, inputs, index, dim_size = None):
        out = None
        node_dim = self.node_dim
        out=torch_scatter.scatter(inputs, index, dim=node_dim,reduce='mean')
        return out
 
    def forward(self, x, edge_index, size = None):
        out=self.propagate(edge_index,x=(x,x))
        out=self.lin_l(x)+out
        if self.normalize:
            out=F.normalize(out)
        return out

①message函数定义全局消息传递的内容。参数x_j描述所有消息传递关系中源节点的特征，形状为 $[|\mathcal{E}|, d]$ ， $\in \mathcal{E}$ .

②aggregate函数定义了中心节点接收和聚合消息的方法。参数inputs是message函数的返回值，index描述了每个中心节点 $v$ 接收来自邻居节点 $u$ 的消息在inputs的哪一行行。scatter函数声明为

torch_scatter.scatter(input: Tensor, index: Tensor, dim: int = -1, out: Optional[Tensor] = None, dim_size: Optional[int] = None, reduce: str = 'sum')→ Tensor[source]

函数功能为用index在dim指定的维度索引张量input，再根据reduce规则计算返回值。

在这里插入图片描述

如图所示，中心节点0的邻居节点在input的第0、1、3个索引。

③propagate函数定义在MessagePassing父类。用于启动一次消息传递过程。edge_index为整张图的边索引信息，形状是 $[2,\mathcal{E}]$ 。参数x存放邻居节点和中心节点的特征。因为每个节点既是中心节点又是邻居节点，且采用一样的特征描述，所以元组的两个元素是一样的。propagate函数会自动调用message和aggregate完成消息传递和消息聚合。