tensorflow 实现RBFSoftmax

最新推荐文章于 2021-04-07 17:33:45 发布

醉意流年go

最新推荐文章于 2021-04-07 17:33:45 发布

阅读量383

点赞数

分类专栏： tensorflow 深度学习deep learning NLP

本文链接：https://blog.csdn.net/u010626747/article/details/108728038

版权

tensorflow 同时被 3 个专栏收录

25 篇文章 0 订阅

订阅专栏

深度学习deep learning

20 篇文章 0 订阅

订阅专栏

NLP

12 篇文章 0 订阅

订阅专栏

参考论文：

RBF_softmax：Learning Deep Representative Prototypes with Radial Basis Function Softmax


"""
RBF_softmax：Learning Deep Representative Prototypes with Radial Basis Function Softmax
交叉熵是深度学习中非常常用的一种损失，通过交叉熵学到的特征表示会有比较大的类内的多样性。因为传统的softmax损失优化的是类内和类间的差异的最大化，也就是类内和类间的距离（logits）的差别的最大化，没有办法得到表示类别的向量表示来对类内距离进行正则化。之前的方法都是想办法增加类内的内聚性，而忽视了不同的类别之间的关系。本文提出了Radial Basis Function（RBF）距离来代替原来的softmax中的內积，这样可以自适应的给类内和类间距离施加正则化，可以得到更好的表示类别的向量，从而提高性能。

github源码：https://github.com/2han9x1a0release/RBF-Softmax
主要参考实现：https://github.com/2han9x1a0release/RBF-Softmax/blob/master/pycls/losses/rbflogit.py

具体内容可以参考原始论文：
中文讲解：https://blog.csdn.net/u011984148/article/details/108688071

"""

# 以下是使用tensorflow重现RBF softmax
"""
以下是使用tensorflow重现RBF softmax

"""


import tensorflow as tf


class RBFSoftmax(tf.layers.Layer):

    def __init__(self, feature_dim, num_classes, scale, gamma):
        super(RBFSoftmax, self).__init__()

        self.feature_dim = feature_dim
        self.num_classes = num_classes
        self.scale = scale
        self.gamma = gamma

    def build(self, input_shape):
        # 代表着每个类都有一个类中心的向量，用来计算RBF score （也可以添加偏置bias）
        self.weight = tf.Variable(tf.truncated_normal(shape=(self.num_classes, self.feature_dims), stddev=0.02))
        # self.bias = tf.Variable([0] * self.num_classes)
        self.built = True

    def call(self, inputs, training=None):
        """ 计算RBF logits

        :param inputs: Tensor, shape:(batch, feature_dim)
        :return: Tensor, shape:(batch, num_classes)

        可以先经过若干层的dense层，再进行计算RBFSoftmax

        """

        diff = tf.expand_dims(self.weight, axis=0) - tf.expand_dims(inputs, axis=1)
        diff = tf.multiply(diff, diff)
        metric = tf.reduce_sum(diff, axis=-1)  # shape: (batch, num_classes)
        kernel_metric = tf.exp(-1.0 * metric / self.gamma)
        logits = self.scale * kernel_metric
        return logits


"""
使用样例demo: 

rbflogit = RBFSoftmax(...)
logits = rbflogit(inputs)
true_one_hot_labels = ...
losses = tf.nn.sparse_softmax_cross_entropy_with_logits(labels=true_one_hot_labels, logits=logits)
loss = tf.reduce_mean(losses)
"""

醉意流年go

关注

0
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
tensorflow 实现RBFSoftmax

参考论文：RBF_softmax：Learning Deep Representative Prototypes with Radial Basis Function Softmax"""RBF_softmax：Learning Deep Representative Prototypes with Radial Basis Function Softmax交叉熵是深度学习中非常常用的一种损失，通过交叉熵学到的特征表示会有比较大的类内的多样性。因为传统的softmax损失优化的是类内.
复制链接

扫一扫