风格损失和内容损失的tf实现

最新推荐文章于 2023-03-15 18:35:16 发布

学术飙

最新推荐文章于 2023-03-15 18:35:16 发布

阅读量1.4k

点赞数 2

分类专栏： # 细碎的小技巧/常识/解决方案文章标签： tensorflow

本文链接：https://blog.csdn.net/qq_25036523/article/details/107515538

版权

有其他方面的论文可知，感知损失主要分为内容损失和风格损失。

其中内容损失主要是两个比较对象的L1或者l2范数。

而风格损失则主要是两个比较对象先求各自的gram矩阵，然后求L1或者l2范数。

在求gram矩阵时，可以按照以下理解：

内容content为vgg等网络提取出来的featuremap。大小为[b, h, w, c]。[批大小，长，宽，通道数]

需要的gram矩阵由[b, c, hw] 与[b, hw, c]相乘得到为[b, c, c]

代码如下：

def ContentLoss(messageresult, compareresult):
    result = 0

    for x, y in zip(messageresult, compareresult):
        shape = x.get_shape().as_list()
        k = np.prod(shape[1:])
        diff = x - y
        diff = tf.norm(diff, ord=1) / k
        result = result + diff

    return result

# 求gram矩阵
def gram_matrix(input):
    # input [batch, h, w, c]
    input = tf.transpose(input, perm=[0, 3, 1, 2])  # input [batch, c, h, w]
    shape = input.get_shape().as_list()
    channel = shape[1]
    dim = np.prod(shape[2:])
    input = tf.reshape(input, [-1, channel, dim])   # input

最低0.47元/天解锁文章

学术飙

关注

2
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
风格损失和内容损失的tf实现

有其他方面的论文可知，感知损失主要分为内容损失和风格损失。其中内容损失主要是两个比较对象的L1或者l2范数。而风格损失则主要是两个比较对象先求各自的gram矩阵，然后求L1或者l2范数。在求gram矩阵时，可以按照一下理解：内容content为vgg等网络提取出来的featuremap。大小为[b, h, w, c]。[批大小，长，宽，通道数]需要的gram矩阵由[b, c, hw] 与[b, hw, c]相乘得到为[b, c, c]...
复制链接

扫一扫