论文阅读记录_[CVPR2016] Convolutional Pose Machine

[CVPR2016]Convolutional Pose Machine

1.特点

  • 全卷积网络。
  • 不需显式地构建关键点之间的上下文关系,通过增大网络感受野来让网络自主学习。
  • 多阶段。随着网络的加深,感受野逐渐增大。因此早期的阶段着重关注局部特征,后期的阶段着重关注全局特征。
    这里写图片描述

2.重要结论

  • 感受野的大小对于关键点预测结果的影响。感受野越大,上下文关系越多,所以结果更准确。论文的输入尺寸是368 * 368,感受野能达到将近300,几乎可以获得全图的上下文信息。这里写图片描述

  • 不同类关键点之间可以相互增强,联合学习。简单的点可以帮助难点定位。
    这里写图片描述

  • 中间监督,解决网络加深而出现的梯度消失问题。
    这里写图片描述

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Gatys et al. (2016) proposed an algorithm for style transfer, which can generate an image that combines the content of one image and the style of another image. The algorithm is based on the neural style transfer technique, which uses a pre-trained convolutional neural network (CNN) to extract the content and style features from the input images. In this algorithm, the content and style features are extracted from the content and style images respectively using the VGG-19 network. The content features are extracted from the output of one of the convolutional layers in the network, while the style features are extracted from the correlations between the feature maps of different layers. The Gram matrix is used to measure these correlations. The optimization process involves minimizing a loss function that consists of three components: the content loss, the style loss, and the total variation loss. The content loss measures the difference between the content features of the generated image and the content image. The style loss measures the difference between the style features of the generated image and the style image. The total variation loss is used to smooth the image and reduce noise. The optimization is performed using gradient descent, where the gradient of the loss function with respect to the generated image is computed and used to update the image. The process is repeated until the loss function converges. The code for this algorithm is available online, and it is implemented using the TensorFlow library. It involves loading the pre-trained VGG-19 network, extracting the content and style features, computing the loss function, and optimizing the generated image using gradient descent. The code also includes various parameters that can be adjusted, such as the weight of the content and style loss, the number of iterations, and the learning rate.
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值