CVPR2017_Learning Detailed Face Reconstruction from a Single Image

Author

Elad Richardson, Matan Sela 伊利诺伊大学phd, 两人好基友,一起发了很多篇关于3D face 重建的论文

Abstract

对于从单张图片生成人脸3D模型,之前的工作都是使用additional data

本文用end to end的CNN结构分两阶段进行 coarseNet-> FineNet

因为没有合适的用于CNN的人脸重建数据集,我们是使用合成训练数据

CoarseNet是使用的3DMM用以生成粗粒度人脸几何模型

FineNet用之前生成的粗粒度的深度图作为输入,然后希望还原人脸的细节,这里通过使用无标签的人脸数据进行无监督训练

为了连接CoarseNet和FineNet, 我们提出了一个新层,将3DMM的表示和pose参数作为输入从而产生深度图, 喂入FineNet

新层可以支持梯度反传播,所以这两个网络可以同时训练

CNN用来人脸3D建模的潜力可见:

3D face reconstruction by learning from synthetic data(3D Vision 2016)

 However, their network can only produce the coarse geometry, and must be given an aligned template model as initialization. These limitations force their solution to depend on external algorithms for pose alignment and detail refinement.

这篇论文的两个部分都是受别人论文的启发,只不过作者能把他拼接起来而且效果很好

 

转载于:https://www.cnblogs.com/lainey/p/8698008.html

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Gatys et al. (2016) proposed an algorithm for style transfer, which can generate an image that combines the content of one image and the style of another image. The algorithm is based on the neural style transfer technique, which uses a pre-trained convolutional neural network (CNN) to extract the content and style features from the input images. In this algorithm, the content and style features are extracted from the content and style images respectively using the VGG-19 network. The content features are extracted from the output of one of the convolutional layers in the network, while the style features are extracted from the correlations between the feature maps of different layers. The Gram matrix is used to measure these correlations. The optimization process involves minimizing a loss function that consists of three components: the content loss, the style loss, and the total variation loss. The content loss measures the difference between the content features of the generated image and the content image. The style loss measures the difference between the style features of the generated image and the style image. The total variation loss is used to smooth the image and reduce noise. The optimization is performed using gradient descent, where the gradient of the loss function with respect to the generated image is computed and used to update the image. The process is repeated until the loss function converges. The code for this algorithm is available online, and it is implemented using the TensorFlow library. It involves loading the pre-trained VGG-19 network, extracting the content and style features, computing the loss function, and optimizing the generated image using gradient descent. The code also includes various parameters that can be adjusted, such as the weight of the content and style loss, the number of iterations, and the learning rate.
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值