face alignment network 相关开源代码

收集一些人脸对齐相关的开源代码,并做简单的比较

1. 基于MTCNN的方法

[code],是matlab实现的,在sphereface等里面用于数据预处理,速度快,论文时间为2016年

2. cmu的openpose

相关的姿态pose、手势为2017年论文;人脸的检测是使用手势的同一套网络,但是数据集是人脸的数据集,比如微软的coco

安装

a). 需要安装cmake-gui;如需要指定caffe路径的话,参考b)中的路径设置

b).如果不安装的话,需要指定opencv和caffe的路径,方法(提供了两种方法,二选一);

mkdir build
cd build

cmake -DOpenCV_INCLUDE_DIRS=/usr/local/include/ -DOpenCV_LIBS_DIR=/usr/local/lib/ -DCaffe_INCLUDE_DIRS=/home/X/work/caffe_master/build/src -DCaffe_LIBS=/home/X/work/caffe_master/build/lib/libcaffe.so -DBUILD_CAFFE=OFF ..

make -j8

# make -j`nproc`

在make的时候,非常麻烦的错误是,还需要引用hpp头文件。Ubuntu下编译的结果不会再build下面再包含头文件,所以解决办法就是将caffe_root/include/caffe整个的复制的build/src/caffe下,爱用哪个头文件就用哪个头文件吧。

c). 调用,比如这样

./build/examples/openpose/openpose.bin --image_dir ~/work/face-alignment/test/assets/ --face --num_gpu 1 --write_keypoint ~/work/face-alignment/test/assets/

但是效果比较差,不如1adrianb的效果,当然是选择了比较难的样本。

3. 1adrianb/face-alignment

人脸检测是基于dlib中的人脸检测方法,该方法是dlib_face_recognition_resnet_model_v1 模型,在2017年的LFW上精度为99.38%,目标是3D人脸的对齐。

安装

  1. 安装dlib; 该模式生成关键点,但也可以指定检测模型。
  2.  需要下载tar,2D的路径是这个,所下载的文件保存到了~/.face_alignment中。在Ubuntu中可以直接下载,有时候在window下打开需要翻墙,所以也一并将他们贴到CSDN资源池里面了,下载文件一文件二,以及3D时需要的depth网络模型(链接:https://pan.baidu.com/s/1Y10qL3Lcp6nPZb-uSlbAVg 密码:9vqx)
  3. pytorch需要较新的版本,因为里面用到了比如with no_grad(),在0.2中不存在。pytorch需要更新,更新的时候可能遇到pip不能安装的问题,还挺普遍存在,可以下载下来,升级wheel到3,再次安装。
  4. 调用:使用官网给的范例就够了,输出的是关键点的二维数组。

4. 1adrianb/binary-human-pose-estimation

是2017Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources论文的实现,该论文作为ICCV 2017 Oral口头报告呈现。人脸和姿态都有,代码是分开的。

5. MarekKowalski/DeepAlignmentNetwork

是2017的论文,作者称DAN为 been accepted to the First Faces in-the-wild Workshop-Challenge at CVPR 2017

同样的,需要下载一些网络参数模型,也需要翻墙,上传到CSDN链接中,共三个(DAN,DAN-Menpo-tracking,DAN-Menpo)

 6. kayamin/DR-GAN

该论文Disentangled Representation Learning GAN for Pose-Invariant Face Recognition,论文名称是《Representation Learning by Rotating Your Faces》,还有其他的实现,可以参考一下Github。

 7. Deep Face Feature for Face Alignment (没有代码)

此论文2018称优于3DDFA、jourabloo&liu、ERT。

8. tpys/face-everthing

是一些以往的代码的集成,但不全是alignment,没什么用。

 

 

 

  • 1
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
25篇机器学习经典论文合集,有需要欢迎积分自取 Efficient sparse coding algorithms论文附有代码 [1] Zheng S, Kwok J T. Follow the moving leader in deep learning[C]//Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org, 2017: 4110-4119. [2] Kalai A, Vempala S. Efficient algorithms for online decision problems[J]. Journal of Computer and System Sciences, 2005, 71(3): 291-307. [3] Kingma, D. and Ba, J. Adam: A method for stochastic optimization. In Proceedings of the International Conference for Learning Representations, 2015. [4] Lee H, Battle A, Raina R, et al. Efficient sparse coding algorithms[C]//Advances in neural information processing systems. 2007: 801-808. [5] Fan J, Ding L, Chen Y, et al. Factor Group-Sparse Regularization for Efficient Low-Rank Matrix Recovery[J]. 2019. [6] Z. Lai, Y. Chen, J. Wu, W. W. Keung, and F. Shen, “Jointly sparse hashing for image retrieval,” IEEE Transactions on Image Processing, vol. 27, no. 12, pp. 6147–6158, 2018. [7] Z. Zhang, Y. Chen, and V. Saligrama, “Efficient training of very deep neural networks for supervised hashing,” in Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition, 2016, pp. 1487–1495. [8] Wei-Shi Zheng, Shaogang Gong, Tao Xiang. Person re-identification by probabilistic relative distance comparison[C]// CVPR 2011. IEEE, 2011. [9] Liao S, Hu Y, Zhu X, et al. Person re-identification by local maximal occurrence representation and metric learning[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 2197-2206. [10] Liu X, Li H, Shao J, et al. Show, tell and discriminate: Image captioning by self-retrieval with partially labeled data[C]//Proceedings of the European Conference on Computer Vision (ECCV). 2018: 338-354. [11] Yao T, Pan Y, Li Y, et al. Exploring visual relationship for image captioning[C]//Proceedings of the European conference on computer vision (ECCV). 2018: 684-699. [12] Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang., ”Image Super-Resolution Using Deep Convolutional Networks, ” IEEE Transactions on Pattern Analysis and Machine Intelligence, Preprint, 2015. [13] M. D. Zeiler, D. Krishnan, Taylor, G. W., and R. Fergus, "Deconvolutional networks," in Proc. IEEE Comput. Soc. Conf. Comput. Vision Pattern Recog., 2010, pp. 2528-2535. [14] Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580-587. [15] Girshick R . Fast R-CNN[J]. Computer Science, 2015. [16] Joseph Redmon, Santosh Divvala, Ross Girshick, et al. You Only Look Once: Unified, Real-Time Object Detection[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2016. [17] LeCun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE, 1998, 86(11): 2278-2324. [18] Hinton G E, Salakhutdinov R R. Reducing the dimensionality of data with neural networks[J]. science, 2006, 313(5786): 504-507. [19] Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks[C]//Advances in neural information processing systems. 2012: 1097-1105. [20] Zeiler M D, Fergus R. Visualizing and understanding convolutional networks[C]//European conference on computer vision. Springer, Cham, 2014: 818-833. [21] Szegedy C, Liu W, Jia Y, et al. Going deeper with convolutions[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 1-9. [22] Wu, Y., & He, K. (2018). Group normalization. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 3-19). [23] Goodfellow I,Pouget-Abadie J, Mirza M, et al. Generative adversarial nets[C]//Advances in Neural Information Processing Systems. 2014: 2672-2680. [24] Tran, L., Yin, X., & Liu, X. (2017). Disentangled representation learning gan for pose-invariant face recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1415-1424). [25] Pu, Y., Gan, Z., Henao, R., Yuan, X., Li, C., Stevens, A., & Carin, L. (2016). Variational autoencoder for deep learning of images, labels and captions. In Advances in neural information processing systems (pp. 2352-2360).
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值