"Deep Facial Expression Recognition: A Survey"论文笔记

introduction

  • FER systems can be divided into two main categories according to the feature representations: static image FER and dynamic sequence FER. (时空信息)
  • The majority of the traditional methods have used handcrafted features or shallow learning (e.g., local binary patterns (LBP) [12], LBP on three orthogonal planes (LBP-TOP) [15], non-negative matrix factorization (NMF) [19] and sparse learning [20]) for FER.
    However, many competitions have collected relatively sufficient training data from challenging real-world scenarios,in
    the meanwhile, due to the dramatically increased chip processing abilities (e.g., GPU units) and well-designed network architecture, studies in various fields have begun to transfer to deep learning methods.

database

deep facial expression recognition

  1.pre-processing

  • face alignment(detector and to coordinate localized landmarks)

          

  • Kim et al. [76] considered different inputs (original image and histogram equalized image) and different face detection       models (V&J [72] and MoT [56]), and the landmark set with the highest confidence provided by the Intraface [73] was  selected.
  • data augmentation(enlarge database)

  • Data augmentation techniques can be divided into two groups: on-the-fly data augmentation and offline data
    augmentation.
  • Usually, the on-the-fly data augmentation is embedded in deep learning toolkits to alleviate overfitting. During the training step, the input samples are randomly cropped from the four corners and center of the image and then flipped horizontally.
  • Besides the elementary on-the-fly data augmentation, various offline data augmentation operations have been designed to further expand data on both size and diversity. The most frequently used operations include random perturbations and transforms, e.g., rotation, shifting, skew, scaling, noise, contrast and color jittering. Furthermore, deep learning based technology can be applied for data augmentation. For example,CNN or GAN(generatie adversatial network).
  • face normalization(to ameliorate illumination and head pose)

  • illumination normalization 

      a.sevearal algorithms:isotropic diffusion (IS)-based normalization, discrete cosine transform (DCT)-based normalization [85] and difference of Gaussian (DoG)

      b.homomorphic filtering based normalization & histogram equalization combined with illumination normalization etc.

      c.weighted summation approach to combine histogram equalization and linear mapping ( to solve overemphasizing local contrast problem)

      d.global equalization(GCN), local normalization and histogram equalization.

  • pose normalization

      a.Specifically, after localizing facial landmarks, a 3D texture reference model generic to all faces is generated to efficiently estimate visible facial components. T

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值