READING NOTE: SuperCNN: A Superpixelwise Convolutional Neural Network for Salient Object Detection

TITLE: SuperCNN: A Superpixelwise Convolutional Neural Network for Salient Object Detection

AUTHER: Shengfeng He, Rynson W.H. Lau, Wenxi Liu, Zhe Huang, Qingxiong Yang Reid, Ian

FROM: IJCV2015

CONTRIBUTIONS

  1. A novel superpixel-wise convolutional neural network approach is proposed.
  2. Two kinds of sequence code are designed as input feature to CNN.

METHOD

  1. Superpixels are extracted via some methods such as oversegmentation.
  2. Extract Color Uniqueness Sequences (CU) for each superpixels to describe the color contrast between regions.
  3. Extract Color Distribution Sequences (CD) for each superpixels to measure the color compactness of colors.
  4. The two sequences are fed into a CNN to generate two saliency maps.
  5. A regressor is used to merge the two predicted saliency maps

SOME DETAILS

Color Uniqueness Sequence is used to describe the color contrast of a Region. Given an image I and the superpixels or regions R={r1,...,rx,...,rN}, each region rx contains a color uniqueness sequence QCx={qc1,...,qcj,...,qcN} . Each element, qcx is defined as

qcx=t(rj)|C(rx)C(rj)|w(P(rx),P(rj))

where t(rj) counts the total number of pixels in region rj . |C(rx)C(rj)| is a 3D vector storing the absolute differences of each color channel. P(rx is the mean position of region rj and w(P(rx),P(rj)) is defined as

w(P(rx),P(rj))=exp(12σ2sP(rx)P(rj)2)

The sequence QCx is sorted by the spatial distance to region rx .

Color Distribution Sequence is a sequence QDx={qd1,...,qdj,...,qdN} with the element qdj defined as:

qdj=t(rj)|P(rx)P(rj)|w(C(rx),C(rj))

where

w(C(rx),C(rj))=exp(12σ2sC(rx)C(rj)2)

the sequence is also sorted by the spatial distance.

Network Structure is briefly illustrated as below:

Saliency Inference is first to get the N predicted saliency scores of the N regions. Because of the two kinds of sequences, two sets of scores S1andS2 are predicted. The final saliency map can be obtained by:

S(rx)=u{1,2}vucSu(rx)

ADVANTAGES

  1. It is fast when infering.
  2. Large context are encoded in the sequences.

DISADVANTAGES

  1. The CNN is of a very light-weight structure. Deeper network may provide better performance.
  2. As the sequences are used to describe contrast information, which may lead to failure with the foreground and background having similar colors.
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值