cs231n lecture11 segmentation

Segmentation, Localization, Detection

Semantic Segmentation

  • label each pixel in the image with a category label
  • know classes
  • idea: sliding window
    • inefficient, Not reusing shared features between overlapping patches
  • idea: fully convolutional network (HxWxC)
    • whole image is computational heavy
    • downsampling and upsampling inside the network
      fc
sampling

downsampling

  • pooling
  • stride conv

upsampling

  • unpooling
    • NN(duplicate)
    • Bed of Nails(0s padding)
    • max: use positions from pooling layer

up

  • transpose convolution
    • sum where output overlaps
    • eg: input: [a b c d], filter(weight): [x y z]
      decov
conclusion

FCN + dilated conv + CRF

Classification + Localization

  • single object
    loc
  • model as regression problem
  • train seperately, when converge, then train jointly
  • Aside
    • Human Pose Estimation

Object Detection

  • various object
  • PASCAL VOC(too easy now)
  • as localization
    • fail, number of objects doesn’t fix
  • as classification:
    • sliding window
    • add class: bcakground
    • need to apply CNN to huge number of locations and scales

Region Based

  • Region Proposals
    • find “blobby” image regions that are likely to contain objects
    • Selective Search, fast
  • RCNN
    • Region Proposal + CNN
    • multi-task: also predict(correct) proposes bounding box
    • supervised
  • Fast R-CNN
    • RP on feature map(after convolved)
    • ROL pooling to crop image
  • Faster R-CNN
    • Region Proposal Network(RPN) to predict proposals(no ground truth)

frcnn

YOLO/ SSD

  • “You Only Look Once”/ “Single-shot MultiBox Detector”
  • treat as a regression problem
  • diff: bouding box are fixed, RCNN = region proposal + classification. SSD combine this.

yolo

Takeaways

  • Faster R-CNN is slower but more accurate
  • SSD is much faster but not as accurate

Instance Segmentation

Mask R-CNN

  • trained on Coco
    • 200000 training images
    • 80 categories per images

To Read

  • region proposal
  • ROI align / POOL

Thinks

  • multi task
  • end to end
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值