cs231n lecture11 segmentation

最新推荐文章于 2022-03-06 10:35:06 发布

feitianlzk

最新推荐文章于 2022-03-06 10:35:06 发布

阅读量287

点赞数

分类专栏： AI

本文链接：https://blog.csdn.net/feitianlzk/article/details/79638374

版权

19 篇文章 0 订阅

订阅专栏

Segmentation, Localization, Detection

label each pixel in the image with a category label
know classes
idea: sliding window
- inefficient, Not reusing shared features between overlapping patches
idea: fully convolutional network (HxWxC)
- whole image is computational heavy
- downsampling and upsampling inside the network

downsampling

upsampling

unpooling
- NN(duplicate)
- Bed of Nails(0s padding)
- max: use positions from pooling layer

transpose convolution
- sum where output overlaps
- eg: input: [a b c d], filter(weight): [x y z]

FCN + dilated conv + CRF

various object
PASCAL VOC(too easy now)
as localization
- fail, number of objects doesn’t fix
as classification:
- sliding window
- add class: bcakground
- need to apply CNN to huge number of locations and scales

Region Based

Region Proposals
- find “blobby” image regions that are likely to contain objects
- Selective Search, fast
RCNN
- Region Proposal + CNN
- multi-task: also predict(correct) proposes bounding box
- supervised
Fast R-CNN
- RP on feature map(after convolved)
- ROL pooling to crop image
Faster R-CNN
- Region Proposal Network(RPN) to predict proposals(no ground truth)

frcnn

YOLO/ SSD

“You Only Look Once”/ “Single-shot MultiBox Detector”
treat as a regression problem
diff: bouding box are fixed, RCNN = region proposal + classification. SSD combine this.

yolo

Takeaways

Mask R-CNN

关注