Scene Parsing

Scene Parsing

Problem

segment and parse an image into different image regions associated with semantic categories

Evaluation

  • mean of the pixel-wise accuracy
    the ratio of pixels which are correctly predicted.
  • class-wise IoU
    the Intersection of Union of pixels averaged over all the semantic categories.

Dataset

  • Stanford Background
    S. Gould, R. Fulton, and D. Koller. Decomposing a scene into geometric and semantically consistent regions. In Computer Vision, 2009 IEEE 12th International Conference on, pages 1–8, Sept 2009.

  • SIFT Flow
    C. Liu, J. Yuen, and A. Torralba. Nonparametric scene parsing via label transfer. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 33(12):2368–2382, Dec 2011.

  • PASCAL-Context
    Mottaghi, Roozbeh, et al. “The role of context for object detection and semantic segmentation in the wild.” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2014.

  • ADE20K
    Semantic Understanding of Scenes through ADE20K Dataset. B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba. arXiv:1608.05442

DatasetStanford BackgroundSIFT FlowADE20K
No. of images715268825562
No. of train set572248820210
No. of val set002000
No. of test set1432003352
No. of classes833150

Samples of ADE20K

http://sceneparsing.csail.mit.edu/browse.php/?dirname=training/

ADE_train_00019523.jpg ADE_train_00019523.jpg

ADE_train_00000278.jpg ADE_train_00000278.jpg

Result

Stanford Background

MethodPixel Acc.Class Acc.averaged computing time per image
Single-scale ConvNet6656.50.35 (GPU)
Augmented CNNs71.9766.16-
Superparsing77.5-10 to 300
Deep 2D LSTM (window 5x5)77.7368.261.3 (CPU)
Deep 2D LSTM (window 3x3)78.5668.793.7 (CPU)
Multi-scale ConvNet78.872.40.6 (CPU)
RCNN2 (3 instances)80.269.910.7 (GPU)
N-ReNet80.471.80.07 (GPU)
Multi-CNN + rCPN Fast80.978.80.37 (GPU)
multiscale net + CRF on gPb81.476.060.5 (CPU)
Zoom-out82.177.3-
HGDN82.4172.980.02 (GPU)
RCNN_NIPS201583.174.80.03 (GPU)

SIFT Flow

MethodPixel Acc.Class Acc.mean IUf.w. IUaveraged computing time per image
Augmented CNNs49.3944.54---
Deep 2D LSTM (window 5x5)68.7422.59--1.2 (CPU)
Deep 2D LSTM (window 3x3)70.1120.90--3.1 (CPU)
RCNN2 (3 instances)77.729.8---
multiscale net + cover172.350.8---
multiscale net + cover278.529.6---
RCNN (balanced)79.357.1--0.03 (GPU)
HGDN79.6851.26--0.03 (GPU)
RCNN-large84.341.0--0.04 (GPU)
FCN-16s85.251.7--0.175 (GPUs)
VGG-conv5-DAG-RNN(8)85.355.7---
FCN-8s85.953.941.277.2-
patch CRF+CNN88.153.4---

PASCAL-Context

MethodPixel Acc.Class Acc.mean IUf.w. IU
CFM--18.1-
CFM--34.4-
FCN-32s65.549.136.750.9
FCN-16s66.951.338.452.3
FCN-8s67.552.339.153.0
patch CRF+CNN71.553.9--

Reference

MethodYearConferenceReference Paper
Superparsing2010ECCVSuperparsing: Scalable nonparametric image parsing with superpixels
Single-scale ConvNet2013PAMILearning hierarchical features for scene labeling
multiscale net2013PAMILearning hierarchical features for scene labeling
Augmented CNNs2014BMVCContextually constrained deep networks for scene labeling
RCNN2 (3 instances)2014ICMLRecurrent convolutional neural networks for scene labeling
Multi-CNN + rCPN Fast2014NIPSRecursive context propagation network for semantic scene labeling
RCNN (balanced)2015NIPSConvolutional Neural Networks with Intra-layer
RCNN-large2015NIPSConvolutional Neural Networks with Intra-layer
Deep 2D LSTM2015CVPRScene Labeling with LSTM Recurrent Neural Networks
Zoom-out2015CVPRFeedforward semantic segmentation with zoom-out features
FCN-16s2015CVPRFully convolutional networks for semantic segmentation
N-ReNet2016Combining the Best of Convolutional Layers and Recurrent Layers: A Hybrid Network for Semantic Segmentation
HGDN2016CVPRHierarchically Gated Deep Networks for Semantic Segmentation
VGG-conv5-DAG-RNN(8)2016CVPRDAG-Recurrent Neural Networks For Scene Labeling
patch CRF+CNN2016CVPREfficient Piecewise Training of Deep Structured Models for Semantic Segmentation
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值