人群计数:SFCN--Learning from Synthetic Data for Crowd Counting in the Wild

Now, there is a serious problem, the scarce data led many methods suffer from over-fitting to a different extent.

**

contribution:

**

  1. We are the first to develop a data collector and labeler for crowd counting, which can automatically collect and annotate images without any labor costs. By using them, we create the first large-scale, synthetic and diverse crowd counting dataset.

  2. We present a pretrained scheme to facilitate the original method’s performance on the real data, which can more effectively reduce the estimation errors compared with random initialization and ImageNet model. Further, through the strategy, our proposed SFCN achieves the state-of-the-art results.

  3. We are the first to propose a crowd counting method via domain adaptation, which does not use any label of the real data. By our designed SE Cycle GAN, the domain gap between the synthetic and real data can be significantly reduced. Finally, the proposed method outperforms the two baselines.

**

GCC dataset:

**
The full name of GCC is GTA5 Crowd Counting. It has four highlights:

  1. free collection and annotation
  2. larger data volume and higher resolution
  3. more diversified scenes
  4. more accurate annotations

The process of getting a image for training:
a) select a location and setup the cameras
b) segment Region of interest (ROI) for crowd
c) set weather and time.
Place persons:
a) create persons in the ROI and get the head positions
b) obtain the person mask from stencil
c) integrate multiple images into one image
d) remove the positions of occluded heads.

How to use GCC ?

  1. Random splitting the training set and testing set.
  2. Cross-camera splitting: as for a specific location, one surveillance camera is randomly selected for testing and the others for training.
  3. Cross-location splitting: we randomly choose 75/25 locations for training/testing.

This table shows the advantage of using GCC to pretrain their model:

在这里插入图片描述

**

generating density map:

**
There are two ways to estimate the destiny map:
1. superised crowd counting: pretrained GCC model on finetuning real dataset.
2. Crowd Counting via Domain Adaptation: learning mapping between the synthetic domain S and the real-world domain R, then training the SFCN just on GCC.

The relationship of them is shown in below:
在这里插入图片描述

superrised crowd counting:

A spatial encoder via a sequence of convolution on the four directions (down, up, left-to-right and right-to-left). After the spatial encoder, a regression layer is added, which directly outputs the density map with input’s 1/8 size.

We design a spatial FCN (SFCN) to produce the density map, which adopt VGG-16 or ResnNet-101 as the backbone. We modify the stride size to 1 in conv4 x of ResNet-101 backbone, which makes conv4 x output the feature maps with 1/8 size of the
input image.在这里插入图片描述

Crowd Counting via Domain Adaptation:

Propose a crowd counting method via domain adaptation learns specific patterns or features from the synthetic data and transfers them to the real world.
To be specific, we present a SSIM Embedding (SE) Cycle GAN to transform the synthetic image to the photo-realistic image. Then we will train a SFCN on the translated data. No finetune on the real dataset. Then we get a satisfactory result:
在这里插入图片描述

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值