数据增广(data augmentation)在训练深度神经网络时的必要性解释

该文探讨了使用Residual U-Net进行左心室精确且鲁棒的分割,强调了多样化训练数据集和数据增强在防止过拟合中的重要性。通过随机选择图像并应用翻转、旋转和位移操作,增加输入数据的多样性,避免网络过度关注特定区域的特征。此外,提到随机弹性变形是训练少标注图像的关键。论文还提到了数据增强在教网络学习不变性和鲁棒性方面的作用。
摘要由CSDN通过智能技术生成

Left-Ventricle Quantification Using Residual U-Net

Kerfoot E., Clough J., Oksuz I., Lee J., King A.P., Schnabel J.A. (2019) Left-Ventricle Quantification Using Residual U-Net. In: Pop M. et al. (eds) Statistical Atlases and Computational Models of the Heart. Atrial Segmentation and LV Quantification Challenges. STACOM 2018. Lecture Notes in Computer Science, vol 11395. Springer, Cham. https://doi.org/10.1007/978-3-030-12029-0_40

3.1 Image Preprocessing

An accurate and robust segmentation network requires a large dataset so that it learns a general solution which can correctly segment images not seen in training, and does not become over-fitted to the input data. The key concept is variety since the network is being trained to identify geometry embedded in varying contexts. If some ancillary feature in this context is often present with important features the network will correlate these features and produce a poor result for inputs lacking the ancillary feature. To provide this varied data, we first create an expanded input dataset from the original challenge data and then apply data augmentation during the training process.

During training, we use data augmentation1 2 when creating an input batch of images at each step. Images from the expanded dataset are selected at random, then a random selection of transpose, flip, 90/180/270-degree rotation, and shift operations are applied to each image/segmentation pair. Applying these transformations essentially produces a further expanded dataset which contains increased image variation although not further geometric variation. This prevents the network from fixating on features in specific regions of its perceptive field since the random transformations move such features around the field during training. Augmentation is the only technique we use to prevent over-fitting, other techniques like dropout were found not to improve performance and so omitting them contributed to a simpler network architecture.


U-Net: Convolutional Networks for BiomedicalImage Segmentation

3.1 Data Augmentation

Data augmentation is essential to teach the network the desired invariance and robustness properties, when only few training samples are available. In case of microscopical images we primarily need shift and rotation invariance as well as robustness to deformations and gray value variations. Especially random elastic deformations of the training samples seem to be the key concept to train a segmentation network with very few annotated images. We generate smooth deformations using random displacement vectors on a coarse 3 by 3 grid. The displacements are sampled from a Gaussian distribution with 10 pixels standard deviation. Per-pixel displacements are then computed using bicubic interpolation. Drop-out layers at the end of the contracting path perform further implicit data augmentation.


  1. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems, NIPS 2012, vol. 1, pp. 1097–1105. Curran Associates Inc., USA (2012) ↩︎

  2. Simard, P.Y., Steinkraus, D., Platt, J.: Best practices for convolutional neural networks applied to visual document analysis. Institute of Electrical and Electronics Engineers, Inc. August 2003 ↩︎

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 打赏
    打赏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

Skr.B

WUHOOO~

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值