【TensorFlow2.0】tf.keras.preprocessing.image.ImageDataGenerator

最新推荐文章于 2024-03-13 17:23:45 发布

Accelerating

最新推荐文章于 2024-03-13 17:23:45 发布

阅读量599

点赞数

分类专栏： TensorFlow 文章标签： keras

本文链接：https://blog.csdn.net/Accelerating/article/details/120459468

版权

TensorFlow 专栏收录该内容

13 篇文章 1 订阅

订阅专栏

@keras_export('keras.preprocessing.image.ImageDataGenerator')
class ImageDataGenerator(image.ImageDataGenerator):
  """Generate batches of tensor image data with real-time data augmentation.

   The data will be looped over (in batches).

  Args:
      featurewise_center: Boolean.
          Set input mean to 0 over the dataset, feature-wise.
      samplewise_center: Boolean. Set each sample mean to 0.
      featurewise_std_normalization: Boolean.
          Divide inputs by std of the dataset, feature-wise.
      samplewise_std_normalization: Boolean. Divide each input by its std.
      zca_epsilon: epsilon for ZCA whitening. Default is 1e-6.
      zca_whitening: Boolean. Apply ZCA whitening.
      rotation_range: Int. Degree range for random rotations.
      width_shift_range: Float, 1-D array-like or int
          - float: fraction of total width, if < 1, or pixels if >= 1.
          - 1-D array-like: random elements from the array.
          - int: integer number of pixels from interval
              `(-width_shift_range, +width_shift_range)`
          - With `width_shift_range=2` possible values
              are integers `[-1, 0, +1]`,
              same as with `width_shift_range=[-1, 0, +1]`,
              while with `width_shift_range=1.0` possible values are floats
              in the interval [-1.0, +1.0).
      height_shift_range: Float, 1-D array-like or int
          - float: fraction of total height, if < 1, or pixels if >= 1.
          - 1-D array-like: random elements from the array.
          - int: integer number of pixels from interval
              `(-height_shift_range, +height_shift_range)`
          - With `height_shift_range=2` possible values
              are integers `[-1, 0, +1]`,
              same as with `height_shift_range=[-1, 0, +1]`,
              while with `height_shift_range=1.0` possible values are floats
              in the interval [-1.0, +1.0).
      brightness_range: Tuple or list of two floats. Range for picking
          a brightness shift value from.
      shear_range: Float. Shear Intensity
          (Shear angle in counter-clockwise direction in degrees)
      zoom_range: Float or [lower, upper]. Range for random zoom.
          If a float, `[lower, upper] = [1-zoom_range, 1+zoom_range]`.
      channel_shift_range: Float. Range for random channel shifts.
      fill_mode: One of {"constant", "nearest", "reflect" or "wrap"}.
          Default is 'nearest'.
          Points outside the boundaries of the input are filled
          according to the given mode:
          - 'constant': kkkkkkkk|abcd|kkkkkkkk (cval=k)
          - 'nearest':  aaaaaaaa|abcd|dddddddd
          - 'reflect':  abcddcba|abcd|dcbaabcd
          - 'wrap':  abcdabcd|abcd|abcdabcd
      cval: Float or Int.
          Value used for points outside the boundaries
          when `fill_mode = "constant"`.
      horizontal_flip: Boolean. Randomly flip inputs horizontally.
      vertical_flip: Boolean. Randomly flip inputs vertically.
      rescale: rescaling factor. Defaults to None.
          If None or 0, no rescaling is applied,
          otherwise we multiply the data by the value provided
          (after applying all other transformations).
      preprocessing_function: function that will be applied on each input.
          The function will run after the image is resized and augmented.
          The function should take one argument:
          one image (Numpy tensor with rank 3),
          and should output a Numpy tensor with the same shape.
      data_format: Image data format,
          either "channels_first" or "channels_last".
          "channels_last" mode means that the images should have shape
          `(samples, height, width, channels)`,
          "channels_first" mode means that the images should have shape
          `(samples, channels, height, width)`.
          It defaults to the `image_data_format` value found in your
          Keras config file at `~/.keras/keras.json`.
          If you never set it, then it will be "channels_last".
      validation_split: Float. Fraction of images reserved for validation
          (strictly between 0 and 1).
      dtype: Dtype to use for the generated arrays.

  Raises:
    ValueError: If the value of the argument, `data_format` is other than
          `"channels_last"` or `"channels_first"`.
    ValueError: If the value of the argument, `validation_split` > 1
          or `validation_split` < 0.
          """
		def __init__(self,
		               featurewise_center=False,
		               samplewise_center=False,
		               featurewise_std_normalization=False,
		               samplewise_std_normalization=False,
		               zca_whitening=False,
		               zca_epsilon=1e-6,
		               rotation_range=0,
		               width_shift_range=0.,
		               height_shift_range=0.,
		               brightness_range=None,
		               shear_range=0.,
		               zoom_range=0.,
		               channel_shift_range=0.,
		               fill_mode='nearest',
		               cval=0.,
		               horizontal_flip=False,
		               vertical_flip=False,
		               rescale=None,
		               preprocessing_function=None,
		               data_format=None,
		               validation_split=0.0,
		               dtype=None)

具体含义如下：

featurewise_center：布尔值，使输入数据集去中心化（均值为0）

samplewise_center：布尔值，使输入数据的每个样本均值为0。

featurewise_std_normalization：布尔值，将输入除以数据集的标准差以完成标准化。

samplewise_std_normalization：布尔值，将输入的每个样本除以其自身的标准差。

zca_whitening：布尔值，对输入数据施加ZCA白化。

rotation_range：整数，数据增强时图片随机转动的角度。随机选择图片的角度，是一个0~180的度数，取值为0~180。

width_shift_range：浮点数，图片宽度的某个比例，数据增强时图片随机水平偏移的幅度。

height_shift_range：浮点数，图片高度的某个比例，数据增强时图片随机竖直偏移的幅度。 

shear_range：浮点数，剪切强度（逆时针方向的剪切变换角度）。是用来进行剪切变换的程度。

zoom_range：浮点数或形如[lower,upper]的列表，随机缩放的幅度，若为浮点数，则相当于[lower,upper] = [1 - zoom_range, 1+zoom_range]。用来进行随机的放大。

channel_shift_range：浮点数，随机通道偏移的幅度。

fill_mode：‘constant’，‘nearest’，‘reflect’或‘wrap’之一，当进行变换时超出边界的点将根据本参数给定的方法进行处理。

cval：浮点数或整数，当fill_mode=constant时，指定要向超出边界的点填充的值。

horizontal_flip：布尔值，进行随机水平翻转。随机的对图片进行水平翻转，这个参数适用于水平翻转不影响图片语义的时候。

vertical_flip：布尔值，进行随机竖直翻转。

rescale: 值将在执行其他处理前乘到整个图像上，我们的图像在RGB通道都是0~255的整数，这样的操作可能使图像的值过高或过低，所以我们将这个值定为0~1之间的数。

preprocessing_function: 将被应用于每个输入的函数。该函数将在任何其他修改之前运行。该函数接受一个参数，为一张图片（秩为3的numpy array），并且输出一个具有相同shape的numpy array。

Accelerating

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
【TensorFlow2.0】tf.keras.preprocessing.image.ImageDataGenerator

@keras_export('keras.preprocessing.image.ImageDataGenerator')class ImageDataGenerator(image.ImageDataGenerator): """Generate batches of tensor image data with real-time data augmentation. The data will be looped over (in batches). Args: fea
复制链接

扫一扫

专栏目录