使用tensorflow预处理输入图像得到神经网络的输入层

最新推荐文章于 2024-06-21 09:08:19 发布

AmibitionWei

最新推荐文章于 2024-06-21 09:08:19 发布

阅读量4.3k

点赞数

分类专栏：数字图像处理机器学习文章标签：机器学习 tensorflow 图像预处理

本文链接：https://blog.csdn.net/qq_30666517/article/details/79216858

版权

机器学习同时被 2 个专栏收录

12 篇文章 2 订阅

订阅专栏

数字图像处理

3 篇文章 0 订阅

订阅专栏

本文使用tensorflow自带的图像处理工具包来预处理训练集中的图片。目的是得到更多的训练不同数据，这些数据从色度，

饱和度，亮度，对比度等角度来生成更多的训练数据，期间也包括了图像的随机翻转，最后得到的图像是一个bounding box大

小的输入图像。代码比较简单，由于也是初学tensorflow,本文参考《从零开始学TensorFlow》一书，考虑到版本的不同，所以笔

者在运行时候出现了很多错误，在网上查找了很多资料才一一解决。现将笔者归纳的关键点总结如下：

1、使用tensorflow工具箱读取图片,tf.gfile.FastGFile(）函数里，图片的路径和用opencv读取时一样，不能含有中文字符，

但是如果是利用matplotlib和scipy工具箱读取则没有这个限制。另外读取模式mode='rb',如果只设置成只读模式，可能会报错.

2、路径设置问题，具体可以参考博客点击打开链接

3、调整色度和饱和度时，必须要求输入图像是rgb三通道图像，所以要确保你的图像是24位深度。

4、设置bounding box 起始位置坐标是，必须是一个三维数组

源代码如下:

# -*- coding: utf-8 -*-
"""
Created on Wed Jan 31 12:14:42 2018

@author: Administrator
"""
# 图像预处理的完整流程
import tensorflow as tf
import numpy as np
import matplotlib.pyplot as plt

# 转换图像的色彩(包括色度，饱和度，对比度， 亮度)，不同的顺序会得到不同的结果
# 转换的顺序随机
def distort_color(input_img, color_order=0):
    # 调整色度和饱和度的图像必须要求是rgb三通道图像
    # 所以要确保你的图像位深度是24位
    if color_order == 0:
        # 随机调整色度
        img = tf.image.random_hue(input_img, 0.2)
        # 随机调整饱和度
        img = tf.image.random_saturation(img, 0.5, 1.5)
        # 随机调整对比度
        img = tf.image.random_contrast(img, 0.5, 1.5)
        # 随机调整亮度
        img = tf.image.random_brightness(img, 0.5)
    elif color_order == 1:
        # 随机调整色度
        img = tf.image.random_hue(input_img, 0.2)
        # 随机调整对比度
        img = tf.image.random_contrast(img, 0.5, 1.5)
        # 随机调整亮度
        img = tf.image.random_brightness(img, 0.5)
        # 随机调整饱和度
        img = tf.image.random_saturation(img, 0.5, 1.5)
    elif color_order == 2:
        # 随机调整饱和度
        img = tf.image.random_saturation(input_img, 0.5, 1.5)
        # 随机调整亮度
        img = tf.image.random_brightness(img, 0.5)
        # 随机调整色度
        img = tf.image.random_hue(input_img, 0.2)
        # 随机调整对比度
        img = tf.image.random_contrast(img, 0.5, 1.5)
    image = tf.clip_by_value(img, 0.0, 1.0)
    return image

# 图像预处理函数
# 输入一张解码后的图像，目标图像的尺寸以及图像上的标注框
def image_preprocessing(input_img, height, width, bbox):
    # 如果没有输入边界框，则默认整个图像都需要关注
    if bbox is None:
        bbox = tf.constant([0.0, 0.0, 1.0, 1.0],shape=(1,1,4))
    # 转换图像的数据类型
    if input_img.dtype != tf.float32:
        input_img = tf.image.convert_image_dtype(input_img, tf.float32)
    # 随机截取图像, 减少需要关注的物体的大小对识别算法的影响
    # 随机生成一个bounding box(大小一定,位置随机)
    bbox_begin, bbox_size, _ = tf.image.sample_distorted_bounding_box(
            tf.shape(input_img), bbox)
    # 得到随机截取的图像.
    distorted_img = tf.slice(input_img, bbox_begin, bbox_size)
    
    # 将随机截取的图像调整到指定大小，内插算法是随机选择的
    distorted_img = tf.image.resize_images(distorted_img, (height, width), 
                                           method = np.random.randint(4))
    # 随机左右翻转图像
    distorted_img = tf.image.random_flip_left_right(distorted_img)
    # 随机上下翻转图像
    distorted_img = tf.image.random_flip_up_down(distorted_img)
    # 随机打乱图像的色彩
    distorted_img = distort_color(distorted_img, 
                                  color_order=np.random.randint(3))
    return distorted_img
    
# 定义主函数
def main():
    # 注意路径中不能有中文字符
    picpath = './1.jpg'
    # 加载图片，并解码得到三维数组, 注意打开模式必须是rb
    raw_pic = tf.gfile.FastGFile(picpath, 'rb').read()
    # 解码得到三维数组
    raw_data = tf.image.decode_jpeg(raw_pic)
    # print(raw_data.get_shape())
    # 设置bounding box 大小
    bbox = tf.constant([0.2, 0.2, 0.8, 0.8], shape=(1,1,4))
    plt.figure(figsize=(8,6))
    with tf.Session() as sess:
        # 随机6次截取
        for i in range(6):
            plt.subplot(2,3,i+1)
            croped_img = image_preprocessing(raw_data, 256, 256, bbox)
            print(tf.shape(croped_img))
            plt.imshow(sess.run(croped_img))
            plt.axis('off')

if __name__ == '__main__':
    main()

运行结果如下所示:

AmibitionWei

关注

0
点赞
踩
7

收藏

觉得还不错? 一键收藏
1
评论
使用tensorflow预处理输入图像得到神经网络的输入层

本文使用tensorflow自带的图像处理工具包来预处理训练集中的图片。目的是得到更多的训练不同数据，这些数据从色度，饱和度，亮度，对比度等角度来生成更多的训练数据，期间也包括了图像的随机翻转，最后得到的图像是一个bounding box大小的输入图像。代码比较简单，由于也是初学tensorflow,本文参考《从零开始学TensorFlow》一书，考虑到版本的不同，所以笔者在运行时候
复制链接

扫一扫

专栏目录