利用keras进行样本扩充

最新推荐文章于 2024-06-04 09:47:08 发布

背包客(wyq)

最新推荐文章于 2024-06-04 09:47:08 发布

阅读量2.8k

点赞数 2

分类专栏：深度学习文章标签：样本扩充 python

本文链接：https://blog.csdn.net/weixin_39954229/article/details/78305395

版权

深度学习专栏收录该内容

5 篇文章 0 订阅

订阅专栏

1.首先安装tensorflow和keras(建议通过conda进行安装)，网上有好多教程，以下面的教程为例

http://www.linuxidc.com/Linux/2016-07/133214.htm

2.将要扩充的样本放在data目录下的train文件夹下,并在data文件夹下新建一个preview的文件夹用来存放扩充后的样本，具体目录如下：

test/data/preview <and> train

train目录下为需要扩充的样本文件夹：如bus,flower,horse等等

3.其具体代码如下，只有30多行

empty#-*- coding:utf-8 -*-
import os
from keras.preprocessing.image import ImageDataGenerator, array_to_img, img_to_array, load_img

datagen = ImageDataGenerator(
        rotation_range=40,
        width_shift_range=0.2,
        height_shift_range=0.2,
        shear_range=0.2,
        zoom_range=0.2,
        horizontal_flip=True,
        fill_mode='nearest')
def expand(lable):
	j = 1
	if not os.path.isdir('/home/wyq/test/data/preview/'+lable): #判断preview目录下是否存在××该文件夹，
		os.mkdir('/home/wyq/test/data/preview/'+lable) #若不存在则创建一个文件夹来保存扩充后的样本
	for file_name in os.listdir('/home/wyq/test/data/train/'+lable): #要扩充的图片所在目录
		img = load_img('/home/wyq/test/data/train/'+lable+'/'+file_name) #this is a PIL image
		x = img_to_array(img)  # this is a Numpy array with shape (3,150,150)
		x = x.reshape((1,) + x.shape)  # this is a Numpy array with shape (1,3,150,150)
		
		i = 1
		for batch in datagen.flow(x, batch_size=1,
					 save_to_dir='/home/wyq/test/data/preview/'+lable, save_prefix=lable, save_format='jpg'):#设置扩充后的样本保存位置及属性
			i += 1
			if i > 10:  #每张图片扩充10张
				break  # otherwise the generator would loop indefinitely
		j +=1
		if j>100: #每个文件夹中有100张图片，故遍历100次
			break

expand("bus")# bus为车的图片的文件夹名称
expand("dinosaur")#dinosaur为恐龙图片的文件夹名称
expand("elephant")#同上
expand("flower")#同上
expand("horse")#同上