使用imgaug进行增强,项目地址:https://github.com/aleju/imgaug
由于在车牌识别中使用的增强方法有限,因此直接使用作者给的例子即可 https://imgaug.readthedocs.io/en/latest/source/examples_basics.html#a-standard-use-case
import imgaug as ia
from imgaug import augmenters as iaa
import numpy as np
from PIL import Image
ia.seed(1)
# Example batch of images.
# The array has shape (32, 64, 64, 3) and dtype uint8.
images = np.array(
[Image.open('path/to/image') for _ in range(32)],
dtype=np.uint8
)
seq = iaa.Sequential([
iaa.Fliplr(0.5), # horizontal flips
iaa.Crop(percent=(0, 0.1)), # random crops
# Small gaussian blur with random sigma between 0 and 0.5.
# But we only blur about 50% of all images.
iaa.Sometimes(0.5,
iaa.GaussianBlur(sigma=(0, 0.5))
),
# Strengthen or weaken the contrast in each image.
iaa.ContrastNormalization((0.75, 1.5)),
# Add gaussian noise.
# For 50% of all images, we sample the noise once per pixel.
# For the other 50% of all images, we sample the noise per pixel AND
# channel. This can change the color (not only brightness) of the
# pixels.
iaa.AdditiveGaussianNoise(loc=0, scale=(0.0, 0.05*255), per_channel=0.5),
# Make some images brighter and some darker.
# In 20% of all cases, we sample the multiplier once per channel,
# which can end up changing the color of the images.
iaa.Multiply((0.8, 1.2), per_channel=0.2),
# Apply affine transformations to each image.
# Scale/zoom them, translate/move them, rotate them and shear them.
iaa.Affine(
scale={
"x": (0.8, 1.2), "y": (0.8, 1.2)},
translate_percent={
"x": (-0.2, 0.2), "y": (-0.2, 0.2)},
rotate=(-25, 25),
shear=(-8, 8)
)
], random_order=True) # apply augmenters in random order
images_aug = seq.augment_images(images)
for i in range(len(images_aug)):
Image.fromarray(images_aug[i]).show()
将一张图片迭代32次,组成一个numpy进行处理。
由于在车牌识别中需要标定车牌的位置,因此在数据增强时需要将标定点也一起进行处理
# 对标记数据进行处理,获取车牌四个点坐标的位置
with open('index.yaml', 'r') as f:
pos = f.readlines()[3].split(':')[1].split()
x1 = int(