飞桨深度学习实践YOLOAI识虫

最新推荐文章于 2023-12-14 11:58:45 发布

yao_qian_shu

最新推荐文章于 2023-12-14 11:58:45 发布

阅读量1.3k

点赞数 1

分类专栏：深度学习文章标签： python 深度学习机器学习

本文链接：https://blog.csdn.net/yao_qian_shu/article/details/108291286

版权

该博客介绍了使用飞桨进行深度学习实践，特别是针对AI识虫任务。内容包括处理2183张图片的数据集，其中包含7种昆虫的训练、验证和测试集。首先，将昆虫类别字符串转换为数字，以便神经网络处理。接着，详细讨论了数据预处理的步骤，如随机调整亮度、对比度、颜色，随机填充、裁剪、缩放和翻转，以及真实框的随机排列，这些方法有助于图像增广，提升模型泛化能力。

摘要由CSDN通过智能技术生成

1.数据集处理部分
读取AI识虫数据集标注信息
AI识虫数据集结构如下：

提供了2183张图片，其中训练集1693张，验证集245，测试集245张。
包含7种昆虫，分别是Boerner、Leconte、Linnaeus、acuminatus、armandi、coleoptera和linnaeus。
包含了图片和标注，请读者先将数据解压，并存放在insects目录下。


# 解压数据脚本，第一次运行时打开注释，将文件解压到work目录下
!unzip -d /home/aistudio/work /home/aistudio/data/data19638/insects.zip

数据集中读取xml文件，将每张图片的标注信息读取出来。在读取具体的标注文件之前，我们先完成一件事情，就是将昆虫的类别名字（字符串）转化成数字表示的类别。因为神经网络里面计算时需要的输入类型是数值型的，所以需要将字符串表示的类别转化成具体的数字。昆虫类别名称的列表是：[‘Boerner’, ‘Leconte’, ‘Linnaeus’, ‘acuminatus’, ‘armandi’, ‘coleoptera’, ‘linnaeus’]，这里我们约定此列表中：'Boerner’对应类别0，'Leconte’对应类别1，…，'linnaeus’对应类别6。使用下面的程序可以得到表示名称字符串和数字类别之间映射关系的字典。

INSECT_NAMES = ['Boerner', 'Leconte', 'Linnaeus', 
                'acuminatus', 'armandi', 'coleoptera', 'linnaeus']

def get_insect_names():
    """
    return a dict, as following,
        {'Boerner': 0,
         'Leconte': 1,
         'Linnaeus': 2, 
         'acuminatus': 3,
         'armandi': 4,
         'coleoptera': 5,
         'linnaeus': 6
        }
    It can map the insect name into an integer label.
    """
    insect_category2id = {
   }
    for i, item in enumerate(INSECT_NAMES):
        insect_category2id[item] = i

    return insect_category2id`

cname2cid = get_insect_names()
cname2cid

数据读取和预处理

### 数据读取
import cv2

def get_bbox(gt_bbox, gt_class):
    # 对于一般的检测任务来说，一张图片上往往会有多个目标物体
    # 设置参数MAX_NUM = 50， 即一张图片最多取50个真实框；如果真实
    # 框的数目少于50个，则将不足部分的gt_bbox, gt_class和gt_score的各项数值全设置为0
    MAX_NUM = 50
    gt_bbox2 = np.zeros((MAX_NUM, 4))
    gt_class2 = np.zeros((MAX_NUM,))
    for i in range(len(gt_bbox)):
        gt_bbox2[i, :] = gt_bbox[i, :]
        gt_class2[i] = gt_class[i]
        if i >= MAX_NUM:
            break
    return gt_bbox2, gt_class2

def get_img_data_from_file(record):#读取真实框的信息
    """
    record is a dict as following,
      record = {
            'im_file': img_file,
            'im_id': im_id,
            'h': im_h,
            'w': im_w,
            'is_crowd': is_crowd,
            'gt_class': gt_class,
            'gt_bbox': gt_bbox,
            'gt_poly': [],
            'difficult': difficult
            }
    """
    im_file = record['im_file']
    h = record['h']
    w = record['w']
    is_crowd = record['is_crowd']
    gt_class = record['gt_class']
    gt_bbox = record['gt_bbox']
    difficult = record['difficult']

    img = cv2.imread(im_file)#opencv2读取文件路径
    img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)#把文件BGR格式转为RGB格式

    # check if h and w in record equals that read from img
    assert img.shape[0] == int(h), \
             "image height of {} inconsistent in record({}) and img file({})".format(
               im_file, h, img.shape[0])#看读取出来的数据是否和标注的信息是否一致，进行高的检查

    assert img.shape[1] == int(w), \
             "image width of {} inconsistent in record({}) and img file({})".format(
               im_file, w, img.shape[1])#看读取出来的数据是否和标注的信息是否一致，进行宽的检查

    gt_boxes, gt_labels = get_bbox(gt_bbox, gt_class)

    # gt_bbox 用相对值
    gt_boxes[:, 0] = gt_boxes[:, 0] / float(w)
    gt_boxes[:, 1] = gt_boxes[:, 1] / float(h)
    gt_boxes[:, 2] = gt_boxes[:, 2] / float(w)
    gt_boxes[:, 3] = gt_boxes[:, 3] / float(h)
  
    return img, gt_boxes, gt_labels, (h, w)

数据预处理
在计算机视觉中，通常会对图像做一些随机的变化，产生相似但又不完全相同的样本。主要作用是扩大训练数据集，抑制过拟合，提升模型的泛化能力，常用的方法见下面的程序。
随机改变亮暗、对比度和颜色等`

import numpy as np
import cv2
from PIL import Image, ImageEnhance
import random

# 随机改变亮暗、对比度和颜色等
def random_distort(img):
    # 随机改变亮度
    def random_brightness(img, lower=0.5

最低0.47元/天解锁文章

yao_qian_shu

关注

1
点赞
踩
8

收藏

觉得还不错? 一键收藏
打赏
2
评论
飞桨深度学习实践YOLOAI识虫

1.数据集处理部分读取AI识虫数据集标注信息AI识虫数据集结构如下：提供了2183张图片，其中训练集1693张，验证集245，测试集245张。包含7种昆虫，分别是Boerner、Leconte、Linnaeus、acuminatus、armandi、coleoptera和linnaeus。包含了图片和标注，请读者先将数据解压，并存放在insects目录下。# 解压数据脚本，第一次运行时打开注释，将文件解压到work目录下!unzip -d /home/aistudio/work /home/
复制链接

扫一扫