Yolov5（用labelimg制作数据集)

ABlurry_Face

已于 2022-01-25 21:36:00 修改

阅读量2k

点赞数

分类专栏： yolov5 文章标签：目标检测

于 2022-01-24 10:29:56 首次发布

本文链接：https://blog.csdn.net/weixin_50461778/article/details/122662350

版权

本文介绍了如何使用labelimg工具创建Yolov5目标检测数据集，包括建立文件夹、标注图片、转换xml为txt以及训练模型的详细步骤。在训练模型部分，涉及预训练权重的选择、数据配置文件和模型配置文件的修改，以及使用tensorboard监控训练过程。

摘要由CSDN通过智能技术生成

一、制作数据集

1.1建立存放数据集的文件夹

├── VOCdevkit

│├── VOC2007
││├── JPEGImages 存放需要打标签的图片文件
││├── Annotations 存放标注的标签文件
││├── predefined_classes.txt 定义自己要标注的所有类别

在VOC2007中cmd输入代码打开labelimg：

labelimg JPEGImages predefined_classes.txt

1.2labelimg中点击View，选择以下内容：

Auto Save mode：切换到下一张图的时候，会自动保存标签。

Display Labels：会显示标注框和标签

Advanced Mode：标注的十字架会一直悬浮在窗口。

1.3labelimg常用快捷键如下：

A：切换到上一张图片

D：切换到下一张图片

W：调出标注十字架

del ：删除标注框框

Ctrl+u：选择标注的图片文件夹

Ctrl+r：选择标注好的label标签存在的文件夹

1.4将xml转换为txt，划分训练集和验证集

修改classes类别

把代码放在文件夹VOCdevkit 同一目录下运行

import xml.etree.ElementTree as ET
import pickle
import os
from os import listdir, getcwd
from os.path import join
import random
from shutil import copyfile
 
classes = ["hat", "person"]
#classes=["ball"]
 
TRAIN_RATIO = 80
 
def clear_hidden_files(path):
    dir_list = os.listdir(path)
    for i in dir_list:
        abspath = os.path.join(os.path.abspath(path), i)
        if os.path.isfile(abspath):
            if i.startswith("._"):
                os.remove(abspath)
        else:
            clear_hidden_files(abspath)
 
def convert(size, box):
    dw = 1./size[0]
    dh = 1./size[1]
    x = (box[0] + box[1])/2.0
    y = (box[2] + box[3])/2.0
    w = box[1] - box[0]
    h = box[3] - box[2]
    x = x*dw
    w = w*dw
    y = y*dh
    h = h*dh
    return (x,y,w,h)
 
def convert_annotation(image_id):
    in_file = open('VOCdevkit/VOC2007/Annotations/%s.xml' %image_id)
    out_file = open('VOCdevkit/VOC2007/YOLOLabels/%s.txt' %image_id, 'w')
    tree=ET.parse(in_file)
    root = tree.getroot()
    size = root.find('size')
    w = int(size.find('width').text)
    h = int(size.find('height').text)
 
    for obj in root.iter('object'):
        difficult = obj.find('difficult').text
        cls = obj.find('name').text
        if cls not in classes or int(difficult) == 1:
            continue
        cls_id = classes.index(cls)
        xmlbox = obj.find('bndbox')
        b = (float(xmlbox.find('xmin').text), float(xmlbox.find('xmax').text), float(xmlbox.find('ymin').text), float(xmlbox.find('ymax').text))
        bb = convert((w,h), b)
        out_file.write(str(cls_id) + " " + " ".join([str(a) for a in bb]) + '\n')
    in_file.close()
    out_file.close()
 
wd = os.getcwd()
wd = os.getcwd()
data_base_dir = os.path.join(wd, "VOCdevkit/")
if not os.path.isdir(data_base_dir):
    os.mkdir(data_base_dir)
work_sapce_dir = os.path.join(data_base_dir, "VOC2007/")
if not os.path.isdir(work_sapce_dir):
    os.mkdir(work_sapce_dir)
annotation_dir = os.path.join(work_sapce_