yolov5 xml标签转pytorch的txt标签

最新推荐文章于 2024-04-15 09:27:03 发布

CodingWZP

最新推荐文章于 2024-04-15 09:27:03 发布

阅读量1.3k

点赞数

本文链接：https://blog.csdn.net/m0_37940759/article/details/117235526

版权

深度学习专栏收录该内容

2 篇文章 1 订阅

订阅专栏

xml转pytorch标签

文件结构
代码实现
转换原理

文件结构

在这里插入图片描述

代码实现

修改为自己的xml数据集和类别，代码执行完毕后会在txt文件夹下生成用于pytorch训练的.txt文件。

import os
import xml.etree.ElementTree as ET


def ConverCoordinate(imgshape, bbox):
    # 将xml像素坐标转换为txt归一化后的坐标
    xmin, xmax, ymin, ymax = bbox
    width = imgshape[0]
    height = imgshape[1]
    dw = 1. / width
    dh = 1. / height
    x = (xmin + xmax) / 2.0
    y = (ymin + ymax) / 2.0
    w = xmax - xmin
    h = ymax - ymin

    # 归一化
    x = x * dw
    y = y * dh
    w = w * dw
    h = h * dh

    return (x,y,w,h)

def readxml(image_set, filename):
    outfile = open('{}/txt/{}.txt'.format(image_set, filename), 'w')
    filetree = ET.parse('{}/Annotations/{}.xml'.format(image_set, filename))
    root = filetree.getroot()
    size = root.find('size')
    width = int(size.find('width').text)
    height = int(size.find('height').text)
    imgshape = (width, height)

    for obj in root.findall('object'):
        # 获取类别名，判断是否在classes中，不存在则跳过。
        obj_name = obj.find('name').text
        if obj_name not in classes:
            continue
        obj_id = classes.index(obj_name)
        # 获取每个obj的bbox框的左上和右下坐标
        bbox = obj.find('bndbox')
        xmin = float(bbox.find('xmin').text)
        xmax = float(bbox.find('xmax').text)
        ymin = float(bbox.find('ymin').text)
        ymax = float(bbox.find('ymax').text)
        bbox_coor = (xmin, xmax, ymin, ymax)

        txtvalue = ConverCoordinate(imgshape, bbox_coor)
        outfile.write('{}'.format(obj_id) + ' ' + ' '.join([str(i) for i in txtvalue]) + '\n')


if __name__ == '__main__':
    # 超参数
    image_set = 'train'
    classes = ['person', 'root']

    # 配置JPEG文件路径
    localdir = os.getcwd()
    datasetdir = os.path.join(localdir, image_set)
    JPEGImagefiledir = os.path.join(datasetdir, 'JPEGImages')
    for filename in os.listdir(JPEGImagefiledir):
        readxml(image_set, filename[:-4])

转换原理

参照yolov5的官网数据转换文档https://github.com/ultralytics/yolov5/wiki/Train-Custom-Data
wXzM3OTQwNzU5,size_16,color_FFFFFF,t_70)

CodingWZP

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
yolov5 xml标签转pytorch的txt标签

xml转pytorch标签文件结构代码实现转换原理文件结构代码实现修改为自己的xml数据集和类别，代码执行完毕后会在txt文件夹下生成用于pytorch训练的.txt文件。import xml.etree.ElementTree as ETimport osimage_set = 'train'classes = ['person']def convert(size, box): dw = 1. / size[0] dh = 1. / size[1] x =
复制链接

扫一扫