VOC数据集转 YOLO数据集

zmysunshine

已于 2023-04-18 19:25:37 修改

阅读量233

点赞数

文章标签： YOLO 深度学习目标检测

于 2023-04-18 19:22:55 首次发布

本文链接：https://blog.csdn.net/qq_46542320/article/details/130229165

版权

该文章提供了一段Python代码，用于将VOC数据集的XML标注文件转换成YOLO数据集所需的TXT格式。代码处理了类名映射，并按YOLO格式要求计算并输出边界框坐标。适用于图像识别模型训练过程中的数据预处理。

摘要由CSDN通过智能技术生成

VOC数据集的格式为 .xml

YOLO数据集的格式为 .txt

此博客讲解 VOC格式转 YOLO格式

直接复制下列代码即可转换成功，亲测有效。

import os.path
import xml.etree.ElementTree as ET
import os
import random
# class_names = ['palm', 'stone', 'scissor', 'awesome', 'heartB', 'OK', 'ROCK', 'one', 'swear', 'thanks', 'heartA',
#                'heartC', 'good', 'bad', 'pray', 'call', 'take_picture', 'salute']
class_names = ['aeroplane', 'bicycle', 'bird', 'boat', 'bottle', 'bus', 'car', 'cat', 'chair',
               'cow', 'diningtable', 'dog', 'horse', 'motorbike', 'person', 'pottedplant',
               'sheep', 'sofa', 'train', 'tvmonitor']
xmlpath = 'D:/pycharm/PythonProject/datasets/VOC/images/VOCtest_06-Nov-2007/VOCdevkit/VOC2007/Annotations/'  # 原xml路径
txtpath = 'D:/pycharm/PythonProject/datasets/VOC/images/VOCtest_06-Nov-2007/VOCdevkit/VOC2007/labels_copy/'  # 转换后txt文件存放路径
if not os.path.exists(txtpath):
    os.makedirs(txtpath)
files = []

for root, dirs, files in os.walk(xmlpath):
    None

number = len(files)
print(number)
i = 0
while i < number:

    name = files[i][0:-4]
    xml_name = name + ".xml"
    txt_name = name + ".txt"
    xml_file_name = xmlpath + xml_name
    txt_file_name = txtpath + txt_name

    xml_file = open(xml_file_name)
    tree = ET.parse(xml_file)
    root = tree.getroot()
    filename = root.find('filename').text

    image_name = root.find('filename').text
    w = int(root.find('size').find('width').text)
    h = int(root.find('size').find('height').text)

    f_txt = open(txt_file_name, 'w+')
    content = ""

    first = True

    for obj in root.iter('object'):

        name = obj.find('name').text
        class_num = class_names.index(name)

        xmlbox = obj.find('bndbox')

        x1 = int(xmlbox.find('xmin').text)
        x2 = int(xmlbox.find('xmax').text)
        y1 = int(xmlbox.find('ymin').text)
        y2 = int(xmlbox.find('ymax').text)

        if first:
            content += str(class_num) + " " + \
                       str((x1 + x2) / 2 / w) + " " + str((y1 + y2) / 2 / h) + " " + \
                       str((x2 - x1) / w) + " " + str((y2 - y1) / h)
            first = False
        else:
            content += "\n" + \
                       str(class_num) + " " + \
                       str((x1 + x2) / 2 / w) + " " + str((y1 + y2) / 2 / h) + " " + \
                       str((x2 - x1) / w) + " " + str((y2 - y1) / h)

    # print(str(i / (number - 1) * 100) + "%\n")
    print(content)
    f_txt.write(content)
    f_txt.close()
    xml_file.close()
    i += 1

print("done!")

运行上述代码，即可转换成功！如下图所示，直截取了部分。

温馨小提示：代码中的路径记得更改为自己的路径哦！

加油！每天学会一点点！！

zmysunshine

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
打赏
0
评论
VOC数据集转 YOLO数据集

运行上述代码，即可转换成功！如下图所示，直截取了部分。此博客讲解 VOC格式转 YOLO格式。直接复制下列代码即可转换成功，亲测有效。温馨小提示：记得更改为自己的路径哦！YOLO数据集的格式为 .txt。VOC数据集的格式为 .xml。
复制链接

扫一扫