TensorFlow 对数据集标记的xml文件解析记录

ʚVVcatɞ

已于 2022-01-22 23:29:22 修改

阅读量1.8k

点赞数

分类专栏：深度学习文章标签： tensorflow xml python

于 2021-12-22 00:38:03 首次发布

本文链接：https://blog.csdn.net/qq_44989881/article/details/122076000

版权

深度学习专栏收录该内容

29 篇文章 7 订阅

订阅专栏

环境

Windows：10
Python 3.7.10
TensorFlow：2.3
matplotlib：3.3.4
lxml：4.7.1

最近要用TensorFlow做20种水果识别，对刚入手的数据集，开始对数据集进行检验。

原图如下：
在这里插入图片描述
以下是通过精灵标注助手生成的xml 文件

<?xml version="1.0" ?>
<annotation>
<folder>菠萝</folder>
<filename>pineapple.jpg</filename>
<path>C:\Users\Desktop\pineapple.jpg</path>
<source>
    <database>Unknown</database>
</source>
<size>
    <width>730</width>
    <height>413</height>
    <depth>3</depth>
</size>

<segmented>0</segmented>
    <object>
    <name>菠萝</name>
    <pose>Unspecified</pose>
    <truncated>0</truncated>
    <difficult>0</difficult>
    <bndbox>
        <xmin>125</xmin>
        <ymin>112</ymin>
        <xmax>543</xmax>
        <ymax>400</ymax>
    </bndbox>
</object>
    <object>
    <name>菠萝</name>
    <pose>Unspecified</pose>
    <truncated>0</truncated>
    <difficult>0</difficult>
    <bndbox>
        <xmin>547</xmin>
        <ymin>97</ymin>
        <xmax>721</xmax>
        <ymax>390</ymax>
    </bndbox>
</object>
</annotation>

安装 matplotlib

pip install matplotlib

安装 lxml

pip install lxml

通过以下代码将xml中绘画的矩形框显示到图片中。

import tensorflow as tf
import matplotlib.pyplot as plt
from lxml import etree
from matplotlib.patches import Rectangle  # 绘制矩形框

img = tf.io.read_file(r'./pineapple.jpg')

img = tf.image.decode_jpeg(img)  # 对图像进行解码
print(img.shape)
plt.imshow(img)
plt.show()

xml = open(r'./pineapple.xml', encoding='utf-8').read()  # 读取 xml文件
sel = etree.HTML(xml)  # 对 xml 文件进行解析
width = sel.xpath('//size/width/text()')[0]  # 获取图片的宽
height = sel.xpath('//size/height/text()')[0]  # 获取图片的高
bndbox = sel.xpath('//bndbox')
ax = plt.gca()  # 获取当前图像
for i in range(0, len(bndbox)):
    xmin = sel.xpath('//bndbox/xmin/text()')[i]
    ymin = sel.xpath('//bndbox/ymin/text()')[i]
    xmax = sel.xpath('//bndbox/xmax/text()')[i]
    ymax = sel.xpath('//bndbox/ymax/text()')[i]
    xmin = int(xmin)
    ymin = int(ymin)
    xmax = int(xmax)
    ymax = int(ymax)
    plt.imshow(img.numpy())
    rect = Rectangle((xmin, ymin), (xmax - xmin), (ymax - ymin), fill=False, color='red')  # fill=False 不需要填充
    ax.axes.add_patch(rect)  # 添加矩形框
plt.show()

还原出入手的数据集用精灵标注助手标记的效果如下：
在这里插入图片描述

由于发现数据集中有多边形和矩形框数据混合，所以通过以下代码区分开来
在这里插入图片描述
以上xml文件一个一个的点开查看比较麻烦，用以下代码进行处理查看：

import os

try:
    import xml.etree.cElementTree as ET
except ImportError:
    import xml.etree.ElementTree as ET

# xml文件路径
txt_path = 'C:\\Users\\vvcat\\Desktop\\xxx\\xxxxx\\outputs2\\'

for txt_file in os.listdir(txt_path):
    txt_name = os.path.splitext(txt_file)[0]  #获取文件名
    txt_suffix = os.path.splitext(txt_file)[1]  # 获取后缀
    # print(txt_name, txt_suffix)
    file_name_path = txt_path + txt_name + txt_suffix
    root = ET.parse(file_name_path)
    bndboxs = root.getiterator("bndbox")

    if bndboxs == []:
        print(txt_name + txt_suffix)   # 打印包含多边形框的xml文件

效果如下：
在这里插入图片描述
打开A（1）.xml文件，内容如下：

通过以下代码批量将xml中绘画的矩形框显示到图片中，并保存成新的图片。

import tensorflow as tf
import matplotlib.pyplot as plt
from lxml import etree
from matplotlib.patches import Rectangle  # 绘制矩形框
import glob
import os

images = glob.glob('./inputs/*.jpg')
xmls = glob.glob('./outputs/*.xml')
xmls_names = [x.split('\\')[-1].split('.xml')[0] for x in xmls]
images_names = [x.split('\\')[-1].split('.jpg')[0] for x in images]
names = list(set(images_names) & set(xmls_names))
imgs = [img for img in images if img.split('\\')[-1].split('.jpg')[0] in names]  #根据名称排序
imgs.sort(key=lambda x: x.split('\\')[-1].split('.jpg')[0])
xmls.sort(key=lambda x: x.split('\\')[-1].split('.xml')[0])

dstfile = './output_image/'
fpath = os.path.dirname(dstfile)  # 获取文件路径
if not os.path.exists(fpath):
    os.makedirs(fpath)  # 没有就创建路径
images_names = ''

for i in range(0, len(xmls)):
    img = tf.io.read_file(imgs[i])
    img = tf.image.decode_jpeg(img)  # 对图像进行解码
    xml = open(xmls[i], encoding='utf-8').read()  # 读取 xml文件
    sel = etree.HTML(xml)  # 对 xml 文件进行解析
    width = sel.xpath('//size/width/text()')[0]  # 获取图片的宽
    height = sel.xpath('//size/height/text()')[0]  # 获取图片的高
    bndbox = sel.xpath('//bndbox')
    ax = plt.gca()  # 获取当前图像
    for j in range(0, len(bndbox)):
        xmin = sel.xpath('//bndbox/xmin/text()')[j]
        ymin = sel.xpath('//bndbox/ymin/text()')[j]
        xmax = sel.xpath('//bndbox/xmax/text()')[j]
        ymax = sel.xpath('//bndbox/ymax/text()')[j]
        xmin = int(xmin)
        ymin = int(ymin)
        xmax = int(xmax)
        ymax = int(ymax)
        plt.imshow(img.numpy())
        rect = Rectangle((xmin, ymin), (xmax - xmin), (ymax - ymin), fill=False, color='red')  # fill=False 不需要填充
        ax.axes.add_patch(rect)  # 添加矩形框
    images_names = imgs[i].split('\\')[-1]
    plt.savefig(dstfile + images_names)
    # plt.show()
    plt.close()

ʚVVcatɞ

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
TensorFlow 对数据集标记的xml文件解析记录

环境Windows：10Python 3.7.10TensorFlow：2.3matplotlib：3.3.4lxml：4.7.1原图以下是通过精灵标记助手生成的xml 文件<?xml version="1.0" ?><annotation><folder>菠萝</folder><filename>pineapple.jpg</filename><path>C:\Users\Desktop\pi
复制链接

扫一扫

专栏目录