修改数据集xml改变标注框位置

最新推荐文章于 2023-03-03 11:01:40 发布

Allen2401

最新推荐文章于 2023-03-03 11:01:40 发布

阅读量913

点赞数 1

分类专栏：机器学习

本文链接：https://blog.csdn.net/sinat_38439143/article/details/98854210

版权

机器学习专栏收录该内容

13 篇文章 0 订阅

订阅专栏

1.xml.dom.minidom相关知识:

1.parse返回dom对象，使用DOM的documentElement属性可以获得root Element。

DOMTree = xml.dom.minidom.parse(path) 
collection = DOMTree.documentElement

2.DOM为树形结构，每一个node 有nodeName,nodeValue和nodeValue。nodeType有：

'ATTRIBUTE_NODE'
'CDATA_SECTION_NODE'
'COMMENT_NODE'
'DOCUMENT_FRAGMENT_NODE'
'DOCUMENT_NODE'
'DOCUMENT_TYPE_NODE'
'ELEMENT_NODE'
'ENTITY_NODE'
'ENTITY_REFERENCE_NODE'
'NOTATION_NODE'
'PROCESSING_INSTRUCTION_NODE'
'TEXT_NODE'

但是nodeValue只对textNode 有效，对别的节点无效。textNode同样可以使用data属性来获得文本内容，其它节点没有data属性。

<width>zh is smart</width>

for width in collection.getElementsByTagName("width"):
    print(width.nodeValue)     #none
    print(width.firstChild.nodeValue)　　#zh is smart
    widthVal = width.firstChild.data
 #width的type为element node,没有data属性，nodeValue属性对其无效
#width.firstChild 为text node,"zh is smart",其data 属性和nodeValue属性均为"zh is smart"

3.getElementsByTagName()可以根据名字来查找elements,可以得到一个nodeList.

widths = collection.getElementByTagName("width")

4.chilsNodes可以得到所有的子nodes,其中所有的文本皆为textNode。firstChild返回第一个子节点。

5.writexml()addindent=’ ‘表示子元素的缩进，newl=’\n’表示元素间的换行，encoding='utf-8’表示生成的xml的编码格式（<?xml version="1.0" encoding="utf-8"?>）。

 f = open( 'image2.xml' , 'w') 
 DOMTree.writexml(f, addindent = ' ' , newl = '\n' ,encoding = 'utf-8' )

2.下面贴应用代码：

应用背景：制作pascal voc数据集之前忘记resize,所以只能修改xml 文件中框的位置来使其匹配resize之后的图片。

import os
import xml.dom.minidom
#resize之后的宽和高
newheight = 256
newwidth = 256　
root = "/home/danale/Desktop/six/"　　#视频文件夹
for xmlfile in os.listdir(root):
    DOMTree = xml.dom.minidom.parse(os.path.join(root, xmlfile))
    collection = DOMTree.documentElement
    for width in collection.getElementsByTagName("width"):
        oldwidth = int(width.firstChild.data)
        width.firstChild.data = str(newwidth)
    for height in collection.getElementsByTagName("height"):
        oldheight = int(height.firstChild.data)
        height.firstChild.data = str(newheight)
    for xmin in collection.getElementsByTagName("xmin"):
        xmin.firstChild.data = int(int(xmin.firstChild.data) / 1.0 / oldwidth* newwidth)
    for xmax in collection.getElementsByTagName("xmax"):
        xmax.firstChild.data = int(int(xmax.firstChild.data) / 1.0 / oldwidth * newwidth)
    for ymin in collection.getElementsByTagName("ymin"):
        ymin.firstChild.data = int(int(ymin.firstChild.data) / 1.0 / oldheight * newheight)
    for ymax in collection.getElementsByTagName("ymax"):
        ymax.firstChild.data = int(int(ymax.firstChild.data) / 1.0 / oldheight * newheight)
    with open(os.path.join(root, xmlfile), 'w') as fh:
        DOMTree.writexml(fh)
        print('写入name/pose OK!')

Allen2401

关注

1
点赞
踩
7

收藏

觉得还不错? 一键收藏
3
评论
修改数据集xml改变标注框位置

1.xml.dom.minidom相关知识:1.parse返回dom对象，使用DOM的documentElement属性可以获得root Element。DOMTree = xml.dom.minidom.parse(path) collection = DOMTree.documentElement2.DOM为树形结构，每一个node 有nodeName,nodeValue和nodeV...
复制链接

扫一扫

专栏目录