python对PASCAL VOC标注数据进行统计

最新推荐文章于 2024-02-26 10:06:47 发布

竹子熊猫

最新推荐文章于 2024-02-26 10:06:47 发布

阅读量1.1k

点赞数 3

分类专栏： python

本文链接：https://blog.csdn.net/summermaoz/article/details/79815179

版权

python 专栏收录该内容

63 篇文章 1 订阅

订阅专栏

用于统计训练数据中的类别，以及所有目标的个数：

# coding:utf-8
import xml.etree.cElementTree as ET
import os
from collections import Counter
import shutil

# Counter({'towCounter({'tower': 3074, 'windpower': 2014, 'thermalpower': 689, 'hydropower': 261, 'transformer': 225})
# total_num: 6263

def count(pathdir,despath):
	category = []
	path = pathdir + '/XML/'
	for index,xml in enumerate(os.listdir(path)):
		# print(str(index) + ' xml: '+ xml)
		root = ET.parse(os.path.join(path, xml))
		objects = root.findall('object')
		
		# ==================select images which has a special object=============
		for obj in objects:
			obj_label = obj.find('name').text
			if obj_label == 'transformer':
				print(xml)
				imgfile = pathdir + 'JPEG/' + xml.replace('xml', 'jpg')
				img_despath = despath + xml.replace('xml', 'jpg')
				# if not os.path.exists(img_despath):
				shutil.copyfile(imgfile, img_despath)

		# ==================select images which has a special object=============

		category += [ob.find('name').text for ob in objects]
	print(Counter(category))
	total_num = sum([value for key, value in Counter(category).items()])
	print('total_num:',total_num)

if __name__ == '__main__':
	# pathdirs = list(set(os.listdir('./')) ^ set(['tools','count.py']))
	# print(pathdirs)
	# for pathdir in pathdirs:
	pathdir = '/summer/Desktop/power_traindata/'
	despath = '/transformer/'
	count(pathdir,despath)

竹子熊猫

关注

3
点赞
踩
7

收藏

觉得还不错? 一键收藏
0
评论
python对PASCAL VOC标注数据进行统计

用于统计训练数据中的类别，以及所有目标的个数：# coding:utf-8import xml.etree.cElementTree as ETimport osfrom collections import Counterimport shutil# Counter({'towCounter({'tower': 3074, 'windpower': 2014, 'thermalpow...
复制链接

扫一扫

专栏目录