1.首先得到所有的文件名称(这里获取所有的xml文件,其中xml文件名已类别+序列命名)
cd Annotations_Root
find ./ -name "*" |sort >>../ImageSets/Main/all.txt
进入标签xml的文件夹,将所有文件保存在all.txt中。
2.
import os
num = 0
for tmpline in open("./all.txt"):
num += 1
# print num
if num==1:
continue
tmp = tmpline.strip('\n')
tmp = tmp.replace(".xml","")
tmp = tmp.replace("./","")
tmpp=tmp.split('_',3)
number= int(tmpp[2])
aa = number%10
if aa==0 or aa==1:
with open('/data_2/data/train_data/ss_big_obj_object_bdf/ssd_big_obj/ImageSets/Main/test.txt', 'a') as f:
f.writelines(tmp + '\n')
elif aa==2:
with open('/data_2/data/train_data/ss_big_obj_object_bdf/ssd_big_obj/ImageSets/Main/val.txt', 'a') as f:
f.writelines(tmp + '\n')
else:
with open('/data_2/data/train_data/ss_big_obj_object_bdf/ssd_big_obj/ImageSets/Main/trainval.txt', 'a') as f:
f.writelines(tmp + '\n')
将序号尾号为0或者1的文件作为测试集;
将序号尾号为2的文件作为验证集;
将序号尾号为其它(除了0,1,2)的作为训练集。