配置SSD模型以及相关报错的解决方法

_Endymion

已于 2024-05-09 12:38:08 修改

阅读量822

点赞数 1

文章标签： python pytorch 深度学习目标检测计算机视觉

于 2023-06-22 09:14:41 首次发布

本文链接：https://blog.csdn.net/C2580390720/article/details/131337967

版权

一、代码说明

（1）本仓库为本人根据自己项目需要所配置好的pytorch版本的ssd模型。建仓库的初衷是本人在跑的过程当中遇到了很多的报错，因此想一起分享交流经验。本仓库修改了部分可能出现bug的代码，同时附上了解决部分问题的博客和交流贴。配置后的模型已开源：https://github.com/Charlie020/ssd.pytorch_configured

（2）SSD模型源代码的作者及其仓库链接：https://github.com/amdegroot/ssd.pytorch

二、配置模型以及可能出现的问题的解决方案

以下步骤不需要都完成，根据自己的需要进行阅读

1、训练

（1）配置ssd：https://blog.csdn.net/m0_47452894/article/details/112783858 中的运行eval.py之前的部分，在配置好这一条的基础上运行train.py，看能不能成功运行。

（2）完善ssd：https://blog.csdn.net/dear_queen/article/details/114301614 （用于解决目标计算机积极拒绝的问题），，即需要在终端里面启动visdom，输入命令：python -m visdom.server

（3）解决问题“Expected a 'cuda' device type for generator but found 'cpu'”： https://github.com/amdegroot/ssd.pytorch/issues/561

（4）若你的数据集按VOC格式划分好了的，可跳过这一步。若未划分，则可以找到data/VOCdevkit/VOC2007下的make.py，先在代码中设定数据集各部分划分的比例（默认训练：验证：测试=6：2：2），然后运行。代码在ImageSets/Main下自动生成trainval.txt,test.txt,train.txt,val.txt四个文件，方便模型读取哪些图片用来训练，哪些用来验证，哪些用来测试。

data文件夹中的VOC数据集文件结构如下（Annotations中放数据集中所有的标签，ImageSets中放数据集中所有的图片）：

--data
|   --VOCdevkit
|   	  --VOC2007
|        	--Annotations
|             		--1.xml,2.xml,....
|        	--JPEGImages
|             		--1.jpg,2.jpg,....
|        	--ImageSets
|             		--Main
|                 	    --test.txt        # 用于测试，这几个文件中存的都是文件的索引，如1,2,3,4...加上.xml或者.jpg就是对应标签和图像的名字
|                 	    --train.txt       # 用于训练
|                 	    --trainval.txt    # 训练加验证
|                 	    --val.txt         # 验证
|        	--debug.py
|        	--make.py
|               --clear.py
|   ······ # 其他文件

（5）解决问题：

img, boxes, labels = self.transform(img, target[:, :4], target[:, 4])
IndexError: too many indices for array: array is 1-dimensional, but 2 were indexed ：
此为数据集当中可能出现的问题，主要原因可能是作为背景训练的图片的XML文件中没有obj对象。
解决办法：https://github.com/amdegroot/ssd.pytorch/issues/224, 交流帖中代码已保存至仓库中data/VOCdevkit/VOC2007/debug.py，即与VOC2007数据集中的Annotations、JPEGImages等在同一目录，运行后在哪一个xml停下来，就去删除或修改对应的xml和图片，还要删除train.txt或val.txt或trainval.txt中对应的索引，不然会报（6）中的错。当运行debug.py不再显示 “ INDEX ERROR HERE ! ”，再运行train.py则不会报该项错误）

(6)解决问题
source = open(source, "rb") FileNotFoundError: [Errno 2] No such file or directory: 'E:\\...\\ssd.pytorch-master\\data\\VOCdevkit\\VOC2007\\Annotations\\00077.xml'，该问题是（5）中删除了对应的xml和图片，而train.txt或val.txt或trainval.txt中存的索引没有删除所导致的，去train.txt或val.txt或trainval.txt中删除对应索引即可。

（7）clear.py用来解决图片与标签数量不匹配的问题。

一般而言，完成上述部分即可开始训练了。
在这里插入图片描述

2、测试

先按照上述（1）中的运行eval.py修改，运行eval.py，若没有成功运行则可能会报TypeError: forward() takes 4 positional arguments but 9 were given的错，因此若遇到该项报错，只需按照以下内容修改即可。

（1）修改eval.py

#修改39行；将源码的模型文件改成自己训练好的模型文件
parser.add_argument('--trained_model',
                    default='weights/ssd300_VOC_20000.pth', type=str, 
                    help='Trained state_dict file path to open')

#将71行的
imgsetpath = os.path.join(args.voc_root, 'VOC2007', 'ImageSets','Main', '{:s}.txt')
#修改为
imgsetpath = os.path.join(args.voc_root, 'VOC2007', 'ImageSets', 'Main') + os.sep + '{:s}.txt'
#如不修改，则imgsetpath 可能无法正常拼接出来，导致报错

（2）按照以下内容修改ssd.py：

# 51行附近
	if phase == 'test':
     	self.softmax = nn.Softmax()
     	self.detect = Detect()
        
        
# 120行附近        
    if self.phase == "test":
            output = self.detect.forward(
                loc.view(loc.size(0), -1, 4),                   # loc preds
                self.softmax(conf.view(conf.size(0), -1,
                             self.num_classes)),                # conf preds
                self.priors.type(type(x.data))                  # default boxes
            )

完成上述部分，看能否运行成功eval.py，若报错，下面的部分可能会有所帮助。

（3）可能遇到”RuntimeError: index_select(): functions with out=... arguments don't support automatic differentiation, but one of the arguments requires grad.”，解决问题：https://blog.csdn.net/XiaoGShou/article/details/125253471

（4）可能遇到

cv2.error: OpenCV(4.5.2)  :-1 error: (-5:Bad argument) in function 'rectangle`
\> Overload resolution failed:
\> - Can't parse 'pt1'. Sequence item with index 0 has a wrong type
\> - Can't parse 'pt1'. Sequence item with index 0 has a wrong type
······

解决问题：https://stackoverflow.com/questions/67921192/5bad-argument-in-function-rectangle-cant-parse-pt1-sequence-item-wit

（5）可能遇到”AttributeError: ‘NoneType‘ object has no attribute ‘text‘“，解决问题：https://blog.csdn.net/qq_55535816/article/details/121456901 ，将代码报错部分按照此博客中方法二的方法设置

（6）可能遇到 R = [obj for obj in recs[imagename] if obj['name'] == classname] KeyError: '1000'，解决问题：https://github.com/amdegroot/ssd.pytorch/issues/482 （删除annotations_cache/annots.pkl）

3、注：测试时修改后的ssd.py的代码并没法在训练的时候也成功运行，因此以后想要反复训练和测试，仅需在相应阶段使用相应的代码，这里已经整理好了，仅需调整`ssd.py`的内容（下面代码为训练时用，测试部分已被注释）：

（训练时，注释掉测试用的代码，同时取消注释训练用的代码；测试时同理）

# 51行附近
	# 训练时代码
        if phase == 'test':
            self.softmax = nn.Softmax(dim=-1)
                    # ORIGINAL IMPLEMENTATION DEPRECATED
                    # self.detect = Detect(num_classes, 0, 200, 0.01, 0.45)
            self.detect = Detect()  # corrected implementation
                    # my comments
                    # this 'Detect' Function is not compatible with new Pytorch version,
                    # generates error 'Legacy autograd function with non-static forward
                    # method is deprecated. Please use new-style autograd function with
                    # static forward method.'
                    # Correction is implemented by passing above arguments directly to
                    # command .apply() at the forward method below.
        """
        # 测试时代码
        if phase == 'test':
            self.softmax = nn.Softmax(dim=-1)
            self.detect = Detect(num_classes, 0, 200, 0.25, 0.45)
        """


# 120行附近
	# 训练时代码
        if self.phase == "test":
            # ORIGINAL LINE IS DEPRECATED
            # output = self.detect(
            # corrected implementation:
            output = self.detect.apply(self.num_classes, 0, 200, 0.01, 0.45,
               # loc preds
               loc.view(loc.size(0), -1, 4),
               # conf preds
               self.softmax(conf.view(conf.size(0), -1, self.num_classes)),
               # default boxes
               self.priors.type(type(x.data))
           )
        # 测试时代码
            """
        if self.phase == "test":
            output = self.detect.forward(
                loc.view(loc.size(0), -1, 4),                   # loc preds
                self.softmax(conf.view(conf.size(0), -1,
                             self.num_classes)),                # conf preds
                self.priors.type(type(x.data))                  # default boxes
            )
            """