问题解决心路历程:yolov4 tesnorrt client 运行yolov5engine时预测输出不对

最新推荐文章于 2023-06-26 01:24:30 发布

滑稽的柴犬

最新推荐文章于 2023-06-26 01:24:30 发布

阅读量335

点赞数

分类专栏： python 神经网络机器学习文章标签： python 深度学习图像识别

本文链接：https://blog.csdn.net/qq_32545287/article/details/116602390

版权

神经网络同时被 3 个专栏收录

23 篇文章 0 订阅

订阅专栏

机器学习

23 篇文章 0 订阅

订阅专栏

python

16 篇文章 0 订阅

订阅专栏

经过同事提醒发现是预处理的问题。

yolov4

def preprocess(image,input_height,input_width):
    image = cv2.resize(image, ( input_width,input_height))
    image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)
    image = np.transpose(np.array(image, dtype=np.float32, order='C'), (2, 0, 1))
    image /= 255.0
    return image

yolov5

https://gitee.com/doge_ac_cn/yolov5/blob/master/utils/datasets.py

这里找的时候的思路是直接找train,py里数据集生成的函数

找到对应的dataloader，然后进文件找__next__函数，这里肯定有预处理操作。

最终把yolov5的预处理修改如下:

def  preprocess(img, new_shape=(640, 640), color=(128, 128, 128), auto=True, scaleFill=False, scaleup=True) :
    # Resize image to a 32-pixel-multiple rectangle https://github.com/ultralytics/yolov3/issues/232
    shape = img.shape[:2]  # current shape [height, width]
    if isinstance(new_shape, int):
        new_shape = (new_shape, new_shape)

    # Scale ratio (new / old)
    r = min(new_shape[0] / shape[0], new_shape[1] / shape[1])
    if not scaleup:  # only scale down, do not scale up (for better test mAP)
        r = min(r, 1.0)

    # Compute padding
    ratio = r, r  # width, height ratios
    new_unpad = int(round(shape[1] * r)), int(round(shape[0] * r))
    dw, dh = new_shape[1] - new_unpad[0], new_shape[0] - new_unpad[1]  # wh padding
    # if auto:  # minimum rectangle
    #     dw, dh = np.mod(dw, 32), np.mod(dh, 32)  # wh padding
    # elif scaleFill:  # stretch
    #     dw, dh = 0.0, 0.0
    #     new_unpad = (new_shape[1], new_shape[0])
    #     ratio = new_shape[1] / shape[1], new_shape[0] / shape[0]  # width, height ratios

    dw /= 2  # divide padding into 2 sides
    dh /= 2

    if shape[::-1] != new_unpad:  # resize
        img = cv2.resize(img, new_unpad, interpolation=cv2.INTER_LINEAR)
    top, bottom = int(round(dh - 0.1)), int(round(dh + 0.1))
    left, right = int(round(dw - 0.1)), int(round(dw + 0.1))
    img = cv2.copyMakeBorder(img, top, bottom, left, right, cv2.BORDER_CONSTANT, value=color)  # add border
    # Convert
    img = img[:, :, ::-1].transpose(2, 0, 1)  # BGR to RGB, to 3x416x416
    img = np.ascontiguousarray(img)

    img = np.array(img, dtype=np.float32)

    return img/255.0

yolov5 的预处理需要会对长宽中较短的一边补Padding。

比如如果图片是 640*360，就会上下各补140的padding填充，从而获得640*640的图片，而不是直接resize。

因此后处理也必须把预测目标的boundding box 位置进行更改。

先把xy各自移动长宽的的填充量dw,dh。

box_xy = ( (selected_bboxes_keep[idx, :2]) -[dw,dh] )

再把xywh都乘以缩放倍数。

box *= img_scale

后处理

def postprocess(buffer, image_width, image_height,dw,dh,padding_w,padding_h,conf_threshold=0.8, nms_threshold=0.5):
    detected_objects = []
    img_scale = [image_width / (INPUT_WIDTH-dw*2), image_height / (INPUT_HEIGHT -dh*2), image_width / (INPUT_WIDTH-dw*2), image_height / (INPUT_HEIGHT-dh*2)]
    num_bboxes = int(buffer[0, 0, 0, 0])

    if num_bboxes:
        bboxes = buffer[0, 1 : (num_bboxes * 6 + 1), 0, 0].reshape(-1, 6)
        # print(bboxes)
        labels = set(bboxes[:, 5].astype(int))
        for label in labels:
            selected_bboxes = bboxes[np.where((bboxes[:, 5] == label) & ((bboxes[:, 4] ) >= conf_threshold))]
            selected_bboxes_keep = selected_bboxes[nms(selected_bboxes[:, :4], selected_bboxes[:, 4] , nms_threshold)]
            for idx in range(selected_bboxes_keep.shape[0]):
                print (selected_bboxes_keep[idx, :2])
                box_xy = ( (selected_bboxes_keep[idx, :2]) \
                    -[dw,dh] ) 
                print(box_xy)
                box_wh = selected_bboxes_keep[idx, 2:4]
                score = selected_bboxes_keep[idx, 4] 
                # print(score)
                box_x1y1 = np.maximum([0,0],box_xy - (box_wh / 2))
                box_x2y2 = np.minimum(box_xy + (box_wh / 2), [INPUT_WIDTH, INPUT_HEIGHT]) 
                box = np.concatenate([box_x1y1, box_x2y2])
                box *= img_scale
                if box[0] == box[2]:
                    continue
                if box[1] == box[3]:
                    continue

                detected_objects.append(BoundingBox(label, score, box[0], box[2], box[1], box[3], image_height, image_width))
    return detected_objects

滑稽的柴犬

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
问题解决心路历程:yolov4 tesnorrt client 运行yolov5engine时预测输出不对

经过同事提醒发现是预处理的问题。yolov4def preprocess(image,input_height,input_width): image = cv2.resize(image, ( input_width,input_height)) image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB) image = np.transpose(np.array(image, dtype=np.float32, order=..
复制链接

扫一扫