基于YOLOv8-pose的手部关键点检测（3）- 实现实时手部关键点检测

paradoxjun

已于 2024-08-17 13:59:12 修改

阅读量1.9k

点赞数 5

分类专栏：目标检测手势识别文章标签： YOLO 目标检测计算机视觉人工智能深度学习

于 2024-08-16 23:57:20 首次发布

本文链接：https://blog.csdn.net/qq_40387714/article/details/141270112

版权

目标检测同时被 2 个专栏收录

16 篇文章

订阅专栏

手势识别

7 篇文章

订阅专栏

前言

使用YOLOv8-m对图像进行手部检测，然后扩大检测框区域，并对该区域使用YOLOv8-s-pose使用关键点检测，实现实时的手部关键点检测。

实现效果：

1.扩大检测框区域

参考：基于YOLOv8-pose的手部关键点检测（2）- 模型训练、结果分析和超参数优化

需要将手部区域放大：2/3 。放大框的函数如下，scale=2/3:

def expand_bbox(xyxy, img_width, img_height, scale=0.1):
    # 计算宽度和高度，和中心点
    width = xyxy[2] - xyxy[0]
    height = xyxy[3] - xyxy[1]
    center_x = xyxy[0] + width / 2
    center_y = xyxy[1] + height / 2

    # 增加10%的宽度和高度
    new_width = width * (1 + scale)
    new_height = height * (1 + scale)

    # 计算新的边界框坐标，并确保新的边界框坐标不超过图片的边界
    new_x1 = max(2, int(center_x - new_width / 2))
    new_y1 = max(2, int(center_y - new_height / 2))
    new_x2 = min(int(img_width) - 2, int(center_x + new_width / 2))
    new_y2 = min(int(img_height), int(center_y + new_height / 2))

    return new_x1, new_y1, new_x2, new_y2

2.先检测手部，后检测手部关键点

关键代码如下，下面结果开始展示容错，因为我把scale错写成了1/3：

    frame, _ = resize_image(frame, 720)
    img_height, img_width, _ = frame.shape

    img = cv2.cvtColor(frame, cv2.COLOR_BGR2RGB)
    hand_all = det_shou(img)[0]

    for i, bbox in enumerate(hand_all.boxes.xyxy):
        x1, y1, x2, y2 = list(map(int, bbox))
        x11, y11, x22, y22 = expand_bbox(bbox, img_width, img_height, scale=1 / 3)
        conf = hand_all.boxes.conf[i]
        cls = hand_all.boxes.cls[i]
        label = f'{hand_all.names[int(cls)]} {float(conf):.2f}'

        # 绘制边界框和标签
        cv2.rectangle(frame, (x11, y11), (x22, y22), (0, 255, 0), 2)
        cv2.putText(frame, label, (x11, y11 - 10), cv2.FONT_HERSHEY_SIMPLEX, 0.5, (0, 255, 0), 2)

        image_shou = frame[y11:y22, x11:x22]
        shou_all = pose_shou(image_shou)[0].cpu().numpy()

        if len(shou_all.boxes.conf) > 0:
            kpts = [list(map(int, shou_all.keypoints.xy[0].reshape(1, 42)[0].tolist()))]

            image_shou = draw_bboxes_and_keypoints(image_shou, shou_all.boxes.xyxy, shou_all.boxes.conf,
                                                   shou_all.boxes.cls,
                                                   kpts=kpts, cat_order=_connections, line_color=line_color)

            frame[y11:y22, x11:x22] = image_shou

    cv2.imshow('Frame', frame)

以下绿色框表示：YOLOv8的bbox扩大得到的检测框；

以下红色框表示：YOLOv8-pose的bbox。