物品识别树莓派 5 YOLO v5 v8 v10 11 计算机视觉

最新推荐文章于 2025-03-30 20:42:40 发布

熊猫小账本App

最新推荐文章于 2025-03-30 20:42:40 发布

阅读量2.9k

点赞数 17

分类专栏： YOLO 硬件树莓派文章标签： YOLO 计算机视觉人工智能 AI 树莓派

本文链接：https://blog.csdn.net/u013633921/article/details/144223296

版权

树莓派同时被 3 个专栏收录

21 篇文章

订阅专栏

硬件

11 篇文章

订阅专栏

YOLO

1 篇文章

订阅专栏

0. 要实现的效果

让树莓派可以识别身边的一些物品，比如电脑，鼠标，键盘，杯子，行李箱，双肩包，床，椅子等

请添加图片描述

1. 硬件设备

树莓派 5 raspberrypi.com/products/raspberry-pi-5/
树莓派官方摄像头 v3 raspberrypi.com/products/camera-module-3/
自己的电脑，windows 或者 mac

请添加图片描述

2. 前置条件

给树莓派烧录好操作系统，下面我们会用现在最新的（2024年12月） bookworm
VNC 连接或者用一根 HDMI 或者用官方的 raspberrypi connect

我写过一篇关于给树莓派烧录操作系统的 blog blog.csdn.net/u013633921/article/details/121433186

也有一篇 VNC 的 blog blog.csdn.net/u013633921/article/details/129677105

3. 开始！

更新一下，下面 4 个截图都好理解，不懂问问 AI

在这里插入图片描述

下面这条命令将安装 OpenCV 以及运行 YOLO 所需的基础设施

pip install ultralytics[export]

还会安装大量其他软件包，容易失败
如果安装失败（会显示一大片红色）
只需重新执行，已经安装过的不会再安装
我是一次过的，哈哈哈哈哈哈～（过程大概有 2 个小时 🤔）

安装后，重启树莓派
Pi 5 有物理按键，连续按两次会关机。等等再按一次，就会启动。

4. Thonny

切换到常规模式。
在这里插入图片描述
关闭 Thonny 再打开 Thonny。

在这里插入图片描述

用 Thonny 创建个文件 yolo.py

import cv2
from picamera2 import Picamera2
from ultralytics import YOLO

# Set up the camera with Picam
picam2 = Picamera2()
picam2.preview_configuration.main.size = (1280, 1280)
picam2.preview_configuration.main.format = "RGB888"
picam2.preview_configuration.align()
picam2.configure("preview")
picam2.start()

# Load YOLOv8
model = YOLO("yolov8n.pt")

while True:
    # Capture a frame from the camera
    frame = picam2.capture_array()
    
    # Run YOLO model on the captured frame and store the results
    results = model(frame)
    
    # Output the visual detection data, we will draw this on our camera preview window
    annotated_frame = results[0].plot()
    
    # Get inference time
    inference_time = results[0].speed['inference']
    fps = 1000 / inference_time  # Convert to milliseconds
    text = f'FPS: {fps:.1f}'

    # Define font and position
    font = cv2.FONT_HERSHEY_SIMPLEX
    text_size = cv2.getTextSize(text, font, 1, 2)[0]
    text_x = annotated_frame.shape[1] - text_size[0] - 10  # 10 pixels from the right
    text_y = text_size[1] + 10  # 10 pixels from the top

    # Draw the text on the annotated frame
    cv2.putText(annotated_frame, text, (text_x, text_y), font, 1, (255, 255, 255), 2, cv2.LINE_AA)

    # Display the resulting frame
    cv2.imshow("Camera", annotated_frame)

    # Exit the program if q is pressed
    if cv2.waitKey(1) == ord("q"):
        break

# Close all windows
cv2.destroyAllWindows()