基于深度学习的障碍物检测系统：YOLOv5/v6/v7/v8/v10模型实现与UI界面集成

深度学习实战项目

于 2024-10-03 14:20:10 发布

阅读量302

点赞数 14

分类专栏： YOLOv10/v8/v7/v6/v5实战项目文章标签：深度学习 YOLO ui 人工智能目标检测

本文链接：https://blog.csdn.net/m0_52343631/article/details/142691747

版权

YOLOv10/v8/v7/v6/v5实战项目专栏收录该内容

60 篇文章 0 订阅

订阅专栏

摘要

本文将详细介绍如何基于YOLO系列（YOLOv5、YOLOv6、YOLOv7、YOLOv8和YOLOv10）构建一个障碍物检测系统。我们将涵盖从数据集准备、模型训练到用户界面（UI）开发的每一个步骤。最后，提供完整的代码和data.yaml文件，以便读者能够独立复现该项目。

1. 引言

随着计算机视觉技术的发展，障碍物检测在自动驾驶、机器人导航、安防监控等领域得到了广泛应用。YOLO（You Only Look Once）是一种流行的实时目标检测系统，因其高效性和准确性而受到青睐。本项目将介绍如何利用YOLO系列模型，结合Python和OpenCV，构建一个完整的障碍物检测系统。

2. 环境准备

2.1 软件环境

在开始之前，请确保您的计算机上已安装以下软件：

Python 3.x
PyTorch
OpenCV
Tkinter（用于UI界面）
YOLOv5/YOLOv6/YOLOv7/YOLOv8/YOLOv10

可以通过以下命令安装所需的库：

pip install torch torchvision torchaudio opencv-python Pillow PyQt5

2.2 硬件要求

建议使用具有GPU加速的计算机以提高训练和推理速度。NVIDIA显卡和CUDA驱动是必需的。

3. 数据集准备

为了训练YOLO模型，您需要一个包含障碍物的图像数据集。常用的数据集包括 COCO 数据集、Pascal VOC 数据集以及自定义数据集。

3.1 自定义数据集

若要创建自定义数据集，您需要：

图像：收集多种场景下的障碍物图像。
标注：使用LabelImg等工具为图像中的障碍物创建标注。标注格式应为YOLO格式（即每个图像对应一个.txt文件，文件中包含每个障碍物的类别和坐标）。

YOLO格式示例

每个标注文件内容如下：

<class_id> <x_center> <y_center> <width> <height>

class_id：障碍物类别（从0开始）
x_center、y_center：障碍物中心相对图像宽高的比例
width、height：障碍物宽高相对图像宽高的比例

3.2 创建 `data.yaml` 文件

data.yaml 文件用于定义数据集的结构和参数。下面是一个 data.yaml 文件示例：

train: ../data/images/train
val: ../data/images/val

nc: 2  # 类别数
names: ['obstacle1', 'obstacle2']  # 类别名称

在此示例中，train和val指定了训练和验证图像的路径，nc表示类别数，names列出了所有类别的名称。

4. 模型训练

接下来，我们将使用YOLO模型进行训练。这里以YOLOv5为例，其他版本的训练过程类似。

4.1 克隆YOLOv5仓库

首先，克隆YOLOv5仓库并安装依赖：

git clone https://github.com/ultralytics/yolov5
cd yolov5
pip install -r requirements.txt

4.2 开始训练

使用以下命令开始训练：

python train.py --img 640 --batch 16 --epochs 50 --data data.yaml --weights yolov5s.pt

--img：输入图像大小
--batch：每批训练的样本数
--epochs：训练轮数
--data：指定数据集配置文件
--weights：预训练模型权重

5. 模型推理

在模型训练完成后，可以使用以下命令进行推理测试：

python detect.py --weights runs/train/exp/weights/best.pt --img 640 --source ../data/images/test

--weights：指定训练后保存的权重文件
--source：待检测的图像或视频源

6. UI界面开发

使用Python的Tkinter库创建一个简单的用户界面，以展示检测结果。

6.1 UI界面代码示例

以下是一个简单的Tkinter UI示例代码：

import tkinter as tk
from tkinter import filedialog
import cv2
from PIL import Image, ImageTk

class ObjectDetectionApp:
    def __init__(self, master):
        self.master = master
        master.title("Obstacle Detection System")

        self.label = tk.Label(master, text="Select an image for detection:")
        self.label.pack()

        self.select_button = tk.Button(master, text="Select Image", command=self.select_image)
        self.select_button.pack()

        self.canvas = tk.Canvas(master, width=640, height=480)
        self.canvas.pack()

    def select_image(self):
        file_path = filedialog.askopenfilename()
        self.detect_objects(file_path)

    def detect_objects(self, image_path):
        # 使用YOLO模型进行检测
        model = 'runs/train/exp/weights/best.pt'  # 替换为训练后的权重路径
        img = cv2.imread(image_path)
        results = self.run_yolo_detection(img, model)

        # 转换为PIL格式并在canvas上显示
        img_pil = Image.fromarray(results)
        img_tk = ImageTk.PhotoImage(img_pil)
        self.canvas.create_image(0, 0, anchor='nw', image=img_tk)
        self.canvas.image = img_tk

    def run_yolo_detection(self, img, model):
        # 这里调用YOLO模型进行检测
        # 返回处理后的图像
        return img  # 替换为实际检测结果

if __name__ == "__main__":
    root = tk.Tk()
    app = ObjectDetectionApp(root)
    root.mainloop()

6.2 代码说明

选择图像：用户通过按钮选择待检测的图像。
检测对象：调用YOLO模型进行检测，并将结果展示在界面上。

7. 完整代码示例

将以上所有代码片段整合，以下是一个完整的示例，包括训练、检测和UI部分。

# main.py
import tkinter as tk
from tkinter import filedialog
import cv2
from PIL import Image, ImageTk
import torch

class ObjectDetectionApp:
    def __init__(self, master):
        self.master = master
        master.title("Obstacle Detection System")

        self.label = tk.Label(master, text="Select an image for detection:")
        self.label.pack()

        self.select_button = tk.Button(master, text="Select Image", command=self.select_image)
        self.select_button.pack()

        self.canvas = tk.Canvas(master, width=640, height=480)
        self.canvas.pack()

    def select_image(self):
        file_path = filedialog.askopenfilename()
        self.detect_objects(file_path)

    def detect_objects(self, image_path):
        # 使用YOLOv5进行检测
        model = torch.hub.load('ultralytics/yolov5', 'custom', path='runs/train/exp/weights/best.pt')  # 替换为训练后的权重路径
        img = cv2.imread(image_path)
        
        # YOLO模型检测
        results = model(img)
        img_result = results.render()[0]  # 获取检测结果图像

        # 转换为PIL格式并在canvas上显示
        img_pil = Image.fromarray(img_result)
        img_tk = ImageTk.PhotoImage(img_pil)
        self.canvas.create_image(0, 0, anchor='nw', image=img_tk)
        self.canvas.image = img_tk

if __name__ == "__main__":
    root = tk.Tk()
    app = ObjectDetectionApp(root)
    root.mainloop()