快速上手yolov8项目

§ 简介

YOLOv8是由Ultralytics公司开发的最新一代目标检测模型,它是YOLO(You Only Look Once)系列的延续,旨在提供实时、高精度的目标检测能力。YOLOv8在保持高速度的同时,也注重精度的提升,适用于各种物体检测任务。
在这里插入图片描述

YOLOv8系列提供多种模型,每种模型都专门用于计算机视觉中的特定任务,如物体检测、实例分割、姿态/关键点检测、定向物体检测和分类等。
在这里插入图片描述
yolov5和yolov8为同一作者,yolov5在yolov8上作了集成,可以在yolov8上运行yolov5模型。

§ 简单上手

YOLOv8 环境安装参考教程:【手把手带你实战YOLOv8-入门篇】YOLOv8 环境安装

pip源码安装:

cmd激活yolov8环境,cd到存放代码的文件夹,然后输入命令:

pip install -e .

查看虚拟环境中安装的包

pip list

在这里插入图片描述
使用:

yolo predict model=yolov8n.pt source=ultralytics/assets/bus.jpg

在这里插入图片描述

命令解释:

yolo task=detect model=predict model=./yolov8n.pt source="./ultralytics/assets/bus.jpg"
  • yolo:启动命令
  • task=detect:指定了要执行的任务类型,这里是 detect,意味着进行目标检测。
  • model=predict:模型的作用,预测、训练、验证……
  • model=./yolov8n.pt:使用什么模型
  • source=“./ultralytics/assets/bus.jpg”:指定输入图像的路径,指向 ultralytics/assets/bus.jpg 文件。

§ 数据集构建

数据准备

图片类型数据:无需额外处理,直接可以进行标注
视频类型数据:进行抽帧处理,导出为图片

抽帧处理

import cv2
import matplotlib.pyplot as plt

video = cv2.VideoCapture('1.avi')
num = 0
step = 44 # 间隔帧数
while True:
    ret, frame = video.read()
    if not ret:
        break
    num += 1
    if num % step == 0:
        cv2.imwrite("./1/" + "" + str(num) + ".jpg", frame)

在这里插入图片描述

1、./是当前目录(可省略)
2、…/是父级目录
3、/是根目录

pip install labelimg

打开labelimg进行标注

快捷键:
w:标框
a:上一张图片
d:下一张图片

标图网站

makesense

在线标注,加载预训练模型辅助标注

参考教程
在这里插入图片描述

寻找公开数据集

roboflow 公开数据集
地址:
https://public.roboflow.com/object-detection
https://universe.roboflow.com/

§ 模型训练

训练前准备

数据集准备:

images:存放图片

  • trgin:训练集图片
  • val:验证集图片

labels:存放标签

  • train:训练集标签文件,要与训练集图片名称-一对应
  • val:验证集标签文件,要与验证集图片名称–对应

在这里插入图片描述
配置yaml文件

# Ultralytics YOLO 🚀, AGPL-3.0 license
# COCO8 dataset (first 8 images from COCO train2017) by Ultralytics
# Documentation: https://docs.ultralytics.com/datasets/detect/coco8/
# Example usage: yolo train data=coco8.yaml
# parent
# ├── ultralytics
# └── datasets
#     └── coco8  ← downloads here (1 MB)

# Train/val/test sets as 1) dir: path/to/imgs, 2) file: path/to/imgs.txt, or 3) list: [path/to/imgs1, path/to/imgs2, ..]
path: ../datasets/bvn # dataset root dir
train: images/train # train images (relative to 'path') 4 images
val: images/val # val images (relative to 'path') 4 images
test: # test images (optional)

# Classes
names:
  0: dong
  1: fa

在这里插入图片描述

模型训练

以python文件运行:

from ultralytics import YOLO

model = YOLO('yolov8n.pt')

model.train(data='yolo-bvn.yaml',workers=0,epochs=50,batch=8)

在这里插入图片描述

以命令行方式运行:

yolo task=detect mode=train model=./yolov8n.pt data="yolo-bvn.yaml” workers=1 epochs=50 batch=16
  • yolo: 这是执行命令的程序或脚本的名称,通常是一个可执行文件或脚本,用于运行 YOLO 模型。

  • task=detect: 这个参数指定了任务类型。在这里,detect 表示进行目标检测。

  • mode=train: 这个参数指定了模式。train 表示训练模式,即命令将用于训练模型而不是进行推理。

  • model=./yolov8n.pt: 这个参数指定了模型的路径。在这里,./yolov8n.pt 表示模型文件位于当前目录下,文件名为 yolov8n.pt。

  • data=“yolo-bvn.yaml”: 这个参数指定了数据配置文件的路径。yolo-bvn.yaml 是一个 YAML 格式的文件,包含了训练数据的路径、类别标签、训练参数等信息。

  • workers=1: 这个参数指定了数据加载时使用的子进程数量。在这里,1 表示使用一个子进程来加载数据。

  • epochs=50: 这个参数指定了训练过程中要遍历整个训练集的次数。在这里,50 表示模型将被训练 50 个周期。

  • batch=16: 这个参数指定了每个训练批次中的样本数量。在这里,16 表示每次迭代将使用 16 个样本进行训练。

综上所述,这个命令将开始一个训练过程,使用名为 yolov8n.pt 的模型,配置文件 yolo-bvn.yaml,训练 50 个周期,每个批次 16 个样本,并且使用一个子进程来加载数据。

第三种方式:
复制yaml文件:

yolo copy-cfg

配置default_copy.yaml文件

# Ultralytics YOLO 🚀, AGPL-3.0 license
# Default training settings and hyperparameters for medium-augmentation COCO training

task: detect # (str) YOLO task, i.e. detect, segment, classify, pose
mode: train # (str) YOLO mode, i.e. train, val, predict, export, track, benchmark

# Train settings -------------------------------------------------------------------------------------------------------
model: yolov8n.pt # (str, optional) path to model file, i.e. yolov8n.pt, yolov8n.yaml
data: yolo-bvn.yaml # (str, optional) path to data file, i.e. coco8.yaml
epochs: 50 # (int) number of epochs to train for
time: # (float, optional) number of hours to train for, overrides epochs if supplied
patience: 50 # (int) epochs to wait for no observable improvement for early stopping of training
batch: 16 # (int) number of images per batch (-1 for AutoBatch)
imgsz: 640 # (int | list) input images size as int for train and val modes, or list[h,w] for predict and export modes
save: True # (bool) save train checkpoints and predict results
save_period: -1 # (int) Save checkpoint every x epochs (disabled if < 1)
cache: False # (bool) True/ram, disk or False. Use cache for data loading
device: # (int | str | list, optional) device to run on, i.e. cuda device=0 or device=0,1,2,3 or device=cpu
workers: 0 # (int) number of worker threads for data loading (per RANK if DDP)
project: # (str, optional) project name
name: # (str, optional) experiment name, results saved to 'project/name' directory
exist_ok: False # (bool) whether to overwrite existing experiment
pretrained: True # (bool | str) whether to use a pretrained model (bool) or a model to load weights from (str)
optimizer: auto # (str) optimizer to use, choices=[SGD, Adam, Adamax, AdamW, NAdam, RAdam, RMSProp, auto]
verbose: True # (bool) whether to print verbose output
seed: 0 # (int) random seed for reproducibility
deterministic: True # (bool) whether to enable deterministic mode
single_cls: False # (bool) train multi-class data as single-class
rect: False # (bool) rectangular training if mode='train' or rectangular validation if mode='val'
cos_lr: False # (bool) use cosine learning rate scheduler
close_mosaic: 10 # (int) disable mosaic augmentation for final epochs (0 to disable)
resume: False # (bool) resume training from last checkpoint
amp: True # (bool) Automatic Mixed Precision (AMP) training, choices=[True, False], True runs AMP check
fraction: 1.0 # (float) dataset fraction to train on (default is 1.0, all images in train set)
profile: False # (bool) profile ONNX and TensorRT speeds during training for loggers
freeze: None # (int | list, optional) freeze first n layers, or freeze list of layer indices during training
multi_scale: False # (bool) Whether to use multiscale during training
# Segmentation
overlap_mask: True # (bool) masks should overlap during training (segment train only)
mask_ratio: 4 # (int) mask downsample ratio (segment train only)
# Classification
dropout: 0.0 # (float) use dropout regularization (classify train only)

# Val/Test settings ----------------------------------------------------------------------------------------------------
val: True # (bool) validate/test during training
split: val # (str) dataset split to use for validation, i.e. 'val', 'test' or 'train'
save_json: False # (bool) save results to JSON file
save_hybrid: False # (bool) save hybrid version of labels (labels + additional predictions)
conf: # (float, optional) object confidence threshold for detection (default 0.25 predict, 0.001 val)
iou: 0.7 # (float) intersection over union (IoU) threshold for NMS
max_det: 300 # (int) maximum number of detections per image
half: False # (bool) use half precision (FP16)
dnn: False # (bool) use OpenCV DNN for ONNX inference
plots: True # (bool) save plots and images during train/val

# Predict settings -----------------------------------------------------------------------------------------------------
source: # (str, optional) source directory for images or videos
vid_stride: 1 # (int) video frame-rate stride
stream_buffer: False # (bool) buffer all streaming frames (True) or return the most recent frame (False)
visualize: False # (bool) visualize model features
augment: False # (bool) apply image augmentation to prediction sources
agnostic_nms: False # (bool) class-agnostic NMS
classes: # (int | list[int], optional) filter results by class, i.e. classes=0, or classes=[0,2,3]
retina_masks: False # (bool) use high-resolution segmentation masks
embed: # (list[int], optional) return feature vectors/embeddings from given layers

# Visualize settings ---------------------------------------------------------------------------------------------------
show: False # (bool) show predicted images and videos if environment allows
save_frames: False # (bool) save predicted individual video frames
save_txt: False # (bool) save results as .txt file
save_conf: False # (bool) save results with confidence scores
save_crop: False # (bool) save cropped images with results
show_labels: True # (bool) show prediction labels, i.e. 'person'
show_conf: True # (bool) show prediction confidence, i.e. '0.99'
show_boxes: True # (bool) show prediction boxes
line_width: # (int, optional) line width of the bounding boxes. Scaled to image size if None.

# Export settings ------------------------------------------------------------------------------------------------------
format: torchscript # (str) format to export to, choices at https://docs.ultralytics.com/modes/export/#export-formats
keras: False # (bool) use Kera=s
optimize: False # (bool) TorchScript: optimize for mobile
int8: False # (bool) CoreML/TF INT8 quantization
dynamic: False # (bool) ONNX/TF/TensorRT: dynamic axes
simplify: False # (bool) ONNX: simplify model using `onnxslim`
opset: # (int, optional) ONNX: opset version
workspace: 4 # (int) TensorRT: workspace size (GB)
nms: False # (bool) CoreML: add NMS

# Hyperparameters ------------------------------------------------------------------------------------------------------
lr0: 0.01 # (float) initial learning rate (i.e. SGD=1E-2, Adam=1E-3)
lrf: 0.01 # (float) final learning rate (lr0 * lrf)
momentum: 0.937 # (float) SGD momentum/Adam beta1
weight_decay: 0.0005 # (float) optimizer weight decay 5e-4
warmup_epochs: 3.0 # (float) warmup epochs (fractions ok)
warmup_momentum: 0.8 # (float) warmup initial momentum
warmup_bias_lr: 0.1 # (float) warmup initial bias lr
box: 7.5 # (float) box loss gain
cls: 0.5 # (float) cls loss gain (scale with pixels)
dfl: 1.5 # (float) dfl loss gain
pose: 12.0 # (float) pose loss gain
kobj: 1.0 # (float) keypoint obj loss gain
label_smoothing: 0.0 # (float) label smoothing (fraction)
nbs: 64 # (int) nominal batch size
hsv_h: 0.015 # (float) image HSV-Hue augmentation (fraction)
hsv_s: 0.7 # (float) image HSV-Saturation augmentation (fraction)
hsv_v: 0.4 # (float) image HSV-Value augmentation (fraction)
degrees: 0.0 # (float) image rotation (+/- deg)
translate: 0.1 # (float) image translation (+/- fraction)
scale: 0.5 # (float) image scale (+/- gain)
shear: 0.0 # (float) image shear (+/- deg)
perspective: 0.0 # (float) image perspective (+/- fraction), range 0-0.001
flipud: 0.0 # (float) image flip up-down (probability)
fliplr: 0.5 # (float) image flip left-right (probability)
bgr: 0.0 # (float) image channel BGR (probability)
mosaic: 1.0 # (float) image mosaic (probability)
mixup: 0.0 # (float) image mixup (probability)
copy_paste: 0.0 # (float) segment copy-paste (probability)
auto_augment: randaugment # (str) auto augmentation policy for classification (randaugment, autoaugment, augmix)
erasing: 0.4 # (float) probability of random erasing during classification training (0-0.9), 0 means no erasing, must be less than 1.0.
crop_fraction: 1.0 # (float) image crop fraction for classification (0.1-1), 1.0 means no crop, must be greater than 0.

# Custom config.yaml ---------------------------------------------------------------------------------------------------
cfg: # (str, optional) for overriding defaults.yaml

# Tracker settings ------------------------------------------------------------------------------------------------------
tracker: botsort.yaml # (str) tracker type, choices=[botsort.yaml, bytetrack.yaml]

控制台输入:

yolo cfg=default_copy.yaml

开始训练

训练结果

训练结果位置:
在这里插入图片描述
在这里插入图片描述
best.pt:最好的模型
last.pt:最后一个模型

模型效果查看:

yolo detect predict model=runs/detect/train/weights/best.pt source=./BvN.mp4 show=True

数据集描述文件
数据地址从datasets目录里开始写起,且就放在根目录下,会避免很多坑

调整数据集目录后再次训练:
删除 ~/AppData/Roaming/Ultralytics文件夹下的settings.yam!

可使用everything搜索

运行成功了,开心😆

参考

ultralytics——GitHub地址
【手把手带你实战YOLOv8-入门篇】YOLOv8 环境安装——哔哩哔哩

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

kaixin_啊啊

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值