2021SC@SDUSC山东大学软件学院软件工程应用与实践--YOLOV5代码分析（五）general.py-3

最新推荐文章于 2024-04-28 20:44:49 发布

xjunjin

最新推荐文章于 2024-04-28 20:44:49 发布

阅读量1.6k

点赞数

分类专栏： yolov5 文章标签： pytorch python

本文链接：https://blog.csdn.net/xjunjin/article/details/120828464

版权

深度学习模型优化训练技巧数据预处理学习率调度

关键词由CSDN通过智能技术生成

yolov5 专栏收录该内容

18 篇文章 25 订阅

订阅专栏

2021SC@SDUSC

labels_to_class_weights函数

labels_to_image_weights函数

url2file函数

def url2file(url):
    # Convert URL to filename, i.e. https://url.com/file.txt?auth -> file.txt
    url = str(Path(url)).replace(':/', '://')  # Pathlib turns :// -> :/
    file = Path(urllib.parse.unquote(url)).name.split('?')[0]  # '%2F' to '/', split https://url.com/file.txt?auth
    return file

url：目标网址

讲网址转换为文件名称，例如将 https://url.com/file.txt?auth转换为file.txt

首先将://都替换为:/，再对网址进行解码以及切割，从而获取文件名称

download函数

def download(url, dir='.', unzip=True, delete=True, curl=False, threads=1):
    # Multi-threaded file download and unzip function, used in data.yaml for autodownload
    def download_one(url, dir):
        # Download 1 file
        f = dir / Path(url).name  # filename
        if Path(url).is_file():  # exists in current path
            Path(url).rename(f)  # move to dir
        elif not f.exists():
            print(f'Downloading {url} to {f}...')
            if curl:
                os.system(f"curl -L '{url}' -o '{f}' --retry 9 -C -")  # curl download, retry and resume on fail
            else:
                torch.hub.download_url_to_file(url, f, progress=True)  # torch download
        if unzip and f.suffix in ('.zip', '.gz'):
            print(f'Unzipping {f}...')
            if f.suffix == '.zip':
                s = f'unzip -qo {f} -d {dir}'  # unzip -quiet -overwrite
            elif f.suffix == '.gz':
                s = f'tar xfz {f} --directory {f.parent}'  # unzip
            if delete:  # delete zip file after unzip
                s += f' && rm {f}'
            os.system(s)

    dir = Path(dir)
    dir.mkdir(parents=True, exist_ok=True)  # make directory
    if threads > 1:
        pool = ThreadPool(threads)
        pool.imap(lambda x: download_one(*x), zip(url, repeat(dir)))  # multi-threaded
        pool.close()
        pool.join()
    else:
        for u in [url] if isinstance(url, (str, Path)) else url:
            download_one(u, dir)

url：下载目标的网址

dir：下载文件的根目录

unzip：下载之后自动解压

delete：解压之后删除下载的压缩包

curl：true时用cUrl来进行下载

threads：线程数

该函数实现了多线程下载并解压文件

函数内部又定义了下载一个文件的函数，参数为url和dir。

将url转换为路径+文件名称，当该文件已经存在时，将该文件转移到给定的路径下；不存在则开始下载，如果curl为true则用curl进行下载，失败时重试，否则用pytorch函数进行下载。

如果unzip为true并且文件的后缀为zip或gz，则进行解压。若delete为true，删除压缩包。

os.system是os模块中常用的函数，可以以字符串形式执行命令行操作。

下载单个文件的外部：

将dir转换为路径并创建一个空文件夹。当线程数大于1时，创建一个线程池，每个线程都执行下载一个文件的操作，否则就一个for循环依次下载单个文件。

check_dataset函数

def check_dataset(data, autodownload=True):
    # Download and/or unzip dataset if not found locally
    # Usage: https://github.com/ultralytics/yolov5/releases/download/v1.0/coco128_with_yaml.zip

    # Download (optional)
    extract_dir = ''
    if isinstance(data, (str, Path)) and str(data).endswith('.zip'):  # i.e. gs://bucket/dir/coco128.zip
        download(data, dir='../datasets', unzip=True, delete=False, curl=False, threads=1)
        data = next((Path('../datasets') / Path(data).stem).rglob('*.yaml'))
        extract_dir, autodownload = data.parent, False

    # Read yaml (optional)
    if isinstance(data, (str, Path)):
        with open(data, errors='ignore') as f:
            data = yaml.safe_load(f)  # dictionary

    # Parse yaml
    path = extract_dir or Path(data.get('path') or '')  # optional 'path' default to '.'
    for k in 'train', 'val', 'test':
        if data.get(k):  # prepend path
            data[k] = str(path / data[k]) if isinstance(data[k], str) else [str(path / x) for x in data[k]]

    assert 'nc' in data, "Dataset 'nc' key missing."
    if 'names' not in data:
        data['names'] = [f'class{i}' for i in range(data['nc'])]  # assign class names if missing
    train, val, test, s = [data.get(x) for x in ('train', 'val', 'test', 'download')]
    if val:
        val = [Path(x).resolve() for x in (val if isinstance(val, list) else [val])]  # val path
        if not all(x.exists() for x in val):
            print('\nWARNING: Dataset not found, nonexistent paths: %s' % [str(x) for x in val if not x.exists()])
            if s and autodownload:  # download script
                if s.startswith('http') and s.endswith('.zip'):  # URL
                    f = Path(s).name  # filename
                    print(f'Downloading {s} ...')
                    torch.hub.download_url_to_file(s, f)
                    root = path.parent if 'path' in data else '..'  # unzip directory i.e. '../'
                    Path(root).mkdir(parents=True, exist_ok=True)  # create root
                    r = os.system(f'unzip -q {f} -d {root} && rm {f}')  # unzip
                elif s.startswith('bash '):  # bash script
                    print(f'Running {s} ...')
                    r = os.system(s)
                else:  # python script
                    r = exec(s, {'yaml': data})  # return None
                print('Dataset autodownload %s\n' % ('success' if r in (0, None) else 'failure'))  # print result
            else:
                raise Exception('Dataset not found.')

    return data  # dictionary

data：下载数据网址

autodownload：自动下载

检查数据集是否存在，不存在下载并解压数据集

当data是字符串或路径时，并且是以zip结尾，下载该数据集，并设置extract_dir和autodownload

当data是字符串或路径，打开该文件，并读取文件

接下来解析yaml文件，返回data

clean_str函数

def clean_str(s):
    # Cleans a string by replacing special characters with underscore _
    return re.sub(pattern="[|@#!¡·$€%&()=?¿^*;:,¨´><+]", repl="_", string=s)

s：需要处理的字符串

将字符串中的特殊符号替换为_

one_cycle函数

def one_cycle(y1=0.0, y2=1.0, steps=100):
    # lambda function for sinusoidal ramp from y1 to y2 https://arxiv.org/pdf/1812.01187.pdf
    return lambda x: ((1 - math.cos(x * math.pi / steps)) / 2) * (y2 - y1) + y1

返回一个lambda函数，该函数用于定义lr_scheduler，在train.py中进行调用

# Scheduler
    if opt.linear_lr:
        lf = lambda x: (1 - x / (epochs - 1)) * (1.0 - hyp['lrf']) + hyp['lrf']  # linear
    else:
        lf = one_cycle(1, hyp['lrf'], epochs)  # cosine 1->hyp['lrf']
    scheduler = lr_scheduler.LambdaLR(optimizer, lr_lambda=lf)  # plot_lr_scheduler(optimizer, scheduler, epochs)

这里是在定义学习率调度器，用以调整学习率。

该函数实现了从y1到y2以cos曲线进行平滑地过度

colorstr函数

def colorstr(*input):
    # Colors a string https://en.wikipedia.org/wiki/ANSI_escape_code, i.e.  colorstr('blue', 'hello world')
    *args, string = input if len(input) > 1 else ('blue', 'bold', input[0])  # color arguments, string
    colors = {'black': '\033[30m',  # basic colors
              'red': '\033[31m',
              'green': '\033[32m',
              'yellow': '\033[33m',
              'blue': '\033[34m',
              'magenta': '\033[35m',
              'cyan': '\033[36m',
              'white': '\033[37m',
              'bright_black': '\033[90m',  # bright colors
              'bright_red': '\033[91m',
              'bright_green': '\033[92m',
              'bright_yellow': '\033[93m',
              'bright_blue': '\033[94m',
              'bright_magenta': '\033[95m',
              'bright_cyan': '\033[96m',
              'bright_white': '\033[97m',
              'end': '\033[0m',  # misc
              'bold': '\033[1m',
              'underline': '\033[4m'}
    return ''.join(colors[x] for x in args) + f'{string}' + colors['end']

为字符串着色，更好的美观性

labels_to_class_weights函数

def labels_to_class_weights(labels, nc=80):
    # Get class weights (inverse frequency) from training labels
    if labels[0] is None:  # no labels loaded
        return torch.Tensor()

    labels = np.concatenate(labels, 0)  # labels.shape = (866643, 5) for COCO
    classes = labels[:, 0].astype(np.int)  # labels = [class xywh]
    weights = np.bincount(classes, minlength=nc)  # occurrences per class

    # Prepend gridpoint count (for uCE training)
    # gpi = ((320 / 32 * np.array([1, 2, 4])) ** 2 * 3).sum()  # gridpoints per image
    # weights = np.hstack([gpi * len(labels)  - weights.sum() * 9, weights * 9]) ** 0.5  # prepend gridpoints to start

    weights[weights == 0] = 1  # replace empty bins with 1
    weights = 1 / weights  # number of targets per class
    weights /= weights.sum()  # normalize
    return torch.from_numpy(weights)

labels：需要获取权重的标签

nc：类别的数量

每一个label都是一个五维的向量，包含了[class x y w h]，分别为类别，中心点的横、纵坐标，预测框的宽、高，因此我们需要将class提取出来，并转换为int类型。接下来对labels里的每个类别进行计数，初始化权重，将空的类别的权重置为1，再对其取倒数，也就是反频，再返回。

eg：

classes为[2 1 3 4 4 3]，weights为[0 1 1 2 2]，即将每个类别的数量作为初始化权重，接下来将所有的0置换为1，weights为[1 1 1 2 2]，再取倒数[1 1 1 0.5 0.5]，归一化权重并返回。

labels_to_image_weights函数

def labels_to_image_weights(labels, nc=80, class_weights=np.ones(80)):
    # Produces image weights based on class_weights and image contents
    class_counts = np.array([np.bincount(x[:, 0].astype(np.int), minlength=nc) for x in labels])
    image_weights = (class_weights.reshape(1, nc) * class_counts).sum(1)
    # index = random.choices(range(n), weights=image_weights, k=1)  # weight image sample
    return image_weights

根据类别权重以及图像内容对图像创建一个权重

xyxy2xywh函数

def xyxy2xywh(x):
    # Convert nx4 boxes from [x1, y1, x2, y2] to [x, y, w, h] where xy1=top-left, xy2=bottom-right
    y = x.clone() if isinstance(x, torch.Tensor) else np.copy(x)
    y[:, 0] = (x[:, 0] + x[:, 2]) / 2  # x center
    y[:, 1] = (x[:, 1] + x[:, 3]) / 2  # y center
    y[:, 2] = x[:, 2] - x[:, 0]  # width
    y[:, 3] = x[:, 3] - x[:, 1]  # height
    return y

将预测框的左上角和右下角的表示转换为中心点与宽高的表示

中心点坐标为左上角与右下角的中点，相加除以2即可，宽高为两个点的横、纵坐标之差

如两个点的坐标为[0,3],[2,1]，那么中心点坐标就是[1,2]，宽为2，高为2

xywh2xyxy函数

def xywh2xyxy(x):
    # Convert nx4 boxes from [x, y, w, h] to [x1, y1, x2, y2] where xy1=top-left, xy2=bottom-right
    y = x.clone() if isinstance(x, torch.Tensor) else np.copy(x)
    y[:, 0] = x[:, 0] - x[:, 2] / 2  # top left x
    y[:, 1] = x[:, 1] - x[:, 3] / 2  # top left y
    y[:, 2] = x[:, 0] + x[:, 2] / 2  # bottom right x
    y[:, 3] = x[:, 1] + x[:, 3] / 2  # bottom right y
    return y

将预测框的xywh形式转换为xyxy形式。左上角坐标为中心点坐标的横坐标减去宽的一半，纵坐标加上高的一半，右下角为加上宽的一半和减去高的一半

clip_coords函数

def clip_coords(boxes, shape):
    # Clip bounding xyxy bounding boxes to image shape (height, width)
    if isinstance(boxes, torch.Tensor):  # faster individually
        boxes[:, 0].clamp_(0, shape[1])  # x1
        boxes[:, 1].clamp_(0, shape[0])  # y1
        boxes[:, 2].clamp_(0, shape[1])  # x2
        boxes[:, 3].clamp_(0, shape[0])  # y2
    else:  # np.array (faster grouped)
        boxes[:, [0, 2]] = boxes[:, [0, 2]].clip(0, shape[1])  # x1, x2
        boxes[:, [1, 3]] = boxes[:, [1, 3]].clip(0, shape[0])  # y1, y2

boxes：预测宽，内容为左上角坐标与右下角坐标，即2*2的矩阵

shape：图片的大小

将包围盒的大小限定在图片大小之内，防止预测框超出图片

shape[0]是图片高，shape[1]是图片的宽

boxes中0 1 2 3分别是x1 y1 x2 y2的值

xywhn2xyxy函数

def xywhn2xyxy(x, w=640, h=640, padw=0, padh=0):
    # Convert nx4 boxes from [x, y, w, h] normalized to [x1, y1, x2, y2] where xy1=top-left, xy2=bottom-right
    y = x.clone() if isinstance(x, torch.Tensor) else np.copy(x)
    y[:, 0] = w * (x[:, 0] - x[:, 2] / 2) + padw  # top left x
    y[:, 1] = h * (x[:, 1] - x[:, 3] / 2) + padh  # top left y
    y[:, 2] = w * (x[:, 0] + x[:, 2] / 2) + padw  # bottom right x
    y[:, 3] = h * (x[:, 1] + x[:, 3] / 2) + padh  # bottom right y
    return y

x：xywh的值

w：相对宽的值

h“相对高的值

padw：相对点的横坐标

padh：相对点的纵坐标

与上面的函数内容差不多，只不过计算的预测框是相对偏移量的。

w和h是在宽和高的方向上相对的倍数，而padw和padh为相对的左上角的坐标。

计算出左上角和右下角的坐标后再加上偏移量。

xyxy2xywhn函数

def xyxy2xywhn(x, w=640, h=640, clip=False, eps=0.0):
    # Convert nx4 boxes from [x1, y1, x2, y2] to [x, y, w, h] normalized where xy1=top-left, xy2=bottom-right
    if clip:
        clip_coords(x, (h - eps, w - eps))  # warning: inplace clip
    y = x.clone() if isinstance(x, torch.Tensor) else np.copy(x)
    y[:, 0] = ((x[:, 0] + x[:, 2]) / 2) / w  # x center
    y[:, 1] = ((x[:, 1] + x[:, 3]) / 2) / h  # y center
    y[:, 2] = (x[:, 2] - x[:, 0]) / w  # width
    y[:, 3] = (x[:, 3] - x[:, 1]) / h  # height
    return y

x：坐标值

w：相对宽的值

h：相对高的值

clip：为true时将坐标值限定在w-eps和h-eps之内

eps：偏差

将左上角和右下角坐标转换为中心点坐标和宽高的形式。

计算方法同上，计算完之后再将其缩放。

这么做的目的是使计算出的预测框的值尽量小，较小的值更容易收敛，可以提高精确度。

xyn2xy函数

def xyn2xy(x, w=640, h=640, padw=0, padh=0):
    # Convert normalized segments into pixel segments, shape (n,2)
    y = x.clone() if isinstance(x, torch.Tensor) else np.copy(x)
    y[:, 0] = w * x[:, 0] + padw  # top left x
    y[:, 1] = h * x[:, 1] + padh  # top left y
    return y

将缩小后的值放大至真实值

segment2box函数

def segment2box(segment, width=640, height=640):
    # Convert 1 segment label to 1 box label, applying inside-image constraint, i.e. (xy1, xy2, ...) to (xyxy)
    x, y = segment.T  # segment xy
    inside = (x >= 0) & (y >= 0) & (x <= width) & (y <= height)
    x, y, = x[inside], y[inside]
    return np.array([x.min(), y.min(), x.max(), y.max()]) if any(x) else np.zeros((1, 4))  # xyxy

segment：输入数据，形如((x1,y1),(x2,y2),...)

将一段的xy的值转换为对应的框的形式。

首先将segment转置取得x和y的值，然后取在限定范围的值，转换成左上角右下角坐标的形式

segments2boxes函数

def segments2boxes(segments):
    # Convert segment labels to box labels, i.e. (cls, xy1, xy2, ...) to (cls, xywh)
    boxes = []
    for s in segments:
        x, y = s.T  # segment xy
        boxes.append([x.min(), y.min(), x.max(), y.max()])  # cls, xyxy
    return xyxy2xywh(np.array(boxes))  # cls, xywh

将segment转换为xywh的形式。首先将segment转换成xyxy的形式，做法同上，再将xyxy转换成xywh的形式

resample_segments函数

def resample_segments(segments, n=1000):
    # Up-sample an (n,2) segment
    for i, s in enumerate(segments):
        x = np.linspace(0, len(s) - 1, n)
        xp = np.arange(len(s))
        segments[i] = np.concatenate([np.interp(x, xp, s[:, i]) for i in range(2)]).reshape(2, -1).T  # segment xy
    return segments

对segment重新进行采样

scale_coords函数

def scale_coords(img1_shape, coords, img0_shape, ratio_pad=None):
    # Rescale coords (xyxy) from img1_shape to img0_shape
    if ratio_pad is None:  # calculate from img0_shape
        gain = min(img1_shape[0] / img0_shape[0], img1_shape[1] / img0_shape[1])  # gain  = old / new
        pad = (img1_shape[1] - img0_shape[1] * gain) / 2, (img1_shape[0] - img0_shape[0] * gain) / 2  # wh padding
    else:
        gain = ratio_pad[0][0]
        pad = ratio_pad[1]

    coords[:, [0, 2]] -= pad[0]  # x padding
    coords[:, [1, 3]] -= pad[1]  # y padding
    coords[:, :4] /= gain
    clip_coords(coords, img0_shape)
    return coords

把img1的xy的值重新缩放至img0的大小

如果没有指定ratio_pad，根据img1和img0继续计算，否则直接根据ratio_pad的值来计算

先将横纵坐标减去偏移量，再统一除以gain获取新的值，最后再限定在img0的大小内。

xjunjin

关注

0
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
2021SC@SDUSC山东大学软件学院软件工程应用与实践--YOLOV5代码分析（五）general.py-3

2021SC@SDUSC目录url2file函数download函数check_dataset函数clean_str函数one_cycle函数colorstr函数labels_to_class_weights函数labels_to_image_weights函数xyxy2xywh函数xywh2xyxy函数clip_coords函数xywhn2xyxy函数xyxy2xywhn函数xyn2xy函数segment2box函数segments2
复制链接

扫一扫