Datawhale 零基础入门CV-Task2 数据读取与数据扩增

最新推荐文章于 2021-02-23 21:06:40 发布

ccclllyyyy

最新推荐文章于 2021-02-23 21:06:40 发布

阅读量142

点赞数

分类专栏： Datawhale 组队学习

本文链接：https://blog.csdn.net/qq_38167814/article/details/106306692

版权

Datawhale 组队学习专栏收录该内容

8 篇文章 0 订阅

订阅专栏

学习目标

学习Python和Pytorch中图像读取
学会扩增方法和Pytorch读取塞梯数据

图像读取

常见的库：
Pillow (PIL)
OpenCV

Pytorch读取数据

import os, sys, glob, shutil, json
import cv2

from PIL import Image
import numpy as np

import torch
from torch.utils.data.dataset import Dataset
import torchvision.transforms as transforms

class SVHNDataset(Dataset):
    def __init__(self, img_path, img_label, transform=None):
        self.img_path = img_path
        self.img_label = img_label 
        if transform is not None:
            self.transform = transform
        else:
            self.transform = None

    def __getitem__(self, index):
        img = Image.open(self.img_path[index]).convert('RGB')

        if self.transform is not None:
            img = self.transform(img)
        
        # 原始SVHN中类别10为数字0
        lbl = np.array(self.img_label[index], dtype=np.int)
        lbl = list(lbl)  + (5 - len(lbl)) * [10]
        
        return img, torch.from_numpy(np.array(lbl[:5]))

    def __len__(self):
        return len(self.img_path)

train_path = glob.glob('../input/train/*.png')
train_path.sort()
train_json = json.load(open('../input/train.json'))
train_label = [train_json[x]['label'] for x in train_json]

data = SVHNDataset(train_path, train_label,
          transforms.Compose([
              # 缩放到固定尺寸
              transforms.Resize((64, 128)),

              # 随机颜色变换
              transforms.ColorJitter(0.2, 0.2, 0.2),

              # 加入随机旋转
              transforms.RandomRotation(5),

              # 将图片转换为pytorch 的tesntor
              # transforms.ToTensor(),

              # 对图像像素进行归一化
              # transforms.Normalize([0.485,0.456,0.406],[0.229,0.224,0.225])
            ]))

图像扩增

目的：增加训练样本。

常见的数据扩增方法：
颜色空间，尺度空间，样本空间。
对于图像分类，数据扩增一般不会改变标签；对于物体检测，数据扩增改变物体坐标位置；对于图像分割，数据扩增改变像素标签。

数据扩增库：
torchvision
imgaug
albumentations

常见方法：
在这里插入图片描述

ccclllyyyy

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Datawhale 零基础入门CV-Task2 数据读取与数据扩增

Datawhale 零基础入门CV-Task2 数据读取与数据扩增学习目标学习Python和Pytorch中图像读取学会扩增方法和Pytorch读取塞梯数据图像读取常见的库：Pillow (PIL)OpenCVPytorch读取数据import os, sys, glob, shutil, jsonimport cv2from PIL import Imageimport numpy as npimport torchfrom torch.utils.data.datas
复制链接

扫一扫