PyTorch官方Tutorials Transforms

最新推荐文章于 2023-10-22 21:25:08 发布

Mendax_chen

最新推荐文章于 2023-10-22 21:25:08 发布

阅读量147

点赞数

分类专栏： # PyTorch

本文链接：https://blog.csdn.net/qq_43209726/article/details/118190697

版权

PyTorch 专栏收录该内容

9 篇文章 2 订阅

订阅专栏

PyTorch官方Tutorials

跟着PyTorch官方Tutorials码的，便于理解自己稍有改动代码并添加注释，IDE用的jupyter notebook
链接：Tutorials-Transforms

TRANSFORMS

Data does not always come in its final processed form that is required for training machine learning algorithms. We use transforms to perform some manipulation of the data and make it suitable for training.

All TorchVision datasets have two parameters -transform to modify the features and target_transform to modify the labels - that accept callables containing the transformation logic. The torchvision.transforms module offers several commonly-used transforms out of the box.

The FashionMNIST features are in PIL Image format, and the labels are integers. For training, we need the features as normalized tensors, and the labels as one-hot encoded tensors. To make these transformations, we use ToTensor and Lambda.

import torch
from torchvision import datasets
from torchvision.transforms import ToTensor, Lambda

ds = datasets.FashionMNIST(
    root="data",
    train=True,
    download=True,
    transform=ToTensor(),
    target_transform=Lambda(lambda y: torch.zeros(10, dtype=torch.float).scatter_(0, torch.tensor(y), value=1))
)

ToTensor()

ToTensorconverts a PIL image or NumPy ndarray into a FloatTensor. and scales the image’s pixel intensity values in the range [0., 1.]

Lambda Transforms

注意这里不是python的lambda函数

Lambda transforms apply any user-defined lambda function. Here, we define a function to turn the integer into a one-hot encoded tensor. It first creates a zero tensor of size 10 (the number of labels in our dataset) and calls scatter_ which assigns a value=1 on the index as given by the label y.

    target_transform = Lambda(lambda y: torch.zeros(
        10, dtype=torch.float).scatter_(dim=0, index=torch.tensor(y), value=1))

总结

一下就是PyTorch提供transform这个操作，可以对数据集进行某些处理，官方给的例子里用到了ToTensor()将ndarray或PIL图像转为FloatTensor，并将像素密度值映射到0,1之间。Lambda()则是将整型label转为one-hot编码，这里用到了scatter_()函数。