CNN中——PILImage和tensor转换

最新推荐文章于 2023-10-13 17:06:25 发布

giganticpower

最新推荐文章于 2023-10-13 17:06:25 发布

阅读量3.4k

点赞数 6

文章标签：深度学习 python pytorch 神经网络

本文链接：https://blog.csdn.net/giganticpower/article/details/108195052

版权

PILImage和tensor转换

在我们使用图片训练时，或者使用图片进行预测时，我们都需要过一遍net。
然而图片形式是PILImage的格式。
net需要的时tensor格式的图片。并且形式为（channel, height, width）
我们需要进行转换。
转换的方法有两种。
一、使用numpy->tensor
二、使用PILImage->tensor
使用到两个依赖包：
torch.from_numpy,
torchvision.transforms.

numpy->tensor

PILImage->numpy->tensor

image = Image.open(image_path)  # 读取图片
image = np.array(image, dtype=np.float32)  # PILImage->numpy 输出(h,w,c)
image /= 255.0  # 网络输入需要归一化
image = np.transpos	e(image, (2, 0, 1))  # np下维度转换使用transpose
image = torch.from_numpy(image)  # numpy->tensor
print(image.shape)

使用cv读：

image = cv2.imread(image_path)  # 使用cv读直接为numpy格式
image = image.astype(np.float32)
image /= 255.0
image = np.transpose(image, (2, 0, 1))
image = torch.from_numpy(image)

输出

torch.Size([3, 575, 551])  # (c,h,w)

PILImage->tensor

image = Image.open(image_path)  # 读取图片
image = transforms.ToTensor()(image)  # PILImage->tensor
# image = torch.Tensor.permute(image, (0, 1, 2))  # tensor下维度转换
print(image.shape)

使用torchvision下transforms依赖包，一步到位。
在第二行中，集成处理了以下步骤：

1，img.tobytes() 将图片转化成内存中的存储格式
2，torch.BytesStorage.frombuffer(img.tobytes() ) 将字节以流的形式输入，转化成一维的张量
3，对张量进行reshape
4，对张量进行permute（2,0,1）
5，将当前张量的每个元素除以255
6，输出张量

第三行备注的那句，是在tensor下如果想要转换维度使用的代码。跟之前numpy下不同。

若要将tensor->PILImage
使用相反的：

image = transforms.ToPILImage()(image)

结束~

giganticpower

关注

6
点赞
踩
15

收藏

觉得还不错? 一键收藏
3
评论
CNN中——PILImage和tensor转换

PILImage和tensor转换在我们使用图片训练时，或者使用图片进行预测时，我们都需要过一遍net。然而图片形式是PILImage的格式。net需要的时tensor格式的图片。并且形式为（channel, width, height）我们需要进行转换。转换的方法有两种。一、使用numpy->tensor二、使用PILImage->tensor使用到两个依赖包：torch.from_numpy,torchvision.transforms.numpy->tensor
复制链接

扫一扫