pytorch 中使用 torch.nn.functional.interpolate实现插值和上采样

最新推荐文章于 2025-01-10 13:23:15 发布

weixin_41956906

最新推荐文章于 2025-01-10 13:23:15 发布

阅读量1.2w

点赞数 4

分类专栏：深度学习文章标签： pytorch 深度学习 python

原文链接：https://www.cnblogs.com/wanghui-garcia/p/11399034.html

版权

深度学习专栏收录该内容

2 篇文章

订阅专栏

pytorch 中使用 torch.nn.functional实现插值和上采样

interpolate的用法
interpolate的参数说明
注意
栗子

interpolate的用法

torch.nn.functional.interpolate(input, size=None, scale_factor=None, mode='nearest', align_corners=None)

输入要进行上下采样的feature（input），根据给定的size或scale_factor参数来对输入进行下/上采样

支持目前的temporal(1D, 如向量数据), spatial(2D, 如jpg、png等图像数据)和volumetric(3D, 如点云数据)类型的采样数据作为输入，输入数据的格式为minibatch x channels x [optional depth] x [optional height] x width，具体为：

对于一个temporal输入，期待着3D张量的输入，即minibatch x channels x width
对于一个空间spatial输入，期待着4D张量的输入，即minibatch x channels x height x width
对于体积volumetric输入，则期待着5D张量的输入，即minibatch x channels x depth x height x width

interpolate的参数说明

input (Tensor) – 输入张量
size (int or Tuple[int] or Tuple[int, int] or Tuple[int, int, int]) – 输出大小
scale_factor (float or Tuple[float]) – 指定输出为输入的多少倍数。如果输入为tuple，其也要制定为tuple类型
mode (str) – 可使用的上采样算法，有’nearest’, ‘linear’, ‘bilinear’, ‘bicubic’ , ‘trilinear’和’area’. 默认使用’nearest’
align_corners (bool, optional) –
几何上，我们认为输入和输出的像素是正方形，而不是点。如果设置为True，则输入和输出张量由其角像素的中心点对齐，从而保留角像素处的值。如果设置为False，则输入和输出张量由它们的角像素的角点对齐，插值使用边界外值的边值填充;当scale_factor保持不变时，使该操作独立于输入大小。仅当使用的算法为’linear’, ‘bilinear’, 'bilinear’or 'trilinear’时可以使用。默认设置为False

注意

使用mode='bicubic’时，可能会导致overshoot问题，即它可以为图像生成负值或大于255的值。如果你想在显示图像时减少overshoot问题，可以显式地调用result.clamp(min=0,max=255)。

When using the CUDA backend, this operation may induce nondeterministic behaviour in be backward that is not easily switched off. Please see the notes on Reproducibility for background.

栗子

上采样

import torch
from torch import nn
import torch.nn.functional as F
input = torch.arange(1, 5, dtype=torch.float32).view(1, 1, 2, 2)
print('input:', input)
x = F.interpolate(input, scale_factor=2, mode='nearest')
print('x:', x)

input:
tensor([[[[1., 2.],
          [3., 4.]]]])
x:
tensor([[[[1., 1., 2., 2.],
          [1., 1., 2., 2.],
          [3., 3., 4., 4.],
          [3., 3., 4., 4.]]]])