Conv2d中的stride和padding参数的使用

晓亮.

已于 2022-06-06 18:52:19 修改

阅读量5.8k

点赞数 5

分类专栏： PyTorch学习文章标签： python 开发语言深度学习 cnn 神经网络

于 2022-05-31 16:44:24 首次发布

本文链接：https://blog.csdn.net/m0_51816252/article/details/125068354

版权

PyTorch学习专栏收录该内容

17 篇文章

订阅专栏

本文通过实例详细解析了PyTorch中Conv2d模块的stride和padding参数。首先介绍了Conv2d的基本使用，然后通过代码展示了不同stride和padding设置对输出的影响，帮助读者深入理解卷积操作。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Conv2d中最常用的参数就是in_channels ，out_channels ，kernel_size ，stride ，padding 这5个，往往需要我们手动输入，本文结合代码介绍了stride和padding参数的使用。

前言

Convolution Layers 卷积层内有很多的工具，
nn.Conv1d 1维数据处理
nn.Conv2d 2维数据处理，像图片

其中用的最多的还是Conv2d。

一、Conv2d的官方文档

torch.nn.functional.Conv2d(input: Tensor,
weight: Tensor,
bias: Optional[Tensor]=None,
stride: Union[_int, _size]=1,
padding: str="valid",
dilation: Union[_int, _size]=1,
groups: _int=1)

二、开始练习

1.写入数据

使用手动输入数组，可以使用functional下的conv2d来运行一下，明白卷积的原理，

代码如下：

import torch #输入数据 input = torch.tensor([[1, 2, 0, 3, 1], [0, 1, 2, 3, 1], [1, 2, 1, 0, 0], [5, 2, 3, 1, 1], [2, 1, 0, 1, 1]]) #卷积核 kernal = torch.tensor([[1, 2, 1], [0, 1, 0], [2, 1, 0]]) #数据尺寸 print(input.shape) print(kernal.shape)

输出结果：
torch.Size([5, 5])
torch.Size([3, 3])

因为conv2d中输入的数据应该有(N, C, H, W)的。.
其中：
N是batch的大小
C是通道数量
H是输入的高度
W是输入的宽度

所以要进行转换。可以使用pyrhon中的尺寸变化torch.reshape().
代码如下：

import torch

input = torch.tensor([[1, 2, 0, 3, 1],
[0, 1, 2, 3, 1],
[1, 2, 1, 0, 0],
[5, 2, 3, 1, 1],
[2, 1, 0, 1, 1]])

kernal = torch.tensor([[1, 2, 1],
[0, 1, 0],
[2, 1, 0]])
input = torch.reshape(input, (1, 1, 5, 5))
kernal = torch.reshape(kernal, (1, 1, 3, 3))
print(input.shape)
print(kernal.shape)

输出结果为：
torch.Size([1, 1, 5, 5])
torch.Size([1, 1, 3, 3])

这样就符合conv2d的输入要求了。

2.conv2d中stride练习

调用conv2d函数了，代码如下：

import torch.nn.functional as F

output = F.conv2d(input, kernal, stride=1, ) 移动的步横向为1，纵向为1
print(output)

output2 = F.conv2d(input, kernal, stride=2) 移动的步横向为2，纵向为2
print(output2)

output3 = F.conv2d(input, kernal, stride=(1, 2)) 移动的步横向为1，纵向为2
print(output3)

该处使用的url网络请求的数据。

3.conv2d中padding 练习

代码如下：

output4 = F.conv2d(input, kernal, stride=1, padding=1) 横向上下各填充一行，左右各填充一行，默认填充的为0
print(output4)

output5 = F.conv2d(input, kernal, stride=1, padding=(1, 2)) 横向上下各填充一行，左右各填充两行，默认填充的为0
print(output5)

4.完整代码及结果

import torch
import torch.nn.functional as F
#输入图像
input = torch.tensor([[1, 2, 0, 3, 1],
[0, 1, 2, 3, 1],
[1, 2, 1, 0, 0],
[5, 2, 3, 1, 1],
[2, 1, 0, 1, 1]])
#卷积核
kernal = torch.tensor([[1, 2, 1],
[0, 1, 0],
[2, 1, 0]])

#卷积后的结果

input = torch.reshape(input, (1, 1, 5, 5)) #转换尺寸
kernal = torch.reshape(kernal, (1, 1, 3, 3)) #转换尺寸

print(input.shape)
print(kernal.shape)

#stride步练习
output = F.conv2d(input, kernal, stride=1, )
print(output)

output2 = F.conv2d(input, kernal, stride=2)
print(output2)

output3 = F.conv2d(input, kernal, stride=(1, 2))
print(output3)

#padding练习
output4 = F.conv2d(input, kernal, stride=1, padding=1)
print(output4)

output5 = F.conv2d(input, kernal, stride=1, padding=(1, 2))
print(output5)

输出结果：
torch.Size([1, 1, 5, 5])
torch.Size([1, 1, 3, 3])
tensor([[[[10, 12, 12],
[18, 16, 16],
[13, 9, 3]]]])
tensor([[[[10, 12],
[13, 3]]]])
tensor([[[[10, 12],
[18, 16],
[13, 3]]]])
tensor([[[[ 1, 3, 4, 10, 8],
[ 5, 10, 12, 12, 6],
[ 7, 18, 16, 16, 8],
[11, 13, 9, 3, 4],
[14, 13, 9, 7, 4]]]])
tensor([[[[ 0, 1, 3, 4, 10, 8, 2],
[ 1, 5, 10, 12, 12, 6, 1],
[ 0, 7, 18, 16, 16, 8, 3],
[ 1, 11, 13, 9, 3, 4, 2],
[ 5, 14, 13, 9, 7, 4, 1]]]])