神经网络算子的张量维度对应关系

Aresiii

已于 2023-07-08 11:14:46 修改

阅读量918

点赞数

分类专栏：机器学习文章标签：神经网络深度学习机器学习

于 2023-03-31 15:51:24 首次发布

本文链接：https://blog.csdn.net/Aresiii/article/details/129882157

版权

机器学习专栏收录该内容

84 篇文章 7 订阅

订阅专栏

神经网络中的算子操作通常都是基于张量（Tensor）的运算，因此了解算子与张量之间的维度对应关系非常重要。以下是常见算子的张量维度对应关系：

一）线性层

输入张量的维度为(batch_size, input_dim)，权重张量的维度为(input_dim, output_dim)，偏置张量的维度为(output_dim,)，输出张量的维度为(batch_size, output_dim)。

以下是一个使用PyTorch实现线性层的示例代码：

import torch
import torch.nn as nn

# 定义线性层模型
class LinearLayer(nn.Module):
    def __init__(self, input_dim, output_dim):
        super(LinearLayer, self).__init__()
        self.linear = nn.Linear(input_dim, output_dim)

    def forward(self, x):
        out = self.linear(x)
        return out

# 创建输入张量
batch_size = 32
input_dim = 10
input_tensor = torch.randn(batch_size, input_dim)

# 创建线性层模型
output_dim = 5
linear_layer = LinearLayer(input_dim, output_dim)

# 执行推理
output_tensor = linear_layer(input_tensor)

# 打印输出张量的维度
print(output_tensor.size())  # 输出: torch.Size([32, 5])

在上述代码中，我们首先定义了一个LinearLayer类，该类继承自PyTorch的nn.Module类。在__init__方法中，我们使用nn.Linear定义了一个线性层，它的输入维度为input_dim，输出维度为output_dim。在forward方法中，我们将输入张量x传递给线性层，并返回输出张量。

然后，我们创建了一个输入张量input_tensor，其维度为(batch_size, input_dim)，其中batch_size表示批处理大小，input_dim表示输入维度。

接下来，我们实例化了LinearLayer类，传入输入维度input_dim和输出维度output_dim，得到线性层模型linear_layer。

最后，我们将输入张量input_tensor传递给线性层模型，并得到输出张量output_tensor。通过打印输出张量的维度，我们可以验证输出的维度为(batch_size, output_dim)。

二）卷积层

输入张量的维度为(batch_size, input_channels, height, width)，卷积核张量的维度为(output_channels, input_channels, kernel_height, kernel_width)，偏置张量的维度为(output_channels,)，输出张量的维度为(batch_size, output_channels, output_height, output_width)。

以下是一个使用PyTorch实现卷积层的示例代码：

import torch
import torch.nn as nn

# 定义卷积层模型
class ConvolutionalLayer(nn.Module):
    def __init__(self, input_channels, output_channels, kernel_size, stride=1, padding=0):
        super(ConvolutionalLayer, self).__init__()
        self.conv = nn.Conv2d(input_channels, output_channels, kernel_size, stride, padding)

    def forward(self, x):
        out = self.conv(x)
        return out

# 创建输入张量
batch_size = 32
input_channels = 3
height = 32
width = 32
input_tensor = torch.randn(batch_size, input_channels, height, width)

# 创建卷积层模型
output_channels = 16
kernel_size = 3
conv_layer = ConvolutionalLayer(input_channels, output_channels, kernel_size)

# 执行推理
output_tensor = conv_layer(input_tensor)

# 打印输出张量的维度
print(output_tensor.size())  # 输出: torch.Size([32, 16, 30, 30])

在上述代码中，我们定义了一个ConvolutionalLayer类，该类继承自PyTorch的nn.Module类。在__init__方法中，我们使用nn.Conv2d定义了一个二维卷积层，其中input_channels表示输入通道数，output_channels表示输出通道数，kernel_size表示卷积核大小，stride表示步长，padding表示填充大小。

在forward方法中，我们将输入张量x传递给卷积层，并返回输出张量。

然后，我们创建了一个输入张量input_tensor，其维度为(batch_size, input_channels, height, width)，其中batch_size表示批处理大小，input_channels表示输入通道数，height表示输入图像的高度，width表示输入图像的宽度。

接下来，我们实例化了ConvolutionalLayer类，传入输入通道数input_channels、输出通道数output_channels以及卷积核大小kernel_size，得到卷积层模型conv_layer。

最后，我们将输入张量input_tensor传递给卷积层模型，并得到输出张量output_tensor。通过打印输出张量的维度，我们可以验证输出的维度为(batch_size, output_channels, output_height, output_width)。

三）池化层

输入张量的维度为(batch_size, input_channels, height, width)，输出张量的维度为(batch_size, input_channels, output_height, output_width)，其中output_height和output_width分别是池化后的特征图高度和宽度。

以下是一个使用PyTorch实现池化层的示例代码：

import torch
import torch.nn as nn

# 定义池化层模型
class PoolingLayer(nn.Module):
    def __init__(self, kernel_size, stride=None, padding=0):
        super(PoolingLayer, self).__init__()
        self.pool = nn.MaxPool2d(kernel_size, stride, padding)

    def forward(self, x):
        out = self.pool(x)
        return out

# 创建输入张量
batch_size = 32
input_channels = 3
height = 32
width = 32
input_tensor = torch.randn(batch_size, input_channels, height, width)

# 创建池化层模型
kernel_size = 2
pool_layer = PoolingLayer(kernel_size)

# 执行推理
output_tensor = pool_layer(input_tensor)

# 打印输出张量的维度
print(output_tensor.size())  # 输出: torch.Size([32, 3, 16, 16])

在上述代码中，我们定义了一个PoolingLayer类，该类继承自PyTorch的nn.Module类。在__init__方法中，我们使用nn.MaxPool2d定义了一个最大池化层，其中kernel_size表示池化核大小，stride表示步长，padding表示填充大小。

在forward方法中，我们将输入张量x传递给池化层，并返回输出张量。

接下来，我们实例化了PoolingLayer类，传入池化核大小kernel_size，得到池化层模型pool_layer。

最后，我们将输入张量input_tensor传递给池化层模型，并得到输出张量output_tensor。通过打印输出张量的维度，我们可以验证输出的维度为(batch_size, input_channels, output_height, output_width)。

四）循环层

输入张量的维度为(batch_size, seq_length, input_dim)，输出张量的维度为(batch_size, seq_length, output_dim)。隐藏状态张量的维度为(batch_size, hidden_dim)，门控张量的维度为(batch_size, 2 * hidden_dim)。

以下是一个使用PyTorch实现循环层（以LSTM为例）的示例代码：

import torch
import torch.nn as nn

# 定义循环层模型（LSTM）
class RecurrentLayer(nn.Module):
    def __init__(self, input_dim, hidden_dim, output_dim):
        super(RecurrentLayer, self).__init__()
        self.lstm = nn.LSTM(input_dim, hidden_dim, batch_first=True)
        self.fc = nn.Linear(hidden_dim, output_dim)

    def forward(self, x):
        out, _ = self.lstm(x)
        out = self.fc(out)
        return out

# 创建输入张量
batch_size = 32
seq_length = 10
input_dim = 5
input_tensor = torch.randn(batch_size, seq_length, input_dim)

# 创建循环层模型
hidden_dim = 20
output_dim = 8
recurrent_layer = RecurrentLayer(input_dim, hidden_dim, output_dim)

# 执行推理
output_tensor = recurrent_layer(input_tensor)

# 打印输出张量的维度
print(output_tensor.size())  # 输出: torch.Size([32, 10, 8])

在上述代码中，我们定义了一个RecurrentLayer类，该类继承自PyTorch的nn.Module类。在__init__方法中，我们使用nn.LSTM定义了一个LSTM循环层，其中input_dim表示输入维度，hidden_dim表示隐藏状态的维度，batch_first=True表示输入张量的第一个维度是批处理大小。我们还使用nn.Linear定义了一个全连接层，将隐藏状态转换为输出维度。

在forward方法中，我们将输入张量x传递给LSTM循环层，并将LSTM的输出通过全连接层进行线性变换，得到最终的输出张量。

然后，我们创建了一个输入张量input_tensor，其维度为(batch_size, seq_length, input_dim)，其中batch_size表示批处理大小，seq_length表示序列长度，input_dim表示输入维度。

接下来，我们实例化了RecurrentLayer类，传入输入维度input_dim、隐藏状态维度hidden_dim和输出维度output_dim，得到循环层模型recurrent_layer。

最后，我们将输入张量input_tensor传递给循环层模型，并得到输出张量output_tensor。通过打印输出张量的维度，我们可以验证输出的维度为(batch_size, seq_length, output_dim)。

五）归一化层

输入张量和输出张量的维度都与线性层相同。

以下是一个使用PyTorch实现归一化层（以Batch Normalization为例）的示例代码：

import torch
import torch.nn as nn

# 定义归一化层模型（Batch Normalization）
class NormalizationLayer(nn.Module):
    def __init__(self, num_features):
        super(NormalizationLayer, self).__init__()
        self.norm = nn.BatchNorm1d(num_features)

    def forward(self, x):
        out = self.norm(x)
        return out

# 创建输入张量
batch_size = 32
input_dim = 10
input_tensor = torch.randn(batch_size, input_dim)

# 创建归一化层模型
num_features = input_dim
normalization_layer = NormalizationLayer(num_features)

# 执行推理
output_tensor = normalization_layer(input_tensor)

# 打印输出张量的维度
print(output_tensor.size())  # 输出: torch.Size([32, 10])

在上述代码中，我们定义了一个NormalizationLayer类，该类继承自PyTorch的nn.Module类。在__init__方法中，我们使用nn.BatchNorm1d定义了一个一维批归一化层，其中num_features表示输入特征的数量。

在forward方法中，我们将输入张量x传递给批归一化层，并返回归一化后的输出张量。

然后，我们创建了一个输入张量input_tensor，其维度为(batch_size, input_dim)，其中batch_size表示批处理大小，input_dim表示输入特征的数量。

接下来，我们实例化了NormalizationLayer类，传入输入特征的数量num_features，得到归一化层模型normalization_layer。

最后，我们将输入张量input_tensor传递给归一化层模型，并得到输出张量output_tensor。通过打印输出张量的维度，我们可以验证输出的维度与输入张量的维度相同。

六）激活函数

输入张量和输出张量的维度一致。

七）损失函数

输入张量的维度取决于具体的任务类型和数据格式，而输出张量的维度通常为(batch_size,)，表示每个样本的损失值。

需要注意的是，不同算子的输入和输出张量维度可能会有所不同，具体的实现方式也可能因不同框架而异。在实际使用中，需要根据具体情况来确定每个算子的张量维度，以确保算子之间的输入输出能够匹配和正确传递。

Aresiii

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
神经网络算子的张量维度对应关系

卷积层：输入张量的维度为(batch_size, input_channels, height, width)，卷积核张量的维度为(output_channels, input_channels, kernel_height, kernel_width)，偏置张量的维度为(output_channels,)，输出张量的维度为(batch_size, output_channels, output_height, output_width)。归一化层：输入张量和输出张量的维度都与线性层相同。
复制链接

扫一扫