线性网络模型

Joe-Stalin

已于 2023-07-17 10:42:56 修改

阅读量218

点赞数

分类专栏：深度学习文章标签：深度学习 python 机器学习 pytorch

于 2023-07-16 19:11:35 首次发布

本文链接：https://blog.csdn.net/qq_44378386/article/details/131753663

版权

深度学习专栏收录该内容

2 篇文章 0 订阅

订阅专栏

文章介绍了线性网络模型的基础知识，包括Y=wx+b的公式表示，权重w和偏置b的作用，以及如何根据输入特征和输出特征构建权重矩阵w和偏置向量b。此外，还详细解析了PyTorch中实现全连接层的nn.Linear模块，强调了in_features和out_features参数，并通过代码示例展示了如何创建和应用全连接层。

摘要由CSDN通过智能技术生成

一：线性网络模型

线性网络模型的基本表现形式为:
$Y = w x + b$
其中 $w$ 表示权重， $b$ 表示偏置。

如果对于样本 $X = [x_1, x_2, x_3......x_n]$ 代表样本 $X$ 的 $n$ 个特征，那么我们对每个特征都设置一个相对应的权重 $w_i$ , 则改写成:
$\begin{aligned} Y &= w_1x_1 + w_2x_2 + w_3x_3 + ...... + w_n*x_n + b \\ &= \sum_{i=1}^n {w_ix_i} + b \end{aligned}$
将上面的公式改写成矩阵的形式得到:
$Y = w^Tx + b$
其中 $w^T = [w_1, w_2, ...... , w_n], x^T = [x_1, x_2, x_3,......, x_n]$

如果形象化的表示就是:
在这里插入图片描述
在上面这个过程中我们处理的只是一个特征，现在我们将特征扩展为三个:
此时 $w, b$ 变成了:
$\begin{bmatrix} w_{11} & w_{12} & w_{13} \\ w_{21} & w_{22} & w_{23} \\ w_{31} & w_{32} & w_{33} \end{bmatrix} {\;}\;\; b = \begin{bmatrix} b_{1} \\ b_{2} \\ b_{3} \end{bmatrix}$
$w$ 的行数代表输出的特征个数，列数代表的是输入的特征数，每个特征对应的权重值。例如，假如我们的样本特征是4维，我们想在第一层神经网络模型构造一个6维的特征。那么这中间 $w = (6 * 4), b = (1 * 6)$ 。

二：Linear Model

全连接层: torch.nn.Linear

全连接层就是我们上面提到的线性网络模型:

class Linear(Module):
  
    __constants__ = ['in_features', 'out_features']
    in_features: int
    out_features: int
    weight: Tensor

    def __init__(self, in_features: int, out_features: int, bias: bool = True,
                 device=None, dtype=None) -> None:
        factory_kwargs = {'device': device, 'dtype': dtype}
        super().__init__()
        self.in_features = in_features
        self.out_features = out_features
        self.weight = Parameter(torch.empty((out_features, in_features), **factory_kwargs))
        if bias:
            self.bias = Parameter(torch.empty(out_features, **factory_kwargs))
        else:
            self.register_parameter('bias', None)
        self.reset_parameters()

全连接层最主要的两个参数是 in_features, out_features分别代表输入特征和输出特征。根据上面的推导我们可以知道根据这两个参数我们就可以构造出矩阵 $w = (out\_features * in\_features), b = (1 * out\_features)$

使用

if __name__ == '__main__':
    # 构造有四个特征的样本
    sample = torch.tensor([1., 1., 1., 1.])
    # 构造一个全连接层, 输入是dim(sample), 输出是6维
    linear = torch.nn.Linear(4, 6)
    print(linear.weight)
    print(linear.bias)
    output = linear(sample)
    print(output)
    
"""
weight = tensor([ [ 0.0172,  0.0627,  0.1316,  0.4917],
                  [ 0.4408,  0.0173,  0.0681, -0.2942],
                  [ 0.1543, -0.2808, -0.4453, -0.4480],
                  [-0.4821, -0.0123,  0.0235, -0.1196],
                  [ 0.3925, -0.4515,  0.4247, -0.1525],
                  [ 0.1604,  0.0124,  0.1822, -0.4387] ])

bias = tensor([-0.0178,  0.3837, -0.0688,  0.2666, -0.0250,  0.4139])

output = tensor([ 0.6855,  0.6157, -1.0886, -0.3239,  0.1883,  0.3301])
"""