向量、矩阵范数

最新推荐文章于 2024-01-19 14:12:19 发布

来日可期1314

最新推荐文章于 2024-01-19 14:12:19 发布

阅读量348

点赞数 1

分类专栏：机器学习文章标签：线性代数机器学习

本文链接：https://blog.csdn.net/ssjq123/article/details/120270034

版权

机器学习专栏收录该内容

17 篇文章 1 订阅

订阅专栏

范数

1. 向量范数
- $L_p$ 范数
2. 矩阵范数
3. pytorch计算范数[^3]

1. 向量范数

对于向量 $\mathbf{x}=(x_1, x_2, \dots, x_n)$

$L_p$ 范数

$L_p$ 范数是一系列范数的一般表示形式，包括 $L_0$ 范数， $L_1$ 范数， $L_2$ 范数…
$\|\mathbf{x}\|_p=\sqrt[p]{\sum_i{|x_i|^p}}$

1.1 $L_0$ 范数

$\|\mathbf{x}\|_0=\sqrt[0]{\sum_i{|x_i|^0}}=\vert\{1\leq i\leq n \vert x_i\neq0\}\vert$
表示向量中的非零元素个数

1.2 $L_1$ 范数

$\|\mathbf{x}\|_1=\sqrt[1]{\sum_i{|x_i|^1}}=\sum_i{|x_i|}$
表示向量中元素的绝对值之和

1.3 $L_2$ 范数

$\|\mathbf{x}\|_1=\sqrt[2]{\sum_i{|x_i|^2}}$

可以类比为向量 $\mathbf{x}$ 与原点之间的欧氏距离。

1.4 $L_{\infty}$ 范数

$\|\mathbf{x}\|_{\infty}=\max{(|x_i|)}$
正无穷范数表示求取向量元素绝对值中的最大值

1.5 $L_{-\infty}$ 范数

$\|\mathbf{x}\|_{-\infty}=\min{(|x_i|)}$
负无穷范数表示求取所有向量元素绝对值中的最小值

2. 矩阵范数

对于矩阵 $\mathbf{A}\in \mathbb{R}^{m\times n}$

2.1 1-范数

$\|\mathbf{A}\|_1=\max_j{\sum_{i=1}^m{|a_{ij}|}}$
矩阵元素也可以表示为： $a_{i,j}$

2.2 2-范数

$\Vert\mathbf{A}\Vert_2=\sqrt[2]{\lambda_1}$
其中 $\lambda_1$ 表示 $\mathbf{A}^{\mathrm{T}}\mathbf{A}$ 的最大特征值，称为谱函数。

2.3 $\infty$ 范数

$\|\mathbf{A}\|_{\infty}=\max_i{\sum_{j=1}^n{|a_{ij}|}}$

2.4 Fro(Frobenius)范数

F-范数表示方法是否粗体问题，参考已有的论文¹，可以使用粗体。
论文中的F-范数

$\|\mathbf{A}\|_{\mathbf{F}}=(\sum_{i=1}^m{\sum_{j=1}^n{a_{ij}}^2})^{\frac{1}{2}}$
经常取其平方, 即
$\|\mathbf{A}\|_{\mathbf{F}}^2=(\sum_{i=1}^m{\sum_{j=1}^n{a_{ij}}^2})$

2.5 核范数

核范数是矩阵奇异值的和，用于约束矩阵的低秩，对于稀疏性质的数据言，其矩阵是低秩且会包含大量冗余信息，这些信息可被用于恢复数据和提取特征。²

2.6 $l_{2,1}$ 范数³

对每个行向量求 $l_2$ 范数，再对列向量求 $l_1$ 范数。
$\Vert\mathbf{A}\Vert_{2,1}=\sum_{i=1}^m\sqrt{\sum_{j=1}^n\vert a_{ij}\vert^2}$

3. pytorch计算范数⁴

配置pytorch环境，参见上期博客。

def norm(input, p="fro", dim=None, keepdim=False, out=None, dtype=None):  # noqa: F811
    r"""Returns the matrix norm or vector norm of a given tensor.

    .. warning::

        torch.norm is deprecated and may be removed in a future PyTorch release.

        Use :func:`torch.linalg.norm`, instead, or :func:`torch.linalg.vector_norm`
        when computing vector norms and :func:`torch.linalg.matrix_norm` when
        computing matrix norms. Note, however, the signature for these functions
        is slightly different than the signature for torch.norm.

    Args:
        input (Tensor): The input tensor. Its data type must be either a floating
            point or complex type. For complex inputs, the norm is calculated using the
            absolute value of each element. If the input is complex and neither
            :attr:`dtype` nor :attr:`out` is specified, the result's data type will
            be the corresponding floating point type (e.g. float if :attr:`input` is
            complexfloat).

        p (int, float, inf, -inf, 'fro', 'nuc', optional): the order of norm. Default: ``'fro'``
            The following norms can be calculated:

            ======  ==============  ==========================
            ord     matrix norm     vector norm
            ======  ==============  ==========================
            'fro'   Frobenius norm  --
            'nuc'   nuclear norm    --
            Number  --              sum(abs(x)**ord)**(1./ord)
            ======  ==============  ==========================

            The vector norm can be calculated across any number of dimensions.
            The corresponding dimensions of :attr:`input` are flattened into
            one dimension, and the norm is calculated on the flattened
            dimension.

            Frobenius norm produces the same result as ``p=2`` in all cases
            except when :attr:`dim` is a list of three or more dims, in which
            case Frobenius norm throws an error.

            Nuclear norm can only be calculated across exactly two dimensions.

        dim (int, tuple of ints, list of ints, optional):
            Specifies which dimension or dimensions of :attr:`input` to
            calculate the norm across. If :attr:`dim` is ``None``, the norm will
            be calculated across all dimensions of :attr:`input`. If the norm
            type indicated by :attr:`p` does not support the specified number of
            dimensions, an error will occur.
        keepdim (bool, optional): whether the output tensors have :attr:`dim`
            retained or not. Ignored if :attr:`dim` = ``None`` and
            :attr:`out` = ``None``. Default: ``False``
        out (Tensor, optional): the output tensor. Ignored if
            :attr:`dim` = ``None`` and :attr:`out` = ``None``.
        dtype (:class:`torch.dtype`, optional): the desired data type of
            returned tensor. If specified, the input tensor is casted to
            :attr:'dtype' while performing the operation. Default: None.

    .. note::
        Even though ``p='fro'`` supports any number of dimensions, the true
        mathematical definition of Frobenius norm only applies to tensors with
        exactly two dimensions. :func:`torch.linalg.norm` with ``ord='fro'`` aligns
        with the mathematical definition, since it can only be applied across
        exactly two dimensions.

    Example::

        >>> import torch
        >>> a = torch.arange(9, dtype= torch.float) - 4
        >>> b = a.reshape((3, 3))
        >>> torch.norm(a)
        tensor(7.7460)
        >>> torch.norm(b)
        tensor(7.7460)
        >>> torch.norm(a, float('inf'))
        tensor(4.)
        >>> torch.norm(b, float('inf'))
        tensor(4.)
        >>> c = torch.tensor([[ 1, 2, 3],[-1, 1, 4]] , dtype= torch.float)
        >>> torch.norm(c, dim=0)
        tensor([1.4142, 2.2361, 5.0000])
        >>> torch.norm(c, dim=1)
        tensor([3.7417, 4.2426])
        >>> torch.norm(c, p=1, dim=1)
        tensor([6., 6.])
        >>> d = torch.arange(8, dtype= torch.float).reshape(2,2,2)
        >>> torch.norm(d, dim=(1,2))
        tensor([ 3.7417, 11.2250])
        >>> torch.norm(d[0, :, :]), torch.norm(d[1, :, :])
        (tensor(3.7417), tensor(11.2250))
    """

import torch
import cmath

x = torch.arange(9, dtype=torch.float) - 4
y = x.reshape((3, 3))
# 默认是Fro范数
print("torch.norm(x) = {}".format(torch.norm(x)))
print("torch.norm(y) = {}".format(torch.norm(y)))
sum = 0.
for i in x:
   sum += i**2
print(cmath.sqrt(sum))

# 无穷范数
print("torch.norm(y, float('inf')) = {}".format(torch.norm(y, float('inf'))))
print("torch.norm(y, float('-inf')) = {}".format(torch.norm(y, float('-inf'))))
c = torch.tensor([[1, 2, 3], [-1, 1, 4]], dtype=torch.float)
print("torch.norm(c, dim=0) = {}".format(torch.norm(c, dim=0)))
print("torch.norm(c, dim=1) = {}".format(torch.norm(c, dim=1)))
print("torch.norm(c, p=1, dim=1) = {}".format(torch.norm(c, p=1, dim=1)))
d = torch.arange(8, dtype=torch.float).reshape(2, 2, 2)
print("torch.norm(d[0, :, :]) = {}".format( torch.norm(d[0, :, :])))

运行结果：

torch.norm(x) = 7.745966911315918
torch.norm(y) = 7.745966911315918
(7.745966692414834+0j)
torch.norm(y, float(‘inf’)) = 4.0
torch.norm(y, float(’-inf’)) = 0.0
torch.norm(c, dim=0) = tensor([1.4142, 2.2361, 5.0000])
torch.norm(c, dim=1) = tensor([3.7417, 4.2426])
torch.norm(c, p=1, dim=1) = tensor([6., 6.])
torch.norm(d, dim=(1,2)) = tensor([ 3.7417, 11.2250])

pytorch的norm方法注释中解释的已经很清楚了，值得注意的是注释中提到torch.norm() is deprecated，意思就是该方法不再维护了。 $l_{2,1}$ 范数的计算可以通过控制dim参数分两次计算。

伍俊良,刘飞.实对称矩阵和与差的一些特征值与F-范数不等式[J].高等学校计算数学学报,2004(04):365-370. ↩︎
https://hyper.ai/wiki/2687 ↩︎
https://blog.csdn.net/minfanphd/article/details/119106708?spm=1001.2014.3001.5501 ↩︎
https://zhuanlan.zhihu.com/p/35897775 ↩︎