深度学习处理数据中维度变换操作手册

挨打且不服66

已于 2024-08-03 22:17:25 修改

阅读量289

点赞数 1

分类专栏： python 文章标签：深度学习人工智能

于 2024-07-26 16:07:21 首次发布

本文链接：https://blog.csdn.net/lf_78910jqk/article/details/140718188

版权

python 专栏收录该内容

50 篇文章 0 订阅

订阅专栏

好的，我们可以通过具体数字来更清楚地了解维度变换。假设以下变量有以下形状：

batch_size = 4
max_ent_cnt = 97
num_labels = 97
feature_size = 768
proto_dim = 768

初始形状如下：

ent_head: [97, 4, 768]（[max_ent_cnt, batch_size, feature_size]）
ent_tail: [97, 4, 768]（[max_ent_cnt, batch_size, feature_size]）
proto: [4, 97, 768]（[batch_size, num_labels, proto_dim]）

Step-by-Step Transformation

ent_head 和 ent_tail 变换

# Original shape: [max_ent_cnt, batch_size, feature_size] -> [97, 4, 768]
ent_head = ent_head.permute(1, 0, 2).unsqueeze(2).repeat(1, 1, self.max_ent_cnt, 1)
# After permute and unsqueeze: [4, 97, 1, 768]
# After repeat: [4, 97, 97, 768]

# Original shape: [max_ent_cnt, batch_size, feature_size] -> [97, 4, 768]
ent_tail = ent_tail.permute(1, 0, 2).unsqueeze(1).repeat(1, self.max_ent_cnt, 1, 1)
# After permute and unsqueeze: [4, 1, 97, 768]
# After repeat: [4, 97, 97, 768]

具体过程：

permute(1, 0, 2) 将 [97, 4, 768] 变为 [4, 97, 768]
unsqueeze(2) 将 [4, 97, 768] 变为 [4, 97, 1, 768]（在第三维添加一个维度）
repeat(1, 1, self.max_ent_cnt, 1) 将 [4, 97, 1, 768] 变为 [4, 97, 97, 768]（在第三维重复 self.max_ent_cnt 次）

类似的步骤对 ent_tail 进行处理，只是 unsqueeze 和 repeat 的维度不同。

proto 变换

# Original shape: [batch_size, num_labels, proto_dim] -> [4, 97, 768]
proto = proto.unsqueeze(1).repeat(1, self.max_ent_cnt, 1, 1)
# After unsqueeze: [4, 1, 97, 768]
# After repeat: [4, 97, 97, 768]

具体过程：

unsqueeze(1) 将 [4, 97, 768] 变为 [4, 1, 97, 768]（在第一维添加一个维度）
repeat(1, self.max_ent_cnt, 1, 1) 将 [4, 1, 97, 768] 变为 [4, 97, 97, 768]（在第二维重复 self.max_ent_cnt 次）

调整 einsum 等式以匹配形状

logits = torch.einsum("xyz,bhtx,bcy,bhtz->bhtc", self.core_tensor, ent_head, proto.float(), ent_tail) + self.cls_bias

einsum 计算说明：

self.core_tensor 形状 [768, 768, 768] -> xyz
ent_head 形状 [4, 97, 97, 768] -> bhtx
proto 形状 [4, 97, 97, 768] -> bcy
ent_tail 形状 [4, 97, 97, 768] -> bhtz

einsum 将按照如下方式进行计算：

x 与 x 相乘，结果的形状是 [4, 97, 97, 97]
y 与 y 相乘，结果的形状是 [4, 97, 97, 97]
z 与 z 相乘，结果的形状是 [4, 97, 97, 97]
结果最终聚合为 [4, 97, 97, 97]

最终形状

logits 的最终形状为 [4, 97, 97, 97]。

在使用 PyTorch 的 reshape 或 view 函数时，不会丢失数据。这两个函数的主要作用是改变张量的形状，而不会改变张量的数据内容。

示例代码

让我们通过一个具体的例子来说明这一点：

import torch

# 生成一个形状为 [1, 97] 的张量，使用从 0 到 96 的整数
tensor = torch.arange(97).reshape(1, 97)

# 打印生成的张量及其形状
print("Original tensor:")
print(tensor)
print("Shape:", tensor.shape)

# 将张量转换为形状 [97, 1]
reshaped_tensor = tensor.reshape(97, 1)

# 打印转换后的张量及其形状
print("\nReshaped tensor:")
print(reshaped_tensor)
print("Shape:", reshaped_tensor.shape)

输出

运行上述代码后，您会看到输出如下：

Original tensor:
tensor([[ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15, 16, 17,
         18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35,
         36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53,
         54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71,
         72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89,
         90, 91, 92, 93, 94, 95, 96]])
Shape: torch.Size([1, 97])

Reshaped tensor:
tensor([[ 0],
        [ 1],
        [ 2],
        [ 3],
        [ 4],
        [ 5],
        [ 6],
        [ 7],
        [ 8],
        [ 9],
        [10],
        [11],
        [12],
        [13],
        [14],
        [15],
        [16],
        [17],
        [18],
        [19],
        [20],
        [21],
        [22],
        [23],
        [24],
        [25],
        [26],
        [27],
        [28],
        [29],
        [30],
        [31],
        [32],
        [33],
        [34],
        [35],
        [36],
        [37],
        [38],
        [39],
        [40],
        [41],
        [42],
        [43],
        [44],
        [45],
        [46],
        [47],
        [48],
        [49],
        [50],
        [51],
        [52],
        [53],
        [54],
        [55],
        [56],
        [57],
        [58],
        [59],
        [60],
        [61],
        [62],
        [63],
        [64],
        [65],
        [66],
        [67],
        [68],
        [69],
        [70],
        [71],
        [72],
        [73],
        [74],
        [75],
        [76],
        [77],
        [78],
        [79],
        [80],
        [81],
        [82],
        [83],
        [84],
        [85],
        [86],
        [87],
        [88],
        [89],
        [90],
        [91],
        [92],
        [93],
        [94],
        [95],
        [96]])
Shape: torch.Size([97, 1])