活到老学到老之index操作

最新推荐文章于 2024-08-22 18:03:46 发布

ox180x

最新推荐文章于 2024-08-22 18:03:46 发布

阅读量228

点赞数

文章标签：深度学习 pytorch 机器学习神经网络自然语言处理

本文链接：https://blog.csdn.net/ox180x/article/details/124095601

版权

快速想一想，你能想到torch有哪些常见的index操作？？

1. gather

>>> a = torch.tensor([[1, 2, 3],
        [4, 5, 6]])
>>> a.gather(dim=1, index=torch.tensor([[0,1], [1,2]]))
tensor([[1, 2],
        [5, 6]])

2. index_select

>>> a
tensor([[1, 2, 3],
        [4, 5, 6]])
>>> a.index_select(dim=1, index=torch.tensor([1,2]))
tensor([[2, 3],
        [5, 6]])

3. 骚气的来了哦

根据上面例子可以看到，a为矩阵，选择a中的index，但是下面介绍一个map操作.

>>> index
tensor([[1, 2, 3],
        [4, 5, 6]])

>>> a = torch.tensor([11, 22, 33, 44, 55, 66, 77])
>>> a
tensor([11, 22, 33, 44, 55, 66, 77])
>>> index
tensor([[1, 2, 3],
        [4, 5, 6]])
>>> a[index]
tensor([[22, 33, 44],
        [55, 66, 77]])

这种操作有一个真实场景，比如：

# 1. 这是两个特征
>>> words = ['我', '爱', '中', '国']
>>> pos = ['n', 'v', 'n', 'n']

# 2. 假设words变成了一个4 * 4的临接矩阵，用于表示每个token和其他token的一个关联重要程度

>>> words_attn = torch.rand(4,4)

>>> words_attn
tensor([[0.6279, 0.6234, 0.9831, 0.5267],
        [0.2265, 0.8453, 0.5740, 0.4772],
        [0.7759, 0.6952, 0.1758, 0.3800],
        [0.9998, 0.3138, 0.5078, 0.5565]])


>>> scores, indices = words_attn.topk(k=2, dim=1)

>>> indices
tensor([[2, 0],
        [1, 2],
        [0, 1],
        [0, 3]])

# 3. 假设pos转为了
>>> pos_tensor = torch.tensor([111, 222, 333, 444])

# 4. map操作
>>> pos_tensor[indices]
tensor([[333, 111],
        [222, 333],
        [111, 222],
        [111, 444]])

# 5. 随后就可以接一个embedding搞事情了
pos_embedding(pos_tensor[indices])

# 6. 总结，这个示例的优点可以看出是快速计算，取topK然后再结合其他的特征进行操作。

ox180x

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
活到老学到老之index操作

快速想一想，你能想到torch有哪些常见的index操作？？1. gather12345>>> a = torch.tensor([[1, 2, 3], [4, 5, 6]])>>> a.gather(dim=1, index=torch.tensor([[0,1], [1,2]]))tensor([[1, 2],...
复制链接

扫一扫