报错：RuntimeError: view size is not compatible with input tensor‘s size and stride

最新推荐文章于 2023-05-17 09:47:43 发布

呀比小饼干

最新推荐文章于 2023-05-17 09:47:43 发布

阅读量766

点赞数

分类专栏： Debug 文章标签： python 机器学习数据挖掘

本文链接：https://blog.csdn.net/qq_45850131/article/details/123717159

版权

Debug 专栏收录该内容

4 篇文章 0 订阅

订阅专栏

报错：RuntimeError: view size is not compatible with input tensor‘s size and stride

def attention_net(self, lstm_output, final_state):
        batch_size = lstm_output.size(0)
        hidden = final_state.view(batch_size, -1, 1)
        
        attn_weights = torch.bmm(lstm_output, hidden).squeeze(2)
        soft_attn_weights = F.softmax(attn_weights, 1)
        context = torch.bmm(lstm_output.transpose(1, 2), soft_attn_weights.unsqueeze(2)).squeeze(2)
        print(context.shape)
        print(soft_attn_weights.shape)
        return context, soft_attn_weights

python报错：RuntimeError: view size is not compatible with input tensor‘s size and stride

这是因为view()需要Tensor中的元素地址是连续的，但可能出现Tensor不连续的情况，所以先用 .contiguous() 将其在内存中变成连续分布，改为：

hidden = final_state.contiguous().view(batch_size, -1, 1)

这样就能成功运行了；

关于连续性的解释：
多维张量在内存中存储是线性数组，而怎么去解读它，就需要自己定义shape，比如对于a[6]数组，可以解释为两行三列(2,3)或者三行两列(3,2)。而这里的索引和在内存中的线性数组的一一对应关系要符合一定的规则，符合就是连续的，不符合就不是，就得需要调用contiguous。而在训练网络中，并不需要理解背后的机理，这属于张量计算框架的内容，想要进一步了解的可以参考Github上的miniTorch教程。

参考文章：https://blog.csdn.net/tiao_god/article/details/108189879

呀比小饼干

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
报错：RuntimeError: view size is not compatible with input tensor‘s size and stride

报错：RuntimeError: view size is not compatible with input tensor‘s size and stridedef attention_net(self, lstm_output, final_state): batch_size = lstm_output.size(0) hidden = final_state.view(batch_size, -1, 1) attn_weights
复制链接

扫一扫