tensorflow matmul函数

最新推荐文章于 2024-04-28 18:05:44 发布

starfarmingSuZhou

最新推荐文章于 2024-04-28 18:05:44 发布

阅读量1.5k

点赞数 2

分类专栏：深度学习文章标签：深度学习人工智能 tensorflow

本文链接：https://blog.csdn.net/qq_35597259/article/details/118679389

版权

深度学习专栏收录该内容

2 篇文章 0 订阅

订阅专栏

这篇博客探讨了TensorFlow中matmul函数的要求和一些隐性行为。通常，`tf.matmul()`函数要求操作数的最后两维可乘，其余维度相等。然而，它也支持一些隐性的维度匹配，例如通过扩展维度、堆叠或平铺来使得形状兼容。博主展示了几个例子，说明即使维度不完全符合标准要求，TensorFlow仍能通过内部处理使运算成功。但并非所有情况都适用，当无法通过这些方式调整形状时，将会抛出错误。

摘要由CSDN通过智能技术生成

tensorflow 函数matmul要求

a=tf.constant([[[1,2,3],[4,5,6]],[[1,2,3],[4,5,6]]])
b=tf.constant([[[1,2],[3,4],[5,6]],[[1,2],[3,4],[5,6]]])
print(a.shape) #2*2*3
print(b.shape) #2*3*2
c=tf.matmul(a,b)
print(c)

结论：要求 a、b的最后两维可乘，其他维度相等。但有时会有一些隐性的东西：

a=tf.constant([[[1,2,3],[4,5,6]],[[1,2,3],[4,5,6]]])
b=tf.constant([[1,2],[3,4],[5,6]])
print(a.shape) 
print(b.shape) #3*2
c=tf.matmul(a,b)
print(c)

比如这个是成立的，tf内部实现时使用了expend_dim以及tile后stack在一起，所以发生了有意思的事情，就是：

a=tf.constant([[[1,2,3],[4,5,6]],[[1,2,3],[4,5,6]]])
b=tf.constant([[1,2],[3,4],[5,6]])#3*2

a=tf.constant([[[1,2,3],[4,5,6]],[[1,2,3],[4,5,6]]])
b=tf.constant([[[1,2],[3,4],[5,6]]]) #1*3*2

a=tf.constant([[[1,2,3],[4,5,6]],[[1,2,3],[4,5,6]]])
b=tf.constant([[[1,2],[3,4],[5,6]],[[1,2],[3,4],[5,6]]]) #2*3*2

这三种方式是等效的，但是当b的维度不可以通过tile达到a的维度时会报错：

a=tf.constant([[[1,2,3],[4,5,6]],[[1,2,3],[4,5,6]]])
b=tf.constant([[[1,2],[3,4],[5,6]],[[1,2],[3,4],[5,6]],,[[1,2],[3,4],[5,6]]]) #3*3*2

tensorflow.python.framework.errors_impl.InvalidArgumentError: In[0] and In[1] must have compatible batch dimensions: [2,2,3] vs. [3,3,2] [Op:BatchMatMulV2]