int和float类型的Numpy数组dot运算效率比较

最新推荐文章于 2023-07-20 17:58:24 发布

sunnyyan

最新推荐文章于 2023-07-20 17:58:24 发布

阅读量1.5k

点赞数

分类专栏： Python 文章标签： Numpy

本文链接：https://blog.csdn.net/sunnyyan/article/details/84964023

版权

Python 专栏收录该内容

4 篇文章 1 订阅

订阅专栏

在使用Numpy的dot函数时遇到了一个非常有意思的问题: int类型的Numpy数组进行dot运算的效率要远远低于float类型的Numpy数组。

同样的shape，但由于类型的不同却导致几十倍的性能差距。

在stackoverflow的一个帖子里找到了原因：

“When Numpy is built with an accelerated BLAS like ATLAS, these functions are replaced to make use of the faster implementations. The faster implementations only affect float32, float64, complex64, and complex128 arrays. Furthermore, the BLAS API only includes matrix-matrix, matrix-vector, and vector-vector products. Products of arrays with larger dimensionalities use the built in functions and are not accelerated.“

原来为了实现更快的计算，Numpy采用了一个类似ATLAS的BLAS（Basic Linear Algebra Subprograms, 基础线性代数子程序）。但这些实现只能影响类型为float32, float64, complex64和complex128的数组。另外，BLAS API只包含矩阵和矩阵，矩阵和向量以及向量和向量的乘法。因此，更高维的数组乘法并不能被加速。