GotoBlas2库

GotoBLAS2是HPC领域的一个高性能矩阵库,旨在优化BLAS子例程的执行速度。它支持多种硬件平台和操作系统,如Intel Nehalem、AMD Shanghai等,并具有NUMA控制等功能。开发者Kazushige Goto已将其开源,尽管不再活跃开发,但仍然是优化计算速度的重要资源。文章介绍了如何在Windows Mingw环境下编译库,并提供了使用CBLAS库进行矩阵乘法的示例代码。
摘要由CSDN通过智能技术生成

在High Performance Computing (HPC)领域,最有影响的矩阵库-GotoBLAS,在长久的等待后终于有了更新,而且是直接从1.26跳到了GotoBLAS2,似乎Goto重写了,目前可在官网上下载的最新版本是GotoBLAS2-1.13_bsd。

今天下载了,在mingw下近20分钟的编译过程后成功生成了libgoto2_penrynp-r1.13.lib库文件。

 

GotoBLAS2 has been released by the Texas Advanced Computing Center as open source software under the BSD license. This product is no longer under active development by TACC, but it is being made available to the community to use, study, and extend. GotoBLAS2 uses new algorithms and memory techniques for optimal performance of the BLAS routines. The changes in this final version target new architecture features in microprocessors and interprocessor communication techniques; also, NUMA controls enhance multi-threaded execution of BLAS routines on node. The library features optimal performance on the following platforms:

Intel Nehalem and Atom systems
VIA Nanoprocessor
AMD Shanghai and Istanbul

The library includes the following features:

  • Configurations for a variety of hardware platforms
  • Incorporation of features of many ISAs (Instruction Set Architecture)
  • Implementation of NUMA controls to assure best process affinity and memory policy
  • Dynamic detection of multiple architecture components, which can be included in a single binary (for binary distributions)

 

 

 

有机会研究一下源码。

 

 

What are the GotoBLAS?

评论 26
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值