OpenMP + MPI 编程 (二)Stampede2 RAM Arrangement

Stampede2 RAM Arrangement

Stampede2 includes 4,200 Intel Xeon Phi compute nodes with KNL processors, and 1,736 Intel Xeon compute nodes with Skylake processors. While the two sides of the system have notable differences, the general organization of Stampede2 follows the typical pattern for clusters, which can be summarized and illustrated as follows:

  • KNL nodes: distributed memory
    Each KNL node has 96GB DDR4 RAM
    Each KNL node has 16GB MCDRAM high bandwidth memory (HBM) as well
    Memory is local to each node and is not directly accessible from other nodes

  • Skylake nodes: distributed memory
    Each Skylake Xeon (SKX) node has 192GB DDR4 RAM
    Again, memory is local to each node and is not directly accessible from other nodes

  • Memory spans all cores on a node (of either type): shared memory
    A node’s full local memory is addressable from any core in any socket

  • One or two sockets per node
    Each KNL node has one socket (to hold one Intel “KNL” processor)
    Each Xeon node has two sockets (to hold two Intel “Skylake” processors)

  • Multiple cores per socket
    Each KNL socket (processor) has 68 cores
    Each Xeon socket (processor) has 24 cores

  • Memory is attached to sockets
    Cores sharing the socket have fastest access to attached memory
    KNL has additional core-level memory locality (to be discussed later)

In most of the following diagrams, we will use the Xeon Skylake processor as a model in order to make the figures easier to follow (which wouldn’t be the case if we always drew KNL cores!).

真有钱。。。

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值