总结报告
叶子心情你不懂
帅哥
展开
-
OpenMP + MPI 编程 (七)Quiz
Hybrid programming combines what two models of computing?Shared-memory parallel and distributed-memory parallelParallel and serialMessage-passing and NUMAAll of the aboveCorrect.An MPI process is generally single-threaded unless the code has been..原创 2020-11-13 16:29:58 · 435 阅读 · 0 评论 -
OpenMP + MPI 编程 (六)Hybrid Program Template & Types of MPI Calls Among Threads
Hybrid Program TemplateStart with MPI initialization(Serial regions are executed by themain thread of the MPI process)Create OMP parallel regions withineach MPI processMPI calls may be allowed here tooMPI rank is known to all threadsCall MPI in sin原创 2020-11-13 16:29:22 · 435 阅读 · 0 评论 -
OpenMP + MPI 编程 (五)NUMA Operations&using numactl&SMP Nodes&SMP Sockets
NUMA OperationsIf memory were completely uniform, there would be no need to worry about questions like “where do processes go?”. Only for NUMA is the placement of processes/threads and allocated memory (NUMA control) of any importance.The default NUMA co原创 2020-11-13 16:28:51 · 585 阅读 · 0 评论 -
OpenMP + MPI 编程 (四)Two Ways to View a Node When Writing Parallel ProgramsANDCreating Hybrid Configur
Two Ways to View a Node When Writing Parallel ProgramsOpenMP (or pthreads) onlylaunch one process per nodehave each process fork one thread (or maybe more) per coreshare data using shared memorycan’t share data with a different process (except mayb原创 2020-11-13 16:28:31 · 169 阅读 · 0 评论 -
OpenMP + MPI 编程 (三)Dealing with NUMA and Why Hybrid? Or Why Not?
Dealing with NUMAHow do we deal with NUMA (Non-Uniform Memory Access)?Standard models for parallel programs assume a uniform architectureThreads for shared memoryparent process uses pthreads or OpenMP to fork multiple threadsthreads share the same原创 2020-11-13 16:27:59 · 237 阅读 · 0 评论 -
OpenMP + MPI 编程 (二)Stampede2 RAM Arrangement
Stampede2 RAM ArrangementStampede2 includes 4,200 Intel Xeon Phi compute nodes with KNL processors, and 1,736 Intel Xeon compute nodes with Skylake processors. While the two sides of the system have notable differences, the general organization of Stamped原创 2020-11-13 16:27:11 · 205 阅读 · 0 评论 -
OpenMP + MPI 编程 (一)Introduction
Yun (Helen) He and Chris Ding, Lawrence Berkeley National Laboratory, June 24, 2004: Hybrid OpenMP and MPI Programming and Tuning (NUG2004).Texas Advanced Computing Center: Stampede2 User Guide: Hybrid Model.Message Passing Interface Forum: MPI-2: M.原创 2020-11-13 16:26:38 · 277 阅读 · 0 评论 -
GNU中用于原子内存访问的内置函数
https://gcc.gnu.org/onlinedocs/gcc-4.1.2/gcc/Atomic-Builtins.htmlAPI功能type __sync_fetch_and_add (type *ptr, type value, …)type __sync_fetch_and_sub (type *ptr, type value, …)type __sync_fetch_and_or (type *ptr, type value, …)type __sync_fetch_an原创 2020-08-13 23:10:54 · 199 阅读 · 0 评论 -
libnuma详解(A NUMA API for LINUX)
白皮书numa3手册numa3手册一、什么是NUMA关于这个很多博客都有,基本都是这个白皮书的翻译或者扩充。在传统的SMP(对称多处理)系统中,计算机有一个由所有cpu共享的内存控制器。当所有处理器同时访问内存时,这种单一内存连接常常成为瓶颈。对于具有更多cpu的大型系统,它也不能很好地扩展。由于这个原因,越来越多的现代系统正在使用CC/NUMA(缓存一致性/非一致性内存访问)架构。例如AMD* Opteron*、IBM* Power5*、HP* Superdome和SGI* Altix*。原创 2020-08-05 22:23:01 · 10503 阅读 · 0 评论 -
图计算系统小调研
仔细看看图计算系统是干嘛的,做笔记,决定未来方向。一. 费马科技 五篇知乎文 图计算系统发展简史https://zhuanlan.zhihu.com/p/83202666社交网络分析、网页排序、社区发现、天体物理学、计算化学、生物信息学、超大规模集成电路的设计、运输路线的规划、电力网络的仿真模拟等等早期的图计算框架:早期的面向图计算的程序库缺乏对用户友好的编程模型,需要介入和管理的细节较多,上手难度大(BGL、PBGL)。通用大数据处理系统通过较为廉价的普通服务器集群进行大规模数原创 2020-08-04 22:39:32 · 809 阅读 · 0 评论 -
OpenMP学习笔记
一、OpenMP简介1. 内存共享并行模型。机器必须多处理器/核,共享内存。底层架构可以是UMA和NUMA。UMANUMA2. Fork-Join模型重复fork-join操作。3. OpenMP特性OpenMP仅通过线程来完成并行。OpenMP的程序一般由串行等价性,也就是并行运算最终的结果会和某个串行程序相等。支持 c / c++ 等语言。 编译器VS、gcc、clang等都支持OpenMP。可移植性好。编译命令一般加上-fopenmp就可以。二、Op原创 2020-08-04 22:34:59 · 3246 阅读 · 1 评论 -
c++ 实现linear和CNN神经网络(数学公式)
https://www.cnblogs.com/pinard/p/6422831.htmlhttps://www.cnblogs.com/pinard/p/6494810.htmlhttps://www.cnblogs.com/pinard/p/10750718.htmlhttps://www.cnblogs.com/pinard/p/10773942.html理论多层感知机前向传播及导数al=σ(wlTal−1+bl)ml×n=σ(ml×ml−1×ml−1×n+ml×1)a^l=\sigma(原创 2020-07-18 13:20:28 · 930 阅读 · 0 评论 -
贝叶斯概率模型
一、贝叶斯模型思想贝叶斯理论的思想,简单点讲就是要在已知条件的前提下,先设定一个假设,然后通过先验实验来更新这个概率。工作流程优点1. 基于古典数学理论,分类效率稳定2. 对缺失数据不敏感3. 准确度高,速度快4. 小规模数据表现好5. 能处理多分类6. 增量训练,分批训练缺点1. 依赖训练数据2. 属性不能相关3. 需要基于先验概率,假设出来的先验概率可能出...原创 2020-07-18 13:18:02 · 4798 阅读 · 0 评论 -
牛顿广义二项式定理-母函数
好久没写博客了,有好多都是写成了草稿没写完。列个清单慢慢补。。数论专题。概率\期望专题划分树专题省赛训练补题机器学习的记录课程要求写的一些东西cf的题 好了进入正题广义组合数 数据结构老师让搜 (70.5)\binom{7}{0.5}(0.57) 这种东西。搜出来是广义组合数,对应的有广义二项式定理。一看这个玩意儿ACM经常用就学一下。...原创 2019-03-28 23:54:26 · 5744 阅读 · 4 评论 -
一点小感触
明天要考PAT甲级,刷了五道题,惊讶发现好水的题。再想想去年的我乙级只有不到五十分真的有点感触。平时做acm题一直觉得自己一题都不会,只是顶着压力学,其实还是有进步的啊,进步都是悄悄的。...原创 2018-09-07 22:58:10 · 215 阅读 · 1 评论