2017年03月___DARK__

12月 11月 09月 08月 07月 06月 05月 04月 03月 02月 01月

原创 VPR

lut ble(1:1) clb(1:n) 逻辑clusterslice block ram(1:32) 存储vpr package produce logic unit placement physical position routing

2017-03-31 14:29:59 1287

原创 GPGPU-SIM 之编译错误 cannot find -lcutil_x86_64 -lshrutil_x86_64

/usr/bin/ld: cannot find -lcutil_x86_64/usr/bin/ld: cannot find -lshrutil_x86_64这个编译错误是我在 GPGPU-Sim 中经常遇到的错误!出现这个错误原因是什么呢？就是因为在编译过程中找不到 libcutil_x86_64.a 和 libshrutil_x86_64.a !为什么找不到呢？？这是因为在 ma

2017-03-30 23:23:13 2074 6

原创 main 函数原型及参数

Main 函数原型Main 函数参数Main 函数返回值Main 函数不是第一个执行的函数Main 函数原型C/C++ 中的main 函数，经常带有参数 argc，argv，如下：int main(int argc,char** argv)int main(int argc,char* argv[])int main(int argc, char* argv[], char* env[] )

2017-03-28 20:56:16 3258

翻译 CUDA 编程之二进制工具与反编译

1.Overview1.1什么是CUDA二进制文件cuda 二进制文件（.cubin .elf）是一种 ELF格式的文件，其中包含可执行代码部分，还包含了一些符号、relocator、debug 信息等其他内容。默认情况下，CUDA编译器 nvcc 可以讲cubin 文件，嵌入到 host 的可执行文件中。也可以使用 nvcc -cubin 来编译直接得到 x.cubin 文件。注意：更多细

2017-03-15 23:22:55 7371

原创 Perl 入门 1

1.perl 的历史 2.数据类型-标量类型

2017-03-15 22:23:30 453

原创产生cache coherence 问题的原因

出现不一致的原因有3个：共享可写的数据，进程迁移，I/O传输1.共享可写数据引起的不一致性（sharing of writable data）如图所示描述我们把核（core）简称P，把核的私有缓存简称C；图一，表示更新前的状态，P1，P2的私有cache C1 和 C2 中都有共享数据x，与shared memory 一致。图二，执行write through 的写回策略，当P1将C1中的数据x

2017-03-10 08:58:16 2262

原创 cache coherence 概念

引子首先说一下一致性，现实中，由于沟通不好而导致的问题，比如，一方改变了某些东西，而没有及时通知另一方，导致两方的信息不同，这就是一致性问题。概念多核处理器中也存在不一致的问题。举个例子，比如内存中有个数据x，被缓存到不同的私有cache 中，假如有两个核，core0 和 core1，本来x=1，如果core0 将x 重新赋值为3的话，而core1还以为x=1，那么就会导致程序出错

2017-03-09 16:28:19 1730

原创 PTX ISA 之同步指令 bar & membar

barBarrier synchronization.Syntaxbar.sync a{, b};bar.arrive a, b;bar.red.popc.u32 d, a{, b}, {!}c;bar.red.op.pred p, a{, b}, {!}c;.op = { .and, .or };http://docs.nvidia.com/cuda/parallel-t

2017-03-07 23:37:00 2457

原创 L1 Data Cache in Nvidia

Nvidia 架构local dataglobal loadsglobal storefor L1 cachereference white paperFermicachingcachingcachingL1/shared memnot coherentKeplercachingNot cachingNot cachi

2017-03-07 20:11:56 425

原创 GPU 架构基础之 Concurrent Kernel Execution in Fermi arch & later

Fermi supports concurrent kernel execution, where different kernels of the same application context can execute on the GPU at the same time. Concurrent kernel execution allows programs that execute a

2017-03-07 14:35:42 1050

原创 GPU架构基础之 L1 data cache & Unified L2 cache IN Fermi Arch

NVIDIA Parallel DataCache TM with Configurable L1 and Unified L2 Cache Working with hundreds of GPU computing applications from various industries, we learned that while Shared memory benefits many

2017-03-07 10:11:20 2248

转载 install typora for linux

for Linux# optional, but recommendedsudo apt-key adv --keyserver keyserver.ubuntu.com --recv-keys BA300B7755AFCFAE# add Typora's repositorysudo add-apt-repository 'deb https://typora.io ./linux/'sud

2017-03-04 22:50:41 2568

UEFI BIOS 支持两种引导模式：Legacy BIOS 引导模式和 UEFI 引导模式。某些设备和操作系统尚不支持基于 UEFI 的 BIOS，因此只能从 Legacy BIOS 引导模式引导。根据您的情况，您可能必须指定要使用的 UEFI BIOS 引导模式：Legacy BIOS 引导模式或 UEFI 引导模式。选择 Legacy BIOS 引导模式可允许 HBA 和 Express Mo

2017-03-04 11:15:18 1334

原创 PTX ISA 之原子操作 atom 的小结

关于原子操作 atom 的小结atomAtomic reduction operations for thread-to-thread communication.语法atom{.space}{.scope}.op.type d, [a], b; atom{.space}{.scope}.op.type d, [a], b, c; .space = { .global, .shared }; .s

2017-03-01 16:15:13 915

并行程序设计

并行程序设计PPT，与并行程序设计配套使用.

2015-11-06

opencv中文版

图像处理，专业用于图像处理，视频检测，摄像头应用等

2013-03-17

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

This is Chuanqiz's Blog