在ubuntu16系统中,安装gpu主要有以下3步:
- 安装nvidia显卡驱动
- 安装cudaToolkit
- 安装cudnn
安装nvidia显卡驱动
方案1 电脑自己安装驱动
打开ubuntu系统的 系统设置>>软件&更新>>additional Drivers,点击第一个,在点击Apply Change。
这样安装的驱动可能不是最新的。重启后,在终端输入命令
nvidia-smi
会出现Nvidia的信息,安装成功。
方案2 手动去官网下载.run文件自己安装
获得显卡型号信息,在官网nvidia下载对应驱动,安装。
方案3 命令安装
- 删除之前安装的nvidia驱动,运行 :
sudo apt-get purge nvidia-*
- 添加第三方驱动源,运行
sudo add-apt-repository ppa:graphics-drivers/ppa
3.更新软件源:
sudo apt-get update
4.通过下面查看支持的版本
ubuntu-drivers devices
5.安装:
sudo apt-get install nvidia-384 ## 375是Nvidia的型号,在上图中可看到。
sudo apt-get update
安装cuda
在安装之前,需要特别注意的是Nvidia版本和cuda有个对应关系,见下图
安装过程:
- 首先下载cuda8.0的安装文件,链接:https://developer.nvidia.com/cuda-toolkit-archive
- 然后,安装CUDA8.0,打开终端输入
进入cuda 所在文件夹
sudo dpkg -i cuda-repo-ubuntu1604-8-0-rc_8.0.27-1_amd64.deb
sudo apt-get update
sudo apt-get install cuda
- 配置环境变量,在终端中输入
sudo gedit ~/.bashrc
末尾写入
export PATH="$PATH:/usr/local/cuda-8.0/bin"
export LD_LIBRARY_PATH="/usr/local/cuda-8.0/lib64"
使环境变量更新
source ~/.bashrc
- 测试
终端输入
cd /usr/local/cuda-10.0/samples/1_Utilities/deviceQuery
make
./deviceQuery
正确安装时输出以下内容:
./deviceQuery Starting…
CUDA Device Query (Runtime API) version (CUDART static linking)
Detected 1 CUDA Capable device(s)
Device 0: “GeForce GTX 1080 Ti”
CUDA Driver Version / Runtime Version 10.1 / 10.1
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory: 11178 MBytes (11721506816 bytes)
(28) Multiprocessors, (128) CUDA Cores/MP: 3584 CUDA Cores
GPU Max Clock rate: 1582 MHz (1.58 GHz)
Memory Clock rate: 5505 Mhz
Memory Bus Width: 352-bit
L2 Cache Size: 2883584 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
Device supports Unified Addressing (UVA): Yes
Device supports Compute Preemption: Yes
Supports Cooperative Kernel Launch: Yes
Supports MultiDevice Co-op Kernel Launch: Yes
Device PCI Domain ID / Bus ID / location ID: 0 / 1 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 10.1, CUDA Runtime Version = 10.1, NumDevs = 1
Result = PASS
也可在终端输入
nvcc -V
若成功,会出现cuda的信息。
安装cudnn
首先下载软件(链接:https://developer.nvidia.com/cudnn )需要登录,按网上的流程走就行,记得下载cuda对应版本。
然后打开终端,
- cd到文件的下载目录;
- 解压cudnn8.0文件;
- 复制一些文件
- 在终端中输入以下命令;
cd cd /media/你的用户名/ # 进入 cuDNN 安装文件的所在路径
tar xvzf cudnn-8.0-linux-x64-v6.0.tgz # 解压
sudo cp cuda/include/cudnn.h /usr/local/cuda/include # 复制到 include 中
sudo cp cuda/lib64/libcudnn* /usr/local/cuda/lib64 # 复制到 lib64 中
sudo chmod a+r /usr/local/cuda/include/cudnn.h /usr/local/cuda/lib64/libcudnn* # 讲头文件复制进去
测试cudnn
拷贝完成之后,可以使用以下命令查看CUDNN的版本信息:
cat /usr/local/cuda/include/cudnn.h | grep CUDNN_MAJOR -A 2
另一测试见 https://blog.csdn.net/bat67/article/details/84065261