安装NVIDIA驱动
下载地址:https://www.nvidia.com/Download/index.aspx
驱动安装
sudo sh NVIDIA-Linux-x86_64-440.118.02.run
因为一些docker 容器正在使用该驱动,需要输入一下命令:
停止占用的服务
sudo service lightdm stop
sudo stop nvidia-digits-server
sudo service docker stop
sudo rmmod nvidia-uvm
或者需要屏蔽nouveau,以及把docker采用了gpu的已有的服务关闭stop.
屏蔽nouveau
sudo vim /etc/modprobe.d/blacklist-nouveau.conf
加入
blacklist nouveau
options nouveau modeset=0
更新使其生效
驱动安装
sudo update-initramfs -u
修改运行级别为文本模式
systemctl set-default multi-user.target
重启
sudo reboot
检验是否生效,输入以下命令,如果无信息输出,则生效
驱动安装
lsmod | grep nouveau
查看nvidia-smi
[root@localhost sbin]# nvidia-smi
Sat Oct 9 14:54:38 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 440.33.01 Driver Version: 440.33.01 CUDA Version: 10.2 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla V100-FHHL... Off | 00000000:AF:00.0 Off | 0 |
| N/A 55C P0 32W / 150W | 12MiB / 16160MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
升级NVIDIA驱动
升级前需要卸载原有驱动。
清除所有nvidia相关文件和依赖
驱动安装
$ yum remove nvidia-*
清理gcc环境
清理gcc环境
sudo yum autoremove -y gcc*
sudo yum autoremove -y cpp*
sudo yum autoremove -y dkms
sudo rm -rf /opt/rh #删除由devtoolset安装的gcc
sudo rm -rf /usr/lib/gcc
sudo rm -rf /usr/libexec/gcc
whereis gcc
sudo yum install -y gcc
sudo yum install -y gcc-c++
sudo yum install -y glibc-devel
gcc -v
然后执行章节一的安装流程
官方教程:
cuda-installation-guide-linux 12.1 documentation (nvidia.com)