Linux 18.04 LTS 重新安装Nvidia 驱动 、CUDA Toolkit.
最近手残安装docker Nvidia 镜像时把自己原有的环境给破坏了,得重新安装驱动。把当初一并安装的加速库也给卸载了。参考了一些博文关于“彻底卸载”的教程,记得前面安装还管用,但是这次却对我的电脑不起作用了,包括:
#编辑配置文件
sudo vim /etc/modprobe.d/blacklist.conf
#在最后设置为:
blacklist nouveau
options nouveau modeset=0
#更新使之生效
sudo update-initramfs -u
#重启电脑
sudo reboot
#测试,如果无任何输出即可:
lsmod | grep nouveau
#安装驱动,注意最后的参数设置
sudo ./NVIDIA-Linux-x86_64-390.48.run -no-opengl-files -no-nouveau-check
#验证驱动是否安装好:
nvidia-smi
以上过程目前对我的电脑不再起坐用,下面参考Nvidia 给的方法:
参考链接:参考链接
1. 安装前需要:
- 验证系统是否兼容、gcc编译器、开发包是否安装、是否有CUDA兼容的GPU,如GTX1080等,使用的命令是:
lspci | grep -i nvidia
如果你知道你的电脑GPU型号这一步可以省略。
查看系统信息:
uname -m && cat /etc/*release
我的显示为:
(base) mlc@mlc:~$ uname -m && cat /etc/*release
x86_64
DISTRIB_ID=Ubuntu
DISTRIB_RELEASE=18.04
DISTRIB_CODENAME=bionic
DISTRIB_DESCRIPTION="Ubuntu 18.04.5 LTS"
NAME="Ubuntu"
VERSION="18.04.5 LTS (Bionic Beaver)"
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME="Ubuntu 18.04.5 LTS"
VERSION_ID="18.04"
HOME_URL="https://www.ubuntu.com/"
SUPPORT_URL="https://help.ubuntu.com/"
BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/"
PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy"
VERSION_CODENAME=bionic
UBUNTU_CODENAME=bionic
查看gcc 安装情况:
(base) mlc@mlc:~$ gcc --version
gcc (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0
Copyright (C) 2017 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
验证系统运行内核:
(base) mlc@mlc:~$ uname -r
5.4.0-64-generic
The kernel headers and development packages for the currently running kernel can be installed with:
$ sudo apt-get install linux-headers-$(uname -r)
#具体版本号根据链接选择修改:
wget https://developer.download.nvidia.com/compute/cuda/11.3.1/local_installers/cuda_11.3.1_465.19.01_linux.run
sudo sh cuda_11.3.1_465.19.01_linux.run
CUDA Toolkit 包含CUDA驱动和需要创建的工具。
安装CUDA前要把之前安装的CUDA文件完全卸载:
Use the following command to uninstall a Toolkit runfile installation:
$ sudo /usr/local/cuda-X.Y/bin/cuda-uninstaller
Use the following command to uninstall a Driver runfile installation:
$ sudo /usr/bin/nvidia-uninstall
Use the following commands to uninstall a RPM/Deb installation:
$ sudo dnf remove <package_name> # RHEL8/CentOS8
$ sudo yum remove <package_name> # RHEL7/CentOS7
$ sudo dnf remove <package_name> # Fedora
$ sudo zypper remove <package_name> # OpenSUSE/SLES
$ sudo apt-get --purge remove <package_name> # Ubuntu
- 通过runfile安装(这一步和开头一样):
-
Create a file at /etc/modprobe.d/blacklist-nouveau.conf with the following contents: blacklist nouveau options nouveau modeset=0 Regenerate the kernel initramfs: $ sudo update-initramfs -u
安装命令: sudo sh cuda_<version>_linux.run