NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
据大伙儿反应公共服务器一直显示连接超时,无法正常连接,便去七楼查看情况,发现并不是网络问题,使用万能办法:重启!
重启过后果然解决了连接问题,但又出现了新的问题,输入nvidia-smi后报错:
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA
driver. Make sure that the latest NVIDIA driver is installed and
running.
搜了几个帖子,解决了该问题。
1,首先使用 nvcc- V 命令查看驱动与CUDA。
nvcc -V
nvcc: NVIDIA ® Cuda compiler driver
Copyright © 2005-2020 NVIDIA Corporation
Built on Thu_Jun_11_22:26:38_PDT_2020
Cuda compilation tools, release 11.0, V11.0.194
Build cuda_11.0_bu.TC445_37.28540450_0
发现驱动存在,进行下一步。
2, 进行查看驱动版本信息。
ls /usr/src | grep nvidia
我的版本显示为:
nvidia-460.73.01
3, 最后一步。
sudo apt-get install dkms
sudo dkms install -m nvidia -v 460.73.01
最后输入gpustat检查一下,发现可以正常 显示了:
ubun Mon Jun 21 09:30:46 2021 460.73.01
[0] GeForce RTX 2080 Ti | 39’C, 0 % | 0 / 11019 MB |
[1] GeForce RTX 2080 Ti | 45’C, 0 % | 0 / 11019 MB |
[2] GeForce RTX 2080 Ti | 44’C, 0 % | 0 / 11019 MB |
[3] GeForce RTX 2080 Ti | 45’C, 0 % | 0 / 11019 MB |