莫名其妙NIVDIA驱动失效了,搜了好多方法,只有这个方法work了,参考:https://medium.com/@uraibeef/how-to-solve-nvidia-smi-has-failed-because-it-couldnt-communicate-with-the-nvidia-driver-deaecb956a8c
- 第一步,卸载已有驱动
sudo apt-get purge nvidia-*
sudo apt-get update
sudo apt-get autoremove
- 第二步,查看合适nvidia-driver版本
ubuntu-drivers devices
得到结果:
== /sys/devices/pci0000:00/0000:00:01.0/0000:01:00.0 ==
modalias : pci:v000010DEd00002684sv00001043sd000088EFbc03sc00i00
vendor : NVIDIA Corporation
driver : nvidia-driver-535-server-open - distro non-free
driver : nvidia-driver-535-open - distro non-free
driver : nvidia-driver-525 - distro non-free
driver : nvidia-driver-525-server - distro non-free
driver : nvidia-driver-525-open - distro non-free
driver : nvidia-driver-535 - distro non-free recommended
driver : nvidia-driver-535-server - distro non-free
driver : xserver-xorg-video-nouveau - distro free builtin
看 "recommend" ,说明应该安装535版本.
- 第三步,安装nvidia-driver
安装推荐的适合的版本:
sudo apt install libnvidia-common-535
sudo apt-get -y install libnvidia-gl-535
sudo apt install nvidia-driver-535-server
- 第四步,重启
sudo reboot
之后再输入
nvidia-smi
应该可以显示正常的显卡信息