下载合适的nvidia驱动以使用nvidia-smi

三只佩奇不结义

已于 2023-08-05 11:37:49 修改

阅读量412

点赞数

文章标签： nvidia

于 2023-05-07 20:40:56 首次发布

本文链接：https://blog.csdn.net/qq_41196612/article/details/130547529

版权

文章描述了解决NVIDIA-SMI无法与NVIDIA驱动通信的问题。解决方案包括彻底卸载所有NVIDIA和CUDA相关包，安装Linux内核头文件，然后从NVIDIA官网下载最新运行文件进行安装。在某些情况下，可能需要重启电脑以使nvidia-smi正常工作。此问题发生在执行系统更新并改变Linux内核后。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

方法1

参考：参考资料，可能没有效

方法2

具体参考NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running里面已解决所使用的方法，主要的流程为：先删除所有与cuda有关的内容，再安装CUDA Toolkit：

Even with those commands, the issue wasn’t solved.

Eventually, the fastest way to fix 2 machines with a package manager is to purge all Nvidia & Cuda,did it by:

sudo apt-get remove --purge '^nvidia-.*'
sudo apt-get remove --purge '^libnvidia-.*'
sudo apt-get remove --purge '^cuda-.*'

Then after it’s clean ran that:

sudo apt-get install linux-headers-$(uname -r)

From here - it’s the same for all VMs:
Download latest run file from Nvidia site, and run it, accept if needed to upgrade current, or install from scratch.

The driver is back to work. （有时候需要重启电脑才能用nvidia-smi）

The issue was started after did some updates, and the Linux kernel was changed.