Linux系统 训练模型时cuda不存在 nvidia-smi 报错
报错:RuntimeError: cuda runtime error (38) : no CUDA-capable device is detected at /tmp/pip-req-build-ocx5vxk7/aten/src/THC/THCGeneral.cpp:50
nvidia-smi:
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver.
Make sure that the latest NVIDIA driver is installed and running.
已经装过nvidia 显卡驱动,并且之前训练过模型,nvidia-smi报错,避免重装驱动的繁琐,找到以下解决方案:
先查看自己之前安装的驱动版本
ls /usr/src | grep nvidia
nvidia-418-418.39
我的驱动版本是nvidia-418-418.39,在终端输入命令:
sudo apt install dkms
sudo dkms install -m nvidia -v 418.39
问题解决
@Elaine