使用linux服务器时出现如题问题,说明显卡驱动出现错误
1、sudo nvidia-bug-report.sh 在当前路径下生成显卡问题报告
2、对问题报告解压并查看
3、翻阅到下面部分,发现从下午16:11之后就没有记录,说明在这里出现问题
journalctl -b -0:
Dec 16 15:24:20 ubuntu kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 244
Dec 16 15:24:21 ubuntu kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel Module 418.43 Tue Feb 19 01:12:11 CST 2019
Dec 16 15:24:21 ubuntu kernel: nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 418.43 Tue Feb 19 01:05:14 CST 2019
Dec 16 15:24:21 ubuntu kernel: [drm] [nvidia-drm] [GPU ID 0x00000400] Loading driver
Dec 16 15:24:21 ubuntu kernel: [drm] [nvidia-drm] [GPU ID 0x00000500] Loading driver
Dec 16 15:24:21 ubuntu kernel: [drm] [nvidia-drm] [GPU ID 0x00000600] Loading driver
Dec 16 15:24:21 ubuntu kernel: [drm] [nvidia-drm] [GPU ID 0x00000700] Loading driver
Dec 16 15:24:21 ubuntu kernel: [drm] [nvidia-drm] [