报错如下:
(base) user@slave:~$ docker run -it --runtime=nvidia e6039addee71 /bin/bash
docker: Error response from daemon: OCI runtime create failed: unable to retrieve OCI runtime error (open /run/containerd/io.containerd.runtime.v1.linux/moby/d9f69fa38a697ffbc276caefea82f0e3262683c815f95b783ad835a19461696b/log.json: no such file or directory): fork/exec /usr/bin/nvidia-container-runtime: no such file or directory: : unknown.
ERRO[0000] error waiting for container: context canceled
ubuntu解决方案:
Installation
sudo apt-get install nvidia-container-runtime
Systemd drop-in file
sudo mkdir -p /etc/systemd/system/docker.service.d
sudo tee /etc/systemd/system/docker.service.d/override.conf <<EOF
[Service]
ExecStart=
ExecStart=/usr/bin/dockerd --host=fd:// --add-runtime=nvidia=/usr/bin/nvidia-container-runtime
EOF
sudo systemctl daemon-reload
sudo systemctl restart docker
Daemon configuration file
sudo tee /etc/docker/daemon.json <<EOF
{
"runtimes": {
"nvidia": {
"path": "/usr/bin/nvidia-container-runtime",
"runtimeArgs": []
}
}
}
EOF
sudo pkill -SIGHUP dockerd
官方参考链接:
https://github.com/nvidia/nvidia-container-runtime#docker-engine-setup