在使用docker执行run.sh文件时,出现了错误
Job for docker.service failed because the control process exited with error code.
之后我就按照网上的操作作了一些修改,然后doker就崩了,首先是docker的卸载重装,总是会出现dockecer重装失败的问题,找到如下解决办法:
https://zhidao.baidu.com/question/533997170.html
到这里 dockerce的安装就不会出现问题了
但在执行sudo docker run hello-world
时,出现如下错误:
docker: Cannot connect to the Docker daemon at unix:///var/run/docker.sock. Is the docker daemon running?.
See 'docker run --help'.
解决方案:
https://blog.csdn.net/qq_45527118/article/details/106131071
运行sh./build.sh又出现错误
W: GPG error: https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1604/x86_64 Release: The following signatures were invalid: BADSIG F60F4B3D7FA2AF80 cudatools <cudatools@nvidia.com>
W: The repository 'https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1604/x86_64 Release' is not signed.
E: Failed to fetch https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1604/x86_64/Packages Writing more data than expected (1588456 > 1585719)
E: Some index files failed to download. They have been ignored, or old ones used instead.
解决方法:
https://www.cnblogs.com/vontroy/p/5622718.html
运行sh./build.sh成功
运行sh ./run.sh又出现错误
docker: Error response from daemon: Unknown runtime specified nvidia.
See 'docker run --help'.
解决方案:
https://www.cnblogs.com/luckforefforts/p/13642682.html
又出现新的错误:
docker: Error response from daemon: OCI runtime
create failed: unable to retrieve OCI runtime error
(open /run/containerd/io.containerd.runtime.v1.linux
/moby/11f291bee172d93c0743731f389650afa6741fdac20b8d
67b857f6374691cb0d/log.json: no such file or
directory): fork/exec /usr/bin/nvidia-container-
runtime: no such file or directory: : unknown.
经过一番搜索,终于找到了问题所在,我需要先安装一个能在GPU环境下运行的nvidia docker ,安装过程
https://blog.csdn.net/weixin_38369492/article/details/105809571
然后再安装一个nvidia-container-runtime
sudo apt-get install nvidia-container-runtime
重启docker
sudo systemctl restart docker
运行sh ./run.sh出现如下结果:
(pytorch_gpu) chzy@server2:~/anaconda3/pkgs/SATNet-master/docker$ sh ./run.sh
root@server2:/data#