容器启动:
docker run -it --name xxx --gpus all tensorflow/tensorflow:1.10.0-gpu-py3 /bin/bash
—gpus all 可能容器会启动失败,报错:
docker: Error response from daemon: could not select device driver “” with ...
在确认宿主机nvidia-smi输出没问题后,主要问题可能是由于未安装: nvidia-container-toolkit
安装步骤:
distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
&& curl -s -L [https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.repo](https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.repo) | sudo tee /etc/yum.repos.d/nvidia-docker.repo
sudo yum clean expire-cache
sudo yum install nvidia-container-toolkit -y
参考:https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#docker