1. 使用目的以及注意事项
- 主要用于打包工作中的ai模型,方便在同事的linux电脑上进行迁移
- 最后一个测试的时候,比较新的显卡(rtx 4090)需要使用更高的cuda版本,否则容器会运行失败
2. docker 安装
https://docs.docker.com/engine/install/ubuntu
- 添加docker源
sudo apt update
sudo apt install ca-certificates curl
sudo install -m 0755 -d /etc/apt/keyrings
sudo curl -fsSL https://download.docker.com/linux/ubuntu/gpg -o /etc/apt/keyrings/docker.asc
sudo chmod a+r /etc/apt/keyrings/docker.asc
echo \
"deb [arch=$(dpkg --print-architecture) signed-by=/etc/apt/keyrings/docker.asc] https://download.docker.com/linux/ubuntu \
$(. /etc/os-release && echo "$VERSION_CODENAME") stable" | \
sudo tee /etc/apt/sources.list.d/docker.list > /dev/null
sudo apt update
- 安装依赖
sudo apt install docker-ce docker-ce-cli containerd.io docker-buildx-plugin docker-compose-plugin
- 测试
sudo docker run hello-world
- docker 408请求超时
https://blog.csdn.net/qq_21359467/article/details/136173584
3. 安装nvidia container
https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/1.11.0/install-guide.html
- 添加源
distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
&& curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \
&& curl -s -L https://nvidia.github.io/libnvidia-container/$distribution/libnvidia-container.list | \
sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list
- 安装依赖
sudo apt-get update
sudo apt-get install -y nvidia-docker2
sudo systemctl restart docker
- 测试
sudo docker run --rm --gpus all nvidia/cuda:11.6.2-base-ubuntu20.04 nvidia-smi