在宿主机中新版docker使用nvidia-smi
安装docker
1.关闭docker
sudo systemctl stop docker
2.卸载旧版本
首先卸载旧版本docker
sudo apt-get remove docker docker-engine docker.io containerd runc
或
sudo apt-get purge docker-ce
3.更新索引
更新apt包索引
或
sudo apt-get update
sudo apt-get -y install apt-transport-https ca-certificates curl software-properties-common
4.更换docker源
选择国内的云服务商,这里选择阿里云为例
curl -sSL http://acs-public-mirror.oss-cn-hangzhou.aliyuncs.com/docker-engine/internet | sh -
//或者
curl -fsSL http://mirrors.aliyun.com/docker-ce/linux/ubuntu/gpg | sudo apt-key add
5、 安装所需要的包
sudo apt-get install linux-image-extra-$(uname -r) linux-image-extra-virtual
6、 添加使用 HTTPS 传输的软件包以及 CA 证书
sudo apt-get update
sudo apt-get install apt-transport-https ca-certificates
7、 添加GPG密钥
sudo apt-key adv --keyserver hkp://p80.pool.sks-keyservers.net:80 --recv-keys 58118E89F3A912897C070ADBF76221572C52609D
8、 添加软件源
echo “deb https://apt.dockerproject.org/repo ubuntu-xenial main” | sudo tee /etc/apt/sources.list.d/docker.list
//或者
sudo add-apt-repository "deb [arch=amd64] http://mirrors.aliyun.com/docker-ce/linux/ubuntu $(lsb_release -cs) stable
9、 添加成功后更新软件包缓存
sudo apt-get update
//或者
apt-cache madison docker-ce
10、 安装最新的docker
sudo apt-get install docker-ce
//启动docker
sudo systemctl start docker
// 加入开机启动
sudo systemctl enable docker
11、 查看安装后信息
sudo docker info
查看docker版本
Client:
Context: default
Debug Mode: false
Plugins:
app: Docker App (Docker Inc., v0.9.1-beta3)
buildx: Build with BuildKit (Docker Inc., v0.5.1-docker)
scan: Docker Scan (Docker Inc., v0.8.0)
Server:
Containers: 3
Running: 1
Paused: 0
Stopped: 2
Images: 4
Server Version: 20.10.7
Storage Driver: overlay2
Backing Filesystem: extfs
Supports d_type: true
Native Overlay Diff: true
userxattr: false
Logging Driver: json-file
Cgroup Driver: cgroupfs
Cgroup Version: 1
Plugins:
Volume: local
Network: bridge host ipvlan macvlan null overlay
Log: awslogs fluentd gcplogs gelf journald json-file local logentries splunk syslog
Swarm: inactive
Runtimes: io.containerd.runc.v2 io.containerd.runtime.v1.linux runc
Default Runtime: runc
Init Binary: docker-init
containerd version: 7eba5930496d9bbe375fdf71603e610ad737d2b2
runc version: v1.0.0-0-g84113ee
init version: de40ad0
Security Options:
apparmor
seccomp
Profile: default
Kernel Version: 5.4.0-80-generic
Operating System: Ubuntu 18.04.5 LTS
OSType: linux
Architecture: x86_64
CPUs: 4
Total Memory: 31.24GiB
Name: narijd
ID: C7YI:BXHY:BT2N:OPYM:VKKN:GFAP:UT4O:SDF2:TYPY:NMN5:4P2F:CYPM
Docker Root Dir: /var/lib/docker
Debug Mode: false
Registry: https://index.docker.io/v1/
Labels:
Experimental: false
Insecure Registries:
127.0.0.0/8
Registry Mirrors:
https://1nj0zren.mirror.aliyuncs.com/
https://docker.mirrors.ustc.edu.cn/
http://f1361db2.m.daocloud.io/
https://registry.docker-cn.com/
Live Restore Enabled: false
12、 Docker免sudo
sudo groupadd docker
sudo gpasswd -a ${USER} docker
sudo service docker restart
newgrp – docker
14、 列出GPU设备
docker run -it --rm --gpus all ubuntu nvidia-smi –L
添加用户组以及安装docker镜像
1、 docker安装Ubuntu镜像
docker search ubuntu
2、 添加/删除用户以及密码
sudo useradd –d /home/username –m –s /bin/bash username
#设置或者更改密码
sudo passwd username
#删除用户:
sudo userdel –r username
#username为你添加的用户名同时也是镜像名
3、 docker共享主机nvidia驱动
docker run -itd --gpus all --name bs1 -e NVIDIA_DRIVER_CAPABILITIES=compute,utility -e NVIDIA_VISIBLE_DEVICES=all ubuntu
#bs1为我创建的容器名 ubuntu为拉的镜像名
原理参考(***这篇文章***)
4、 将用户添加到docker组
sudo adduser bs1 docker
#正在添加用户"bs1"到"docker"组...
#正在将用户“bs1”加入到“docker”组中
#完成。
#启动容器,开机或者重启之后
docker start bs1
#关闭容器
docker stop bs1