近段时间因为之前使用的是cuda8.0,由于工作需要,安装tensorflow-gpu1.11.0版本,需要cuda9.0,遇到好多问题,现在给出解决方法:
#cudnn安装
wget https://developer.nvidia.com/compute/machine-learning/cudnn/secure/v7.4.1.5/prod/9.0_20181108/cudnn-9.0-linux-x64-v7.4.1.5.tgz
tar zxvf cudnn-9.0-linux-x64-v7.4.1.5.tgz
sudo cp cuda/include/cudnn.h /usr/local/cuda/include/
sudo cp cuda/lib64/libcudnn* /usr/local/cuda/lib64/
sudo chmod a+r /usr/local/cuda/lib64/libcudnn*
sudo chmod 666 /etc/ld.so.conf
echo "/usr/local/cuda/lib64" >> /etc/ld.so.conf
sudo ldconfig
看到类似/usr/local/cuda/lib64/libcudnn.so.7这反馈就按上了
#nccl安装
wget https://developer.nvidia.com/compute/machine-learning/nccl/secure/v2.1/prod/nccl-repo-ubuntu1604-2.1.15-ga-cuda9.0_1-1_amd64
sudo dpkg -i nccl-repo-ubuntu1604-2.1.15-ga-cuda9.0_1-1_amd64
sudo apt-key add /var/nccl*/*.pub
sudo apt-get update
sudo apt-get install -y libnccl2 libnccl-dev
sudo dpkg -i nccl-repo-ubuntu1604-2.1.15-ga-cuda9.0_1-1_amd64
dpkg -s libnccl2
%http://zsms.ml:13390/cudnn/#cudnn下载地址
%https://developer.nvidia.com/compute/cuda/9.0/Prod/local_installers/cuda_9.0.176_384.81_linux-run#cuda下载地址