AI环境搭建（nvidia + cuda + cudnn + tensorflow+其他）

最新推荐文章于 2024-10-01 20:23:45 发布

南有木兮木不知

最新推荐文章于 2024-10-01 20:23:45 发布

阅读量764

点赞数

分类专栏： AI linux

本文链接：https://blog.csdn.net/mhsszm/article/details/82148606

版权

linux 同时被 2 个专栏收录

54 篇文章

订阅专栏

36 篇文章

订阅专栏

AI环境搭建（nvidia + cuda + cudnn + tensorflow+其他）

1 安装显卡驱动

sudo add-apt-repository ppa:graphics-drivers/ppa  
sudo apt-get update  
sudo apt-get install nvidia-375  #此处要根据上面查询到的版本适当更改    （当前使用的是384版本）
sudo apt-get install mesa-common-dev  
sudo apt-get install freeglut3-dev

安装完成后重启电脑，后输入：

nvidia-smi

或者输入：

nvidia-settings

显示显卡列表，表示安装成功

2 安装cuda

官网下载：cuda_8.0.61_375.26_linux.run

chmod +x cuda_8.0.61_375.26_linux.run
sh cuda_8.0.61_375.26_linux.run
accept
n
后面一直yes

当出现如下提示：

Please make sure that
– PATH includes /usr/local/cuda-8.0/bin
– LD_LIBRARY_PATH includes /usr/local/cuda-8.0/lib64, or, add /usr/local/cuda-8.0/lib64 to /etc/ld.so.conf and run ldconfig as root

To uninstall the CUDA Toolkit, run the uninstall script in /usr/local/cuda-8.0/bin

Please see CUDA_Installation_Guide_Linux.pdf in /usr/local/cuda-8.0/doc/pdf for detailed information on setting up CUDA.

***WARNING: Incomplete installation! This installation did not install the CUDA Driver. A driver of version at least 361.00 is required for CUDA 8.0 functionality to work.
To install the driver using this installer, run the following command, replacing with the name of this run file:
sudo .run -silent -driver

Logfile is /tmp/cuda_install_6583.log

表示安装完毕，可以进行cuda测试

nvcc -V

或者跑cuda测试程序：
进入到Samples安装目录，然后在该目录下终端输入make，等待十来分钟。
可以在Samples里面找到bin/x86_64/linux/release/目录，并切换到该目录
运行deviceQuery程序

sudo ./deviceQuery

查看输出结果，重点关注最后一行，Pass表示通过测试

3 安装cudnn

去官网下载与cuda相应的安装包，解压

tar zxvf cudnn-8.0-linux-x64-v6.0.tgz

将文件拷贝到cuda的相应位置：

cd cuda
sudo cp include/*  /usr/local/cuda-8.0/include/
sudo cp lib64/*  /usr/local/cuda-8.0/lib64/

设置环境变量：

vim ~/.bashrc
export PATH=/usr/local/cuda-8.0/bin:$PATH
export LD_LIBRARY_PATH=/usr/local/cuda-8.0/lib64:$LD_LIBRARY_PATH

source ~/.bashrc

4 安装tensorflow

sudo pip install tensorflow-gpu==1.4.1

测试：

$ python
Python 2.7.12 (default, Dec  4 2017, 14:50:18) 
[GCC 5.4.0 20160609] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import tensorflow as tf 
>>> tf.__version__
'1.4.1'
>>>

进一步测试：

# coding: utf-8
import tensorflow as tf

with tf.device('/cpu:0'):
    a = tf.constant([1.0,2.0,3.0],shape=[3],name='a')
    b = tf.constant([1.0,2.0,3.0],shape=[3],name='b')
with tf.device('/gpu:0'):
    c = a+b

#注意：allow_soft_placement=True表明：计算设备可自行选择，如果没有这个参数，会报错。
#因为不是所有的操作都可以被放在GPU上，如果强行将无法放在GPU上的操作指定到GPU上，将会报错。
sess = tf.Session(config=tf.ConfigProto(allow_soft_placement=True,log_device_placement=True))
#sess = tf.Session(config=tf.ConfigProto(log_device_placement=True))
sess.run(tf.global_variables_initializer())
print(sess.run(c))

5 安装其他一些AI库，scipy，sklearn等

sudo apt install python-opencv -y
sudo pip install scipy (or sudo apt-get install python-scipy)
sudo pip install sklearn
sudo pip install sysv_ipc
sudo pip install matplotlib 
sudo apt install libffi-dev
sudo pip install ipcqueue
sudo pip install tornado==4.5.2
sudo pip install Ipy
sudo apt-get install inetutils-ping