在ubuntu机器安装keras cuda

最新推荐文章于 2024-07-28 13:47:26 发布

wuhuaiyu

最新推荐文章于 2024-07-28 13:47:26 发布

阅读量512

点赞数

分类专栏：算法、技术

本文链接：https://blog.csdn.net/wuhuaiyu/article/details/80284037

版权

算法、技术专栏收录该内容

29 篇文章 0 订阅

订阅专栏

在ubuntu机器安装keras cuda

查看网卡

命令lspci 看到有3D controller: NVIDIA Corporation Device

00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] (rev 02)
00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II]
00:01.1 IDE interface: Intel Corporation 82371SB PIIX3 IDE [Natoma/Triton II]
00:01.2 USB controller: Intel Corporation 82371SB PIIX3 USB [Natoma/Triton II] (rev 01)
00:01.3 Bridge: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 03)
00:02.0 VGA compatible controller: Cirrus Logic GD 5446
00:03.0 Ethernet controller: Red Hat, Inc Virtio network device
00:04.0 Communication controller: Red Hat, Inc Virtio console
00:05.0 SCSI storage controller: Red Hat, Inc Virtio block device
00:06.0 SCSI storage controller: Red Hat, Inc Virtio block device
00:07.0 3D controller: NVIDIA Corporation Device 1b38 (rev a1)
00:08.0 Unclassified device [00ff]: Red Hat, Inc Virtio memory balloon

命令nvidia-smi，能看到网卡型号T40

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 384.81                 Driver Version: 384.81                    |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla P40           On   | 00000000:00:07.0 Off |                    0 |
| N/A   32C    P8     9W / 250W |      0MiB / 22912MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

查看cuda版本

命令：cat /usr/local/cuda/version.txt 
现在阡陌机器默认cuda为8.0，升级方式：
http://wiki.baidu.com/pages/viewpage.action?pageId=328257217，按照第一种方式升级到9.0

安装anacoda

省略此过程

安装tensorflow

最新的tensorflow1.6，注意下载gpu版本，参考
https://www.tensorflow.org/install/install_linux
因为tensorflow1.5以上需要cuda9.0

安装keras

keras只支持tensorflow1.5以上版本

~/.theanorc

[cuda]
root = /usr/local/cuda

[lib]
cnmem=100

[global]
device=cuda
floatX=float32

测试

from theano import function, config, shared, sandbox
import theano.tensor as T
import numpy
import time
vlen = 10 * 30 * 768 # 10 x #cores x # threads per core
iters = 1000
rng = numpy.random.RandomState(22)
x = shared(numpy.asarray(rng.rand(vlen), config.floatX))
f = function([], T.exp(x))
print(f.maker.fgraph.toposort())
t0 = time.time()
for i in xrange(iters):
 r = f()
t1 = time.time()
print("Looping %d times took %f seconds" % (iters, t1 - t0))
print("Result is %s" % (r,))
if numpy.any([isinstance(x.op, T.Elemwise) for x in f.maker.fgraph.toposort()]):
 print('Used the cpu')
else:
print('Used the gpu')

import tensorflow as tf  
sess = tf.Session(config=tf.ConfigProto(log_device_placement=True))

输出：

2018-05-11 17:00:43.348559: I tensorflow/core/platform/cpu_feature_guard.cc:140] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2018-05-11 17:00:43.733815: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:898] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2018-05-11 17:00:43.735674: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1212] Found device 0 with properties:
name: Tesla P40 major: 6 minor: 1 memoryClockRate(GHz): 1.531
pciBusID: 0000:00:07.0
totalMemory: 22.38GiB freeMemory: 22.21GiB
2018-05-11 17:00:43.735771: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1312] Adding visible gpu devices: 0
2018-05-11 17:00:44.414397: I tensorflow/core/common_runtime/gpu/gpu_device.cc:993] Creating TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 21553 MB memory) -> physical GPU (device: 0, name: Tesla P40, pci bus id: 0000:00:07.0, compute capability: 6.1)
Device mapping:
/job:localhost/replica:0/task:0/device:GPU:0 -> device: 0, name: Tesla P40, pci bus id: 0000:00:07.0, compute capability: 6.1
2018-05-11 17:00:45.277718: I tensorflow/core/common_runtime/direct_session.cc:297] Device mapping:
/job:localhost/replica:0/task:0/device:GPU:0 -> device: 0, name: Tesla P40, pci bus id: 0000:00:07.0, compute capability: 6.1

证明tensorflow可以看到GPU