2017CS231n笔记8.深度学习软件

最新推荐文章于 2022-05-27 11:18:11 发布

oldmao_2000

最新推荐文章于 2022-05-27 11:18:11 发布

阅读量322

点赞数

分类专栏：李飞飞CS231n学习笔记（太监）

本文链接：https://blog.csdn.net/oldmao_2001/article/details/93463509

版权

李飞飞CS231n学习笔记（太监）专栏收录该内容

12 篇文章 6 订阅

订阅专栏

文章目录

概述
CPU vs GPU（硬件）
Deep Learning Frameworks（软件）
- TensorFlow
- PyTorch
静态计算图（TensorFlow）与动态计算图（PyTorch）
其他框架
- Caffe
- Caffe2
总结

概述

在线Latex公式
本节包含两个内容：

CPU vs GPU
Deep Learning Frameworks
Caffe / Caffe2
Theano / TensorFlow
Torch / PyTorch
本节从软件、硬件两个方面来看待学习深度学习的工具，虽然目前还没真正开始实践，只是在学ng的深度学习是用的TF，其他的框架还不熟悉，希望学完这个课程会有一个大概的了解。
重点问题：比较静态图框架和动态图框架的优劣。

CPU vs GPU（硬件）

基本概念介绍（CS专业或者硬件玩家基本都了解，这里就不多写了）
在这里插入图片描述
读取数据很容易成为瓶颈

Deep Learning Frameworks（软件）

这块发展相对迅速，每年都会有不同的框架出现，这个视频是17年的：
在这里插入图片描述
主流的还是PyTorch和TensorFlow。框架的好处在于：
(1) Easily build big computational graphs
(2) Easily compute gradients in computational graphs
(3) Run it all efficiently on GPU (wrap cuDNN, cuBLAS, etc)
如果不用框架，直接用numpy实现dl：

Can’t run on GPU
Have to compute our own gradients

TensorFlow

Running example: Train a two-layer ReLU network on random data with L2 loss
大概步骤分两步
在这里插入图片描述
上面红框分三段，这三段都只是定义计算图，并没有进行实际的计算。
第一段：Create placeholders for input x, weights w1 and w2, and targets y
第二段：Forward pass: compute prediction for y and loss (L2 distance between y and y_pred)
第三段：Tell TensorFlow to compute loss of gradient with respect to w1 and w2.
蓝色框values这句是用数据填充placeholders：
Create numpy arrays that will fill in the placeholders above.
后面两句话：
Run the graph: feed in the numpy arrays for x, y, w1, and w2; get numpy arrays for loss, grad_w1, and grad_w2
如果要训练：Run the graph over and over, use gradient to update weights
老师给的例子里面权重等参数很少，如果很多，在运行GPU版本的时候会有CPU和GPU之间拷贝数据的性能瓶颈，所以给出了改进建议，就是把权重放到计算图里面，包括学习率的更新。
Change w1 and w2 from placeholder (fed on each call) to Variable (persists in the graph between calls)
在这里插入图片描述
Add assign operations to update w1 and w2 as part of the graph!

最后：

这里还是有一个问题，就是权重计算后从gpu拷贝到cpu又有性能瓶颈，这里老师用了一个技巧，就是把计算后权重看做个dummy node，让TensorFlow来维护。
在这里插入图片描述
最后的代码：

问题：为什么X/Y没有放入计算图？
答：因为使用mini batch的话，每次丢进网络中计算的数据是变化的，所以没有必要放入。
还可以调用更加高层次的函数来进一步简化代码，例如关于初始化：
Use Xavier initializer
在这里插入图片描述
还可以使用更加高层次的接口，例如Keras、TFLearn。
课上还提到了：
Tensorboard: Add logging to code to record loss, stats, etc Run server and get pretty graphs!
Distributed Version: Split one graph over multiple machines!

PyTorch

三个抽象层与TensorFlow的对比：
在这里插入图片描述
几个要点：
To run on GPU, just cast tensors to a cuda datatype!
PyTorch 的高级接口只有一个叫nn
可以利用modules自定义网络构架

Modules can contain modules，上图中不需要定义反向传播autograd可以自动搞定。
A DataLoader wraps a Dataset and provides minibatching, shuffling, multithreading, for you When you need to load custom data, just write your own Dataset class.这里的DataLoader 在加载数据的时候划分batch会自动帮你搞定随机乱序，很方便。
使用pretrain的模型超方便，只需要一行代码，就会帮你后台下载相关数据和权重
在这里插入图片描述
可视化工具看上去比TensorFlow的好看，但是17年的版本还不支持计算图：

静态计算图（TensorFlow）与动态计算图（PyTorch）

在这里插入图片描述

优化

With static graphs, framework can optimize the graph for you before it runs!
静态图可以预先做优化！

在这里插入图片描述

执行序列化

在这里插入图片描述
序列化好的静态计算图可以存储下来，也许可以拿给不同语言上运行，也就是模型可以更好的复用。
动态计算图在运行的时候才确定计算图，所以语言不能切换（因为图的构架是这个语言写的）

简洁性

动态图简洁性好些，静态图要预先定义好计算图，所以所有的操作符，例如if，都要写到计算图中（看红箭头），而动态图直接用的Python代码。
在这里插入图片描述
这里补一个深度之眼学员的优秀回答：

其他框架

Caffe

● Core written in C++
● Has Python and MATLAB bindings
● Good for training or finetuning
● feedforward classification models
● Often no need to write code!
● Not used as much in research anymore, still popular for deploying models
总而言之，这个玩意工业应用较多，研究用的少，因为它不依赖Python。
No need to write code!

Convert data (run a script)
Define net (edit prototxt)
Define solver (edit prototxt)
Train (with pretrained weights) (run a script)

Caffe2

● Very new - released a week ago =)
● Static graphs, somewhat similar to TensorFlow
● Core written in C++
● Nice Python interface
● Can train model in Python, then serialize and deploy without Python
● Works on iOS / Android, etc

总结

TensorFlow is a safe bet for most projects. Not perfect but has huge community, wide usage. Maybe pair with high-level wrapper (Keras, Sonnet, etc)
I think PyTorch is best for research. However still new, there can be rough patches.
Use TensorFlow for one graph over many machines
Consider Caffe, Caffe2, or TensorFlow for production deployment
Consider TensorFlow or Caffe2 for mobile

oldmao_2000

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
打赏
0
评论
2017CS231n笔记8.深度学习软件

本节包含两个内容：- CPU vs GPU- Deep Learning FrameworksCaffe / Caffe2Theano / TensorFlowTorch / PyTorch本节从软件、硬件两个方面来看待学习深度学习的工具，虽然目前还没真正开始实践，只是在学ng的深度学习是用的TF，其他的框架还不熟悉，希望学完这个课程会有一个大概的了解。
复制链接

扫一扫