【tensorflow 解析】-【1】

最新推荐文章于 2022-04-05 16:45:20 发布

七爷OK

最新推荐文章于 2022-04-05 16:45:20 发布

阅读量423

点赞数

分类专栏： deep learning 计算资源管理文章标签： tensorflow

本文链接：https://blog.csdn.net/weixin_32820767/article/details/82259749

版权

计算资源管理同时被 2 个专栏收录

55 篇文章 2 订阅

订阅专栏

deep learning

16 篇文章 0 订阅

订阅专栏

1 tensorflow GPU 调用架构
如图：

从上图我们可以看到，Tensorflow提供两种方式调用NVIDIA的方式，而NVIDIA的GPU调用方式主要依靠的CUDA的并行计算框架.

2 Stream Executor
StreamExecutor 是一个子项目，是一个google开源的数学并行运算库，是基于CUDA API、OpenCL API管理各种GPU设备的统一API，这种统一的GPU封装适用于需要与GPU设备通信的库，而在Tensorflow上只提供了对CUDA的支持。位置是tensorflow\stream_executor 文件夹下。stream.h 的部分内容：

......
// Represents a stream of dependent computations on a GPU device.
//
// The operations within a stream execute linearly and asynchronously until
// BlockHostUntilDone() is invoked, which synchronously joins host code with
// the execution of the stream.
//
// If any given operation fails when entraining work for the stream, ok() will
// indicate that an error has occurred. After initialization, once a stream is
// !ok(), it will never be ok().
//
// Thread-safe post-initialization.
class Stream {
 public:
  // Instantiate a stream tied to parent as a platform executor. Work
  // entrained onto this stream will be launched/managed on that
  // StreamExecutor's platform.
    explicit Stream(StreamExecutor *parent);

  // Test only. Use an externally-populated value (like a mock) for the
  // platform-specific stream implementation.
  Stream(StreamExecutor *parent, internal::StreamInterface *implementation);

  // Deallocates any stream resources that the parent StreamExecutor has
  // bestowed
  // upon this object.
  ~Stream();
  // Returns whether any errors have occurred while entraining work for this
  // stream.
  bool ok() const { return !InErrorState(); }

  ......
  }

StreamExecutor的主要功能：