Python Multiprocessing with PyCUDA

最新推荐文章于 2024-08-12 23:18:14 发布

AI算法网奇

最新推荐文章于 2024-08-12 23:18:14 发布

阅读量3.2k

点赞数 1

分类专栏： cuda

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/jacke121/article/details/79705484

版权

cuda 专栏收录该内容

121 篇文章 15 订阅

订阅专栏

Python Multiprocessing with PyCUDA

参考：https://stackoverflow.com/questions/5904872/python-multiprocessing-with-pycuda

You need to get all your bananas lined up on the CUDA side of things first, then think about the best way to get this done in Python [shameless rep whoring, I know].

The CUDA multi-GPU model is pretty straightforward pre 4.0 - each GPU has its own context, and each context must be established by a different host thread. So the idea in pseudocode is:

Application starts, process uses the API to determine the number of usable GPUS (beware things like compute mode in Linux)
Application launches a new host thread per GPU, passing a GPU id. Each thread implicitly/explicitly calls equivalent of cuCtxCreate() passing the GPU id it has been assigned
Profit!

In Python, this might look something like this:

import threading
from pycuda import driver

class gpuThread(threading.Thread):
    def __init__(self, gpuid):
        threading.Thread.__init__(self)
        self.ctx  = driver.Device(gpuid).make_context()
        self.device = self.ctx.get_device()

    def run(self):
        print "%s has device %s, api version %s"  \
             % (self.getName(), self.device.name(), self.ctx.get_api_version())
        # Profit!

    def join(self):
        self.ctx.detach()
        threading.Thread.join(self)

driver.init()
ngpus = driver.Device.count()
for i in range(ngpus):
    t = gpuThread(i)
    t.start()
    t.join()

This assumes it is safe to just establish a context without any checking of the device beforehand. Ideally you would check the compute mode to make sure it is safe to try, then use an exception handler in case a device is busy. But hopefully this gives the basic idea.

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
打赏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

AI算法网奇

CSDN认证博客专家 CSDN认证企业博客

码龄15年

Python领域优质创作者

5032: 原创

720: 周排名

4: 总排名

2842万+: 访问

: 等级

24万+: 积分

8万+: 粉丝

8748: 获赞

3296: 评论

2万+: 收藏

私信

关注

热门文章

分类专栏

最新评论

好用的关键点标注工具
weixin_46808613: 所以请问怎么标注关键点呢
去水印算法学习笔记
CSDN-Ada助手: 哇, 你的文章质量真不错，值得学习！不过这么高质量的文章, 还值得进一步提升, 以下的改进点你可以参考下: (1)提升标题与正文的相关性；(2)增加除了各种控件外，文章正文的字数。
bn层学习笔记卷积层和BN层融合
哈曼卡顿并不卡: 合并后的公式中μ和σ时利用什么计算出来的，用卷积前的特征图里的参数吗？
yolov3训练loss为0
欣之助23: 你好，我跑的时候中途loss下降到0，最开始不是0，也是因为标签为空吗？
python 获取类名文件名
CSDN-Ada助手: 哇, 你的文章质量真不错，值得学习！不过这么高质量的文章, 还值得进一步提升, 以下的改进点你可以参考下: (1)增加除了各种控件外，文章正文的字数；(2)使用更多的站内链接；(3)提升标题与正文的相关性。

最新文章

2024

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

打赏作者

AI算法网奇 你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20

扫码支付：¥1

获取中

扫码支付

您的余额不足，请更换扫码支付或充值

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。