python多线程中使用TensorRT（解决报错：invalid device context - no currently active context? ）

本初-ben

已于 2024-03-12 10:06:44 修改

阅读量608

点赞数 10

分类专栏：深度学习实践文章标签： python TensorRT 深度学习

于 2024-02-21 13:35:56 首次发布

本文链接：https://blog.csdn.net/qq_43673118/article/details/136207112

版权

深度学习实践专栏收录该内容

10 篇文章 1 订阅

订阅专栏

文章讲述了在Python多线程环境中使用TensorRT时遇到的逻辑错误，主要介绍了如何在工作线程中手动创建CUDA上下文以及如何在TensorRT类的初始化和推理前后正确管理context，以避免explicit_context_dependentfailed错误。

摘要由CSDN通过智能技术生成

python多线程中使用TensorRT

- 问题描述
- 解决办法

问题描述

在不涉及多线程时，使用TensorRT模型推理，如下在开头import即可自动创建上下文：

import pycuda.driver as cuda
import pycuda.autoinit

而当TensorRT在线程中运行时（比如写在软件中、或者通过多个线程使模型并行推理），代码会报错：
pycuda._driver.LogicError: explicit_context_dependent failed: invalid device context - no currently active context?
报错显示在工作线程中没有建立上下文context，原因是import pycuda.autoinit在线程中不起作用。

解决办法

在TensorRT的工作线程中手动创建context上下文即可：
（1）删掉import pycuda.autoinit，添加cuda.init()：
在这里插入图片描述
（2）在调用TensorRT的类的__init__()的首行加入下面语句：

# 1. 手动创建context上下文
self.cfx = cuda.Devvice(0).make_context()
# 2. 模型预加载[可选]
# 3. 执行self.cfx.pop()，否则会报错
self.cfx.pop()

在这里插入图片描述
（3）在进行推理前加上self.cfx.push()，推理结束后加上self.cfx.pop()：

本初-ben

关注

10
点赞
踩
9

收藏

觉得还不错? 一键收藏
打赏
2
评论
python多线程中使用TensorRT（解决报错：invalid device context - no currently active context? ）

报错显示在工作线程中没有建立上下文context，原因是。（3）在进行推理前加上。
复制链接

扫一扫