batch为1时,如下:
void* buffers[2];
buffers[inputIndex] = inputbuffer;
buffers[outputIndex] = outputBuffer;
但是batch_size大于1时,inputbuffer和outputBuffer如何设置,以及如何构建引擎?
批量推理准备数据:
void parseYolov5(cv::Mat& img,ICudaEngine* engine,IExecutionContext* context,std::vector<Yolo::Detection>& batch_res)
{
// 准备数据 ---------------------------
static float data[BATCH_SIZE * 3 * INPUT_H * INPUT_W]; //输入
static float prob[BATCH_SIZE * OUTPUT_SIZE]; //输出
assert(engine->getNbBindings() == 2);
void* buffers[2];
// In order to bind the buffers, we need to know the names of the input and output tensors.
// No