YOLOv3已出,不管哪个版本,其封装应该大同小异。之前在windows环境下,以https://github.com/AlexeyAB/darknet版本进行封装,linux下封装也是差异不大,而且linux环境下编译工程更简单。以YOLOv2为例,主要找出test_detector函数
void test_detector(char *datacfg, char *cfgfile, char *weightfile, char *filename, float thresh, float hier_thresh, char *outfile, int fullscreen)
{
list *options = read_data_cfg(datacfg);
char *name_list = option_find_str(options, "names", "data/names.list");
char **names = get_labels(name_list);
image **alphabet = load_alphabet();
network *net = load_network(cfgfile, weightfile, 0);
set_batch_network(net, 1);
srand(2222222);
double time;
char buff[256];
char *input = buff;
float nms=.45;
while(1){
if(filename){
strncpy(input, filename, 256);
} else {
printf("Enter Image Path: ");
fflush(stdout);
input = fgets(input, 256, stdin);
if(!input) return;
strtok(input, "\n");
}
image im = load_image_color(input,0,0);
image sized = letterbox_image(im, net->w, net->h);
//image sized = resize_image(im, net->w, net->h);
//image sized2 = resize_max(im, net->w);
//image sized = crop_image(sized2, -((net->w - sized2.w)/2), -((net->h - sized2.h)/2), net->w, net->h);
//resize_network(net, sized.w, sized.h);
layer l = net->layers[net->n-1];
float *X = sized.data;
time=what_time_is_it_now();
network_predict(net, X);
printf("%s: Predicted in %f seconds.\n", input, what_time_is_it_now()-time);
int nboxes = 0;
detection *dets = get_network_boxes(net, im.w, im.h, thresh, hier_thresh, 0, 1, &nboxes);
//printf("%d\n", nboxes);
//if (nms) do_nms_obj(boxes, probs, l.w*l.h*l.n, l.classes, nms);
if (nms) do_nms_sort(dets, nboxes, l.classes, nms);
draw_detections(im, dets, nboxes, thresh, names, alphabet, l.classes);
free_detections(dets, nboxes);
if(outfile){
save_image(im, outfile);
}
else{
save_image(im, "predictions");
#ifdef OPENCV
cvNamedWindow("predictions", CV_WINDOW_NORMAL);
if(fullscreen){
cvSetWindowProperty("predictions", CV_WND_PROP_FULLSCREEN, CV_WINDOW_FULLSCREEN);
}
show_image(im, "predictions");
cvWaitKey(0);
cvDestroyAllWindows();
#endif
}
free_image(im);
free_image(sized);
if (filename) break;
}
}
其中
list *options = read_data_cfg(datacfg);
char *name_list = option_find_str(options, "names", "data/names.list");
char **names = get_labels(name_list);
image **alphabet = load_alphabet();
network *net = load_network(cfgfile, weightfile, 0);
set_batch_network(net, 1);
放入初始化函数,其余部分就是当做接口函数,建一个cpp文件,写一个main函数,调用接口函数,需要注意的几点:
1、注意输入的图像格式,BGR需转化为RGB顺序存储,即按照R->G->B通道顺序存储。当然这块可以自己写,也可以直接使用源代码里面的模块,只是输入参数不同。
2、几个变量需要定义为全局变量,因为初始化和接口函数都使用。
3、因为以c++方式编译,所以封装所用头文件写成,extern “C”{ 头文件},https://github.com/pjreddie/darknet/issues/375
4、源码都是纯c,所以可把string格式的输入变量转为char格式。消除警告。
5、注意初始化函数和接口函数均要以c文件格式来写,然后在linux下对其进行封装成动态库,在用main函数调用,如果初始化和接口函数和main在同一个cpp文件中,则只能跑通cpu模式,gpu模式无法跑通。