opencv调用caffe/tensorflow模型

最新推荐文章于 2023-11-05 11:06:32 发布

通信程序猿

最新推荐文章于 2023-11-05 11:06:32 发布

阅读量4.6k

点赞数 1

分类专栏： OpenCV Python 机器学习

本文链接：https://blog.csdn.net/u011285477/article/details/89030957

版权

Python 同时被 3 个专栏收录

30 篇文章 0 订阅

订阅专栏

机器学习

22 篇文章 2 订阅

订阅专栏

OpenCV

15 篇文章 1 订阅

订阅专栏

我的环境配置是python 3.6.2 + opencv 3.4.5。

下面是crowd counting计算人群密度图的代码。

# coding:utf-8

from __future__ import print_function

import numpy as np
import pylab
import matplotlib.pyplot as plt
import cv2
from cv2 import dnn
import time


cm_path = 'C:\\Users\\admin\\Desktop\\'



if __name__ == "__main__":

    fn = r'C:\Users\admin\Desktop\ShanghaiTech_Crowd_Counting_Dataset\part_B_final\test_data\images\IMG_191.jpg'

    im_ori = cv2.imread(fn)
    plt.figure(1)
    plt.imshow(im_ori)
    plt.axis('off')
    pylab.show()


    blob = dnn.blobFromImage(im_ori, 1, (1024, 768), (0, 0, 0), True)

    print("Input:", blob.shape, blob.dtype)

    net = dnn.readNetFromCaffe(cm_path + 'B_testdemo.prototxt', cm_path + 'B2_iter_93000.caffemodel')

    t = time.time()
    net.setInput(blob)
    density = net.forward()
    elapsed = time.time() - t

    print('inference image: %.4f seconds.' % elapsed)

    density = density/1000.0

    print("Output:", density.shape, density.dtype)

    person_num = np.sum(density[:])
    print("number: ",person_num)

    plt.figure(1)
    plt.imshow(density[0, 0, :, :])
    plt.axis('off')
    pylab.show()

dnn.blobFromImage(input_img,scalefactor, (width, height), mean, swapRB)

mean和scalefactor是用来对图像做标准化的，先减均值，再乘以一个系数。images -= mean；images *= scalefactor

swapRB是选择是否交换R与B颜色通道，opencv默认读取的图片是BGR格式，而训练模型时，往往是转换成RGB输入，所以这里通常设置为True，调换R与B通道。

dnn.readNetFromCaffe(modelTxt, caffe_modelBin)

输入的两个参数分别是网络结构.prototxt文件和模型文件。

程序运行结果如下，可以看到网络模型的输入格式是N*C*H*W（Numbers*Channels*Height*Width）

C:\Users\admin\Desktop\Crowd-Counting-master>python opencv_caffe_crowd_density_map.py
Input: (1, 3, 768, 1024) float32
inference image: 0.2005 seconds.
Output: (1, 1, 192, 256) float32
number:  285.41965

tensorflow 运行.pb模型，前向运行100次耗时0.691 s

init = tf.global_variables_initializer()
sess.run(init)

input_x = sess.graph.get_tensor_by_name("input_x:0")
out_softmax = sess.graph.get_tensor_by_name("predictions/Reshape_1:0")

img = cv2.imread(jpg_path)
img_ori = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
test_img = cv2.resize(img_ori, (width, height))
test_img = np.asarray(test_img, np.float32)
test_img = test_img[np.newaxis, :] / 255.

time_start = time.time()
img_out_softmax = sess.run(out_softmax, feed_dict={input_x:test_img})
time_end = time.time()
print('run time: ', time_end - time_start, 's')
print("pred:",img_out_softmax)

run time:  0.04388284683227539 s
pred: [[9.8955745e-01 1.0416225e-02 2.6317310e-05]]

// example
String weights = "nn.pb";
dnn::Net net = cv::dnn::readNetFromTensorflow(weights);
Mat img = imread(files[i], 1);
Mat inputBlob = dnn::blobFromImage(img, 0.00390625f, Size(256, 256), Scalar(), false,false); 
net.setInput(inputBlob, "data");//set the network input, "data" is the name of the input layer     
Mat pred = net.forward("fc2/prob");

opencv调用tensorflow的.pb模型也是类似的。以图像分类为例，如下所示：

img = cv2.imread(jpg_path)

net = dnn.readNetFromTensorflow(pb_file_path)

net.setInput(cv2.dnn.blobFromImage(img, 1/255.0, (width, height), (0, 0, 0), swapRB=True, crop=False))
time_start = time.time()
pred = net.forward()
time_end = time.time()
print('run time: ', time_end - time_start, 's')
print("pred:",pred)

这里做的是三分类，运行100次耗时0.279 s，计算耗时减少了一半左右。

run time:  0.023903846740722656 s
pred: [[9.8955745e-01 1.0416246e-02 2.6317308e-05]]

【参考资料】

[1] 使用OpenCV_python中的DNN调用CaffeModel识别图像

[2] https://github.com/linzhirui1992/Crowd-Counting

[3] https://github.com/opencv/opencv/wiki/TensorFlow-Object-Detection-API

[4] opencv 调用 TensorFlow 训练好的模型

通信程序猿

关注

1
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
opencv调用caffe/tensorflow模型

我的环境配置是python 3.6.2 + opencv 3.4.5。下面是crowd counting计算人群密度图的代码。# coding:utf-8from __future__ import print_functionimport numpy as npimport pylabimport matplotlib.pyplot as pltimport cv2fr...
复制链接

扫一扫

专栏目录