情绪识别的相关问题（一）

最新推荐文章于 2024-01-07 02:09:20 发布

雪影晴(◕ˇ∀ˇ◕)林丹青

最新推荐文章于 2024-01-07 02:09:20 发布

阅读量398

点赞数

文章标签： tensorflow object detection

本文链接：https://blog.csdn.net/weixin_38102882/article/details/103287310

版权

目标检测

从视频中提取人和动作

参考：利用TensorFlow Object Detection API实现图片和视频物体检测
文章链接地址：https://blog.csdn.net/Hampton_Chen/article/details/93538912
获取坐标位置将目标检测框里的内容截图保存下来
参考：利用object detection API训练模型的输出坐标
文章链接地址：https://www.cnblogs.com/White-xzx/p/9508535.html

以下代码实现对视频的目标检测并将目标检测框里的内容截图保存

import numpy as np
import tensorflow as tf
import cv2
from utils import label_map_util
from utils import visualization_utils as vis_util


cap = cv2.VideoCapture(r'E:\emotion_recognition\videos\videos_sum\201811_V1.mp4')
ret, image_np = cap.read()
out = cv2.VideoWriter(r'E:\emotion_recognition\videos\videos_sum\output_video.mp4', -1, cap.get(cv2.CAP_PROP_FPS), (image_np.shape[1], image_np.shape[0]))

PATH_TO_CKPT = './ssd_mobilenet_v1_coco_2018_01_28/frozen_inference_graph.pb'
PATH_TO_LABELS = './data/mscoco_label_map.pbtxt'
NUM_CLASSES = 90
save_path='./img/'
i = 0

detection_graph = tf.Graph()
with detection_graph.as_default():
    od_graph_def = tf.GraphDef()
    with tf.gfile.GFile(PATH_TO_CKPT, 'rb') as fid:
        od_graph_def.ParseFromString(fid.read())
        tf.import_graph_def(od_graph_def, name='')

label_map = label_map_util.load_labelmap(PATH_TO_LABELS)
categories = label_map_util.convert_label_map_to_categories(label_map, max_num_classes=NUM_CLASSES,
                                                            use_display_name=True)
category_index = label_map_util.create_category_index(categories)


with detection_graph.as_default():
    with tf.Session(graph=detection_graph) as sess:
        image_tensor = detection_graph.get_tensor_by_name('image_tensor:0')
        detection_boxes = detection_graph.get_tensor_by_name('detection_boxes:0')
        detection_scores = detection_graph.get_tensor_by_name('detection_scores:0')
        detection_classes = detection_graph.get_tensor_by_name('detection_classes:0')
        num_detections = detection_graph.get_tensor_by_name('num_detections:0')
        while cap.isOpened():
            ret, image_np = cap.read()
            if len((np.array(image_np)).shape) == 0:
                break

            image_np = cv2.cvtColor(image_np, cv2.COLOR_BGR2RGB)
            image_np_expanded = np.expand_dims(image_np, axis=0)

            (boxes, scores, classes, num) = sess.run(
                [detection_boxes, detection_scores, detection_classes, num_detections],
                feed_dict={image_tensor: image_np_expanded})
            # print(boxes[0].shape)
            # print(boxes[0][1])
            print(image_np.shape)
            print(scores[0][0])
            print(classes[0][0])
            box = boxes[0][0]
            ymin = int(box[0] * image_np.shape[0])
            xmin = int(box[1] * image_np.shape[1])
            ymax = int(box[2] * image_np.shape[0])
            xmax = int(box[3] * image_np.shape[1])
            # print("picture:",image_np.shape)
            # print("xmin",xmin)
            # print("xmax:", xmax)
            # print("ymin:",ymin)
            # print("ymax:",ymax)
            if scores[0][0]>=0.6 and classes[0][0]==1:
                new_img = cv2.rectangle(image_np, (xmin,ymin),(xmax,ymax), (0, 0, 255), 2)
                img = image_np[ymin:ymax,xmin:xmax]
                img = cv2.cvtColor(img,cv2.COLOR_BGR2RGB)
                cv2.imwrite('E:/emotion_recognition/pictures/img/photo_{}.jpg'.format(i), img)
                i+=1
            vis_util.visualize_boxes_and_labels_on_image_array(image_np, np.squeeze(boxes),
                                                               np.squeeze(classes).astype(np.int32), np.squeeze(scores),
                                                               category_index, use_normalized_coordinates=True,
                                                               line_thickness=8)
            out.write(cv2.cvtColor(image_np, cv2.COLOR_RGB2BGR))

cap.release()
out.release()
cv2.destroyAllWindows()

代码完整运行需要自定义文件路径，安装所需要的包。下载模型ssd_mobilenet_v1_coco_2018_01_28/frozen_inference_graph.pb，和对应的类别标签文件mscoco_label_map.pbtxt
模型下载地址：
https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/detection_model_zoo.md

运行结果

目标检测的结果
基于视频的目标检测的输出结果（模糊是后期处理，视频输出的还是带检测框的视频）在这里插入图片描述只截取框里的尺寸大小的图片的运行结果。

雪影晴(◕ˇ∀ˇ◕)林丹青

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
情绪识别的相关问题（一）

情绪识别的相关问题（一）从视频中提取人和动作参考：利用TensorFlow Object Detection API实现图片和视频物体检测文章链接地址：https://blog.csdn.net/Hampton_Chen/article/details/93538912获取坐标位置将目标检测框里的内容截图保存下来参考：利用object detection API训练模型的输出坐标文章链...
复制链接

扫一扫