基于Tensorflow Object Detection API的宠物检测实践

ALL 2 WELL

已于 2022-02-27 09:37:22 修改

阅读量358

点赞数

分类专栏：经验文章标签：深度学习机器学习图像识别

于 2020-09-04 09:21:42 首次发布

本文链接：https://blog.csdn.net/WSQ_2000/article/details/108396897

版权

经验专栏收录该内容

1 篇文章 0 订阅

订阅专栏

使用Tensorflow Object Detection API进行宠物检测

API简介

一个很方便的物体检测模型训练、推理API, 很适合深度学习新手使用。建议搭配图书教程《深度学习图像识别技术：基于Tensorflow Object Detection API 与OpenVINO工作条件》食用

文章目录

使用Tensorflow Object Detection API进行宠物检测

官方教程

https://colab.research.google.com/github/tensorflow/models/blob/master/research/object_detection/colab_tutorials/object_detection_tutorial.ipynb

这里官方是在Google Colab中进行的操作，便于学习，我们也使用Google Colab进行学习实践。

接下来我们跟着教程一步一步走,目前还只是在google环境中跑，没有什么bug，等换个环境再来记录bug

1. 环境安装

!pip install -U --pre tensorflow=="2.*"
!pip install tf_slim
!pip install pycocotools

2. git获取代码

import os
import pathlib


if "models" in pathlib.Path.cwd().parts:
  while "models" in pathlib.Path.cwd().parts:
    os.chdir('..')
elif not pathlib.Path('models').exists():
  !git clone --depth 1 https://github.com/tensorflow/models

编译并安装目标检测包

%%bash
cd models/research/
protoc object_detection/protos/*.proto --python_out=.
%%bash 
cd models/research
pip install .

3. 相关接口封装

导入相关模块

import numpy as np
import os
import six.moves.urllib as urllib
import sys
import tarfile
import tensorflow as tf
import zipfile

from collections import defaultdict
from io import StringIO
from matplotlib import pyplot as plt
from PIL import Image
from IPython.display import display

from object_detection.utils import ops as utils_ops
from object_detection.utils import label_map_util
from object_detection.utils import visualization_utils as vis_util

不知道什么意思

# patch tf1 into `utils.ops`
utils_ops.tf = tf.compat.v1

# Patch the location of gfile
tf.gfile = tf.io.gfile

加载模型

def load_model(model_name):
  base_url = 'http://download.tensorflow.org/models/object_detection/'
  model_file = model_name + '.tar.gz'
  model_dir = tf.keras.utils.get_file(
    fname=model_name, 
    origin=base_url + model_file,
    untar=True)

  model_dir = pathlib.Path(model_dir)/"saved_model"

  model = tf.saved_model.load(str(model_dir))

  return model

加载标签与序号的对应关系

# List of the strings that is used to add correct label for each box.
PATH_TO_LABELS = 'models/research/object_detection/data/mscoco_label_map.pbtxt'
category_index = label_map_util.create_category_index_from_labelmap(PATH_TO_LABELS, use_display_name=True)

填写测试图片所在路径

# If you want to test the code with your images, just add path to the images to the TEST_IMAGE_PATHS.
PATH_TO_TEST_IMAGES_DIR = pathlib.Path('models/research/object_detection/test_images')
TEST_IMAGE_PATHS = sorted(list(PATH_TO_TEST_IMAGES_DIR.glob("*.jpg")))
TEST_IMAGE_PATHS

4. 开始检测

加载模型文件

model_name = 'ssd_mobilenet_v1_coco_2017_11_17'
detection_model = load_model(model_name)

查看输入层与输出层

print(detection_model.signatures['serving_default'].inputs)
detection_model.signatures['serving_default'].output_shapes
detection_model.signatures['serving_default'].output_dtypes

对于单张图片执行推理

def run_inference_for_single_image(model, image):
  image = np.asarray(image)
  # The input needs to be a tensor, convert it using `tf.convert_to_tensor`.
  input_tensor = tf.convert_to_tensor(image)
  # The model expects a batch of images, so add an axis with `tf.newaxis`.
  input_tensor = input_tensor[tf.newaxis,...]

  # Run inference
  model_fn = model.signatures['serving_default']
  output_dict = model_fn(input_tensor)

  # All outputs are batches tensors.
  # Convert to numpy arrays, and take index [0] to remove the batch dimension.
  # We're only interested in the first num_detections.
  num_detections = int(output_dict.pop('num_detections'))
  output_dict = {key:value[0, :num_detections].numpy() 
                 for key,value in output_dict.items()}
  output_dict['num_detections'] = num_detections

  # detection_classes should be ints.
  output_dict['detection_classes'] = output_dict['detection_classes'].astype(np.int64)
   
  # Handle models with masks:
  if 'detection_masks' in output_dict:
    # Reframe the the bbox mask to the image size.
    detection_masks_reframed = utils_ops.reframe_box_masks_to_image_masks(
              output_dict['detection_masks'], output_dict['detection_boxes'],
               image.shape[0], image.shape[1])      
    detection_masks_reframed = tf.cast(detection_masks_reframed > 0.5,
                                       tf.uint8)
    output_dict['detection_masks_reframed'] = detection_masks_reframed.numpy()
    
  return output_dict

根据推理结果在图像上画bounding box

def show_inference(model, image_path):
  # the array based representation of the image will be used later in order to prepare the
  # result image with boxes and labels on it.
  image_np = np.array(Image.open(image_path))
  # Actual detection.
  output_dict = run_inference_for_single_image(model, image_np)
  # Visualization of the results of a detection.
  vis_util.visualize_boxes_and_labels_on_image_array(
      image_np,
      output_dict['detection_boxes'],
      output_dict['detection_classes'],
      output_dict['detection_scores'],
      category_index,
      instance_masks=output_dict.get('detection_masks_reframed', None),
      use_normalized_coordinates=True,
      line_thickness=8)

  display(Image.fromarray(image_np))

对于测试图片路径下的每一张图片执行推理并显示结果

for image_path in TEST_IMAGE_PATHS:
  show_inference(detection_model, image_path)

5.最终结果

在这里插入图片描述

ALL 2 WELL

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
1
评论
基于Tensorflow Object Detection API的宠物检测实践

使用Tensorflow Object Detection API进行宠物检测API简介一个很方便的物体检测模型训练、推理API, 很适合深度学习新手使用。建议搭配图书教程《深度学习图像识别技术：基于Tensorflow Object Detection API 与OpenVINO工作条件》食用文章目录使用Tensorflow Object Detection API进行宠物检测API简介官方教程1. 环境安装2. git获取代码3. 相关接口封装4. 开始检测5.最终结果官方教程https
复制链接

扫一扫