Faster RCNN源码解读（4）全连接与分类

最新推荐文章于 2023-09-24 21:07:11 发布

葛葛葛立鹏啊

最新推荐文章于 2023-09-24 21:07:11 发布

阅读量1k

点赞数

文章标签：深度学习 tensorflow

本文链接：https://blog.csdn.net/weixin_44929101/article/details/105723617

版权

本文深入解析目标检测网络中Classification部分的工作原理，详细介绍了如何利用已获得的proposal feature maps通过全连接层与softmax进行分类，以及如何通过bounding box regression提高目标检测框的精度。

摘要由CSDN通过智能技术生成

前言

现在是最后的收尾部分：
Classification部分利用已经获得的proposal feature maps，通过full connect层与softmax计算每个proposal具体属于那个类别（如人，车，电视等），输出cls_prob概率向量；同时再次利用bounding box regression获得每个proposal的位置偏移量bbox_pred，用于回归更加精确的目标检测框。
在这里插入图片描述

从RoI Pooling获取到7x7=49大小的proposal feature maps后，送入后续网络，可以看到做了如下2件事：

通过全连接和softmax对proposals进行分类，这实际上已经是识别的范畴了
再次对proposals进行bounding box regression，获取更高精度的rect box

_head_to_tail

 def _head_to_tail(self, pool5, is_training, reuse=None):
    with tf.variable_scope(self._scope, self._scope, reuse=reuse):
      pool5_flat = slim.flatten(pool5, scope='flatten')
      fc6 = slim.fully_connected(pool5_flat, 4096, scope='fc6')
      if is_training:
        fc6 = slim.dropout(fc6, keep_prob=0.5, is_training=True, 
                            scope='dropout6')
      fc7 = slim.fully_connected(fc6, 4096, scope='fc7')
      if is_training:
        fc7 = slim.dropout(fc7, keep_prob=0.5, is_training=True, 
                            scope='dropout7')
 
    return fc7

这里经过两次全连接操作，这里有两个函数一是slim.flatten作用是扁平化输出dropout是训练部分。为了后续分类做准备。

_region_classification()

def _region_classification(self, fc7, is_training, initializer, initializer_bbox):
    cls_score = slim.fully_connected(fc7, self._num_classes, 
                                       weights_initializer=initializer,
                                       trainable=is_training,
                                       activation_fn=None, scope='cls_score')
    cls_prob = self._softmax_layer(cls_score, "cls_prob")
    cls_pred = tf.argmax(cls_score, axis=1, name="cls_pred")
    bbox_pred = slim.fully_connected(fc7, self._num_classes * 4, 
                                     weights_initializer=initializer_bbox,
                                     trainable=is_training,
                                     activation_fn=None, scope='bbox_pred')
 
    self._predictions["cls_score"] = cls_score
    self._predictions["cls_pred"] = cls_pred
    self._predictions["cls_prob"] = cls_prob
    self._predictions["bbox_pred"] = bbox_pred
 
    return cls_prob, bbox_pred