FasterRCNN源码解析（五）——RPN（上）RPNHead及anchor生成

最新推荐文章于 2024-05-07 14:40:17 发布

在学习的王哈哈

最新推荐文章于 2024-05-07 14:40:17 发布

阅读量2.7k

点赞数 9

分类专栏：计算机视觉

本文链接：https://blog.csdn.net/prague6695/article/details/115111975

版权

FasterRCNN源码解析（五）——RPN部分

在这里插入图片描述

文章目录

FasterRCNN源码解析（五）——RPN部分
之前数据类型
一、RPNHead部分
二、AnchorsGenerator部分
- 1. forword部分
- 2. 生成ancher的方法
数据在RPN的传输路线

之前数据类型

我们给RPN模块所需要传入的参数有

# 将特征层以及标注target信息传入rpn中
# proposals: List[tensor], tensor_shape:[num_proposals, 4]
# 每个proposal是绝对坐标，且为（x1, y1, x2, y2）格式
proposals, proposal_losses = self.rpn(images, features, targets)

一、RPNHead部分

计算预测的目标分数以及预测的目标bbox regression参数

class RPNHead(nn.Module):
    """
    add a RPN head with classification and regression
    通过滑动窗口计算预测目标概率与bbox regression参数

    Arguments:
        in_channels: number of channels of the input feature
        num_anchors: number of anchors to be predicted
    """

    def __init__(self, in_channels, num_anchors):
        super(RPNHead, self).__init__()
        # 3x3 滑动窗口
        self.conv = nn.Conv2d(in_channels, in_channels, kernel_size=3, stride=1, padding=1)
        # 计算预测的目标分数（这里的目标只是指前景或者背景）
        self.cls_logits = nn.Conv2d(in_channels, num_anchors, kernel_size=1, stride=1)
        # 计算预测的目标bbox regression参数
        self.bbox_pred = nn.Conv2d(in_channels, num_anchors * 4, kernel_size=1, stride=1)

        for layer in self.children(): # 参数初始化
            if isinstance(layer, nn.Conv2d):
                torch.nn.init.normal_(layer.weight, std=0.01)
                torch.nn.init.constant_(layer.bias, 0)

    def forward(self, x):
        # type: (List[Tensor]) -> Tuple[List[Tensor], List[Tensor]]
        logits = []
        bbox_reg = []
        for i, feature in enumerate(x):
            t = F.relu(self.conv(feature))
            logits.append(self.cls_logits(t))
            bbox_reg.append(self.bbox_pred(t))
        return logits, bbox_reg

二、AnchorsGenerator部分

Anchor生成器

class AnchorsGenerator(nn.Module):
    __annotations__ = {
   
        "cell_anchors": Optional[List[torch.Tensor]],
        "_cache": Dict[str, List[torch.Tensor]]
    }

    """
    anchors生成器
    Module that generates anchors for a set of feature maps and
    image sizes.

    The module support computing anchors at multiple sizes and aspect ratios
    per feature map.

    sizes and aspect_ratios should have the same number of elements, and it should
    correspond to the number of feature maps.

    sizes[i] and aspect_ratios[i] can have an arbitrary number of elements,
    and AnchorGenerator will output a set of sizes[i] * aspect_ratios[i] anchors
    per spatial location for feature map i.

    Arguments:
        sizes (Tuple[Tuple[int]]):
        aspect_ratios (Tuple[Tuple[float]]):
    """

    def __init__(self, sizes=(128

最低0.47元/天解锁文章

在学习的王哈哈

关注

9
点赞
踩
24

收藏

觉得还不错? 一键收藏
0
评论
FasterRCNN源码解析（五）——RPN（上）RPNHead及anchor生成

FasterRCNN源码解析（五）——RPN部分文章目录FasterRCNN源码解析（五）——RPN部分之前数据类型一、RPNHead部分二、AnchorsGenerator部分1. forword部分1.引入库2.读入数据数据在RPN的传输路线之前数据类型我们给RPN模块所需要传入的参数有# 将特征层以及标注target信息传入rpn中# proposals: List[tensor], tensor_shape:[num_proposals, 4]# 每个proposal是绝对坐标，且
复制链接

扫一扫

专栏目录