cupy系列（二）——实现roi_pooling

最新推荐文章于 2024-01-21 19:00:00 发布

农夫山泉2号

最新推荐文章于 2024-01-21 19:00:00 发布

阅读量1.1k

点赞数

分类专栏： PYTHON 深度学习 cupy 文章标签： roi_pooling chainer

本文链接：https://blog.csdn.net/u011622208/article/details/91042243

版权

PYTHON 同时被 3 个专栏收录

105 篇文章 4 订阅

订阅专栏

深度学习

84 篇文章 6 订阅

订阅专栏

cupy

5 篇文章 0 订阅

订阅专栏

从chainer中copy出来的。官方有CPU和GPU的分别实现
尊重原创，请看源码 chainer_roipooling

例子

import cupy as cp
import numpy as np

bottom_data = cp.random.randn(1,3,40,40, dtype=np.float32)			# 特征feature 
batch, channels, height, width = bottom_data.shape
spatial_scale = 1.0													# 原始特征和feature的比例
rois = cp.array([[0, 2, 2, 10, 10],
                 [0, 3, 3, 20, 20]], dtype=np.float32)				# rois
pooled_weight = 7													# 池化之后的宽度
pooled_height = 7													# 池化之后的高度

top_data = cp.zeros((2, 3, pooled_height, pooled_weight), dtype=np.float32)		# 输出的feature map
argmax_data = cp.zeros(top_data.shape, np.int32)								# 最大值对应的索引

## 定义核函数
roi_pooling_2d_fwd = cp.ElementwiseKernel(
            '''
            raw T bottom_data, T spatial_scale, int32 channels,
            int32 height, int32 width, int32 pooled_height, int32 pooled_width,
            raw T bottom_rois
            ''',
            'T top_data, int32 argmax_data',
            '''
            // pos in output filter
            int pw = i % pooled_width;
            int ph = (i / pooled_width) % pooled_height;
            int c = (i / pooled_width / pooled_height) % channels;
            int num = i / pooled_width / pooled_height / channels;
            int roi_batch_ind = bottom_rois[num * 5 + 0];
            int roi_start_w = round(bottom_rois[num * 5 + 1] * spatial_scale);
            int roi_start_h = round(bottom_rois[num * 5 + 2] * spatial_scale);
            int roi_end_w = round(bottom_rois[num * 5 + 3] * spatial_scale);
            int roi_end_h = round(bottom_rois[num * 5 + 4] * spatial_scale);
            // Force malformed ROIs to be 1x1
            int roi_width = max(roi_end_w - roi_start_w + 1, 1);
            int roi_height = max(roi_end_h - roi_start_h + 1, 1);
            float bin_size_h = static_cast<float>(roi_height)
                           / static_cast<float>(pooled_height);
            float bin_size_w = static_cast<float>(roi_width)
                           / static_cast<float>(pooled_width);
            int hstart = static_cast<int>(floor(static_cast<float>(ph)
                                          * bin_size_h));
            int wstart = static_cast<int>(floor(static_cast<float>(pw)
                                          * bin_size_w));
            int hend = static_cast<int>(ceil(static_cast<float>(ph + 1)
                                        * bin_size_h));
            int wend = static_cast<int>(ceil(static_cast<float>(pw + 1)
                                        * bin_size_w));
            // Add roi offsets and clip to input boundaries
            hstart = min(max(hstart + roi_start_h, 0), height);
            hend = min(max(hend + roi_start_h, 0), height);
            wstart = min(max(wstart + roi_start_w, 0), width);
            wend = min(max(wend + roi_start_w, 0), width);
            bool is_empty = (hend <= hstart) || (wend <= wstart);
            // Define an empty pooling region to be zero
            float maxval = is_empty ? 0 : -1E+37;
            // If nothing is pooled, argmax=-1 causes nothing to be backprop'd
            int maxidx = -1;
            int data_offset = (roi_batch_ind * channels + c) * height * width;
            for (int h = hstart; h < hend; ++h) {
                for (int w = wstart; w < wend; ++w) {
                    int bottom_index = h * width + w;
                    if (bottom_data[data_offset + bottom_index] > maxval) {
                        maxval = bottom_data[data_offset + bottom_index];
                        maxidx = bottom_index;
                    }
                }
            }
            top_data = maxval;
            argmax_data = maxidx;
            ''', 'roi_pooling_2d_fwd'
        )
roi_pooling_2d_fwd(bottom_data, spatial_scale, channels, height, width,
          pooled_height, pooled_weight, rois, top_data, argmax_data)

top_data.shape
>>> (2, 3, 7, 7)

农夫山泉2号

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
cupy系列（二）——实现roi_pooling

从chainer中copy出来的。官方有CPU和GPU的分别实现尊重原创，请看源码 chainer_roipooling例子import cupy as cpimport numpy as npbottom_data = cp.random.randn(1,3,40,40, dtype=np.float32) # 特征feature batch, channels, height...
复制链接

扫一扫