faster-rcnn cpu实现

最新推荐文章于 2022-10-08 13:29:28 发布

西伯利亚的蓝眼睛

最新推荐文章于 2022-10-08 13:29:28 发布

阅读量1.1w

点赞数

分类专栏：深度学习文章标签： faster-rcn cpu

本文链接：https://blog.csdn.net/qq_14975217/article/details/51495844

版权

本文介绍了如何在py-faster-rcnn中实现CPU模式进行测试和训练，重点在于对roi_pooling_layer和smooth_L1_loss_layer的CPU版本改造。由于原框架仅支持GPU训练，作者提供了对这两个关键文件的修改，使得在没有高性能GPU的情况下也能进行模型训练。已将修改后的cpp文件上传至个人GitHub，替换原文件并重新编译即可。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

py-faster-rcnn-cpu

py-faster-rcnn在测试模型的时候，可以选择使用cpu mode或者gpu mode，但是如果使用该框架训练自己的模型，就只能使用gpu了。应该是作者考虑训练速度的原因，对roi_pooling_layer和smooth_L1_loss_layer只使用和提供了gpu版本的代码.
这两个文件在py-fast-rcnn/caffe-fast-rcnn/src/caffe/layers 。打开这两个文件，可以看到smooth_L1_loss_layer.cpp中forward和backward处都是NOT_IMPLEMENTED。所以如果没有一块满足性能的GPU就做不了训练了。
下边是我对这两个文件的修改，实现了CPU版本的函数，如有错误，欢迎指正交流。另外，在我的github上也可以找到这两个文件。使用时，直接替换原文件,重新make即可。

roi_pooling_layer.cpp

 // ------------------------------------------------------------------
// Fast R-CNN
// Copyright (c) 2015 Microsoft
// Licensed under The MIT License [see fast-rcnn/LICENSE for details]
// Written by Ross Girshick
// ------------------------------------------------------------------

#include <cfloat>

#include "caffe/fast_rcnn_layers.hpp"

using std::max;
using std::min;
using std::floor;
using std::ceil;

namespace caffe {

template <typename Dtype>
void ROIPoolingLayer<Dtype>::LayerSetUp(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top) {
  ROIPoolingParameter roi_pool_param = this->layer_param_.roi_pooling_param();
  CHECK_GT(roi_pool_param.pooled_h(), 0)
      << "pooled_h must be > 0";
  CHECK_GT(roi_pool_param.pooled_w(), 0)
      << "pooled_w must be > 0";
  pooled_height_ = roi_pool_param.pooled_h();
  pooled_width_ = roi_pool_param.pooled_w();
  spatial_scale_ = roi_pool_param.spatial_scale();
  LOG(INFO) << "Spatial scale: " << spatial_scale_;
}

template <typename Dtype>
void ROIPoolingLayer<Dtype>::Reshape(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top) {
  channels_ = bottom[0]->channels();
  height_ = bottom[0]->height();
  width_ = bottom[0]->width();
  top[0]->Reshape(bottom[1]->num(), channels_, pooled_height_,
      pooled_width_);
  max_idx_.Reshape(bottom[1]->num(), channels_, pooled_height_,
      pooled_width_);
}

template <typename Dtype>
void ROIPoolingLayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top) {
  const Dtype* bottom_data = bottom[0]->cpu_data();
  const Dtype* bottom_rois = bottom[1]->cpu_data();
  // Number of ROIs
  int num_rois = bottom[1]->num();
  int batch_size = bottom[0]->num();
  int top_count = top[0]->count();
  Dtype* top_data = top[0]->mutable_cpu_data();
  // Init top_data to -∞
  caffe_set(top_count, Dtype(-FLT_MAX), top_data);
  int* argmax_data = max_idx_.mutable_cpu_data();
  // Init argmax_data t0 -1
  caffe_set(top_count, -1, argmax_data);

  // For each ROI R = [batch_index x1 y1 x2 y2]: max pool over R
  for (int n = 0; n < num_rois; ++n) {
    int roi_batch_ind = bottom_rois[0];
    int roi_start_w = round(bottom_rois[1] * spatial_scale_);
    in

最低0.47元/天解锁文章