Python版Faster-RCNN安装配置

最新推荐文章于 2022-01-04 17:44:59 发布

pym1993

最新推荐文章于 2022-01-04 17:44:59 发布

阅读量1.7k

点赞数

分类专栏：深度学习文章标签： python faster-rcn

本文链接：https://blog.csdn.net/pym1993/article/details/52483874

版权

本文详述了在Python环境下安装配置Faster-RCNN的步骤，包括软件和硬件需求、源代码克隆、Caffe及pycaffe的构建，以及可能出现的编译问题和解决方案。最后，通过测试验证安装是否成功。

摘要由CSDN通过智能技术生成

本文参照github 编写，参考地址如下:
https://github.com/rbgirshick/py-faster-rcnn
一、软件要求：
1，python必须支持python layers:

#  Makefile.config, make sure to have this line uncommented
WITH_PYTHON_LAYER := 1
# Unrelatedly, it's also recommended that you use CUDNN
USE_CUDNN := 1

2,python包必须包括: cython, python-opencv, easydict

二、硬件要求：
1，对于小型网络的训练(ZF, VGG_CNN_M_1024)，一个有3G内存的好的gpu就可以了(e.g., Titan, K20, K40, …) 。
2，为了用VGG16寻训练fast-rcnn，我们需要一个 K40 (~11G of memory)。
3，用端到端(end to end)的方式训练用VGG16构建的Faster-Rcnn ,在使用CUDnn情况下，一个 K40 (~3G of memory)足够了。

三、安装

1，从github上克隆faster-rcnn 源代码，注意必须采用命令行的方式下载，不要使用在浏览器内直接下载，不然会漏掉很多东西:

# Make sure to clone with --recursive
git clone --recursive https://github.com/rbgirshick/py-faster-rcnn.git

2,建立 Cpython模块：

cd $FRCN_ROOT/lib
make

3，建立Caffe 和 pycaffe：

cd $FRCN_ROOT/caffe-fast-rcnn
make -j8 && make pycaffe

注意：在编译的时候容易出现两种错误：
（1），由于cudnn版本不对，无法编译通过，在这里推荐在安装caffe时使用cudnn4.0版
（2），make过程中出现比如 string.h ‘memcy’ was not declared in this scope的错误是由于gcc编译器版本太新，解决方法是打开makefile搜索并替换：

NVCCFLAGS += -ccbin=$(CXX) -Xcompiler -fPIC $(COMMON_FLAGS)
为
NVCCFLAGS += -D_FORCE_INLINES -ccbin=$(CXX) -Xcompiler -fPIC $(COMMON_FLAGS)

当然，如果caffe正确安装，是不会出现这种问题的。

4，下载已经训练好的模型Faster R-CNN ：

 cd $FRCN_ROOT
./data/scripts/fetch_faster_rcnn_models.sh

这一步会在$FRCN_ROOT/data 文件夹下下载生成faster_rcnn_models. 的压缩文件大概700M大小，解压缩后备用。

四、测试：
完成的基本的安装后用demo测试一下：

cd $FRCN_ROOT
./tools/demo.py

运行结果如下则表明安装成功：

/usr/lib/python2.7/dist-packages/matplotlib/font_manager.py:273: UserWarning: Matplotlib is building the font cache using fc-list. This may take a moment.
  warnings.warn('Matplotlib is building the font cache using fc-list. This may take a moment.')
WARNING: Logging before InitGoogleLogging() is written to STDERR
I0908 20:21:59.205785  4215 net.cpp:49] Initializing net from parameters: 
name: "VGG_ILSVRC_16_layers"
input: "data"
input: "im_info"
state {
  phase: TEST
}
input_shape {
  dim: 1
  dim: 3
  dim: 224
  dim: 224
}
input_shape {
  dim: 1
  dim: 3
}
layer {
  name: "conv1_1"
  type: "Convolution"
  bottom: "data"
  top: "conv1_1"
  convolution_param {
    num_output: 64
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu1_1"
  type: "ReLU"
  bottom: "conv1_1"
  top: "conv1_1"
}
layer {
  name: "conv1_2"
  type: "Convolution"
  bottom: "conv1_1"
  top: "conv1_2"
  convolution_param {
    num_output: 64
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu1_2"
  type: "ReLU"
  bottom: "conv1_2"
  top: "conv1_2"
}
layer {
  name: "pool1"
  type: "Pooling"
  bottom: "conv1_2"
  top: "pool1"
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layer {
  name: "conv2_1"
  type: "Convolution"
  bottom: "pool1"
  top: "conv2_1"
  convolution_param {
    num_output: 128
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu2_1"
  type: "ReLU"
  bottom: "conv2_1"
  top: "conv2_1"
}
layer {
  name: "conv2_2"
  type: "Convolution"
  bottom: "conv2_1"
  top: "conv2_2"
  convolution_param {
    num_output: 128
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu2_2"
  type: "ReLU"
  bottom: "conv2_2"
  top: "conv2_2"
}
layer {
  name: "pool2"
  type: "Pooling"
  bottom: "conv2_2"
  top: "pool2"
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layer {
  name: "conv3_1"
  type: "Convolution"
  bottom: "pool2"
  top: "conv3_1"
  convolution_param {
    num_output: 256
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu3_1"
  type: "ReLU"
  bottom: "conv3_1"
  top: "conv3_1"
}
layer {
  name: "conv3_2"
  type: "Convolution"
  bottom: "conv3_1"
  top: "conv3_2"
  convolution_param {
    num_output: 256
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu3_2"
  type: "ReLU"
  bottom: "conv3_2"
  top: "conv3_2"
}
layer {
  name: "conv3_3"
  type: "Convolution"
  bottom: "conv3_2"
  top: "conv3_3"
  convolution_param {
    num_output: 256
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu3_3"
  type: "ReLU"
  bottom: "conv3_3"
  top: "conv3_3"
}
layer {
  name: "pool3"
  type: "Pooling"
  bottom: "conv3_3"
  top: "pool3"
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layer {
  name: "conv4_1"
  type: "Convolution"
  bottom: "pool3"
  top: "conv4_1"
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu4_1"
  type: "ReLU"
  bottom: "conv4_1"
  top: "conv4_1"
}
layer {
  name: "conv4_2"
  type: "Convolution"
  bottom: "conv4_1"
  top: "conv4_2"
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu4_2"
  type: "ReLU"
  bottom: "conv4_2"
  top: "conv4_2"
}
layer {
  name: "conv4_3"
  type: "Convolution"
  bottom: "conv4_2"
  top: "conv4_3"
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu4_3"
  type: "ReLU"
  bottom: "conv4_3"
  top: "conv4_3"
}
layer {
  name: "pool4"
  type: "Pooling"
  bottom: "conv4_3"
  top: "pool4"
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layer {
  name: "conv5_1"
  type: "Convolution"
  bottom: "pool4"
  top: "conv5_1"
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu5_1"
  type: "ReLU"
  bottom: "conv5_1"
  top: "conv5_1"
}
layer {
  name: "conv5_2"
  type: "Convolution"
  bottom: "conv5_1"
  top: "conv5_2"
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu5_2"
  type: "ReLU"
  bottom: "conv5_2"
  top: "conv5_2"
}
layer {
  name: "conv5_3"
  type: "Convolution"
  bottom: "conv5_2"
  top: "conv5_3"
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layer {
  name: "relu5_3"
  type: "ReLU"
  bottom: "conv5_3"
  top: "conv5_3"
}
layer {
  name: "rpn_conv/3x3"
  type: "Convolution"
  bottom: "conv5_3"
  top: "rpn/output"
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
    stride: 1
  }
}
layer {
  name: "rpn_relu/3x3"
  type: "ReLU"
  bottom: "rpn/output"
  top: "rpn/output"
}
layer {
  name: "rpn_cls_score"
  type: "Convolution"
  bottom: "rpn/output"
  top: "rpn_cls_score"
  convolution_param {
    num_output: 18
    pad: 0
    kernel_size: 1
    stride: 1
  }
}
layer {
  name: "rpn_bbox_pred"
  type: "Convolution"
  bottom: "rpn/output"
  top: "rpn_bbox_pred"
  convolution_param {
    num_output: 36
    pad: 0
    kernel_size: 1
    stride: 1
  }
}
layer {
  name: "rpn_cls_score_reshape"
  type: "Reshape"
  bottom: "rpn_cls_score"
  top: "rpn_cls_score_reshape"
  reshape_param {
    shape {
      dim: 0
      dim: 2
      dim: -1
      dim: 0
    }
  }
}
layer {
  name: "rpn_cls_prob"
  type: "Softmax"
  bottom: "rpn_cls_score_reshape"
  top: "rpn_cls_prob"
}
layer {
  name: "rpn_cls_prob_reshape"
  type: "Reshape"
  bottom: "rpn_cls_prob"
  top: "rpn_cls_prob_reshape"
  reshape_param {
    shape {
      dim: 0
      dim: 18
      dim: -1
      dim: 0
    }
  }
}
layer {
  name: "proposal"
  type: "Python"
  bottom: "rpn_cls_prob_reshape"
  bottom: "rpn_bbox_pred"
  bottom: "im_info"
  top: "rois"
  python_param {
    module: "rpn.proposal_layer"
    layer: "ProposalLayer"
    param_str: "\'feat_stride\': 16"
  }
}
layer {
  name: "roi_pool5"
  type: "ROIPooling"
  bottom: "conv5_3"
  bottom: "rois"
  top: "pool5"
  roi_pooling_param {
    pooled_h: 7
    pooled_w: 7
    spatial_scale: 0.0625
  }
}
layer {
  name: "fc6"
  type: "InnerProduct"
  bottom: "pool5"
  top: "fc6"
  inner_product_param {
    num_output: 4096
  }
}
layer {
  name: "relu6"
  type: "ReLU"
  bottom: "fc6"
  top: "fc6"
}
layer {
  name: "fc7"
  type: "InnerProduct"
  bottom: "fc6"
  top: "fc7"
  inner_product_param {
    num_output: 4096
  }
}
layer {
  name: "relu7"
  type: "ReLU"
  bottom: "fc7"
  top: "fc7"
}
layer {
  name: "cls_score"
  type: "InnerProduct"
  bottom: "fc7"
  top: "cls_score"
  inner_product_param {
    num_output: 21
  }
}
layer {
  name: "bbox_pred"
  type: "InnerProduct"
  bottom: "fc7"
  top: "bbox_pred"
  inner_product_param {
    num_output: 84
  }
}
layer {
  name: "cls_prob"
  type: "Softmax"
  bottom: "cls_score"
  top: "cls_prob"
}
I0908 20:21:59.206061  4215 net.cpp:413] Input 0 -> data
I0908 20:21:59.309164  4215 net.cpp:413] Input 1 -> im_info
I0908 20:21:59.309209  4215 layer_factory.hpp:77] Creating layer conv1_1
I0908 20:21:59.309229  4215 net.cpp:106] Creating Layer conv1_1
I0908 20:21:59.309232  4215 net.cpp:454] conv1_1 <- data
I0908 20:21:59.309237  4215 net.cpp:411] conv1_1 -> conv1_1
I0908 20:22:00.151002  4215 net.cpp:150] Setting up conv1_1
I0908 20:22:00.151026  4215 net.cpp:157] Top shape: 1 64 224 224 (3211264)
I0908 20:22:00.151029  4215 net.cpp:165] Memory required for data: 12845056
I0908 20:22:00.151042  4215 layer_factory.hpp:77] Creating layer relu1_1
I0908 20:22:00.151053  4215 net.cpp:106] Creating Layer relu1_1
I0908 20:22:00.151057  4215 net.cpp:454] relu1_1 <- conv1_1
I0908 20:22:00.151062  4215 net.cpp:397] relu1_1 -> conv1_1 (in-place)
I0908 20:22:00.151320  4215 net.cpp:150] Setting up relu1_1
I0908 20:22:00.151327  4215 net.cpp:157] Top shape: 1 64 224 224 (3211264)
I0908 20:22:00.151329  4215 net.cpp:165] Memory required for data: 25690112
I0908 20:22:00.151331  4215 layer_factory.hpp:77] Creating layer conv1_2
I0908 20:22:00.151340  4215 net.cpp:106] Creating Layer conv1_2
I0908 20:22:00.151342  4215 net.cpp:454] conv1_2 <- conv1_1
I0908 20:22:00.151347  4215 net.cpp:411] conv1_2 -> conv1_2
I0908 20:22:00.152340  4215 net.cpp:150] Setting up conv1_2
I0908 20:22:00.152350  4215 net.cpp:157] Top shape: 1 64 224 224 (3211264)
I0908 20:22:00.152354  4215 net.cpp:165] Memory required for data: 38535168
I0908 20:22:00.152359  4215 layer_factory.hpp:77] Creating layer relu1_2
I0908 20:22:00.152364  4215 net.cpp:106] Creating Layer relu1_2
I0908 20:22:00.152367  4215 net.cpp:454] relu1_2 <- conv1_2
I0908 20:22:00.152371  4215 net.cpp:397] relu1_2 -> conv1_2 (in-place)
I0908 20:22:00.152498  4215 net.cpp:150] Setting up relu1_2
I0908 20:22:00.152503  4215 net.cpp:157] Top shape: 1 64 224 224 (3211264)
I0908 20:22:00.152504  4215 net.cpp:165] Memory required for data: 51380224
I0908 20:22:00.152506  4215 layer_factory.hpp:77] Creating layer pool1
I0908 20:22:00.152513  4215 net.cpp:106] Creating Layer pool1
I0908 20:22:00.152513  4215 net.cpp:454] pool1 <- conv1_2
I0908 20:22:00.152518  4215 net.cpp:411] pool1 -> pool1
I0908 20:22:00.152549  4215 net.cpp:150] Setting up pool1
I0908 20:22:00.152552  4215 net.cpp:157] Top shape: 1 64 112 112 (802816)
I0908 20:22:00.152554  4215 net.cpp:165] Memory required for data: 54591488
I0908 20:22:00.152555  4215 layer_factory.hpp:77] Creating layer conv2_1
I0908 20:22:00.152561  4215 net.cpp:106] Creating Layer conv2_1
I0908 20:22:00.152564  4215 net.cpp:454] conv2_1 <- pool1
I0908 20:22:00.152566  4215 net.cpp:411] conv2_1 -> conv2_1
I0908 20:22:00.154081  4215 net.cpp:150] Setting up conv2_1
I0908 20:22:00.154090  4215 net.cpp:157] Top shape: 1 128 112 112 (1605632)
I0908 20:22:00.154093  4215 net.cpp:165] Memory required for data: 61014016
I0908 20:22:00.154098  4215 layer_factory.hpp:77] Creating layer relu2_1
I0908 20:22:00.154103  4215 net.cpp:106] Creating Layer relu2_1
I0908 20:22:00.154105  4215 net.cpp:454] relu2_1 <- conv2_1
I0908 20:22:00.154110  4215 net.cpp:397] relu2_1 -> conv2_1 (in-place)
I0908 20:22:00.154381  4215 net.cpp:150] Setting up relu2_1
I0908 20:22:00.154388  4215 net.cpp:157] Top shape: 1 128