caffe 添加自定义层（custom layer）

最新推荐文章于 2022-04-04 15:22:07 发布

Huo的藏经阁

最新推荐文章于 2022-04-04 15:22:07 发布

阅读量797

点赞数 1

分类专栏： Caffe 文章标签： caffe

本文链接：https://blog.csdn.net/weixin_42730667/article/details/103221685

版权

Caffe 专栏收录该内容

23 篇文章 1 订阅

订阅专栏

在《剖析Caffe源码之Layer》可以知道layer是所有层的基类，由此类派生出各种不同的不同的层，其如下图所示：

由此扩展出了各种不同的层，基本能满足要求，但是有时候在搭建拓扑网络时，所需要的层并没有实现，这时就需要用户自己新添加一个layer。添加layer过程相对较为简单，下面将举一个例子进行说明。

用例

下面将以双曲线余弦函数为例子，说明添加一个custom layer过程：

将该层命名为cosh

添加头文件

首先添加cosh 层头文件，头文件名为：cosh_layer.hpp，存放路径为\include\caffe\layers\文件夹下面,

其内容如下：

#ifndef CAFFE_COSH_LAYER_HPP_
#define CAFFE_COSH_LAYER_HPP_

#include <vector>

#include "caffe/blob.hpp"
#include "caffe/layer.hpp"
#include "caffe/proto/caffe.pb.h"



namespace caffe {

/**
 * @brief cosh layer.
 */
template <typename Dtype>
class CoshLayer : public Layer<Dtype> {
 public:
  /**
   * @param param provides.
   */
  explicit CoshLayer(const LayerParameter& param)
      : Layer<Dtype>(param) {}
  virtual void LayerSetUp(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);
  virtual void Reshape(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);

  virtual inline const char* type() const { return "Cosh"; }
  virtual inline int ExactNumBottomBlobs() const { return 1; }

  virtual inline int MinTopBlobs() const { return 1; }
  virtual inline int MaxTopBlobs() const { return 1; }

 protected:

  virtual void Forward_cpu(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);
  virtual void Forward_gpu(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);


  /// @brief Not implemented -- AccuracyLayer cannot be used as a loss.
  virtual void Backward_cpu(const vector<Blob<Dtype>*>& top,
      const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom) ;
  virtual void Backward_gpu(const vector<Blob<Dtype>*>& top,
      const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom);

};

}  // namespace caffe

#endif  // CAFFE_COSH_LAYER_HPP_

其中有几个需要特别说明的是：

1：inline const char* type(）返回的是Layer 类型，要改成个该层名字一样，标明该层的类型，对查找到该层比较关键。

2：ExactNumBottomBlobs()：表明该层作为输入Blob的个数，cosh层输入为1个

3：MinTopBlobs和MaxTopBlobs：该层输出Blob的最小和最大输出数目

4：由于该层没有参数，不需要添加参数

添加CPP文件，主要为该层实现部分，命名为Cosh_layer.cpp，文件位于：src\caffe\layers目录下，主要实现以下几个函数功能：

LayerSetUp

setup函数，主要对该层允许前一个变量进行设置，比如该层参数等等，由于该层没有参数，所以为空

Reshape

reshape()函数定义该层输出的shape，记住caffe中的处理原则就是在该层的输出top定义shape，不能在输入bottom定义shape,由于cosh函数输出的shape和输入的shape一样，故实现如下：

template <typename Dtype>
void CoshLayer<Dtype>::Reshape(
  const vector<Blob<Dtype>*>& bottom, const vector<Blob<Dtype>*>& top) {

  top[0]->shape(bottom[0]->num(), bottom[0]->count(), bottom[0]->height(),   bottom[0]->width());
 
}

前向传播

前向传播实现，实现上述公式：

template <typename Dtype>
void CoshLayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,
    const vector<Blob<Dtype>*>& top) {
  Dtype accuracy = 0;
  const Dtype* bottom_data = bottom[0]->cpu_data();
  const Dtype* top_data = top[0]->cpu_data();

    for (int i = 0; i < bottom[0]->count(); ++i) {
      top_data[i] = (exp(bottom_data[i]) + exp(-bottom_data[i]))/2;
    }
}
template <typename Dtype>
void Forward_gpu(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top)
{
  Forward_cpu(bottom, top)
}

如果有英伟达GPU可以使用CUDA实现，没有GPU可以直接调用CPU的实现

反向传播

由于该层不需要反向传播，直接为空。

template <typename Dtype>
void CoshLayer<Dtype>:: Backward_cpu(const vector<Blob<Dtype>*>& top,
      const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom)
{
}
template <typename Dtype>
void CoshLayer<Dtype>:: Backward_gpu(const vector<Blob<Dtype>*>& top,
    const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom)
{
}

注册Layer

注册该layer

#ifdef CPU_ONLY
STUB_GPU(CoshLayer);
#endif

INSTANTIATE_CLASS(CoshLayer);
REGISTER_LAYER_CLASS(Cosh);

整体实现如下：

#include <functional>
#include <utility>
#include <vector>

#include "caffe/layers/cosh_layer.hpp"
#include "caffe/util/math_functions.hpp"

namespace caffe {

template <typename Dtype>
void CoshLayer<Dtype>::LayerSetUp(
  const vector<Blob<Dtype>*>& bottom, const vector<Blob<Dtype>*>& top) {
}

template <typename Dtype>
void CoshLayer<Dtype>::Reshape(
  const vector<Blob<Dtype>*>& bottom, const vector<Blob<Dtype>*>& top) {

  top[0]->shape(bottom[0]->num(), bottom[0]->count(), bottom[0]->height(),   bottom[0]->width());
 
}

template <typename Dtype>
void CoshLayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,
    const vector<Blob<Dtype>*>& top) {
  Dtype accuracy = 0;
  const Dtype* bottom_data = bottom[0]->cpu_data();
  const Dtype* top_data = top[0]->cpu_data();

    for (int i = 0; i < bottom[0]->count(); ++i) {
      top_data[i] = (exp(bottom_data[i]) + exp(-bottom_data[i]))/2;
    }
}
template <typename Dtype>
void Forward_gpu(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top)
{
  Forward_cpu(bottom, top)
}

template <typename Dtype>
void CoshLayer<Dtype>:: Backward_cpu(const vector<Blob<Dtype>*>& top,
      const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom)
{
}
template <typename Dtype>
void CoshLayer<Dtype>:: Backward_gpu(const vector<Blob<Dtype>*>& top,
    const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom)
{
}


#ifdef CPU_ONLY
STUB_GPU(CoshLayer);
#endif

INSTANTIATE_CLASS(CoshLayer);
REGISTER_LAYER_CLASS(Cosh);

}  // namespace caffe

由于该层不需要添加参数，所以没有必要修改caffe.proto添加layer自定义结构，如果用户需要添加新的自定义层的参数，则需要在LayerParameter添加自定义结构，对LayerParameter不了解的，可以查看《剖析Caffe源码之Net---NetParameter参数》。

Huo的藏经阁

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
打赏
0
评论
caffe 添加自定义层（custom layer）

在《剖析Caffe源码之Layer》可以知道layer是所有层的基类，由此类派生出各种不同的不同的层，其如下图所示：由此扩展出了各种不同的层，基本能满足要求，但是有时候在搭建拓扑网络时，所需要的层并没有实现，这时就需要用户自己新添加一个layer。添加layer过程相对较为简单，下面将举一个例子进行说明。用例下面将以双曲线余弦函数为例子，说明添加一个custom layer过程：...
复制链接

扫一扫