以resnet作为前置网络的ssd目标提取检测

最新推荐文章于 2024-08-17 20:13:54 发布

girafffeee

最新推荐文章于 2024-08-17 20:13:54 发布

阅读量2.8w

点赞数 10

文章标签： resnet ssd 目标检测神经网络 caffe

本文链接：https://blog.csdn.net/zhangjunbob/article/details/53119959

版权

该博客介绍了如何使用ResNet作为前置网络进行目标检测。首先，通过预训练ResNet模型在ImageNet数据集上生成lmdb数据。接着，详细描述了编写solver和prototxt文件以进行训练。最后，讨论了如何将预训练的ResNet模型接入SSD网络，进行目标检测的finetuning，并提供了训练SSD网络的步骤。

摘要由CSDN通过智能技术生成

以resnet作为前置网络的ssd目标提取检测

1.目标

本文的目标是将resnet结构作为前置网络，在imagenet数据集上进行预训练，随后将ssd目标提取检测网络（一部分）接在resnet前置网络之后，形成一个完整的ssd网络。

ssd网络下载和配置参考点击打开链接

2.resnet前置网络pretrain

2.1 利用imagenet数据生成lmdb，采用create_imagenet.sh生成，内容如下：

#!/usr/bin/env sh
# Create the imagenet lmdb inputs
# N.B. set the path to the imagenet train + val data dirs
set -e

EXAMPLE=models/resnet
DATA=/home/jzhang/data/VOCdevkit/VOC2007
TOOLS=build/tools

TRAIN_DATA_ROOT=/home/jzhang/data/VOCdevkit/VOC2007/JPEGImages/


# Set RESIZE=true to resize the images to 256x256. Leave as false if images have
# already been resized using another tool.
RESIZE=true
if $RESIZE; then
  RESIZE_HEIGHT=224
  RESIZE_WIDTH=224
else
  RESIZE_HEIGHT=0
  RESIZE_WIDTH=0
fi

if [ ! -d "$TRAIN_DATA_ROOT" ]; then
  echo "Error: TRAIN_DATA_ROOT is not a path to a directory: $TRAIN_DATA_ROOT"
  echo "Set the TRAIN_DATA_ROOT variable in create_imagenet.sh to the path" \
       "where the ImageNet training data is stored."
  exit 1
fi



echo "Creating train lmdb..."

GLOG_logtostderr=1 $TOOLS/convert_imageset \
    --resize_height=$RESIZE_HEIGHT \
    --resize_width=$RESIZE_WIDTH \
    --shuffle \
    $TRAIN_DATA_ROOT \
    $DATA/train.txt \
    $EXAMPLE/resnet_train_lmdb



echo "Done."

生成的过程采用TRAIN_DATA_ROOT下的图片，具体的图片目录在train.txt中：

train.txt的内容大致如下：

000001.jpg 0
000002.jpg 1
000003.jpg 2
000004.jpg 3
000005.jpg 4
000006.jpg 5
000007.jpg 6
000008.jpg 7
000009.jpg 8
000010.jpg 9

前面的为 TRAIN_DATA_ROOT下的图片文件名，后面的数字代表其标签label。

运行create_imagenet.sh后就会在EXAMPLE目录下生成lmdb文件夹，其中包含data.mdb和lock.mdb。这些都是caffe需要使用的数据格式。

2.2 编写solver和prototxt

先写各层网络结构的定义res_pretrain.prototxt：

name: "ResNet-50"

layer {
  name: "imagenet"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TRAIN
  }
  
  data_param {
    source: "models/resnet/resnet_train_lmdb"         //刚才产生的train的lmdb
    batch_size: 8
    backend: LMDB
  }
}
layer {
  name: "imagenet"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TEST
  }
  
  data_param {
    source: "models/resnet/resnet_test_lmdb"          //同理可以产生的test的lmdb
    batch_size: 1
    backend: LMDB
  }
}


/
                 resnet结构                              
/

layer {
	bottom: "data"
	top: "conv1"
	name: "conv1"
	type: "Convolution"
	convolution_param {
		num_output: 64
		kernel_size: 7
		pad: 3
		stride: 2
	}
}

layer {
	bottom: "conv1"
	top: "conv1"
	name: "bn_conv1"
	type: "BatchNorm"
	batch_norm_param {
		use_global_stats: true
	}
}

layer {
	bottom: "conv1"
	top: "conv1"
	name: "scale_conv1"
	type: "Scale"
	scale_param {
		bias_term: true
	}
}

layer {
	bottom: "conv1"
	top: "conv1"
	name: "conv1_relu"
	type: "ReLU"
}

layer {
	bottom: "conv1"
	top: "pool1"
	name: "pool1"
	type: "Pooli