ｃａｆｆｅ的二分类微调，制作ｔｒａｉｎ.txt,val.txt

最新推荐文章于 2021-05-18 10:47:18 发布

倒鸡毛的臭小子

最新推荐文章于 2021-05-18 10:47:18 发布

阅读量1k

点赞数 1

分类专栏：深度学习之caffe情节

本文链接：https://blog.csdn.net/banluqingchun/article/details/56845918

版权

深度学习之caffe情节专栏收录该内容

2 篇文章 0 订阅

订阅专栏

　　之前利用ｃａｆｆｅ自带的第二个例子学习微调，大概知道了微调是个什么玩意，做了二分类和多分类，但例子是在ｎｏｔｅｂｏｏｋ中实现的，所以即使实现了，对于分类问题依然云里雾里。
　　之后，就系统性的把分类做了一遍。话不多说，直接上过程。
　　友情提示：由与电脑性能有限，因此未ＧＰＵ加速，因此只是基于ＣＰＵ。
　　【参考了如下文章】
　　系列文章http://blog.csdn.net/sinat_30071459/article/details/51613304
大致分为三个步骤：
一　数据集制作
　　（１）制作ｔｘｔ文件
　　（２）转ＬＭＤＢ格式
　　（３）求均值文件
二　模型训练
　　(1)修改*.prototxt 网络模型
　　（2）修改*solver.prototxt配置文件
三　测试
　　（１）修改deploy.prototxt
　　（１）测单幅图像
　　（２）测多副图像
四　干货分享
　
＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝华丽分割线＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝＝
一　数据集制作
　　这一部分重点是将图像路径、名称、以及标签写成ｔｘｔ文档形式。生成train.txt,train_val.txt,val.txt,test.txt,labels.txt文件，我觉得一般只需要train.txt，val.txt就可以了，ｔｅｓｔ完全不需要提前准备，因为要测试哪副图像，就把该幅图像放在你能找到的位置，只要不参与训练就ＯＫ了。
　ＰＳ：因此需要ｔｒａｉｎ.ｔｘｔ，val.txt以及ｌａｂｅｌ.txt三个文件。
　（１）制作txt文件
　准备两类图像，将其分别放入两个文件夹，如ｆａｌｓｅ，ｔｒｕｅ。复制如下代码在ｔｘｔ中，并将其后缀改为ｓｈ，运行ｓｈ文件的时候，格式一般是：ｓｈ　**.sh

deepls(){
for x in "$1/"*
do 
#echo $x
if [ -f $x ]
then
echo $x $I|cut -d '/' -f1-8 >> $NAME
fi
if [ -d $x ]
then
(deepls "$x")
I=`expr $I + 1`
fi
done
}
I=0
DEST_PATH="/home/name/caffe/secondclassify/true"
NAME="./train1.txt"
deepls $DEST_PATH

　　生成的ｔｒａｉｎ１．ｔｘｔ，这里的ｔｒａｉｎ１.ｔｘｔ只是将其中一类图像转化为ｔｘｔ格式。
　　同理，生成另一类图像的ｔｘｔ文件ｔｒａｉｎ２．ｔｘｔ。

deepls(){
for x in "$1/"*
do 
#echo $x
if [ -f $x ]
then
echo $x $I|cut -d '/' -f1-8 >> $NAME
fi
if [ -d $x ]
then
(deepls "$x")
I=`expr $I + 1`
fi
done
}
I=1
DEST_PATH="/home/name/caffe/secondclassify/ｆａｌｓｅ"
NAME="./train２.txt"
deepls $DEST_PATH

备注：
通过修改-f1-8数值，即可修改ｔｘｔ文件中图像路径的长度。
通过修改Ｉ的值，如Ｉ＝０，或者Ｉ＝１来定标签。
.ｔｘｔ如下图所示。　这里写图片描述
　　将上述的ｔｒｉａｎ１．ｔｘｔ和ｔｒａｉｎ２.ｔｘｔ文件里的内容放在一个ｔｘｔ文件中，取名为ｔｒｉａｎ.txt,然后将ｔｒａｉｎ.ｔｘｔ中标签０和标签１的信息各取一定数量放入到新建的ｖａｌ.ｔｘｔ中当做验证集。至于多少数量，可参考百度知道信息。https://zhidao.baidu.com/question/1834153784184657700.h
　　在机器学习和模式识别等领域中，一般需要将样本分成独立的三部分训练集（train set），验证集（validation set ) 和测试集（test set）。其中训练集用来估计模型，验证集用来确定网络结构或者控制模型复杂程度的参数，而测试集则检验最终选择最优的模型的性能如何。一个典型的划分是训练集占总样本的50%，而其它各占25%，三部分都是从样本中随机抽取。
　　一般的labels.txt可手动完成如二分类。如图所示：这里写图片描述
　（２）转ＬＭＤＢ格式
　　修改create_imagenet.sh(此文件原始存在/home/ｎａｍｅ/caffe/examples/imagenet/里)，之后执行ｓｈ文件。

#!/usr/bin/env sh
# Create the imagenet lmdb inputs
# N.B. set the path to the imagenet train + val data dirs
set -e

EXAMPLE=/home/ｎａｍｅ/caffe/secondclassify
#saving lmdb
DATA=/home/ｎａｍｅ/caffe/secondclassify
#where is txt
TOOLS=/home/ｎａｍｅ/caffe/build/tools
#由于ｔｒａｉｎ
TRAIN_DATA_ROOT=/
VAL_DATA_ROOT=/
#where is pictures

# Set RESIZE=true to resize the images to 256x256. Leave as false if images have
# already been resized using another tool.
RESIZE=true
if $RESIZE; then
  RESIZE_HEIGHT=256
  RESIZE_WIDTH=256
else
  RESIZE_HEIGHT=0
  RESIZE_WIDTH=0
fi

if [ ! -d "$TRAIN_DATA_ROOT" ]; then
  echo "Error: TRAIN_DATA_ROOT is not a path to a directory: $TRAIN_DATA_ROOT"
  echo "Set the TRAIN_DATA_ROOT variable in create_imagenet.sh to the path" \
       "where the ImageNet training data is stored."
  exit 1
fi

if [ ! -d "$VAL_DATA_ROOT" ]; then
  echo "Error: VAL_DATA_ROOT is not a path to a directory: $VAL_DATA_ROOT"
  echo "Set the VAL_DATA_ROOT variable in create_imagenet.sh to the path" \
       "where the ImageNet validation data is stored."
  exit 1
fi

echo "Creating train lmdb..."

GLOG_logtostderr=1 $TOOLS/convert_imageset \
    --resize_height=$RESIZE_HEIGHT \
    --resize_width=$RESIZE_WIDTH \
    --shuffle \
    $TRAIN_DATA_ROOT \
    $DATA/train.txt \
    $EXAMPLE/secondclassify_train_lmdb

echo "Creating val lmdb..."

GLOG_logtostderr=1 $TOOLS/convert_imageset \
    --resize_height=$RESIZE_HEIGHT \
    --resize_width=$RESIZE_WIDTH \
    --shuffle \
    $VAL_DATA_ROOT \
    $DATA/val.txt \
    $EXAMPLE/secondclassify_val_lmdb

echo "Done."

　（３）求均值文件
　修改均值文件make_imagenet_mean.sh(此文件原始存在/home/ｎａｍｅ/caffe/examples/imagenet/里)，之后执行ｓｈ文件。

#!/usr/bin/env sh
# Compute the mean image from the imagenet training lmdb
# N.B. this is available in data/ilsvrc12
EXAMPLE=/home/ｎａｍｅ/caffe/secondclassify
DATA=/home/ｎａｍｅ/caffe/secondclassify
TOOLS=build/tools
rm -rf $DATA/secondclassify_mean.binaryproto
$TOOLS/compute_image_mean $EXAMPLE/secondclassify_train_lmdb \
  $DATA/secondclassify_mean.binaryproto
echo "Done."

二　模型训练
　　在/home/ｎａｍｅ/caffe/models/bvlc_reference_caffenet里复制solver.prototxt和train_val.prototxt文件到你所建的文件夹，并将其名称修改为ｓｅｎｃｏｎdclassify_solver.prototxt和ｓｅｎｃｏｎdclassify_train_val.prototxt。
　　(1) 点击ｓｅｃｏｎdclassify_train_val.prototxt，需要修改以下内容。

１name: "CaffeNet_sencondclassify"（可改可不改）
２mean_file: "/home/ｎａｍｅ/caffe/secondclassify/ｓｅｃｏｎdclassify_mean.binaryproto"
３source: "/home/ｎａｍｅ/caffe/secondclassify/ｓｅｃｏｎdclassify_train_lmdb"
４ mean_file: "/home/ｎａｍｅ/caffe/secondclassify/ｓｅｃｏｎdclassify_mean.binaryproto"
５source: "/home/ｎａｍｅ/caffe/secondclassify/ｓｅｃｏｎdclassify_val_lmdb"
６ name: "fc8_secondclassify"（由于只是对第八层进行了微调，因此所有和ｆｃ８有关系的都需要改名字）
７num_output: **2**

　　 (2) 点击secondclassify_solver.prototxt，需要修改以下内容。
　　包括路径，各个参数，这些参数需要认真查看其每一个的意义。

net: "/home/name/caffe/secondclassify/sencondclassify_train_val.prototxt"
test_iter: 2
test_interval: 100
base_lr: 0.01
lr_policy: "step"
gamma: 0.1
stepsize: 100
display: 20
max_iter: 2000
momentum: 0.9
weight_decay: 0.0005
snapshot: 200
snapshot_prefix: "/home/tuxiang/caffe1/secondclassify/sencondclassify_caffenet_train"
solver_mode: CPU

　　（３）修改并执行train.ｓｈ文件开始训练(此文件原始存在/home/ｎａｍｅ/caffe/examples/imagenet/里)
　　微调第八层，在预训练模型bvlc_reference_caffenet.caffemodel基础上训练，其所在位置为/home/ｎａｍｅ/caffe/models/bvlc_reference_caffenet/。

./build/tools/caffe train \
--solver=secondclassify/secondclassify_solver.prototxt \
--weights=secondclassify/bvlc_reference_caffenet.caffemodel

三　测试
　　（１）修改deploy.prototxt文件
　　基本就是修改ｆｃ８为之前训练时候的名字，修改num_output: 2。
　　（２）测试单副图像（也可用测试多幅图像的代码，只需修改循环的总数就可以了）

import cv2 
import numpy as np
import matplotlib.pyplot as plt
import time
start = time.clock()

plt.rcParams['figure.figsize'] = (10, 10)        # large images
plt.rcParams['image.interpolation'] = 'nearest'  # don't interpolate: show square pixels
plt.rcParams['image.cmap'] = 'gray'  # use grayscale output rather than a (potentially misleading) color heatmap

caffe_root = '/home/name/caffe/secondclassify/'  # this file should be run from {caffe_root}/examples  
import os  
os.chdir(caffe_root)  #将当前工作目录转到制定路径
import sys  
sys.path.insert(0, 'python')  

import caffe

caffe.set_mode_cpu()

model_def = caffe_root + 'secondclassify_deploy.prototxt'
model_weights = caffe_root + 'secondclassify_caffenet_train_iter_2000.caffemodel'

net = caffe.Net(model_def, # defines the structure of the model
 model_weights,# contains the trained weights
                caffe.TEST)# use test mode (e.g., don't perform dropout)


transformer = caffe.io.Transformer({'data': net.blobs['data'].data.shape})
transformer.set_transpose('data', (2,0,1))  # move image channels to outermost dimension
#transformer.set_mean('data', mu) 
transformer.set_mean('data', np.array([104,117,123]))  # subtract the dataset-mean value in each channel
transformer.set_raw_scale('data', 255)      # rescale from [0, 1] to [0, 255]
transformer.set_channel_swap('data', (2,1,0))  # swap channels from RGB to BGR

net.blobs['data'].reshape(1,  # batch size
                          3,   # 3-channel (BGR) images
                          227, 227)  # image size is 227x227
image = caffe.io.load_image(caffe_root + '111/10.jpg')
transformed_image = transformer.preprocess('data', image)
plt.imshow(image)
plt.show()

# copy the image data into the memory allocated for the net
net.blobs['data'].data[...] = transformed_image
### perform classification
output = net.forward()

output_prob = output['prob'][0]  # the output probability vector for the first image in the batch
print 'predicted class is:', output_prob.argmax()

# load ImageNet labels
labels_file = caffe_root + 'labels.txt'
if not os.path.exists(labels_file):
    get_ipython().system(u'../data/ilsvrc12/get_ilsvrc_aux.sh')

labels = np.loadtxt(labels_file, str, delimiter='\t')

print 'output label:', labels[output_prob.argmax()]
print output_prob

end = time.clock()
print('Runing time:%s Senconds'%(end-start))

　　（３）测试多副图像
文件夹１１１里有１０副图像，可根据自己所要测试的数量改写

import numpy as np  
import matplotlib.pyplot as plt  
import sys,os  
import time
caffe_root = '/home/name/caffe/secondclassify/'   
sys.path.insert(0, caffe_root + 'python')  
import caffe  
os.chdir(caffe_root)  
net_file=caffe_root + 'secondclassify_deploy.prototxt' 
caffe_model=caffe_root + 'secondclassify_caffenet_train_iter_2000.caffemodel'  
mean_file=caffe_root + 'secondclassify_mean.binaryproto'  
net = caffe.Net(net_file,caffe_model,caffe.TEST)  
transformer = caffe.io.Transformer({'data': net.blobs['data'].data.shape})  
transformer.set_transpose('data', (2,0,1))  
transformer.set_mean('data', np.array([104,117,123]))
transformer.set_raw_scale('data', 255)   
transformer.set_channel_swap('data', (2,1,0)) 
net.blobs['data'].reshape(1, 3,227, 227)
IMAGE_FILE=caffe_root + '111/' 
imagenet_labels_filename = caffe_root + 'labels.txt'  
labels = np.loadtxt(imagenet_labels_filename, str, delimiter='\t') 
for i in xrange(10):
  start = time.clock()
  imgstr = IMAGE_FILE + str(i+1) + '.jpg'
  im = caffe.io.load_image(imgstr,color=True)
  net.blobs['data'].data[...] = transformer.preprocess('data',im) 
  out = net.forward()
  pridects = out['prob']
  print pridects
  pridect = pridects.argmax()
  print pridect
  print '%d.jpg style: %s  (prob= %5.2f%%)' % (i+1, labels[pridect],100*pridects[0][pridect])
  plt.figure(i+1)
  end = time.clock()
  print 'times = %s' %(end-start)
  plt.title('Image Style: %s (prob= %5.2f%%) times = %s' % (labels[pridect],100*pridects[0][pridect],(end-start)),
  fontsize=15,color='r') 
  plt.imshow(im)
  plt.pause(5)
  plt.close()

这里写图片描述

四　干货分享
１　修改图像尺寸

#resize_image.sh
#!/bin/bash
# Used to resize the images
help_messge()
{
        echo "Usage: resize_image -d <dir> -w <width> -h <height>"
        echo "-d The directory contains the images"
        echo "-w The width after convert"
        echo "-h The height after convert"
        exit 1
}
if [ $# -lt 6 ]
then
        help_messge
fi
while getopts d:w:h: opt
do
        case "$opt" in
                d) dir="$OPTARG";;
                w) w="$OPTARG";;
                h) h="$OPTARG";;
        esac
done
if [ -d $dir ]
then
        cd $dir
        ls *.jpg > filelist.$$
        for image in $(cat filelist.$$)
        do
                echo "Convert file $image"
                convert -resize "$w"x"$h" $dir/$image $dir/$image
        done
        rm filelist.$$
        nautilus $dir
else
        echo "The directory $dir is not exist"
        exit 1
fi

２　批量修改图像名称

#!/bin/bash
#written by mofansheng@2016-02-17
path=/goodboy
[ -d $path ] && cd $path
for file in `ls`
do
 mv $file `echo $file|sed 's/\(.*\)\.\(.*\)/\1_p.\2/g'`
done