训练LCNN

最新推荐文章于 2024-09-13 08:14:55 发布

mathilde27

最新推荐文章于 2024-09-13 08:14:55 发布

阅读量2.1k

点赞数

分类专栏：深度学习

本文链接：https://blog.csdn.net/Chunying27/article/details/55103065

版权

27 篇文章 0 订阅

订阅专栏

使用CASIA-WebFace数据集进行人脸识别训练，包含10575人的493456张图片。训练过程调整学习率为0.001并采用inv方式衰减，Dropout设置为0.7，最终在GTX980上训练两周达到较高准确率。

摘要由CSDN通过智能技术生成

144x144: 眼镜到嘴巴48pixel, 眼睛距离顶部48pixel
test: 128x128,  眼镜到嘴巴48pixel, 眼睛距离顶部40pixel

对学习率的设置 :
初始学习率设置为0.01，训练过程中，发现初始loss为9.3，约为-log(1/10575)正常，稍加训练后，loss上升到80+，说明学习率设置过大，调整为0.001，并以inv方式进行衰减。发现loss逐渐衰减了。

全连接层Dropout设置为0.7。不同层SGD的参数也不一样，前面除了fc2层，momentum设为0.9，weight decay为5e-4，fc2层为了防止过拟合，weight decay为5e-3。learning rate从1e-3降到5e-5。最终在GTX980上训练了两周。

如何用snapshot继续训练

Firstly, you need to generate snapshots. This can be done by specifing in solver.prototxt file.

snapshot: 500
snapshot_prefix: "snapshot/"

This means that it will take a snapshot every 500 iterations. And you will see snapshots in the your defined folder snapshot_prefix :

_iter_500.solverstate 
_iter_500.caffemodel 

_iter_1000.solverstate 
_iter_1000.caffemodel 

...

Once you have the snapshot, you can specify to use the snapshot in the training script.

$caffe train -solver="xxx.prototxt" –snapshot=cifar10_quick_iter_3000.solvers

This will start the training at the 3000th iteration

关注

专栏目录