caffe 中的的参数

最新推荐文章于 2024-07-12 23:11:20 发布

dengyue470288

最新推荐文章于 2024-07-12 23:11:20 发布

阅读量153

点赞数

文章标签：人工智能

原文链接：http://www.cnblogs.com/taokongcn/p/4549504.html

版权

base_lr：初始学习率

momentum：上一次梯度权重

weight_decay：正则项系数

以上三个参数是SGD的核心，关于base_lr和momentum见：http://caffe.berkeleyvision.org/tutorial/solver.html

关于weight_decay: http://stats.stackexchange.com/questions/29130/difference-between-neural-net-weight-decay-and-learning-rate

lr_policy：（gamma、power、step）学习率更新规则，见caffe代码

// Return the current learning rate. The currently implemented learning rate
// policies are as follows:
//    - fixed: always return base_lr.
//    - step: return base_lr * gamma ^ (floor(iter / step))
//    - exp: return base_lr * gamma ^ iter
//    - inv: return base_lr * (1 + gamma * iter) ^ (- power)
//    - multistep: similar to step but it allows non uniform steps defined by
//      stepvalue
//    - poly: the effective learning rate follows a polynomial decay, to be
//      zero by the max_iter. return base_lr (1 - iter/max_iter) ^ (power)
//    - sigmoid: the effective learning rate follows a sigmod decay
//      return base_lr ( 1/(1 + exp(-gamma * (iter - stepsize))))
//

lr_mult：每一层都有两个lr_mult参数代表本层的学习率，第一个是base_lr*lr_mult代表本层样本，第二个是bias 的学习率

xavier：初始化参数，trick，见Understanding the difficulty of training deep feedforward neural networks

转载于:https://www.cnblogs.com/taokongcn/p/4549504.html

dengyue470288

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
caffe 中的的参数

base_lr：初始学习率momentum：上一次梯度权重weight_decay：正则项系数以上三个参数是SGD的核心，关于base_lr和momentum见：http://caffe.berkeleyvision.org/tutorial/solver.html关于weight_decay:http://stats.stackexchange.com/questio...
复制链接

扫一扫