Caffe Solver

最新推荐文章于 2023-04-25 18:09:20 发布

Hi_Panda_CRL

最新推荐文章于 2023-04-25 18:09:20 发布

阅读量502

点赞数

分类专栏： Computer Vision Deep Learning Caffe

本文链接：https://blog.csdn.net/grief_of_the_nazgul/article/details/51983553

版权

Caffe 同时被 3 个专栏收录

20 篇文章 0 订阅

订阅专栏

Deep Learning

15 篇文章 0 订阅

订阅专栏

Computer Vision

11 篇文章 0 订阅

订阅专栏

Solver

scaffolds the optimization bookkeeping and creates the training network for learning and test network(s) for evaluation.
iteratively optimizes by calling forward / backward and updating parameters(periodically)
evaluates the test networks
snapshots the model and solver state throughout the optimization

Solver Parameters

base_lr: 0.01 # begin training at a learning rate of 0.01 = 1e-2

lr_policy: "step" # learning rate policy: drop the learning rate in "steps"

# by a factor of gamma every stepsize iterations
gamma: 0.1 # drop the learning rate by a factor of 10
# (i.e., multiply it by a factor of gamma = 0.1)

stepsize: 100000 # drop the learning rate every 100K iterations

max_iter: 350000 # train for 350K iterations total

momentum: 0.9

<pre>
The learning rate decay policy. The currently implemented learning rate
policies are as follows:
    - fixed: always return base_lr.
    - step: return base_lr * gamma ^ (floor(iter / step))
    - exp: return base_lr * gamma ^ iter
    - inv: return base_lr * (1 + gamma * iter) ^ (- power)
   - multistep: similar to step but it allows non uniform steps defined by
   - 
stepvalue
    - poly: the effective learning rate follows a polynomial decay, to be
      zero by the max_iter. return base_lr (1 - iter/max_iter) ^ (power)
    - sigmoid: the effective learning rate follows a sigmod decay
      return base_lr ( 1/(1 + exp(-gamma * (iter - stepsize))))

 where base_lr, max_iter, gamma, step, stepvalue and power are defined
 in the solver parameter protocol buffer, and iter is the current iteration.
 </pre>