Training Protocol
-
backbone:
ResNet-101 or modified aligned Xception -
pretrain:
ImageNet-1K -
dataset:
PASCAL VOC 2012 (20 foreground object classes, 1 background class)
10582 (trainaug) training images, 1449 (val), 1456 (test) -
lr schedule:
“poly” policy (initial lr:
0.007)
initial learning rate is multiplied by ( 1 − i t e r m a x _ i t e r )