1、msra 不适合InnerProduct, 并且是配合Relu使用
[He, Zhang, Ren and Sun 2015]: Specifically accounts for ReLU nonlinearities
2、xavier 比较常见
[Bengio and Glorot 2010]: Understanding the difficulty of training deep feedforward neural networks