softmax损失函数理解 https://blog.csdn.net/as472780551/article/details/86554478 L1, L2以及smooth L1 loss https://blog.csdn.net/yang_daxia/article/details/91360606 L1与L2损失函数和正则化的区别 https://www.cnblogs.com/jclian91/p/9824310.html