训练神经网络(上)——批量归一化 Batch Normalization Babysitting the learning Process Step1: Preprocess the data Step2: Choose the architecture of network Step3: Double check that the loss is reasonable Summary