昇思25天打卡训练营第14天|ShuffleNet图像分类

不支持GPU设备静态图模式运行

ShuffleNet网络介绍
ShuffleNetV1是旷视科技提出的一种计算高效的CNN模型,和MobileNet, SqueezeNet等一样主要应用在移动端,所以模型的设计目标就是利用有限的计算资源来达到最好的模型精度。ShuffleNetV1的设计核心是引入了两种操作:Pointwise Group Convolution和Channel Shuffle,这在保持精度的同时大大降低了模型的计算量。因此,ShuffleNetV1和MobileNet类似,都是通过设计更高效的网络结构来实现模型的压缩和加速。

了解ShuffleNet更多详细内容,详见论文ShuffleNet。

如下图所示,ShuffleNet在保持不低的准确率的前提下,将参数量几乎降低到了最小,因此其运算速度较快,单位参数量对模型准确率的贡献非常高。

模型架构
ShuffleNet最显著的特点在于对不同通道进行重排来解决Group Convolution带来的弊端。通过对ResNet的Bottleneck单元进行改进,在较小的计算量的情况下达到了较高的准确率。

Pointwise Group Convolution
Group Convolution(分组卷积)原理如下图所示,相比于普通的卷积操作,分组卷积的情况下,每一组的卷积核大小为in_channels/g*k*k,一共有g组,所有组共有(in_channels/g*k*k)*out_channels个参数,是正常卷积参数的1/g。分组卷积中,每个卷积核只处理输入特征图的一部分通道,其优点在于参数量会有所降低,但输出通道数仍等于卷积核的数量。

Depthwise Convolution(深度可分离卷积)将组数g分为和输入通道相等的in_channels,然后对每一个in_channels做卷积操作,每个卷积核只处理一个通道,记卷积核大小为1*k*k,则卷积核参数量为:in_channels*k*k,得到的feature maps通道数与输入通道数相等;

Pointwise Group Convolution(逐点分组卷积)在分组卷积的基础上,令每一组的卷积核大小为  1×1,卷积核参数量为(in_channels/g*1*1)*out_channels。

Channel Shuffle
Group Convolution的弊端在于不同组别的通道无法进行信息交流,堆积GConv层后一个问题是不同组之间的特征图是不通信的,这就好像分成了g个互不相干的道路,每一个人各走各的,这可能会降低网络的特征提取能力。这也是Xception,MobileNet等网络采用密集的1x1卷积(Dense Pointwise Convolution)的原因。

为了解决不同组别通道“近亲繁殖”的问题,ShuffleNet优化了大量密集的1x1卷积(在使用的情况下计算量占用率达到了惊人的93.4%),引入Channel Shuffle机制(通道重排)。这项操作直观上表现为将不同分组通道均匀分散重组,使网络在下一层能处理不同组别通道的信息。

如下图所示,对于g组,每组有n个通道的特征图,首先reshape成g行n列的矩阵,再将矩阵转置成n行g列,最后进行flatten操作,得到新的排列。这些操作都是可微分可导的且计算简单,在解决了信息交互的同时符合了ShuffleNet轻量级网络设计的轻量特征。

ShuffleNet模块
如下图所示,ShuffleNet对ResNet中的Bottleneck结构进行由(a)到(b), (c)的更改:

将开始和最后的 1×1卷积模块(降维、升维)改成Point Wise Group Convolution;

为了进行不同通道的信息交流,再降维之后进行Channel Shuffle;

降采样模块中, 3×3Depth Wise Convolution的步长设置为2,长宽降为原来的一半,因此shortcut中采用步长为2的 3×3平均池化,并把相加改成拼接。

构建ShuffleNet网络
ShuffleNet网络结构如下图所示,以输入图像 224×224组数3(g = 3)为例,首先通过数量24,卷积核大小为 3×3,stride为2的卷积层,输出特征图大小为 112×112,channel为24;然后通过stride为2的最大池化层,输出特征图大小为 56×56,channel数不变;再堆叠3个ShuffleNet模块(Stage2, Stage3, Stage4),三个模块分别重复4次、8次、4次,其中每个模块开始先经过一次下采样模块(上图(c)),使特征图长宽减半,channel翻倍(Stage2的下采样模块除外,将channel数从24变为240);随后经过全局平均池化,输出大小为 1×1×960,再经过全连接层和softmax,得到分类概率。

模型训练和评估
采用CIFAR-10数据集对ShuffleNet进行预训练。

训练集准备与加载
采用CIFAR-10数据集对ShuffleNet进行预训练。CIFAR-10共有60000张32*32的彩色图像,均匀地分为10个类别,其中50000张图片作为训练集,10000图片作为测试集。如下示例使用mindspore.dataset.Cifar10Dataset接口下载并加载CIFAR-10的训练集。目前仅支持二进制版本(CIFAR-10 binary version)。

模型训练
本节用随机初始化的参数做预训练。首先调用ShuffleNetV1定义网络,参数量选择"2.0x",并定义损失函数为交叉熵损失,学习率经过4轮的warmup后采用余弦退火,优化器采用Momentum。最后用train.model中的Model接口将模型、损失函数、优化器封装在model中,并用model.train()对网络进行训练。将ModelCheckpoint、CheckpointConfig、TimeMonitor和LossMonitor传入回调函数中,将会打印训练的轮数、损失和时间,并将ckpt文件保存在当前目录下。

2024-07-26 12:55:25 littlewoodhu
model size is  2.0x
============== Starting Training ==============
epoch: 1 step: 1, loss is 2.6074867248535156
epoch: 1 step: 2, loss is 2.5673539638519287
epoch: 1 step: 3, loss is 2.445343255996704
epoch: 1 step: 4, loss is 2.3593506813049316
epoch: 1 step: 5, loss is 2.4094467163085938
epoch: 1 step: 6, loss is 2.481452465057373
epoch: 1 step: 7, loss is 2.458271026611328
epoch: 1 step: 8, loss is 2.5503995418548584
epoch: 1 step: 9, loss is 2.37678599357605
epoch: 1 step: 10, loss is 2.334716796875
epoch: 1 step: 11, loss is 2.284836769104004
epoch: 1 step: 12, loss is 2.3663182258605957
epoch: 1 step: 13, loss is 2.412055253982544
epoch: 1 step: 14, loss is 2.387876510620117
epoch: 1 step: 15, loss is 2.3900086879730225
epoch: 1 step: 16, loss is 2.3132822513580322
epoch: 1 step: 17, loss is 2.306142807006836
epoch: 1 step: 18, loss is 2.26301908493042
epoch: 1 step: 19, loss is 2.31077241897583
epoch: 1 step: 20, loss is 2.296069383621216
epoch: 1 step: 21, loss is 2.3578920364379883
epoch: 1 step: 22, loss is 2.250715732574463
epoch: 1 step: 23, loss is 2.264376401901245
epoch: 1 step: 24, loss is 2.2397103309631348
epoch: 1 step: 25, loss is 2.2723278999328613
epoch: 1 step: 26, loss is 2.280475616455078
epoch: 1 step: 27, loss is 2.2611913681030273
epoch: 1 step: 28, loss is 2.201805353164673
epoch: 1 step: 29, loss is 2.225273609161377
epoch: 1 step: 30, loss is 2.1677911281585693
epoch: 1 step: 31, loss is 2.2753074169158936
epoch: 1 step: 32, loss is 2.3028290271759033
epoch: 1 step: 33, loss is 2.2406489849090576
epoch: 1 step: 34, loss is 2.1358611583709717
epoch: 1 step: 35, loss is 2.194544553756714
epoch: 1 step: 36, loss is 2.2103586196899414
epoch: 1 step: 37, loss is 2.2556700706481934
epoch: 1 step: 38, loss is 2.1384105682373047
epoch: 1 step: 39, loss is 2.235822916030884
epoch: 1 step: 40, loss is 2.2255942821502686
epoch: 1 step: 41, loss is 2.177318572998047
epoch: 1 step: 42, loss is 2.1832027435302734
epoch: 1 step: 43, loss is 2.1653120517730713
epoch: 1 step: 44, loss is 2.2017037868499756
epoch: 1 step: 45, loss is 2.097541332244873
epoch: 1 step: 46, loss is 2.174957036972046
epoch: 1 step: 47, loss is 2.1471595764160156
epoch: 1 step: 48, loss is 2.1461291313171387
epoch: 1 step: 49, loss is 2.1069915294647217
epoch: 1 step: 50, loss is 2.148634910583496
epoch: 1 step: 51, loss is 2.1617047786712646
epoch: 1 step: 52, loss is 2.1390111446380615
epoch: 1 step: 53, loss is 2.1788032054901123
epoch: 1 step: 54, loss is 2.1838719844818115
epoch: 1 step: 55, loss is 2.1878702640533447
epoch: 1 step: 56, loss is 2.254121780395508
epoch: 1 step: 57, loss is 2.1568925380706787
epoch: 1 step: 58, loss is 2.1664700508117676
epoch: 1 step: 59, loss is 2.2042391300201416
epoch: 1 step: 60, loss is 2.138359546661377
epoch: 1 step: 61, loss is 2.162292003631592
epoch: 1 step: 62, loss is 2.107999801635742
epoch: 1 step: 63, loss is 2.20550799369812
epoch: 1 step: 64, loss is 2.13793683052063
epoch: 1 step: 65, loss is 2.1301920413970947
epoch: 1 step: 66, loss is 2.1473281383514404
epoch: 1 step: 67, loss is 2.1408190727233887
epoch: 1 step: 68, loss is 2.1125881671905518
epoch: 1 step: 69, loss is 2.1237294673919678
epoch: 1 step: 70, loss is 2.0798583030700684
epoch: 1 step: 71, loss is 2.0484025478363037
epoch: 1 step: 72, loss is 2.0448365211486816
epoch: 1 step: 73, loss is 2.1569836139678955
epoch: 1 step: 74, loss is 2.1584362983703613
epoch: 1 step: 75, loss is 2.0341908931732178
epoch: 1 step: 76, loss is 2.17295241355896
epoch: 1 step: 77, loss is 2.1361145973205566
epoch: 1 step: 78, loss is 2.095043182373047
epoch: 1 step: 79, loss is 2.0424888134002686
epoch: 1 step: 80, loss is 2.147951126098633
epoch: 1 step: 81, loss is 2.0803651809692383
epoch: 1 step: 82, loss is 1.9917545318603516
epoch: 1 step: 83, loss is 2.065797805786133
epoch: 1 step: 84, loss is 2.120495557785034
epoch: 1 step: 85, loss is 2.156358242034912
epoch: 1 step: 86, loss is 2.1676712036132812
epoch: 1 step: 87, loss is 2.088245153427124
epoch: 1 step: 88, loss is 2.117332935333252
epoch: 1 step: 89, loss is 2.166799783706665
epoch: 1 step: 90, loss is 2.0351638793945312
epoch: 1 step: 91, loss is 2.0535051822662354
epoch: 1 step: 92, loss is 2.0263636112213135
epoch: 1 step: 93, loss is 2.0360491275787354
epoch: 1 step: 94, loss is 2.0493998527526855
epoch: 1 step: 95, loss is 2.093966484069824
epoch: 1 step: 96, loss is 2.0876405239105225
epoch: 1 step: 97, loss is 2.0177552700042725
epoch: 1 step: 98, loss is 2.0328681468963623
epoch: 1 step: 99, loss is 2.1284565925598145
epoch: 1 step: 100, loss is 2.038817882537842
epoch: 1 step: 101, loss is 2.1064395904541016
epoch: 1 step: 102, loss is 2.0493288040161133
epoch: 1 step: 103, loss is 2.0016369819641113
epoch: 1 step: 104, loss is 2.004840850830078
epoch: 1 step: 105, loss is 2.1012399196624756
epoch: 1 step: 106, loss is 1.9893112182617188
epoch: 1 step: 107, loss is 1.9789358377456665
epoch: 1 step: 108, loss is 2.016775608062744
epoch: 1 step: 109, loss is 2.039645195007324
epoch: 1 step: 110, loss is 2.033100128173828
epoch: 1 step: 111, loss is 2.0751750469207764
epoch: 1 step: 112, loss is 2.054032325744629
epoch: 1 step: 113, loss is 2.0449352264404297
epoch: 1 step: 114, loss is 2.1600353717803955
epoch: 1 step: 115, loss is 1.9872206449508667
epoch: 1 step: 116, loss is 2.1373400688171387
epoch: 1 step: 117, loss is 2.125105857849121
epoch: 1 step: 118, loss is 2.0569660663604736
epoch: 1 step: 119, loss is 2.0372209548950195
epoch: 1 step: 120, loss is 2.0964741706848145
epoch: 1 step: 121, loss is 1.9715644121170044
epoch: 1 step: 122, loss is 2.0824663639068604
epoch: 1 step: 123, loss is 1.9248427152633667
epoch: 1 step: 124, loss is 2.064785957336426
epoch: 1 step: 125, loss is 2.0272512435913086
epoch: 1 step: 126, loss is 1.881690263748169
epoch: 1 step: 127, loss is 2.079745292663574
epoch: 1 step: 128, loss is 1.9766654968261719
epoch: 1 step: 129, loss is 2.0431463718414307
epoch: 1 step: 130, loss is 2.0141639709472656
epoch: 1 step: 131, loss is 2.036090850830078
epoch: 1 step: 132, loss is 2.042241096496582
epoch: 1 step: 133, loss is 1.9501811265945435
epoch: 1 step: 134, loss is 1.8937513828277588
epoch: 1 step: 135, loss is 2.0809664726257324
epoch: 1 step: 136, loss is 1.9941296577453613
epoch: 1 step: 137, loss is 2.07505202293396
epoch: 1 step: 138, loss is 2.0699925422668457
epoch: 1 step: 139, loss is 1.99008047580719
epoch: 1 step: 140, loss is 1.9953538179397583
epoch: 1 step: 141, loss is 1.9193512201309204
epoch: 1 step: 142, loss is 1.9427956342697144
epoch: 1 step: 143, loss is 1.9388988018035889
epoch: 1 step: 144, loss is 1.973027229309082
epoch: 1 step: 145, loss is 2.048292875289917
epoch: 1 step: 146, loss is 2.0957119464874268
epoch: 1 step: 147, loss is 2.0850741863250732
epoch: 1 step: 148, loss is 1.9283853769302368
epoch: 1 step: 149, loss is 2.019087076187134
epoch: 1 step: 150, loss is 1.9871562719345093
epoch: 1 step: 151, loss is 1.9826608896255493
epoch: 1 step: 152, loss is 2.0052707195281982
epoch: 1 step: 153, loss is 2.0426347255706787
epoch: 1 step: 154, loss is 1.9916598796844482
epoch: 1 step: 155, loss is 1.9741183519363403
epoch: 1 step: 156, loss is 2.0702977180480957
epoch: 1 step: 157, loss is 2.0325732231140137
epoch: 1 step: 158, loss is 1.9691928625106812
epoch: 1 step: 159, loss is 2.0533018112182617
epoch: 1 step: 160, loss is 2.09694766998291
epoch: 1 step: 161, loss is 1.9766359329223633
epoch: 1 step: 162, loss is 1.9411063194274902
epoch: 1 step: 163, loss is 1.9895853996276855
epoch: 1 step: 164, loss is 2.015362501144409
epoch: 1 step: 165, loss is 2.064490556716919
epoch: 1 step: 166, loss is 1.9311487674713135
epoch: 1 step: 167, loss is 2.0468943119049072
epoch: 1 step: 168, loss is 1.9442241191864014
epoch: 1 step: 169, loss is 1.9285390377044678
epoch: 1 step: 170, loss is 1.9346100091934204
epoch: 1 step: 171, loss is 2.1009116172790527
epoch: 1 step: 172, loss is 2.0505332946777344
epoch: 1 step: 173, loss is 2.0046796798706055
epoch: 1 step: 174, loss is 2.024062156677246
epoch: 1 step: 175, loss is 1.9987787008285522
epoch: 1 step: 176, loss is 1.9816559553146362
epoch: 1 step: 177, loss is 1.9588998556137085
epoch: 1 step: 178, loss is 1.9840435981750488
epoch: 1 step: 179, loss is 1.9875191450119019
epoch: 1 step: 180, loss is 1.9972584247589111
epoch: 1 step: 181, loss is 1.9971120357513428
epoch: 1 step: 182, loss is 1.9379992485046387
epoch: 1 step: 183, loss is 2.124464511871338
epoch: 1 step: 184, loss is 2.132340908050537
epoch: 1 step: 185, loss is 1.951235294342041
epoch: 1 step: 186, loss is 1.9694455862045288
epoch: 1 step: 187, loss is 1.9557605981826782
epoch: 1 step: 188, loss is 2.0418901443481445
epoch: 1 step: 189, loss is 1.9324007034301758
epoch: 1 step: 190, loss is 1.9891881942749023
epoch: 1 step: 191, loss is 1.8896417617797852
epoch: 1 step: 192, loss is 1.9871134757995605
epoch: 1 step: 193, loss is 1.9819241762161255
epoch: 1 step: 194, loss is 2.067260265350342
epoch: 1 step: 195, loss is 1.9608522653579712
epoch: 1 step: 196, loss is 2.0522453784942627
epoch: 1 step: 197, loss is 2.029884099960327
epoch: 1 step: 198, loss is 2.145172119140625
epoch: 1 step: 199, loss is 1.9631972312927246
epoch: 1 step: 200, loss is 2.002553701400757
epoch: 1 step: 201, loss is 2.0363729000091553
epoch: 1 step: 202, loss is 2.058772563934326
epoch: 1 step: 203, loss is 2.0027172565460205
epoch: 1 step: 204, loss is 1.9935832023620605
epoch: 1 step: 205, loss is 2.090135097503662
epoch: 1 step: 206, loss is 2.0681467056274414
epoch: 1 step: 207, loss is 2.0434460639953613
epoch: 1 step: 208, loss is 1.9367997646331787
epoch: 1 step: 209, loss is 2.147993564605713
epoch: 1 step: 210, loss is 1.9816290140151978
epoch: 1 step: 211, loss is 1.985154390335083
epoch: 1 step: 212, loss is 2.055087089538574
epoch: 1 step: 213, loss is 2.0359644889831543
epoch: 1 step: 214, loss is 1.950507640838623
epoch: 1 step: 215, loss is 1.95393967628479
epoch: 1 step: 216, loss is 1.991260051727295
epoch: 1 step: 217, loss is 1.9864580631256104
epoch: 1 step: 218, loss is 1.9721453189849854
epoch: 1 step: 219, loss is 1.9942901134490967
epoch: 1 step: 220, loss is 1.8883270025253296
epoch: 1 step: 221, loss is 1.8978381156921387
epoch: 1 step: 222, loss is 1.9188950061798096
epoch: 1 step: 223, loss is 1.9739024639129639
epoch: 1 step: 224, loss is 1.9460232257843018
epoch: 1 step: 225, loss is 2.071077823638916
epoch: 1 step: 226, loss is 1.915285348892212
epoch: 1 step: 227, loss is 1.990149736404419
epoch: 1 step: 228, loss is 1.8998172283172607
epoch: 1 step: 229, loss is 1.9340561628341675
epoch: 1 step: 230, loss is 2.058088541030884
epoch: 1 step: 231, loss is 1.8666603565216064
epoch: 1 step: 232, loss is 1.9561322927474976
epoch: 1 step: 233, loss is 1.9107087850570679
epoch: 1 step: 234, loss is 1.911312460899353
epoch: 1 step: 235, loss is 1.9710066318511963
epoch: 1 step: 236, loss is 1.9095042943954468
epoch: 1 step: 237, loss is 2.0002048015594482
epoch: 1 step: 238, loss is 1.9421601295471191
epoch: 1 step: 239, loss is 1.9567835330963135
epoch: 1 step: 240, loss is 1.9102545976638794
epoch: 1 step: 241, loss is 1.9200680255889893
epoch: 1 step: 242, loss is 1.9566885232925415
epoch: 1 step: 243, loss is 1.8946560621261597
epoch: 1 step: 244, loss is 2.032424211502075
epoch: 1 step: 245, loss is 1.8792742490768433
epoch: 1 step: 246, loss is 1.8991434574127197
epoch: 1 step: 247, loss is 1.8859573602676392
epoch: 1 step: 248, loss is 1.804595708847046
epoch: 1 step: 249, loss is 1.969016671180725
epoch: 1 step: 250, loss is 2.0294530391693115
epoch: 1 step: 251, loss is 1.9516113996505737
epoch: 1 step: 252, loss is 1.9055083990097046
epoch: 1 step: 253, loss is 1.9428240060806274
epoch: 1 step: 254, loss is 1.9211325645446777
epoch: 1 step: 255, loss is 1.9290534257888794
epoch: 1 step: 256, loss is 1.927079677581787
epoch: 1 step: 257, loss is 2.036738872528076
epoch: 1 step: 258, loss is 1.9652936458587646
epoch: 1 step: 259, loss is 1.9190199375152588
epoch: 1 step: 260, loss is 1.9562625885009766
epoch: 1 step: 261, loss is 1.92203688621521
epoch: 1 step: 262, loss is 1.901296854019165
epoch: 1 step: 263, loss is 1.9890611171722412
epoch: 1 step: 264, loss is 2.0135769844055176
epoch: 1 step: 265, loss is 1.936768651008606
epoch: 1 step: 266, loss is 1.914853572845459
epoch: 1 step: 267, loss is 1.942449688911438
epoch: 1 step: 268, loss is 1.908927083015442
epoch: 1 step: 269, loss is 1.8932807445526123
epoch: 1 step: 270, loss is 2.0091052055358887
epoch: 1 step: 271, loss is 1.9299839735031128
epoch: 1 step: 272, loss is 1.970038890838623
epoch: 1 step: 273, loss is 1.9781885147094727
epoch: 1 step: 274, loss is 1.8617650270462036
epoch: 1 step: 275, loss is 2.008246421813965
epoch: 1 step: 276, loss is 1.9354159832000732
epoch: 1 step: 277, loss is 1.9291837215423584
epoch: 1 step: 278, loss is 2.012862205505371
epoch: 1 step: 279, loss is 1.8868024349212646
epoch: 1 step: 280, loss is 1.8654429912567139
epoch: 1 step: 281, loss is 1.9085495471954346
epoch: 1 step: 282, loss is 2.0358519554138184
epoch: 1 step: 283, loss is 1.9203890562057495
epoch: 1 step: 284, loss is 1.9827072620391846
epoch: 1 step: 285, loss is 1.8735699653625488
epoch: 1 step: 286, loss is 1.9019166231155396
epoch: 1 step: 287, loss is 2.0084218978881836
epoch: 1 step: 288, loss is 2.000614881515503
epoch: 1 step: 289, loss is 2.0126662254333496
epoch: 1 step: 290, loss is 1.9186487197875977
epoch: 1 step: 291, loss is 1.8803479671478271
epoch: 1 step: 292, loss is 1.873417854309082
epoch: 1 step: 293, loss is 1.8859927654266357
epoch: 1 step: 294, loss is 1.9195986986160278
epoch: 1 step: 295, loss is 1.9440782070159912
epoch: 1 step: 296, loss is 1.9989290237426758
epoch: 1 step: 297, loss is 1.9591376781463623
epoch: 1 step: 298, loss is 2.0234837532043457
epoch: 1 step: 299, loss is 1.948691964149475
epoch: 1 step: 300, loss is 2.0098040103912354
epoch: 1 step: 301, loss is 1.9991217851638794
epoch: 1 step: 302, loss is 1.8470983505249023
epoch: 1 step: 303, loss is 1.9883179664611816
epoch: 1 step: 304, loss is 1.9590797424316406
epoch: 1 step: 305, loss is 1.9329893589019775
epoch: 1 step: 306, loss is 2.0018374919891357
epoch: 1 step: 307, loss is 2.001723289489746
epoch: 1 step: 308, loss is 1.8369154930114746
epoch: 1 step: 309, loss is 1.9257686138153076
epoch: 1 step: 310, loss is 1.9213684797286987
epoch: 1 step: 311, loss is 1.91152024269104
epoch: 1 step: 312, loss is 1.9036773443222046
epoch: 1 step: 313, loss is 1.9819672107696533
epoch: 1 step: 314, loss is 1.9471276998519897
epoch: 1 step: 315, loss is 1.9795304536819458
epoch: 1 step: 316, loss is 1.8561713695526123
epoch: 1 step: 317, loss is 2.045935869216919
epoch: 1 step: 318, loss is 1.9417272806167603
epoch: 1 step: 319, loss is 2.10587215423584
epoch: 1 step: 320, loss is 1.82456636428833
epoch: 1 step: 321, loss is 1.9213714599609375
epoch: 1 step: 322, loss is 1.8929128646850586
epoch: 1 step: 323, loss is 2.016751289367676
epoch: 1 step: 324, loss is 1.8333282470703125
epoch: 1 step: 325, loss is 1.904754877090454
epoch: 1 step: 326, loss is 2.0365891456604004
epoch: 1 step: 327, loss is 1.8032381534576416
epoch: 1 step: 328, loss is 1.889327049255371
epoch: 1 step: 329, loss is 1.8690977096557617
epoch: 1 step: 330, loss is 1.9579219818115234
epoch: 1 step: 331, loss is 1.897598147392273
epoch: 1 step: 332, loss is 1.936568021774292
epoch: 1 step: 333, loss is 1.9168888330459595
epoch: 1 step: 334, loss is 1.9970436096191406
epoch: 1 step: 335, loss is 1.8795043230056763
epoch: 1 step: 336, loss is 1.8512083292007446
epoch: 1 step: 337, loss is 1.9615848064422607
epoch: 1 step: 338, loss is 1.8836190700531006
epoch: 1 step: 339, loss is 1.8904942274093628
epoch: 1 step: 340, loss is 2.0177059173583984
epoch: 1 step: 341, loss is 1.9037587642669678
epoch: 1 step: 342, loss is 1.9891458749771118
epoch: 1 step: 343, loss is 1.901099681854248
epoch: 1 step: 344, loss is 1.9564093351364136
epoch: 1 step: 345, loss is 2.050227165222168
epoch: 1 step: 346, loss is 1.9546282291412354
epoch: 1 step: 347, loss is 1.8253402709960938
epoch: 1 step: 348, loss is 1.881971001625061
epoch: 1 step: 349, loss is 1.915027379989624
epoch: 1 step: 350, loss is 1.9205769300460815
epoch: 1 step: 351, loss is 1.9755690097808838
epoch: 1 step: 352, loss is 1.9711400270462036
epoch: 1 step: 353, loss is 1.9381728172302246
epoch: 1 step: 354, loss is 1.746055245399475
epoch: 1 step: 355, loss is 1.942064881324768
epoch: 1 step: 356, loss is 1.8913757801055908
epoch: 1 step: 357, loss is 1.8841240406036377
epoch: 1 step: 358, loss is 1.978926420211792
epoch: 1 step: 359, loss is 2.0343894958496094
epoch: 1 step: 360, loss is 1.8985085487365723
epoch: 1 step: 361, loss is 1.889115810394287
epoch: 1 step: 362, loss is 1.9065433740615845
epoch: 1 step: 363, loss is 1.9389437437057495
epoch: 1 step: 364, loss is 1.9310613870620728
epoch: 1 step: 365, loss is 1.965096354484558
epoch: 1 step: 366, loss is 2.001617908477783
epoch: 1 step: 367, loss is 1.9463062286376953
epoch: 1 step: 368, loss is 1.9296867847442627
epoch: 1 step: 369, loss is 1.9059423208236694
epoch: 1 step: 370, loss is 1.9741623401641846
epoch: 1 step: 371, loss is 1.9527919292449951
epoch: 1 step: 372, loss is 1.8885220289230347
epoch: 1 step: 373, loss is 1.87715482711792
epoch: 1 step: 374, loss is 2.00614333152771
epoch: 1 step: 375, loss is 1.8491806983947754
epoch: 1 step: 376, loss is 1.8577344417572021
epoch: 1 step: 377, loss is 2.001436948776245
epoch: 1 step: 378, loss is 1.8928618431091309
epoch: 1 step: 379, loss is 2.0045433044433594
epoch: 1 step: 380, loss is 2.115802049636841
epoch: 1 step: 381, loss is 1.8844345808029175
epoch: 1 step: 382, loss is 1.9139431715011597
epoch: 1 step: 383, loss is 2.0448434352874756
epoch: 1 step: 384, loss is 2.0351815223693848
epoch: 1 step: 385, loss is 1.9286075830459595
epoch: 1 step: 386, loss is 1.908137321472168
epoch: 1 step: 387, loss is 1.8745241165161133
epoch: 1 step: 388, loss is 1.9043793678283691
epoch: 1 step: 389, loss is 1.8875763416290283
epoch: 1 step: 390, loss is 1.8309433460235596
Train epoch time: 301169.930 ms, per step time: 772.231 ms
epoch: 2 step: 1, loss is 2.0086636543273926
epoch: 2 step: 2, loss is 1.8561439514160156
epoch: 2 step: 3, loss is 1.959549903869629
epoch: 2 step: 4, loss is 1.845109462738037
epoch: 2 step: 5, loss is 1.859502911567688
epoch: 2 step: 6, loss is 1.8266891241073608
epoch: 2 step: 7, loss is 1.9940757751464844
epoch: 2 step: 8, loss is 1.887444257736206
epoch: 2 step: 9, loss is 1.941131353378296
epoch: 2 step: 10, loss is 2.024829387664795
epoch: 2 step: 11, loss is 1.8800655603408813
epoch: 2 step: 12, loss is 1.9109078645706177
epoch: 2 step: 13, loss is 1.9579572677612305
epoch: 2 step: 14, loss is 1.941686749458313
epoch: 2 step: 15, loss is 1.8821125030517578
epoch: 2 step: 16, loss is 1.8421412706375122
epoch: 2 step: 17, loss is 1.814638376235962
epoch: 2 step: 18, loss is 1.8196184635162354
epoch: 2 step: 19, loss is 1.9221185445785522
epoch: 2 step: 20, loss is 1.9255518913269043
epoch: 2 step: 21, loss is 1.8777854442596436
epoch: 2 step: 22, loss is 1.8691197633743286
epoch: 2 step: 23, loss is 1.9205360412597656
epoch: 2 step: 24, loss is 1.8332617282867432
epoch: 2 step: 25, loss is 1.88694429397583
epoch: 2 step: 26, loss is 1.8043739795684814
epoch: 2 step: 27, loss is 1.939123511314392
epoch: 2 step: 28, loss is 1.844259262084961
epoch: 2 step: 29, loss is 1.8621373176574707
epoch: 2 step: 30, loss is 1.9329479932785034
epoch: 2 step: 31, loss is 1.8549472093582153
epoch: 2 step: 32, loss is 1.8989877700805664
epoch: 2 step: 33, loss is 1.9532358646392822
epoch: 2 step: 34, loss is 1.8432390689849854
epoch: 2 step: 35, loss is 1.9938075542449951
epoch: 2 step: 36, loss is 1.8990700244903564
epoch: 2 step: 37, loss is 1.8663583993911743
epoch: 2 step: 38, loss is 1.8983309268951416
epoch: 2 step: 39, loss is 1.832546591758728
epoch: 2 step: 40, loss is 1.8912301063537598
epoch: 2 step: 41, loss is 1.889316439628601
epoch: 2 step: 42, loss is 1.9854687452316284
epoch: 2 step: 43, loss is 1.8240479230880737
epoch: 2 step: 44, loss is 1.9101498126983643
epoch: 2 step: 45, loss is 1.797684669494629
epoch: 2 step: 46, loss is 1.8577277660369873
epoch: 2 step: 47, loss is 1.8356904983520508
epoch: 2 step: 48, loss is 1.923057198524475
epoch: 2 step: 49, loss is 1.9273544549942017
epoch: 2 step: 50, loss is 1.8730634450912476
epoch: 2 step: 51, loss is 1.7962920665740967
epoch: 2 step: 52, loss is 1.840844988822937
epoch: 2 step: 53, loss is 1.9192214012145996
epoch: 2 step: 54, loss is 1.7557487487792969
epoch: 2 step: 55, loss is 1.8856722116470337
epoch: 2 step: 56, loss is 1.9432092905044556
epoch: 2 step: 57, loss is 1.858530879020691
epoch: 2 step: 58, loss is 2.002671718597412
epoch: 2 step: 59, loss is 1.856890082359314
epoch: 2 step: 60, loss is 1.820954442024231
epoch: 2 step: 61, loss is 1.946556568145752
epoch: 2 step: 62, loss is 1.8445112705230713
epoch: 2 step: 63, loss is 1.7778434753417969
epoch: 2 step: 64, loss is 1.8148187398910522
epoch: 2 step: 65, loss is 1.8800560235977173
epoch: 2 step: 66, loss is 1.7955501079559326
epoch: 2 step: 67, loss is 1.9290978908538818
epoch: 2 step: 68, loss is 1.9206385612487793
epoch: 2 step: 69, loss is 1.8472034931182861
epoch: 2 step: 70, loss is 1.8666832447052002
epoch: 2 step: 71, loss is 1.9787473678588867
epoch: 2 step: 72, loss is 1.8786567449569702
epoch: 2 step: 73, loss is 1.7307939529418945
epoch: 2 step: 74, loss is 1.9424220323562622
epoch: 2 step: 75, loss is 1.8803863525390625
epoch: 2 step: 76, loss is 1.8787437677383423
epoch: 2 step: 77, loss is 1.8715336322784424
epoch: 2 step: 78, loss is 1.8748095035552979
epoch: 2 step: 79, loss is 1.8724067211151123
epoch: 2 step: 80, loss is 1.8271408081054688
epoch: 2 step: 81, loss is 2.0027823448181152
epoch: 2 step: 82, loss is 1.7880679368972778
epoch: 2 step: 83, loss is 1.8629825115203857
epoch: 2 step: 84, loss is 1.7959777116775513
epoch: 2 step: 85, loss is 1.7876453399658203
epoch: 2 step: 86, loss is 1.9502708911895752
epoch: 2 step: 87, loss is 1.833869218826294
epoch: 2 step: 88, loss is 1.8148149251937866
epoch: 2 step: 89, loss is 1.86910879611969
epoch: 2 step: 90, loss is 1.8948606252670288
epoch: 2 step: 91, loss is 1.8240607976913452
epoch: 2 step: 92, loss is 1.7175065279006958
epoch: 2 step: 93, loss is 1.940483570098877
epoch: 2 step: 94, loss is 1.7632392644882202
epoch: 2 step: 95, loss is 1.737947940826416
epoch: 2 step: 96, loss is 1.7186986207962036
epoch: 2 step: 97, loss is 1.8999754190444946
epoch: 2 step: 98, loss is 1.7857352495193481
epoch: 2 step: 99, loss is 1.9178612232208252
epoch: 2 step: 100, loss is 1.8766601085662842
epoch: 2 step: 101, loss is 1.82723069190979
epoch: 2 step: 102, loss is 1.9207415580749512
epoch: 2 step: 103, loss is 1.8620266914367676
epoch: 2 step: 104, loss is 1.8713843822479248
epoch: 2 step: 105, loss is 1.8156766891479492
epoch: 2 step: 106, loss is 1.8680598735809326
epoch: 2 step: 107, loss is 1.8187801837921143
epoch: 2 step: 108, loss is 1.7260297536849976
epoch: 2 step: 109, loss is 1.8599132299423218
epoch: 2 step: 110, loss is 1.881721019744873
epoch: 2 step: 111, loss is 1.8597749471664429
epoch: 2 step: 112, loss is 1.821563720703125
epoch: 2 step: 113, loss is 1.8555083274841309
epoch: 2 step: 114, loss is 1.9005643129348755
epoch: 2 step: 115, loss is 1.7498300075531006
epoch: 2 step: 116, loss is 1.9095624685287476
epoch: 2 step: 117, loss is 2.02589750289917
epoch: 2 step: 118, loss is 1.7067241668701172
epoch: 2 step: 119, loss is 1.9239228963851929
epoch: 2 step: 120, loss is 1.9379498958587646
epoch: 2 step: 121, loss is 1.7686045169830322
epoch: 2 step: 122, loss is 1.8032222986221313
epoch: 2 step: 123, loss is 1.8437695503234863
epoch: 2 step: 124, loss is 1.851049780845642
epoch: 2 step: 125, loss is 1.8590387105941772
epoch: 2 step: 126, loss is 1.816530466079712
epoch: 2 step: 127, loss is 1.7972630262374878
epoch: 2 step: 128, loss is 1.9503204822540283
epoch: 2 step: 129, loss is 1.8170355558395386
epoch: 2 step: 130, loss is 1.9414114952087402
epoch: 2 step: 131, loss is 1.9720697402954102
epoch: 2 step: 132, loss is 1.8431159257888794
epoch: 2 step: 133, loss is 1.8431402444839478
epoch: 2 step: 134, loss is 1.880516529083252
epoch: 2 step: 135, loss is 1.9251197576522827
epoch: 2 step: 136, loss is 1.7931485176086426
epoch: 2 step: 137, loss is 1.8366456031799316
epoch: 2 step: 138, loss is 1.8162732124328613
epoch: 2 step: 139, loss is 1.766951084136963
epoch: 2 step: 140, loss is 1.9067617654800415
epoch: 2 step: 141, loss is 1.8185125589370728
epoch: 2 step: 142, loss is 1.7571754455566406
epoch: 2 step: 143, loss is 1.895729422569275
epoch: 2 step: 144, loss is 1.8479565382003784
epoch: 2 step: 145, loss is 1.851345419883728
epoch: 2 step: 146, loss is 1.7962510585784912
epoch: 2 step: 147, loss is 1.8403642177581787
epoch: 2 step: 148, loss is 1.9102624654769897
epoch: 2 step: 149, loss is 1.8668683767318726
epoch: 2 step: 150, loss is 1.8768565654754639
epoch: 2 step: 151, loss is 1.803843379020691
epoch: 2 step: 152, loss is 1.80448579788208
epoch: 2 step: 153, loss is 1.9157750606536865
epoch: 2 step: 154, loss is 1.7746250629425049
epoch: 2 step: 155, loss is 1.9414875507354736
epoch: 2 step: 156, loss is 1.8375035524368286
epoch: 2 step: 157, loss is 1.7554367780685425
epoch: 2 step: 158, loss is 1.849830985069275
epoch: 2 step: 159, loss is 1.9036533832550049
epoch: 2 step: 160, loss is 1.9142879247665405
epoch: 2 step: 161, loss is 1.8832579851150513
epoch: 2 step: 162, loss is 1.792252779006958
epoch: 2 step: 163, loss is 1.862593412399292
epoch: 2 step: 164, loss is 1.8466187715530396
epoch: 2 step: 165, loss is 1.7753026485443115
epoch: 2 step: 166, loss is 1.7677288055419922
epoch: 2 step: 167, loss is 1.9642395973205566
epoch: 2 step: 168, loss is 1.9213532209396362
epoch: 2 step: 169, loss is 1.9132721424102783
epoch: 2 step: 170, loss is 1.8411797285079956
epoch: 2 step: 171, loss is 1.8710469007492065
epoch: 2 step: 172, loss is 1.8579764366149902
epoch: 2 step: 173, loss is 1.8785043954849243
epoch: 2 step: 174, loss is 1.792521595954895
epoch: 2 step: 175, loss is 1.8440773487091064
epoch: 2 step: 176, loss is 1.8609850406646729
epoch: 2 step: 177, loss is 1.8927359580993652
epoch: 2 step: 178, loss is 1.7732219696044922
epoch: 2 step: 179, loss is 1.8476505279541016
epoch: 2 step: 180, loss is 1.8040062189102173
epoch: 2 step: 181, loss is 1.7327547073364258
epoch: 2 step: 182, loss is 1.7375595569610596
epoch: 2 step: 183, loss is 1.819338083267212
epoch: 2 step: 184, loss is 1.8011999130249023
epoch: 2 step: 185, loss is 1.830836534500122
epoch: 2 step: 186, loss is 1.7516945600509644
epoch: 2 step: 187, loss is 1.8135254383087158
epoch: 2 step: 188, loss is 1.8124158382415771
epoch: 2 step: 189, loss is 1.6668801307678223
epoch: 2 step: 190, loss is 1.7354092597961426
epoch: 2 step: 191, loss is 1.8319144248962402
epoch: 2 step: 192, loss is 1.7852344512939453
epoch: 2 step: 193, loss is 1.8413450717926025
epoch: 2 step: 194, loss is 1.7621142864227295
epoch: 2 step: 195, loss is 1.893214464187622
epoch: 2 step: 196, loss is 1.85415518283844
epoch: 2 step: 197, loss is 1.8773202896118164
epoch: 2 step: 198, loss is 1.7623281478881836
epoch: 2 step: 199, loss is 1.7395901679992676
epoch: 2 step: 200, loss is 1.8567042350769043
epoch: 2 step: 201, loss is 1.778634786605835
epoch: 2 step: 202, loss is 1.7063484191894531
epoch: 2 step: 203, loss is 1.749333143234253
epoch: 2 step: 204, loss is 1.8264789581298828
epoch: 2 step: 205, loss is 1.8365423679351807
epoch: 2 step: 206, loss is 1.8969218730926514
epoch: 2 step: 207, loss is 1.8710927963256836
epoch: 2 step: 208, loss is 1.7963848114013672
epoch: 2 step: 209, loss is 1.8325331211090088
epoch: 2 step: 210, loss is 1.8086352348327637
epoch: 2 step: 211, loss is 1.7635247707366943
epoch: 2 step: 212, loss is 1.786803960800171
epoch: 2 step: 213, loss is 1.86027193069458
epoch: 2 step: 214, loss is 1.8134807348251343
epoch: 2 step: 215, loss is 1.6304543018341064
epoch: 2 step: 216, loss is 1.9413152933120728
epoch: 2 step: 217, loss is 1.8634803295135498
epoch: 2 step: 218, loss is 1.7682809829711914
epoch: 2 step: 219, loss is 1.7050912380218506
epoch: 2 step: 220, loss is 1.8277148008346558
epoch: 2 step: 221, loss is 1.765455961227417
epoch: 2 step: 222, loss is 1.8628239631652832
epoch: 2 step: 223, loss is 1.7774879932403564
epoch: 2 step: 224, loss is 1.7936487197875977
epoch: 2 step: 225, loss is 1.8603432178497314
epoch: 2 step: 226, loss is 1.8159501552581787
epoch: 2 step: 227, loss is 1.8525795936584473
epoch: 2 step: 228, loss is 1.777559757232666
epoch: 2 step: 229, loss is 1.7203599214553833
epoch: 2 step: 230, loss is 1.7874581813812256
epoch: 2 step: 231, loss is 1.772555947303772
epoch: 2 step: 232, loss is 1.6849935054779053
epoch: 2 step: 233, loss is 1.9093971252441406
epoch: 2 step: 234, loss is 1.771955966949463
epoch: 2 step: 235, loss is 1.8456029891967773
epoch: 2 step: 236, loss is 1.7137638330459595
epoch: 2 step: 237, loss is 1.9477996826171875
epoch: 2 step: 238, loss is 1.8464946746826172
epoch: 2 step: 239, loss is 1.7354912757873535
epoch: 2 step: 240, loss is 1.7765865325927734
epoch: 2 step: 241, loss is 1.8965409994125366
epoch: 2 step: 242, loss is 1.8762569427490234
epoch: 2 step: 243, loss is 1.7814252376556396
epoch: 2 step: 244, loss is 1.73980712890625
epoch: 2 step: 245, loss is 1.8301444053649902
epoch: 2 step: 246, loss is 1.811732292175293
epoch: 2 step: 247, loss is 1.8318946361541748
epoch: 2 step: 248, loss is 1.8942484855651855
epoch: 2 step: 249, loss is 1.730800747871399
epoch: 2 step: 250, loss is 1.7132985591888428
epoch: 2 step: 251, loss is 1.7972244024276733
epoch: 2 step: 252, loss is 1.8195381164550781
epoch: 2 step: 253, loss is 1.6059634685516357
epoch: 2 step: 254, loss is 1.7207951545715332
epoch: 2 step: 255, loss is 1.7502498626708984
epoch: 2 step: 256, loss is 1.755635380744934
epoch: 2 step: 257, loss is 1.758680820465088
epoch: 2 step: 258, loss is 1.8161877393722534
epoch: 2 step: 259, loss is 1.847859263420105
epoch: 2 step: 260, loss is 1.7462278604507446
epoch: 2 step: 261, loss is 1.7590373754501343
epoch: 2 step: 262, loss is 1.8493421077728271
epoch: 2 step: 263, loss is 1.7797712087631226
epoch: 2 step: 264, loss is 1.8671611547470093
epoch: 2 step: 265, loss is 1.848267912864685
epoch: 2 step: 266, loss is 1.7634716033935547
epoch: 2 step: 267, loss is 1.6971874237060547
epoch: 2 step: 268, loss is 1.699387788772583
epoch: 2 step: 269, loss is 1.791347622871399
epoch: 2 step: 270, loss is 1.801035761833191
epoch: 2 step: 271, loss is 1.851177453994751
epoch: 2 step: 272, loss is 1.9037847518920898
epoch: 2 step: 273, loss is 1.8303537368774414
epoch: 2 step: 274, loss is 1.8420476913452148
epoch: 2 step: 275, loss is 1.764127492904663
epoch: 2 step: 276, loss is 1.888632893562317
epoch: 2 step: 277, loss is 1.8250199556350708
epoch: 2 step: 278, loss is 1.8460564613342285
epoch: 2 step: 279, loss is 1.767008900642395
epoch: 2 step: 280, loss is 1.9221689701080322
epoch: 2 step: 281, loss is 1.8578596115112305
epoch: 2 step: 282, loss is 1.8321722745895386
epoch: 2 step: 283, loss is 1.738450288772583
epoch: 2 step: 284, loss is 1.8421459197998047
epoch: 2 step: 285, loss is 1.7913256883621216
epoch: 2 step: 286, loss is 1.7157938480377197
epoch: 2 step: 287, loss is 1.768418312072754
epoch: 2 step: 288, loss is 1.8244106769561768
epoch: 2 step: 289, loss is 1.900506615638733
epoch: 2 step: 290, loss is 1.73917555809021
epoch: 2 step: 291, loss is 1.6665081977844238
epoch: 2 step: 292, loss is 1.8142967224121094
epoch: 2 step: 293, loss is 1.8735641241073608
epoch: 2 step: 294, loss is 1.729585886001587
epoch: 2 step: 295, loss is 1.8190531730651855
epoch: 2 step: 296, loss is 1.820863127708435
epoch: 2 step: 297, loss is 1.800586462020874
epoch: 2 step: 298, loss is 1.8346083164215088
epoch: 2 step: 299, loss is 1.8632863759994507
epoch: 2 step: 300, loss is 1.7105364799499512
epoch: 2 step: 301, loss is 1.8319177627563477
epoch: 2 step: 302, loss is 1.8042783737182617
epoch: 2 step: 303, loss is 1.9537193775177002
epoch: 2 step: 304, loss is 1.7980555295944214
epoch: 2 step: 305, loss is 1.6234668493270874
epoch: 2 step: 306, loss is 1.7497445344924927
epoch: 2 step: 307, loss is 1.650848627090454
epoch: 2 step: 308, loss is 1.7433573007583618
epoch: 2 step: 309, loss is 1.8352850675582886
epoch: 2 step: 310, loss is 1.7705419063568115
epoch: 2 step: 311, loss is 1.7893061637878418
epoch: 2 step: 312, loss is 1.8017258644104004
epoch: 2 step: 313, loss is 1.8343088626861572
epoch: 2 step: 314, loss is 1.863562822341919
epoch: 2 step: 315, loss is 1.7683537006378174
epoch: 2 step: 316, loss is 1.7513844966888428
epoch: 2 step: 317, loss is 1.8488166332244873
epoch: 2 step: 318, loss is 1.6568918228149414
epoch: 2 step: 319, loss is 1.7907938957214355
epoch: 2 step: 320, loss is 1.842228889465332
epoch: 2 step: 321, loss is 1.740508794784546
epoch: 2 step: 322, loss is 1.8639365434646606
epoch: 2 step: 323, loss is 1.7938766479492188
epoch: 2 step: 324, loss is 1.7994264364242554
epoch: 2 step: 325, loss is 1.7769482135772705
epoch: 2 step: 326, loss is 1.7914179563522339
epoch: 2 step: 327, loss is 1.8817980289459229
epoch: 2 step: 328, loss is 1.780959129333496
epoch: 2 step: 329, loss is 1.7240921258926392
epoch: 2 step: 330, loss is 1.8149974346160889
epoch: 2 step: 331, loss is 1.7758080959320068
epoch: 2 step: 332, loss is 1.850448489189148
epoch: 2 step: 333, loss is 1.8151671886444092
epoch: 2 step: 334, loss is 1.6135375499725342
epoch: 2 step: 335, loss is 1.8676254749298096
epoch: 2 step: 336, loss is 1.796992540359497
epoch: 2 step: 337, loss is 1.7397630214691162
epoch: 2 step: 338, loss is 1.8294241428375244
epoch: 2 step: 339, loss is 1.918345332145691
epoch: 2 step: 340, loss is 1.7349798679351807
epoch: 2 step: 341, loss is 1.831843376159668
epoch: 2 step: 343, loss is 1.8684601783752441
epoch: 2 step: 344, loss is 1.7865076065063477
epoch: 2 step: 345, loss is 1.7199623584747314
epoch: 2 step: 346, loss is 1.7887533903121948
epoch: 2 step: 347, loss is 1.92432701587677
epoch: 2 step: 348, loss is 1.7560091018676758
epoch: 2 step: 349, loss is 1.8518497943878174
epoch: 2 step: 350, loss is 1.7869535684585571
epoch: 2 step: 351, loss is 1.7211872339248657
epoch: 2 step: 352, loss is 1.985832691192627
epoch: 2 step: 353, loss is 1.8105260133743286
epoch: 2 step: 354, loss is 1.835160255432129
epoch: 2 step: 355, loss is 1.7800557613372803
epoch: 2 step: 356, loss is 1.7522625923156738
epoch: 2 step: 357, loss is 1.7488387823104858
epoch: 2 step: 358, loss is 1.7606761455535889
epoch: 2 step: 359, loss is 1.8526567220687866
epoch: 2 step: 360, loss is 1.7670416831970215
epoch: 2 step: 361, loss is 1.8308148384094238
epoch: 2 step: 362, loss is 1.7068452835083008
epoch: 2 step: 363, loss is 1.8467777967453003
epoch: 2 step: 364, loss is 1.7617813348770142
epoch: 2 step: 365, loss is 1.8768242597579956
epoch: 2 step: 366, loss is 1.7613787651062012
epoch: 2 step: 367, loss is 1.670498251914978
epoch: 2 step: 368, loss is 1.785229206085205
epoch: 2 step: 369, loss is 1.794966459274292
epoch: 2 step: 370, loss is 1.78444504737854
epoch: 2 step: 371, loss is 1.8249762058258057
epoch: 2 step: 372, loss is 1.7419517040252686
epoch: 2 step: 373, loss is 1.7378820180892944
epoch: 2 step: 374, loss is 1.8318363428115845
epoch: 2 step: 375, loss is 1.8721061944961548
epoch: 2 step: 376, loss is 1.6173243522644043
epoch: 2 step: 377, loss is 1.7758114337921143
epoch: 2 step: 378, loss is 1.756319522857666
epoch: 2 step: 379, loss is 1.6763895750045776
epoch: 2 step: 380, loss is 1.7565263509750366
epoch: 2 step: 381, loss is 1.705394983291626
epoch: 2 step: 382, loss is 1.7143198251724243
epoch: 2 step: 383, loss is 1.6517314910888672
epoch: 2 step: 384, loss is 1.722870111465454
epoch: 2 step: 385, loss is 1.7791318893432617
epoch: 2 step: 386, loss is 1.7326841354370117
epoch: 2 step: 387, loss is 1.7197530269622803
epoch: 2 step: 388, loss is 1.6298965215682983
epoch: 2 step: 389, loss is 1.8236207962036133
epoch: 2 step: 390, loss is 1.7148630619049072
Train epoch time: 155993.727 ms, per step time: 399.984 ms
epoch: 3 step: 1, loss is 1.7641422748565674
epoch: 3 step: 2, loss is 1.7547181844711304
epoch: 3 step: 3, loss is 1.793548345565796
epoch: 3 step: 4, loss is 1.8427743911743164
epoch: 3 step: 5, loss is 1.6967132091522217
epoch: 3 step: 6, loss is 1.750718355178833
epoch: 3 step: 7, loss is 1.75998055934906
epoch: 3 step: 8, loss is 1.760133981704712
epoch: 3 step: 9, loss is 1.7812654972076416
epoch: 3 step: 10, loss is 1.716094732284546
epoch: 3 step: 11, loss is 1.7356353998184204
epoch: 3 step: 12, loss is 1.729407787322998
epoch: 3 step: 13, loss is 1.7473516464233398
epoch: 3 step: 14, loss is 1.7516013383865356
epoch: 3 step: 15, loss is 1.7336506843566895
epoch: 3 step: 16, loss is 1.7298648357391357
epoch: 3 step: 17, loss is 1.8397022485733032
epoch: 3 step: 18, loss is 1.824225664138794
epoch: 3 step: 19, loss is 1.7612080574035645
epoch: 3 step: 20, loss is 1.8021020889282227
epoch: 3 step: 21, loss is 1.7924782037734985
epoch: 3 step: 22, loss is 1.820109248161316
epoch: 3 step: 23, loss is 1.8755810260772705
epoch: 3 step: 24, loss is 1.8197954893112183
epoch: 3 step: 25, loss is 1.761940360069275
epoch: 3 step: 26, loss is 1.8842095136642456
epoch: 3 step: 27, loss is 1.7360103130340576
epoch: 3 step: 28, loss is 1.8140599727630615
epoch: 3 step: 29, loss is 1.7043712139129639
epoch: 3 step: 30, loss is 1.7635387182235718
epoch: 3 step: 31, loss is 1.8017815351486206
epoch: 3 step: 32, loss is 1.720243215560913
epoch: 3 step: 33, loss is 1.8202407360076904
epoch: 3 step: 34, loss is 1.775327444076538
epoch: 3 step: 35, loss is 1.674959659576416
epoch: 3 step: 36, loss is 1.562670111656189
epoch: 3 step: 37, loss is 1.7781317234039307
epoch: 3 step: 38, loss is 1.7011034488677979
epoch: 3 step: 39, loss is 1.8195388317108154
epoch: 3 step: 40, loss is 1.6667931079864502
epoch: 3 step: 41, loss is 1.7732675075531006
epoch: 3 step: 42, loss is 1.8425345420837402
epoch: 3 step: 43, loss is 1.6280186176300049
epoch: 3 step: 44, loss is 1.7534148693084717
epoch: 3 step: 45, loss is 1.749954104423523
epoch: 3 step: 46, loss is 1.7335509061813354
epoch: 3 step: 47, loss is 1.8067073822021484
epoch: 3 step: 48, loss is 1.6626200675964355
epoch: 3 step: 49, loss is 1.715445637702942
epoch: 3 step: 50, loss is 1.762415885925293
epoch: 3 step: 51, loss is 1.7296688556671143
epoch: 3 step: 52, loss is 1.7437942028045654
epoch: 3 step: 53, loss is 1.731560468673706
epoch: 3 step: 54, loss is 1.8204193115234375
epoch: 3 step: 55, loss is 1.7309167385101318
epoch: 3 step: 56, loss is 1.7726114988327026
epoch: 3 step: 57, loss is 1.681417465209961
epoch: 3 step: 58, loss is 1.7564997673034668
epoch: 3 step: 59, loss is 1.6605130434036255
epoch: 3 step: 60, loss is 1.7276595830917358
epoch: 3 step: 61, loss is 1.7239662408828735
epoch: 3 step: 62, loss is 1.796515703201294
epoch: 3 step: 63, loss is 1.6727538108825684
epoch: 3 step: 64, loss is 1.7312177419662476
epoch: 3 step: 65, loss is 1.7125099897384644
epoch: 3 step: 66, loss is 1.6296565532684326
epoch: 3 step: 67, loss is 1.7298835515975952
epoch: 3 step: 68, loss is 1.6555956602096558
epoch: 3 step: 69, loss is 1.7559605836868286
epoch: 3 step: 70, loss is 1.802220344543457
epoch: 3 step: 71, loss is 1.8298285007476807
epoch: 3 step: 72, loss is 1.6363513469696045
epoch: 3 step: 73, loss is 1.7824245691299438
epoch: 3 step: 74, loss is 1.7274856567382812
epoch: 3 step: 75, loss is 1.6231114864349365
epoch: 3 step: 76, loss is 1.6169097423553467
epoch: 3 step: 77, loss is 1.7331278324127197
epoch: 3 step: 78, loss is 1.9007054567337036
epoch: 3 step: 79, loss is 1.5682278871536255
epoch: 3 step: 80, loss is 1.6923885345458984
epoch: 3 step: 81, loss is 1.7956039905548096
epoch: 3 step: 82, loss is 1.7601070404052734
epoch: 3 step: 83, loss is 1.7837741374969482
epoch: 3 step: 84, loss is 1.7951622009277344
epoch: 3 step: 85, loss is 1.8333876132965088
epoch: 3 step: 86, loss is 1.819688320159912
epoch: 3 step: 87, loss is 1.6919232606887817
epoch: 3 step: 88, loss is 1.621659278869629
epoch: 3 step: 89, loss is 1.7886335849761963
epoch: 3 step: 90, loss is 1.751739263534546
epoch: 3 step: 91, loss is 1.6526401042938232
epoch: 3 step: 92, loss is 1.7481526136398315
epoch: 3 step: 93, loss is 1.8013017177581787
epoch: 3 step: 94, loss is 1.6293811798095703
epoch: 3 step: 95, loss is 1.578917384147644
epoch: 3 step: 96, loss is 1.7730318307876587
epoch: 3 step: 97, loss is 1.7906396389007568
epoch: 3 step: 98, loss is 1.7818952798843384
epoch: 3 step: 99, loss is 1.8343371152877808
epoch: 3 step: 100, loss is 1.8384268283843994
epoch: 3 step: 101, loss is 1.7740890979766846
epoch: 3 step: 102, loss is 1.693620204925537
epoch: 3 step: 103, loss is 1.6320619583129883
epoch: 3 step: 104, loss is 1.758415699005127
epoch: 3 step: 105, loss is 1.732062578201294
epoch: 3 step: 106, loss is 1.7792495489120483
epoch: 3 step: 107, loss is 1.6865469217300415
epoch: 3 step: 108, loss is 1.830143690109253
epoch: 3 step: 109, loss is 1.7891284227371216
epoch: 3 step: 110, loss is 1.7002507448196411
epoch: 3 step: 111, loss is 1.76425302028656
epoch: 3 step: 112, loss is 1.7251293659210205
epoch: 3 step: 113, loss is 1.7241863012313843
epoch: 3 step: 114, loss is 1.758310079574585
epoch: 3 step: 115, loss is 1.6906688213348389
epoch: 3 step: 116, loss is 1.6078766584396362
epoch: 3 step: 117, loss is 1.7742276191711426
epoch: 3 step: 118, loss is 1.7505218982696533
epoch: 3 step: 119, loss is 1.8065091371536255
epoch: 3 step: 120, loss is 1.77422297000885
epoch: 3 step: 121, loss is 1.6769416332244873
epoch: 3 step: 122, loss is 1.6930099725723267
epoch: 3 step: 123, loss is 1.721684217453003
epoch: 3 step: 124, loss is 1.8611409664154053
epoch: 3 step: 125, loss is 1.7026392221450806
epoch: 3 step: 126, loss is 1.73283851146698
epoch: 3 step: 127, loss is 1.6186188459396362
epoch: 3 step: 128, loss is 1.7966184616088867
epoch: 3 step: 129, loss is 1.65543794631958
epoch: 3 step: 130, loss is 1.7405024766921997
epoch: 3 step: 131, loss is 1.8584294319152832
epoch: 3 step: 132, loss is 1.7835478782653809
epoch: 3 step: 133, loss is 1.8870224952697754
epoch: 3 step: 134, loss is 1.8632557392120361
epoch: 3 step: 135, loss is 1.8035264015197754
epoch: 3 step: 136, loss is 1.7724859714508057
epoch: 3 step: 137, loss is 1.7620857954025269
epoch: 3 step: 138, loss is 1.7977924346923828
epoch: 3 step: 139, loss is 1.7587790489196777
epoch: 3 step: 140, loss is 1.781812310218811
epoch: 3 step: 141, loss is 1.843968391418457
epoch: 3 step: 142, loss is 1.7268784046173096
epoch: 3 step: 143, loss is 1.6952234506607056
epoch: 3 step: 144, loss is 1.6672649383544922
epoch: 3 step: 145, loss is 1.7690939903259277
epoch: 3 step: 146, loss is 1.8385887145996094
epoch: 3 step: 147, loss is 1.705284833908081
epoch: 3 step: 148, loss is 1.669526219367981
epoch: 3 step: 149, loss is 1.7944085597991943
epoch: 3 step: 150, loss is 1.7651945352554321
epoch: 3 step: 151, loss is 1.6766066551208496
epoch: 3 step: 152, loss is 1.5969195365905762
epoch: 3 step: 153, loss is 1.6448216438293457
epoch: 3 step: 154, loss is 1.751434564590454
epoch: 3 step: 155, loss is 1.7247257232666016
epoch: 3 step: 156, loss is 1.7154661417007446
epoch: 3 step: 157, loss is 1.7714577913284302
epoch: 3 step: 158, loss is 1.7265665531158447
epoch: 3 step: 159, loss is 1.7543656826019287
epoch: 3 step: 160, loss is 1.8084063529968262
epoch: 3 step: 161, loss is 1.705984115600586
epoch: 3 step: 162, loss is 1.7250216007232666
epoch: 3 step: 163, loss is 1.6781470775604248
epoch: 3 step: 164, loss is 1.7092280387878418
epoch: 3 step: 165, loss is 1.6585453748703003
epoch: 3 step: 166, loss is 1.8353822231292725
epoch: 3 step: 167, loss is 1.827893853187561
epoch: 3 step: 168, loss is 1.672469139099121
epoch: 3 step: 169, loss is 1.7827669382095337
epoch: 3 step: 170, loss is 1.6130626201629639
epoch: 3 step: 171, loss is 1.6939573287963867
epoch: 3 step: 172, loss is 1.724724531173706
epoch: 3 step: 173, loss is 1.8338401317596436
epoch: 3 step: 174, loss is 1.7745299339294434
epoch: 3 step: 175, loss is 1.8342199325561523
epoch: 3 step: 176, loss is 1.8572815656661987
epoch: 3 step: 177, loss is 1.6647486686706543
epoch: 3 step: 178, loss is 1.7664051055908203
epoch: 3 step: 179, loss is 1.7424143552780151
epoch: 3 step: 180, loss is 1.7544529438018799
epoch: 3 step: 181, loss is 1.7419337034225464
epoch: 3 step: 182, loss is 1.7064985036849976
epoch: 3 step: 183, loss is 1.7526941299438477
epoch: 3 step: 184, loss is 1.6765990257263184
epoch: 3 step: 185, loss is 1.7984998226165771
epoch: 3 step: 186, loss is 1.8013088703155518
epoch: 3 step: 187, loss is 1.8446683883666992
epoch: 3 step: 188, loss is 1.695939302444458
epoch: 3 step: 189, loss is 1.5769917964935303
epoch: 3 step: 190, loss is 1.765929937362671
epoch: 3 step: 191, loss is 1.760894775390625
epoch: 3 step: 192, loss is 1.765155553817749
epoch: 3 step: 193, loss is 1.6571260690689087
epoch: 3 step: 194, loss is 1.782891035079956
epoch: 3 step: 195, loss is 1.822501301765442
epoch: 3 step: 196, loss is 1.685598611831665
epoch: 3 step: 197, loss is 1.7144808769226074
epoch: 3 step: 198, loss is 1.7803773880004883
epoch: 3 step: 199, loss is 1.6685271263122559
epoch: 3 step: 200, loss is 1.7109404802322388
epoch: 3 step: 201, loss is 1.7236040830612183
epoch: 3 step: 202, loss is 1.6281933784484863
epoch: 3 step: 203, loss is 1.6821147203445435
epoch: 3 step: 204, loss is 1.6687819957733154
epoch: 3 step: 205, loss is 1.7789862155914307
epoch: 3 step: 206, loss is 1.769896149635315
epoch: 3 step: 207, loss is 1.823211908340454
epoch: 3 step: 208, loss is 1.6506959199905396
epoch: 3 step: 209, loss is 1.5910999774932861
epoch: 3 step: 210, loss is 1.636285662651062
epoch: 3 step: 211, loss is 1.734756588935852
epoch: 3 step: 212, loss is 1.6787893772125244
epoch: 3 step: 213, loss is 1.8897600173950195
epoch: 3 step: 214, loss is 1.7878351211547852
epoch: 3 step: 215, loss is 1.809739589691162
epoch: 3 step: 216, loss is 1.618238925933838
epoch: 3 step: 217, loss is 1.835951328277588
epoch: 3 step: 218, loss is 1.7431846857070923
epoch: 3 step: 219, loss is 1.777294635772705
epoch: 3 step: 220, loss is 1.7508474588394165
epoch: 3 step: 221, loss is 1.7766541242599487
epoch: 3 step: 222, loss is 1.7049247026443481
epoch: 3 step: 223, loss is 1.681959867477417
epoch: 3 step: 224, loss is 1.8347787857055664
epoch: 3 step: 225, loss is 1.7920269966125488
epoch: 3 step: 226, loss is 1.6991766691207886
epoch: 3 step: 227, loss is 1.6482125520706177
epoch: 3 step: 228, loss is 1.7189157009124756
epoch: 3 step: 229, loss is 1.6653929948806763
epoch: 3 step: 230, loss is 1.6467382907867432
epoch: 3 step: 231, loss is 1.7463375329971313
epoch: 3 step: 232, loss is 1.7395448684692383
epoch: 3 step: 233, loss is 1.5935211181640625
epoch: 3 step: 234, loss is 1.6529710292816162
epoch: 3 step: 235, loss is 1.6924574375152588
epoch: 3 step: 236, loss is 1.782355785369873
epoch: 3 step: 237, loss is 1.7127463817596436
epoch: 3 step: 238, loss is 1.688645362854004
epoch: 3 step: 239, loss is 1.7146425247192383
epoch: 3 step: 240, loss is 1.7599706649780273
epoch: 3 step: 241, loss is 1.804896593093872
epoch: 3 step: 242, loss is 1.704947829246521
epoch: 3 step: 243, loss is 1.6239776611328125
epoch: 3 step: 244, loss is 1.700961947441101
epoch: 3 step: 245, loss is 1.6744734048843384
epoch: 3 step: 246, loss is 1.7533576488494873
epoch: 3 step: 247, loss is 1.7620048522949219
epoch: 3 step: 248, loss is 1.6602070331573486
epoch: 3 step: 249, loss is 1.7579090595245361
epoch: 3 step: 250, loss is 1.8472994565963745
epoch: 3 step: 251, loss is 1.7926963567733765
epoch: 3 step: 252, loss is 1.6725696325302124
epoch: 3 step: 253, loss is 1.7283315658569336
epoch: 3 step: 254, loss is 1.6876342296600342
epoch: 3 step: 255, loss is 1.685721755027771
epoch: 3 step: 256, loss is 1.781528115272522
epoch: 3 step: 257, loss is 1.6120579242706299
epoch: 3 step: 258, loss is 1.741850733757019
epoch: 3 step: 259, loss is 1.7349295616149902
epoch: 3 step: 260, loss is 1.6147620677947998
epoch: 3 step: 261, loss is 1.7662999629974365
epoch: 3 step: 262, loss is 1.570710301399231
epoch: 3 step: 263, loss is 1.7080087661743164
epoch: 3 step: 264, loss is 1.7875734567642212
epoch: 3 step: 265, loss is 1.7477959394454956
epoch: 3 step: 266, loss is 1.6720731258392334
epoch: 3 step: 267, loss is 1.6743179559707642
epoch: 3 step: 268, loss is 1.7317537069320679
epoch: 3 step: 269, loss is 1.6166419982910156
epoch: 3 step: 270, loss is 1.703937292098999
epoch: 3 step: 271, loss is 1.7867095470428467
epoch: 3 step: 272, loss is 1.6708333492279053
epoch: 3 step: 273, loss is 1.6910622119903564
epoch: 3 step: 274, loss is 1.8313789367675781
epoch: 3 step: 275, loss is 1.734154462814331
epoch: 3 step: 276, loss is 1.5862051248550415
epoch: 3 step: 277, loss is 1.842326283454895
epoch: 3 step: 278, loss is 1.7333091497421265
epoch: 3 step: 279, loss is 1.6914715766906738
epoch: 3 step: 280, loss is 1.7590086460113525
epoch: 3 step: 281, loss is 1.782589316368103
epoch: 3 step: 282, loss is 1.671002745628357
epoch: 3 step: 283, loss is 1.7889219522476196
epoch: 3 step: 284, loss is 1.7865937948226929
epoch: 3 step: 285, loss is 1.840965986251831
epoch: 3 step: 286, loss is 1.6704823970794678
epoch: 3 step: 287, loss is 1.7697992324829102
epoch: 3 step: 288, loss is 1.796462893486023
epoch: 3 step: 289, loss is 1.6941657066345215
epoch: 3 step: 290, loss is 1.7072052955627441
epoch: 3 step: 291, loss is 1.7631559371948242
epoch: 3 step: 292, loss is 1.766723394393921
epoch: 3 step: 293, loss is 1.8078240156173706
epoch: 3 step: 294, loss is 1.7114737033843994
epoch: 3 step: 295, loss is 1.6542181968688965
epoch: 3 step: 296, loss is 1.7234742641448975
epoch: 3 step: 297, loss is 1.753818154335022
epoch: 3 step: 298, loss is 1.6865930557250977
epoch: 3 step: 299, loss is 1.7363752126693726
epoch: 3 step: 300, loss is 1.5910722017288208
epoch: 3 step: 301, loss is 1.6719822883605957
epoch: 3 step: 302, loss is 1.6721827983856201
epoch: 3 step: 303, loss is 1.6900347471237183
epoch: 3 step: 304, loss is 1.716599702835083
epoch: 3 step: 305, loss is 1.6665544509887695
epoch: 3 step: 306, loss is 1.6840133666992188
epoch: 3 step: 307, loss is 1.6914595365524292
epoch: 3 step: 308, loss is 1.6977345943450928
epoch: 3 step: 309, loss is 1.7174962759017944
epoch: 3 step: 310, loss is 1.7119249105453491
epoch: 3 step: 311, loss is 1.7550396919250488
epoch: 3 step: 312, loss is 1.6406378746032715
epoch: 3 step: 313, loss is 1.6643996238708496
epoch: 3 step: 314, loss is 1.6515495777130127
epoch: 3 step: 315, loss is 1.7749066352844238
epoch: 3 step: 316, loss is 1.6713601350784302
epoch: 3 step: 317, loss is 1.6499664783477783
epoch: 3 step: 318, loss is 1.6136360168457031
epoch: 3 step: 319, loss is 1.635528564453125
epoch: 3 step: 320, loss is 1.7853888273239136
epoch: 3 step: 321, loss is 1.6467822790145874
epoch: 3 step: 322, loss is 1.61776864528656
epoch: 3 step: 323, loss is 1.6731338500976562
epoch: 3 step: 324, loss is 1.6838327646255493
epoch: 3 step: 325, loss is 1.5910794734954834
epoch: 3 step: 326, loss is 1.6860685348510742
epoch: 3 step: 327, loss is 1.6163591146469116
epoch: 3 step: 328, loss is 1.8347625732421875
epoch: 3 step: 329, loss is 1.6064834594726562
epoch: 3 step: 330, loss is 1.7510273456573486
epoch: 3 step: 331, loss is 1.7274161577224731
epoch: 3 step: 332, loss is 1.8178198337554932
epoch: 3 step: 333, loss is 1.7871026992797852
epoch: 3 step: 334, loss is 1.6706526279449463
epoch: 3 step: 335, loss is 1.6998460292816162
epoch: 3 step: 336, loss is 1.7630598545074463
epoch: 3 step: 337, loss is 1.6557632684707642
epoch: 3 step: 338, loss is 1.7528998851776123
epoch: 3 step: 339, loss is 1.7049301862716675
epoch: 3 step: 340, loss is 1.594982385635376
epoch: 3 step: 341, loss is 1.772371530532837
epoch: 3 step: 342, loss is 1.685744047164917
epoch: 3 step: 343, loss is 1.731490969657898
epoch: 3 step: 344, loss is 1.6554176807403564
epoch: 3 step: 345, loss is 1.8010971546173096
epoch: 3 step: 346, loss is 1.5836352109909058
epoch: 3 step: 347, loss is 1.7499160766601562
epoch: 3 step: 348, loss is 1.6829849481582642
epoch: 3 step: 349, loss is 1.5817031860351562
epoch: 3 step: 350, loss is 1.703901767730713
epoch: 3 step: 351, loss is 1.8326001167297363
epoch: 3 step: 352, loss is 1.7085936069488525
epoch: 3 step: 353, loss is 1.6891402006149292
epoch: 3 step: 354, loss is 1.7434598207473755
epoch: 3 step: 355, loss is 1.677656888961792
epoch: 3 step: 356, loss is 1.8194599151611328
epoch: 3 step: 357, loss is 1.6406285762786865
epoch: 3 step: 358, loss is 1.6195244789123535
epoch: 3 step: 359, loss is 1.5961867570877075
epoch: 3 step: 360, loss is 1.5857040882110596
epoch: 3 step: 361, loss is 1.6756489276885986
epoch: 3 step: 362, loss is 1.6826366186141968
epoch: 3 step: 363, loss is 1.695976734161377
epoch: 3 step: 364, loss is 1.7368282079696655
epoch: 3 step: 365, loss is 1.6196650266647339
epoch: 3 step: 366, loss is 1.6286401748657227
epoch: 3 step: 367, loss is 1.5430071353912354
epoch: 3 step: 368, loss is 1.5972182750701904
epoch: 3 step: 369, loss is 1.7685972452163696
epoch: 3 step: 370, loss is 1.7089072465896606
epoch: 3 step: 371, loss is 1.6556273698806763
epoch: 3 step: 372, loss is 1.637276291847229
epoch: 3 step: 373, loss is 1.7345547676086426
epoch: 3 step: 374, loss is 1.701845407485962
epoch: 3 step: 375, loss is 1.7591440677642822
epoch: 3 step: 376, loss is 1.6914732456207275
epoch: 3 step: 377, loss is 1.7411625385284424
epoch: 3 step: 378, loss is 1.6531426906585693
epoch: 3 step: 379, loss is 1.688938856124878
epoch: 3 step: 380, loss is 1.7634522914886475
epoch: 3 step: 381, loss is 1.7102078199386597
epoch: 3 step: 382, loss is 1.653367280960083
epoch: 3 step: 383, loss is 1.7889350652694702
epoch: 3 step: 384, loss is 1.77799391746521
epoch: 3 step: 385, loss is 1.6914746761322021
epoch: 3 step: 386, loss is 1.7278087139129639
epoch: 3 step: 387, loss is 1.7500312328338623
epoch: 3 step: 388, loss is 1.5245214700698853
epoch: 3 step: 389, loss is 1.574200987815857
epoch: 3 step: 390, loss is 1.750159502029419
Train epoch time: 145058.088 ms, per step time: 371.944 ms
epoch: 4 step: 1, loss is 1.6695165634155273
epoch: 4 step: 2, loss is 1.7273000478744507
epoch: 4 step: 3, loss is 1.7329519987106323
epoch: 4 step: 4, loss is 1.7230921983718872
epoch: 4 step: 5, loss is 1.5585060119628906
epoch: 4 step: 6, loss is 1.645129680633545
epoch: 4 step: 7, loss is 1.8048557043075562
epoch: 4 step: 8, loss is 1.669471263885498
epoch: 4 step: 9, loss is 1.7026921510696411
epoch: 4 step: 10, loss is 1.7005460262298584
epoch: 4 step: 11, loss is 1.570026159286499
epoch: 4 step: 12, loss is 1.6120418310165405
epoch: 4 step: 13, loss is 1.7042216062545776
epoch: 4 step: 14, loss is 1.6466180086135864
epoch: 4 step: 15, loss is 1.656440019607544
epoch: 4 step: 16, loss is 1.638288974761963
epoch: 4 step: 17, loss is 1.7356566190719604
epoch: 4 step: 18, loss is 1.6732922792434692
epoch: 4 step: 19, loss is 1.8225958347320557
epoch: 4 step: 20, loss is 1.6666510105133057
epoch: 4 step: 21, loss is 1.7109061479568481
epoch: 4 step: 22, loss is 1.6872361898422241
epoch: 4 step: 23, loss is 1.656557559967041
epoch: 4 step: 24, loss is 1.7234022617340088
epoch: 4 step: 25, loss is 1.6205995082855225
epoch: 4 step: 26, loss is 1.791795015335083
epoch: 4 step: 27, loss is 1.8158366680145264
epoch: 4 step: 28, loss is 1.72850501537323
epoch: 4 step: 29, loss is 1.7563163042068481
epoch: 4 step: 30, loss is 1.7978169918060303
epoch: 4 step: 31, loss is 1.642251968383789
epoch: 4 step: 32, loss is 1.7088783979415894
epoch: 4 step: 33, loss is 1.7903128862380981
epoch: 4 step: 34, loss is 1.6428248882293701
epoch: 4 step: 35, loss is 1.5747734308242798
epoch: 4 step: 36, loss is 1.730666160583496
epoch: 4 step: 37, loss is 1.752526879310608
epoch: 4 step: 38, loss is 1.5840187072753906
epoch: 4 step: 39, loss is 1.6418838500976562
epoch: 4 step: 40, loss is 1.6454088687896729
epoch: 4 step: 41, loss is 1.519988775253296
epoch: 4 step: 42, loss is 1.667872667312622
epoch: 4 step: 43, loss is 1.6414804458618164
epoch: 4 step: 44, loss is 1.5913159847259521
epoch: 4 step: 45, loss is 1.6626425981521606
epoch: 4 step: 46, loss is 1.6450990438461304
epoch: 4 step: 47, loss is 1.647599697113037
epoch: 4 step: 48, loss is 1.5927698612213135
epoch: 4 step: 49, loss is 1.7867192029953003
epoch: 4 step: 50, loss is 1.539243459701538
epoch: 4 step: 51, loss is 1.6675819158554077
epoch: 4 step: 52, loss is 1.6236523389816284
epoch: 4 step: 53, loss is 1.6589174270629883
epoch: 4 step: 54, loss is 1.6060881614685059
epoch: 4 step: 55, loss is 1.6542298793792725
epoch: 4 step: 56, loss is 1.771117925643921
epoch: 4 step: 57, loss is 1.736961007118225
epoch: 4 step: 58, loss is 1.6171984672546387
epoch: 4 step: 59, loss is 1.8743512630462646
epoch: 4 step: 60, loss is 1.6800825595855713
epoch: 4 step: 61, loss is 1.666163682937622
epoch: 4 step: 62, loss is 1.6979151964187622
epoch: 4 step: 63, loss is 1.6055777072906494
epoch: 4 step: 64, loss is 1.5906095504760742
epoch: 4 step: 65, loss is 1.757987380027771
epoch: 4 step: 66, loss is 1.6081726551055908
epoch: 4 step: 67, loss is 1.676893949508667
epoch: 4 step: 68, loss is 1.7651326656341553
epoch: 4 step: 69, loss is 1.8206675052642822
epoch: 4 step: 70, loss is 1.661590337753296
epoch: 4 step: 71, loss is 1.6335906982421875
epoch: 4 step: 72, loss is 1.764451026916504
epoch: 4 step: 73, loss is 1.7256431579589844
epoch: 4 step: 74, loss is 1.6364463567733765
epoch: 4 step: 75, loss is 1.7188787460327148
epoch: 4 step: 76, loss is 1.6673758029937744
epoch: 4 step: 77, loss is 1.664868950843811
epoch: 4 step: 78, loss is 1.6314384937286377
epoch: 4 step: 79, loss is 1.763289451599121
epoch: 4 step: 80, loss is 1.6337754726409912
epoch: 4 step: 81, loss is 1.6575185060501099
epoch: 4 step: 82, loss is 1.622018575668335
epoch: 4 step: 83, loss is 1.6691434383392334
epoch: 4 step: 84, loss is 1.6801186800003052
epoch: 4 step: 85, loss is 1.7004921436309814
epoch: 4 step: 86, loss is 1.5824577808380127
epoch: 4 step: 87, loss is 1.6731836795806885
epoch: 4 step: 88, loss is 1.7173842191696167
epoch: 4 step: 89, loss is 1.6667479276657104
epoch: 4 step: 90, loss is 1.6506376266479492
epoch: 4 step: 91, loss is 1.6161320209503174
epoch: 4 step: 92, loss is 1.72096848487854
epoch: 4 step: 93, loss is 1.6653797626495361
epoch: 4 step: 94, loss is 1.5721209049224854
epoch: 4 step: 95, loss is 1.8106515407562256
epoch: 4 step: 96, loss is 1.631089448928833
epoch: 4 step: 97, loss is 1.6779348850250244
epoch: 4 step: 98, loss is 1.738546371459961
epoch: 4 step: 99, loss is 1.5996187925338745
epoch: 4 step: 100, loss is 1.6185498237609863
epoch: 4 step: 101, loss is 1.7036750316619873
epoch: 4 step: 102, loss is 1.6072015762329102
epoch: 4 step: 103, loss is 1.5683672428131104
epoch: 4 step: 104, loss is 1.573211431503296
epoch: 4 step: 105, loss is 1.6568917036056519
epoch: 4 step: 106, loss is 1.6942692995071411
epoch: 4 step: 107, loss is 1.701817274093628
epoch: 4 step: 108, loss is 1.6250083446502686
epoch: 4 step: 109, loss is 1.6087796688079834
epoch: 4 step: 110, loss is 1.7560656070709229
epoch: 4 step: 111, loss is 1.6171553134918213
epoch: 4 step: 112, loss is 1.53173828125
epoch: 4 step: 113, loss is 1.6244738101959229
epoch: 4 step: 114, loss is 1.6011207103729248
epoch: 4 step: 115, loss is 1.5669540166854858
epoch: 4 step: 116, loss is 1.7150381803512573
epoch: 4 step: 117, loss is 1.6521196365356445
epoch: 4 step: 118, loss is 1.6225037574768066
epoch: 4 step: 119, loss is 1.683433175086975
epoch: 4 step: 120, loss is 1.6779825687408447
epoch: 4 step: 121, loss is 1.7159409523010254
epoch: 4 step: 122, loss is 1.6259791851043701
epoch: 4 step: 123, loss is 1.7155523300170898
epoch: 4 step: 124, loss is 1.60838782787323
epoch: 4 step: 125, loss is 1.6821777820587158
epoch: 4 step: 126, loss is 1.6594120264053345
epoch: 4 step: 127, loss is 1.6180691719055176
epoch: 4 step: 128, loss is 1.6326568126678467
epoch: 4 step: 129, loss is 1.5710145235061646
epoch: 4 step: 130, loss is 1.600823163986206
epoch: 4 step: 131, loss is 1.630587100982666
epoch: 4 step: 132, loss is 1.767482042312622
epoch: 4 step: 133, loss is 1.540154218673706
epoch: 4 step: 134, loss is 1.6062562465667725
epoch: 4 step: 135, loss is 1.6783620119094849
epoch: 4 step: 136, loss is 1.7135610580444336
epoch: 4 step: 137, loss is 1.4579838514328003
epoch: 4 step: 138, loss is 1.6518511772155762
epoch: 4 step: 139, loss is 1.7175908088684082
epoch: 4 step: 140, loss is 1.7394475936889648
epoch: 4 step: 141, loss is 1.5693387985229492
epoch: 4 step: 142, loss is 1.6008765697479248
epoch: 4 step: 143, loss is 1.5412213802337646
epoch: 4 step: 144, loss is 1.6418453454971313
epoch: 4 step: 145, loss is 1.4967046976089478
epoch: 4 step: 146, loss is 1.7385708093643188
epoch: 4 step: 147, loss is 1.7062017917633057
epoch: 4 step: 148, loss is 1.617221474647522
epoch: 4 step: 149, loss is 1.6853370666503906
epoch: 4 step: 150, loss is 1.5452866554260254
epoch: 4 step: 151, loss is 1.755713939666748
epoch: 4 step: 152, loss is 1.7040354013442993
epoch: 4 step: 153, loss is 1.617229700088501
epoch: 4 step: 154, loss is 1.7074425220489502
epoch: 4 step: 155, loss is 1.7260022163391113
epoch: 4 step: 156, loss is 1.6384129524230957
epoch: 4 step: 157, loss is 1.6073106527328491
epoch: 4 step: 158, loss is 1.5754611492156982
epoch: 4 step: 159, loss is 1.715348243713379
epoch: 4 step: 160, loss is 1.7685743570327759
epoch: 4 step: 161, loss is 1.6414381265640259
epoch: 4 step: 162, loss is 1.7249082326889038
epoch: 4 step: 163, loss is 1.6838865280151367
epoch: 4 step: 164, loss is 1.7254281044006348
epoch: 4 step: 165, loss is 1.6781275272369385
epoch: 4 step: 166, loss is 1.702733039855957
epoch: 4 step: 167, loss is 1.6596628427505493
epoch: 4 step: 168, loss is 1.610282301902771
epoch: 4 step: 169, loss is 1.6470028162002563
epoch: 4 step: 170, loss is 1.611755132675171
epoch: 4 step: 171, loss is 1.6413168907165527
epoch: 4 step: 172, loss is 1.8116881847381592
epoch: 4 step: 173, loss is 1.6114356517791748
epoch: 4 step: 174, loss is 1.7080440521240234
epoch: 4 step: 175, loss is 1.7400500774383545
epoch: 4 step: 176, loss is 1.6361223459243774
epoch: 4 step: 177, loss is 1.6080799102783203
epoch: 4 step: 178, loss is 1.7389817237854004
epoch: 4 step: 179, loss is 1.6394613981246948
epoch: 4 step: 180, loss is 1.6609821319580078
epoch: 4 step: 181, loss is 1.7083775997161865
epoch: 4 step: 182, loss is 1.6260814666748047
epoch: 4 step: 183, loss is 1.670709490776062
epoch: 4 step: 184, loss is 1.733322024345398
epoch: 4 step: 185, loss is 1.6730746030807495
epoch: 4 step: 186, loss is 1.7290054559707642
epoch: 4 step: 187, loss is 1.6578032970428467
epoch: 4 step: 188, loss is 1.607759714126587
epoch: 4 step: 189, loss is 1.6292657852172852
epoch: 4 step: 190, loss is 1.6871564388275146
epoch: 4 step: 191, loss is 1.6059832572937012
epoch: 4 step: 192, loss is 1.673574686050415
epoch: 4 step: 193, loss is 1.6916306018829346
epoch: 4 step: 194, loss is 1.6836265325546265
epoch: 4 step: 195, loss is 1.7154841423034668
epoch: 4 step: 196, loss is 1.7032926082611084
epoch: 4 step: 197, loss is 1.5874526500701904
epoch: 4 step: 198, loss is 1.5883681774139404
epoch: 4 step: 199, loss is 1.636406660079956
epoch: 4 step: 200, loss is 1.6678447723388672
epoch: 4 step: 201, loss is 1.506903886795044
epoch: 4 step: 202, loss is 1.694869041442871
epoch: 4 step: 203, loss is 1.7244055271148682
epoch: 4 step: 204, loss is 1.6359894275665283
epoch: 4 step: 205, loss is 1.63210928440094
epoch: 4 step: 206, loss is 1.6406002044677734
epoch: 4 step: 207, loss is 1.6306772232055664
epoch: 4 step: 208, loss is 1.6416488885879517
epoch: 4 step: 209, loss is 1.7348030805587769
epoch: 4 step: 210, loss is 1.7382646799087524
epoch: 4 step: 211, loss is 1.6398556232452393
epoch: 4 step: 212, loss is 1.7287929058074951
epoch: 4 step: 213, loss is 1.5922458171844482
epoch: 4 step: 214, loss is 1.7324360609054565
epoch: 4 step: 215, loss is 1.6540369987487793
epoch: 4 step: 216, loss is 1.6112148761749268
epoch: 4 step: 217, loss is 1.6772197484970093
epoch: 4 step: 218, loss is 1.848122477531433
epoch: 4 step: 219, loss is 1.764024257659912
epoch: 4 step: 220, loss is 1.563981533050537
epoch: 4 step: 221, loss is 1.5943589210510254
epoch: 4 step: 222, loss is 1.6494523286819458
epoch: 4 step: 223, loss is 1.6289910078048706
epoch: 4 step: 224, loss is 1.7430896759033203
epoch: 4 step: 225, loss is 1.6236622333526611
epoch: 4 step: 226, loss is 1.6899158954620361
epoch: 4 step: 227, loss is 1.6302130222320557
epoch: 4 step: 228, loss is 1.6347051858901978
epoch: 4 step: 229, loss is 1.5936694145202637
epoch: 4 step: 230, loss is 1.699882984161377
epoch: 4 step: 231, loss is 1.6149802207946777
epoch: 4 step: 232, loss is 1.6307373046875
epoch: 4 step: 233, loss is 1.660753846168518
epoch: 4 step: 234, loss is 1.6779448986053467
epoch: 4 step: 235, loss is 1.7419378757476807
epoch: 4 step: 236, loss is 1.646225094795227
epoch: 4 step: 237, loss is 1.566084623336792
epoch: 4 step: 238, loss is 1.611794114112854
epoch: 4 step: 239, loss is 1.6340889930725098
epoch: 4 step: 240, loss is 1.735528826713562
epoch: 4 step: 241, loss is 1.6432706117630005
epoch: 4 step: 242, loss is 1.6555874347686768
epoch: 4 step: 243, loss is 1.700577974319458
epoch: 4 step: 244, loss is 1.5956780910491943
epoch: 4 step: 245, loss is 1.6119542121887207
epoch: 4 step: 246, loss is 1.6775035858154297
epoch: 4 step: 247, loss is 1.6030528545379639
epoch: 4 step: 248, loss is 1.5934855937957764
epoch: 4 step: 249, loss is 1.6230729818344116
epoch: 4 step: 250, loss is 1.7012053728103638
epoch: 4 step: 251, loss is 1.5713801383972168
epoch: 4 step: 252, loss is 1.5452080965042114
epoch: 4 step: 253, loss is 1.6255402565002441
epoch: 4 step: 254, loss is 1.6258832216262817
epoch: 4 step: 255, loss is 1.6188931465148926
epoch: 4 step: 256, loss is 1.6053047180175781
epoch: 4 step: 257, loss is 1.6065144538879395
epoch: 4 step: 258, loss is 1.7241655588150024
epoch: 4 step: 259, loss is 1.6181094646453857
epoch: 4 step: 260, loss is 1.5777652263641357
epoch: 4 step: 261, loss is 1.6306815147399902
epoch: 4 step: 262, loss is 1.5533638000488281
epoch: 4 step: 263, loss is 1.6355679035186768
epoch: 4 step: 264, loss is 1.631014108657837
epoch: 4 step: 265, loss is 1.5684889554977417
epoch: 4 step: 266, loss is 1.6852974891662598
epoch: 4 step: 267, loss is 1.7343311309814453
epoch: 4 step: 268, loss is 1.6117439270019531
epoch: 4 step: 269, loss is 1.6637423038482666
epoch: 4 step: 270, loss is 1.6423864364624023
epoch: 4 step: 271, loss is 1.6112496852874756
epoch: 4 step: 272, loss is 1.6339552402496338
epoch: 4 step: 273, loss is 1.605186939239502
epoch: 4 step: 274, loss is 1.6171804666519165
epoch: 4 step: 275, loss is 1.6028566360473633
epoch: 4 step: 276, loss is 1.665071964263916
epoch: 4 step: 277, loss is 1.6269690990447998
epoch: 4 step: 278, loss is 1.5817041397094727
epoch: 4 step: 279, loss is 1.5939949750900269
epoch: 4 step: 280, loss is 1.6965526342391968
epoch: 4 step: 281, loss is 1.5738329887390137
epoch: 4 step: 282, loss is 1.6287593841552734
epoch: 4 step: 283, loss is 1.6668312549591064
epoch: 4 step: 284, loss is 1.6443724632263184
epoch: 4 step: 285, loss is 1.6746225357055664
epoch: 4 step: 286, loss is 1.5106184482574463
epoch: 4 step: 287, loss is 1.761210322380066
epoch: 4 step: 288, loss is 1.6763477325439453
epoch: 4 step: 289, loss is 1.6645381450653076
epoch: 4 step: 290, loss is 1.5973329544067383
epoch: 4 step: 291, loss is 1.5937137603759766
epoch: 4 step: 292, loss is 1.6388037204742432
epoch: 4 step: 293, loss is 1.687715768814087
epoch: 4 step: 294, loss is 1.7647030353546143
epoch: 4 step: 295, loss is 1.6563464403152466
epoch: 4 step: 296, loss is 1.6363365650177002
epoch: 4 step: 297, loss is 1.6472853422164917
epoch: 4 step: 298, loss is 1.6601572036743164
epoch: 4 step: 299, loss is 1.6924667358398438
epoch: 4 step: 300, loss is 1.6505777835845947
epoch: 4 step: 301, loss is 1.6166727542877197
epoch: 4 step: 302, loss is 1.7392905950546265
epoch: 4 step: 303, loss is 1.7075598239898682
epoch: 4 step: 304, loss is 1.6641656160354614
epoch: 4 step: 305, loss is 1.6933059692382812
epoch: 4 step: 306, loss is 1.5921763181686401
epoch: 4 step: 307, loss is 1.6138687133789062
epoch: 4 step: 308, loss is 1.711918592453003
epoch: 4 step: 309, loss is 1.5096725225448608
epoch: 4 step: 310, loss is 1.6413471698760986
epoch: 4 step: 311, loss is 1.6407179832458496
epoch: 4 step: 312, loss is 1.7054457664489746
epoch: 4 step: 313, loss is 1.6631145477294922
epoch: 4 step: 314, loss is 1.5111362934112549
epoch: 4 step: 315, loss is 1.536847472190857
epoch: 4 step: 316, loss is 1.5711177587509155
epoch: 4 step: 317, loss is 1.7044873237609863
epoch: 4 step: 318, loss is 1.621565580368042
epoch: 4 step: 319, loss is 1.5832188129425049
epoch: 4 step: 320, loss is 1.56040358543396
epoch: 4 step: 321, loss is 1.5922833681106567
epoch: 4 step: 322, loss is 1.6988639831542969
epoch: 4 step: 323, loss is 1.6180733442306519
epoch: 4 step: 324, loss is 1.682058334350586
epoch: 4 step: 325, loss is 1.5848283767700195
epoch: 4 step: 326, loss is 1.5157688856124878
epoch: 4 step: 327, loss is 1.7158801555633545
epoch: 4 step: 328, loss is 1.6353027820587158
epoch: 4 step: 329, loss is 1.879155158996582
epoch: 4 step: 330, loss is 1.5287706851959229
epoch: 4 step: 331, loss is 1.648926019668579
epoch: 4 step: 332, loss is 1.6658390760421753
epoch: 4 step: 333, loss is 1.6448297500610352
epoch: 4 step: 334, loss is 1.652221918106079
epoch: 4 step: 335, loss is 1.5456767082214355
epoch: 4 step: 336, loss is 1.5358079671859741
epoch: 4 step: 337, loss is 1.6559476852416992
epoch: 4 step: 338, loss is 1.6299281120300293
epoch: 4 step: 339, loss is 1.745874285697937
epoch: 4 step: 340, loss is 1.610398292541504
epoch: 4 step: 341, loss is 1.6859097480773926
epoch: 4 step: 342, loss is 1.6728047132492065
epoch: 4 step: 343, loss is 1.5981897115707397
epoch: 4 step: 344, loss is 1.7159134149551392
epoch: 4 step: 345, loss is 1.605245590209961
epoch: 4 step: 346, loss is 1.6806387901306152
epoch: 4 step: 347, loss is 1.538119912147522
epoch: 4 step: 348, loss is 1.586168646812439
epoch: 4 step: 349, loss is 1.6935168504714966
epoch: 4 step: 350, loss is 1.5582554340362549
epoch: 4 step: 351, loss is 1.6914149522781372
epoch: 4 step: 352, loss is 1.5586116313934326
epoch: 4 step: 353, loss is 1.6963200569152832
epoch: 4 step: 354, loss is 1.6159112453460693
epoch: 4 step: 355, loss is 1.6362673044204712
epoch: 4 step: 356, loss is 1.5766600370407104
epoch: 4 step: 357, loss is 1.6794461011886597
epoch: 4 step: 358, loss is 1.5064743757247925
epoch: 4 step: 359, loss is 1.7019294500350952
epoch: 4 step: 360, loss is 1.5504231452941895
epoch: 4 step: 361, loss is 1.6210225820541382
epoch: 4 step: 362, loss is 1.7284070253372192
epoch: 4 step: 363, loss is 1.6535985469818115
epoch: 4 step: 364, loss is 1.5663567781448364
epoch: 4 step: 365, loss is 1.7884958982467651
epoch: 4 step: 366, loss is 1.5132598876953125
epoch: 4 step: 367, loss is 1.7188857793807983
epoch: 4 step: 368, loss is 1.565704345703125
epoch: 4 step: 369, loss is 1.724658489227295
epoch: 4 step: 370, loss is 1.6204599142074585
epoch: 4 step: 371, loss is 1.6468414068222046
epoch: 4 step: 372, loss is 1.565652847290039
epoch: 4 step: 373, loss is 1.6791701316833496
epoch: 4 step: 374, loss is 1.5494015216827393
epoch: 4 step: 375, loss is 1.5870392322540283
epoch: 4 step: 376, loss is 1.688801884651184
epoch: 4 step: 377, loss is 1.6855888366699219
epoch: 4 step: 378, loss is 1.6413899660110474
epoch: 4 step: 379, loss is 1.5933438539505005
epoch: 4 step: 380, loss is 1.7512990236282349
epoch: 4 step: 381, loss is 1.5685606002807617
epoch: 4 step: 382, loss is 1.6456449031829834
epoch: 4 step: 383, loss is 1.5981110334396362
epoch: 4 step: 384, loss is 1.6993358135223389
epoch: 4 step: 385, loss is 1.6400423049926758
epoch: 4 step: 386, loss is 1.6877739429473877
epoch: 4 step: 387, loss is 1.7578630447387695
epoch: 4 step: 388, loss is 1.6703307628631592
epoch: 4 step: 389, loss is 1.64776611328125
epoch: 4 step: 390, loss is 1.6677515506744385
Train epoch time: 135002.970 ms, per step time: 346.161 ms
epoch: 5 step: 1, loss is 1.6414284706115723
epoch: 5 step: 2, loss is 1.6133499145507812
epoch: 5 step: 3, loss is 1.6073421239852905
epoch: 5 step: 4, loss is 1.5657422542572021
epoch: 5 step: 5, loss is 1.5712566375732422
epoch: 5 step: 6, loss is 1.7336235046386719
epoch: 5 step: 7, loss is 1.6461542844772339
epoch: 5 step: 8, loss is 1.5539560317993164
epoch: 5 step: 9, loss is 1.6430338621139526
epoch: 5 step: 10, loss is 1.5809056758880615
epoch: 5 step: 11, loss is 1.5767440795898438
epoch: 5 step: 12, loss is 1.4443682432174683
epoch: 5 step: 13, loss is 1.7491919994354248
epoch: 5 step: 14, loss is 1.5007736682891846
epoch: 5 step: 15, loss is 1.6383237838745117
epoch: 5 step: 16, loss is 1.595729947090149
epoch: 5 step: 17, loss is 1.6138263940811157
epoch: 5 step: 18, loss is 1.582689642906189
epoch: 5 step: 19, loss is 1.5651130676269531
epoch: 5 step: 20, loss is 1.591292381286621
epoch: 5 step: 21, loss is 1.576625943183899
epoch: 5 step: 22, loss is 1.6917927265167236
epoch: 5 step: 23, loss is 1.6040232181549072
epoch: 5 step: 24, loss is 1.546478271484375
epoch: 5 step: 25, loss is 1.6349976062774658
epoch: 5 step: 26, loss is 1.6656715869903564
epoch: 5 step: 27, loss is 1.6220475435256958
epoch: 5 step: 28, loss is 1.661820650100708
epoch: 5 step: 29, loss is 1.543447494506836
epoch: 5 step: 30, loss is 1.5551376342773438
epoch: 5 step: 31, loss is 1.5874927043914795
epoch: 5 step: 32, loss is 1.66179621219635
epoch: 5 step: 33, loss is 1.6630723476409912
epoch: 5 step: 34, loss is 1.6434706449508667
epoch: 5 step: 35, loss is 1.6164604425430298
epoch: 5 step: 36, loss is 1.5964670181274414
epoch: 5 step: 37, loss is 1.5660440921783447
epoch: 5 step: 38, loss is 1.6367616653442383
epoch: 5 step: 39, loss is 1.6313118934631348
epoch: 5 step: 40, loss is 1.5402803421020508
epoch: 5 step: 41, loss is 1.6043792963027954
epoch: 5 step: 42, loss is 1.5902783870697021
epoch: 5 step: 43, loss is 1.6030004024505615
epoch: 5 step: 44, loss is 1.51835298538208
epoch: 5 step: 45, loss is 1.6403353214263916
epoch: 5 step: 46, loss is 1.5426907539367676
epoch: 5 step: 47, loss is 1.6529593467712402
epoch: 5 step: 48, loss is 1.7222962379455566
epoch: 5 step: 49, loss is 1.5422757863998413
epoch: 5 step: 50, loss is 1.6748955249786377
epoch: 5 step: 51, loss is 1.7007254362106323
epoch: 5 step: 52, loss is 1.5466275215148926
epoch: 5 step: 53, loss is 1.600034236907959
epoch: 5 step: 54, loss is 1.6911506652832031
epoch: 5 step: 55, loss is 1.5752044916152954
epoch: 5 step: 56, loss is 1.5590535402297974
epoch: 5 step: 57, loss is 1.675789713859558
epoch: 5 step: 58, loss is 1.4998986721038818
epoch: 5 step: 59, loss is 1.5317859649658203
epoch: 5 step: 60, loss is 1.6225993633270264
epoch: 5 step: 61, loss is 1.5759437084197998
epoch: 5 step: 62, loss is 1.5726115703582764
epoch: 5 step: 63, loss is 1.6666594743728638
epoch: 5 step: 64, loss is 1.636668086051941
epoch: 5 step: 65, loss is 1.46066415309906
epoch: 5 step: 66, loss is 1.6098400354385376
epoch: 5 step: 67, loss is 1.494192361831665
epoch: 5 step: 68, loss is 1.6731780767440796
epoch: 5 step: 69, loss is 1.5911043882369995
epoch: 5 step: 70, loss is 1.640868067741394
epoch: 5 step: 71, loss is 1.6505470275878906
epoch: 5 step: 72, loss is 1.5993077754974365
epoch: 5 step: 73, loss is 1.58303701877594
epoch: 5 step: 74, loss is 1.567945957183838
epoch: 5 step: 75, loss is 1.6461838483810425
epoch: 5 step: 76, loss is 1.5701122283935547
epoch: 5 step: 77, loss is 1.578521490097046
epoch: 5 step: 78, loss is 1.5445590019226074
epoch: 5 step: 79, loss is 1.6084973812103271
epoch: 5 step: 80, loss is 1.5336830615997314
epoch: 5 step: 81, loss is 1.5514973402023315
epoch: 5 step: 82, loss is 1.5627862215042114
epoch: 5 step: 83, loss is 1.4522783756256104
epoch: 5 step: 84, loss is 1.6608734130859375
epoch: 5 step: 85, loss is 1.5586647987365723
epoch: 5 step: 86, loss is 1.4971017837524414
epoch: 5 step: 87, loss is 1.6389094591140747
epoch: 5 step: 88, loss is 1.649534821510315
epoch: 5 step: 89, loss is 1.7511341571807861
epoch: 5 step: 90, loss is 1.7455153465270996
epoch: 5 step: 91, loss is 1.7708572149276733
epoch: 5 step: 92, loss is 1.633859395980835
epoch: 5 step: 93, loss is 1.5422446727752686
epoch: 5 step: 94, loss is 1.6506986618041992
epoch: 5 step: 95, loss is 1.6532657146453857
epoch: 5 step: 96, loss is 1.6974544525146484
epoch: 5 step: 97, loss is 1.5629090070724487
epoch: 5 step: 98, loss is 1.6426180601119995
epoch: 5 step: 99, loss is 1.6129649877548218
epoch: 5 step: 100, loss is 1.6500146389007568
epoch: 5 step: 101, loss is 1.6136314868927002
epoch: 5 step: 102, loss is 1.6230109930038452
epoch: 5 step: 103, loss is 1.5851149559020996
epoch: 5 step: 104, loss is 1.5509576797485352
epoch: 5 step: 105, loss is 1.648830771446228
epoch: 5 step: 106, loss is 1.5736055374145508
epoch: 5 step: 107, loss is 1.563952088356018
epoch: 5 step: 108, loss is 1.6155402660369873
epoch: 5 step: 109, loss is 1.601898431777954
epoch: 5 step: 110, loss is 1.6058074235916138
epoch: 5 step: 111, loss is 1.6439316272735596
epoch: 5 step: 112, loss is 1.7084676027297974
epoch: 5 step: 113, loss is 1.5610415935516357
epoch: 5 step: 114, loss is 1.6334338188171387
epoch: 5 step: 115, loss is 1.5861258506774902
epoch: 5 step: 116, loss is 1.4557653665542603
epoch: 5 step: 117, loss is 1.6719934940338135
epoch: 5 step: 118, loss is 1.7527364492416382
epoch: 5 step: 119, loss is 1.5504150390625
epoch: 5 step: 120, loss is 1.5088813304901123
epoch: 5 step: 121, loss is 1.5114814043045044
epoch: 5 step: 122, loss is 1.564155101776123
epoch: 5 step: 123, loss is 1.6420687437057495
epoch: 5 step: 124, loss is 1.721785306930542
epoch: 5 step: 125, loss is 1.6231032609939575
epoch: 5 step: 126, loss is 1.6007492542266846
epoch: 5 step: 127, loss is 1.5860202312469482
epoch: 5 step: 128, loss is 1.6081591844558716
epoch: 5 step: 129, loss is 1.4897911548614502
epoch: 5 step: 130, loss is 1.6453657150268555
epoch: 5 step: 131, loss is 1.7042666673660278
epoch: 5 step: 132, loss is 1.6355501413345337
epoch: 5 step: 133, loss is 1.6283111572265625
epoch: 5 step: 134, loss is 1.5827581882476807
epoch: 5 step: 135, loss is 1.590122938156128
epoch: 5 step: 136, loss is 1.5546517372131348
epoch: 5 step: 137, loss is 1.451050877571106
epoch: 5 step: 138, loss is 1.8012257814407349
epoch: 5 step: 139, loss is 1.6254920959472656
epoch: 5 step: 140, loss is 1.5159741640090942
epoch: 5 step: 141, loss is 1.5933189392089844
epoch: 5 step: 142, loss is 1.728009819984436
epoch: 5 step: 143, loss is 1.5668749809265137
epoch: 5 step: 144, loss is 1.7010149955749512
epoch: 5 step: 145, loss is 1.6198683977127075
epoch: 5 step: 146, loss is 1.6176905632019043
epoch: 5 step: 147, loss is 1.5649861097335815
epoch: 5 step: 148, loss is 1.627281665802002
epoch: 5 step: 149, loss is 1.417607069015503
epoch: 5 step: 150, loss is 1.7324036359786987
epoch: 5 step: 151, loss is 1.648249864578247
epoch: 5 step: 152, loss is 1.5773850679397583
epoch: 5 step: 153, loss is 1.5680890083312988
epoch: 5 step: 154, loss is 1.5867016315460205
epoch: 5 step: 155, loss is 1.630353569984436
epoch: 5 step: 156, loss is 1.6103355884552002
epoch: 5 step: 157, loss is 1.620666265487671
epoch: 5 step: 158, loss is 1.5707900524139404
epoch: 5 step: 159, loss is 1.5001070499420166
epoch: 5 step: 160, loss is 1.52289617061615
epoch: 5 step: 161, loss is 1.5643471479415894
epoch: 5 step: 162, loss is 1.5430119037628174
epoch: 5 step: 163, loss is 1.579246163368225
epoch: 5 step: 164, loss is 1.6190059185028076
epoch: 5 step: 165, loss is 1.5879192352294922
epoch: 5 step: 166, loss is 1.6042240858078003
epoch: 5 step: 167, loss is 1.5793931484222412
epoch: 5 step: 168, loss is 1.5850214958190918
epoch: 5 step: 169, loss is 1.4722135066986084
epoch: 5 step: 170, loss is 1.4814708232879639
epoch: 5 step: 171, loss is 1.6623531579971313
epoch: 5 step: 172, loss is 1.7225098609924316
epoch: 5 step: 173, loss is 1.5727636814117432
epoch: 5 step: 174, loss is 1.5415968894958496
epoch: 5 step: 175, loss is 1.5758564472198486
epoch: 5 step: 176, loss is 1.7106969356536865
epoch: 5 step: 177, loss is 1.4958528280258179
epoch: 5 step: 178, loss is 1.6774400472640991
epoch: 5 step: 179, loss is 1.69835364818573
epoch: 5 step: 180, loss is 1.6368367671966553
epoch: 5 step: 181, loss is 1.691902756690979
epoch: 5 step: 182, loss is 1.621715784072876
epoch: 5 step: 183, loss is 1.668091058731079
epoch: 5 step: 184, loss is 1.5842854976654053
epoch: 5 step: 185, loss is 1.6971570253372192
epoch: 5 step: 186, loss is 1.6501247882843018
epoch: 5 step: 187, loss is 1.6283658742904663
epoch: 5 step: 188, loss is 1.61997652053833
epoch: 5 step: 189, loss is 1.706433653831482
epoch: 5 step: 190, loss is 1.5875580310821533
epoch: 5 step: 191, loss is 1.585848093032837
epoch: 5 step: 192, loss is 1.6266297101974487
epoch: 5 step: 193, loss is 1.6387460231781006
epoch: 5 step: 194, loss is 1.648606538772583
epoch: 5 step: 195, loss is 1.5681453943252563
epoch: 5 step: 196, loss is 1.6784400939941406
epoch: 5 step: 197, loss is 1.6807732582092285
epoch: 5 step: 198, loss is 1.5594044923782349
epoch: 5 step: 199, loss is 1.6462832689285278
epoch: 5 step: 200, loss is 1.5839227437973022
epoch: 5 step: 201, loss is 1.704714298248291
epoch: 5 step: 202, loss is 1.539453387260437
epoch: 5 step: 203, loss is 1.6192526817321777
epoch: 5 step: 204, loss is 1.714280366897583
epoch: 5 step: 205, loss is 1.6327862739562988
epoch: 5 step: 206, loss is 1.5994987487792969
epoch: 5 step: 207, loss is 1.6471309661865234
epoch: 5 step: 208, loss is 1.5992558002471924
epoch: 5 step: 209, loss is 1.5258523225784302
epoch: 5 step: 210, loss is 1.589914321899414
epoch: 5 step: 211, loss is 1.569895625114441
epoch: 5 step: 212, loss is 1.5933986902236938
epoch: 5 step: 213, loss is 1.467947244644165
epoch: 5 step: 214, loss is 1.5026377439498901
epoch: 5 step: 215, loss is 1.5750608444213867
epoch: 5 step: 216, loss is 1.5915688276290894
epoch: 5 step: 217, loss is 1.6002682447433472
epoch: 5 step: 218, loss is 1.6803622245788574
epoch: 5 step: 219, loss is 1.577143907546997
epoch: 5 step: 220, loss is 1.6161741018295288
epoch: 5 step: 221, loss is 1.5676504373550415
epoch: 5 step: 222, loss is 1.5366853475570679
epoch: 5 step: 223, loss is 1.6090028285980225
epoch: 5 step: 224, loss is 1.5079410076141357
epoch: 5 step: 225, loss is 1.5618683099746704
epoch: 5 step: 226, loss is 1.638017177581787
epoch: 5 step: 227, loss is 1.5741792917251587
epoch: 5 step: 228, loss is 1.5870435237884521
epoch: 5 step: 229, loss is 1.6679387092590332
epoch: 5 step: 230, loss is 1.4960081577301025
epoch: 5 step: 231, loss is 1.538419246673584
epoch: 5 step: 232, loss is 1.5862116813659668
epoch: 5 step: 233, loss is 1.5973320007324219
epoch: 5 step: 234, loss is 1.5232126712799072
epoch: 5 step: 235, loss is 1.5877140760421753
epoch: 5 step: 236, loss is 1.6911592483520508
epoch: 5 step: 237, loss is 1.6973609924316406
epoch: 5 step: 238, loss is 1.5384098291397095
epoch: 5 step: 239, loss is 1.564143180847168
epoch: 5 step: 240, loss is 1.513045072555542
epoch: 5 step: 241, loss is 1.6131844520568848
epoch: 5 step: 242, loss is 1.494818925857544
epoch: 5 step: 243, loss is 1.523600697517395
epoch: 5 step: 244, loss is 1.5096313953399658
epoch: 5 step: 245, loss is 1.6016907691955566
epoch: 5 step: 246, loss is 1.4984893798828125
epoch: 5 step: 247, loss is 1.5625672340393066
epoch: 5 step: 248, loss is 1.5749375820159912
epoch: 5 step: 249, loss is 1.5304640531539917
epoch: 5 step: 250, loss is 1.5693752765655518
epoch: 5 step: 251, loss is 1.705450415611267
epoch: 5 step: 252, loss is 1.5074079036712646
epoch: 5 step: 253, loss is 1.5789763927459717
epoch: 5 step: 254, loss is 1.530361294746399
epoch: 5 step: 255, loss is 1.7976694107055664
epoch: 5 step: 256, loss is 1.6102001667022705
epoch: 5 step: 257, loss is 1.4735286235809326
epoch: 5 step: 258, loss is 1.6302123069763184
epoch: 5 step: 259, loss is 1.5966647863388062
epoch: 5 step: 260, loss is 1.6233352422714233
epoch: 5 step: 261, loss is 1.5608391761779785
epoch: 5 step: 262, loss is 1.606345534324646
epoch: 5 step: 263, loss is 1.5095794200897217
epoch: 5 step: 264, loss is 1.509196162223816
epoch: 5 step: 265, loss is 1.563521385192871
epoch: 5 step: 266, loss is 1.5619696378707886
epoch: 5 step: 267, loss is 1.6058316230773926
epoch: 5 step: 268, loss is 1.6006145477294922
epoch: 5 step: 269, loss is 1.53438401222229
epoch: 5 step: 270, loss is 1.671415090560913
epoch: 5 step: 271, loss is 1.5893216133117676
epoch: 5 step: 272, loss is 1.501842975616455
epoch: 5 step: 273, loss is 1.7832317352294922
epoch: 5 step: 274, loss is 1.4561469554901123
epoch: 5 step: 275, loss is 1.5942699909210205
epoch: 5 step: 276, loss is 1.4835069179534912
epoch: 5 step: 277, loss is 1.5552664995193481
epoch: 5 step: 278, loss is 1.538780689239502
epoch: 5 step: 279, loss is 1.523700475692749
epoch: 5 step: 280, loss is 1.5699076652526855
epoch: 5 step: 281, loss is 1.566076397895813
epoch: 5 step: 282, loss is 1.595410943031311
epoch: 5 step: 283, loss is 1.5802452564239502
epoch: 5 step: 284, loss is 1.5255502462387085
epoch: 5 step: 285, loss is 1.6844873428344727
epoch: 5 step: 286, loss is 1.5715992450714111
epoch: 5 step: 287, loss is 1.529794454574585
epoch: 5 step: 288, loss is 1.5810072422027588
epoch: 5 step: 289, loss is 1.7997422218322754
epoch: 5 step: 290, loss is 1.5457688570022583
epoch: 5 step: 291, loss is 1.592775583267212
epoch: 5 step: 292, loss is 1.6729099750518799
epoch: 5 step: 293, loss is 1.4456005096435547
epoch: 5 step: 294, loss is 1.682685375213623
epoch: 5 step: 295, loss is 1.559996485710144
epoch: 5 step: 296, loss is 1.587217092514038
epoch: 5 step: 297, loss is 1.4631487131118774
epoch: 5 step: 298, loss is 1.6209064722061157
epoch: 5 step: 299, loss is 1.6023503541946411
epoch: 5 step: 300, loss is 1.5702555179595947
epoch: 5 step: 301, loss is 1.656314730644226
epoch: 5 step: 302, loss is 1.6821871995925903
epoch: 5 step: 303, loss is 1.6096503734588623
epoch: 5 step: 304, loss is 1.6399579048156738
epoch: 5 step: 305, loss is 1.6309127807617188
epoch: 5 step: 306, loss is 1.5221701860427856
epoch: 5 step: 307, loss is 1.5615696907043457
epoch: 5 step: 308, loss is 1.5472203493118286
epoch: 5 step: 309, loss is 1.521003007888794
epoch: 5 step: 310, loss is 1.612450361251831
epoch: 5 step: 311, loss is 1.5359975099563599
epoch: 5 step: 312, loss is 1.631341814994812
epoch: 5 step: 313, loss is 1.575690746307373
epoch: 5 step: 314, loss is 1.5513726472854614
epoch: 5 step: 315, loss is 1.5994372367858887
epoch: 5 step: 316, loss is 1.533798098564148
epoch: 5 step: 317, loss is 1.5343968868255615
epoch: 5 step: 318, loss is 1.4302661418914795
epoch: 5 step: 319, loss is 1.5225865840911865
epoch: 5 step: 320, loss is 1.624767541885376
epoch: 5 step: 321, loss is 1.501556396484375
epoch: 5 step: 322, loss is 1.5757759809494019
epoch: 5 step: 324, loss is 1.578230857849121
epoch: 5 step: 325, loss is 1.729386568069458
epoch: 5 step: 326, loss is 1.5106805562973022
epoch: 5 step: 327, loss is 1.6114463806152344
epoch: 5 step: 328, loss is 1.5462558269500732
epoch: 5 step: 329, loss is 1.5644663572311401
epoch: 5 step: 330, loss is 1.6486071348190308
epoch: 5 step: 331, loss is 1.6583094596862793
epoch: 5 step: 332, loss is 1.4188274145126343
epoch: 5 step: 333, loss is 1.485639214515686
epoch: 5 step: 334, loss is 1.558194637298584
epoch: 5 step: 335, loss is 1.7039393186569214
epoch: 5 step: 336, loss is 1.7015631198883057
epoch: 5 step: 337, loss is 1.5780913829803467
epoch: 5 step: 338, loss is 1.5033587217330933
epoch: 5 step: 339, loss is 1.537819743156433
epoch: 5 step: 340, loss is 1.8121535778045654
epoch: 5 step: 341, loss is 1.5806516408920288
epoch: 5 step: 342, loss is 1.5796537399291992
epoch: 5 step: 343, loss is 1.6168782711029053
epoch: 5 step: 344, loss is 1.641409158706665
epoch: 5 step: 345, loss is 1.5133163928985596
epoch: 5 step: 346, loss is 1.6856012344360352
epoch: 5 step: 347, loss is 1.7320266962051392
epoch: 5 step: 348, loss is 1.5903704166412354
epoch: 5 step: 349, loss is 1.7494301795959473
epoch: 5 step: 350, loss is 1.6118680238723755
epoch: 5 step: 351, loss is 1.5314308404922485
epoch: 5 step: 352, loss is 1.556494951248169
epoch: 5 step: 353, loss is 1.714756727218628
epoch: 5 step: 354, loss is 1.5310161113739014
epoch: 5 step: 355, loss is 1.7872475385665894
epoch: 5 step: 356, loss is 1.687711238861084
epoch: 5 step: 357, loss is 1.7892705202102661
epoch: 5 step: 358, loss is 1.5611436367034912
epoch: 5 step: 359, loss is 1.5671555995941162
epoch: 5 step: 360, loss is 1.6102842092514038
epoch: 5 step: 361, loss is 1.5782215595245361
epoch: 5 step: 362, loss is 1.6032049655914307
epoch: 5 step: 363, loss is 1.6011167764663696
epoch: 5 step: 364, loss is 1.4326987266540527
epoch: 5 step: 365, loss is 1.5786182880401611
epoch: 5 step: 366, loss is 1.61308753490448
epoch: 5 step: 367, loss is 1.5704137086868286
epoch: 5 step: 368, loss is 1.5738422870635986
epoch: 5 step: 369, loss is 1.6481573581695557
epoch: 5 step: 370, loss is 1.5465118885040283
epoch: 5 step: 371, loss is 1.5927613973617554
epoch: 5 step: 372, loss is 1.5779811143875122
epoch: 5 step: 373, loss is 1.6706527471542358
epoch: 5 step: 374, loss is 1.658896565437317
epoch: 5 step: 375, loss is 1.6150617599487305
epoch: 5 step: 376, loss is 1.706315517425537
epoch: 5 step: 377, loss is 1.5464962720870972
epoch: 5 step: 378, loss is 1.5608888864517212
epoch: 5 step: 379, loss is 1.610548973083496
epoch: 5 step: 380, loss is 1.610349416732788
epoch: 5 step: 381, loss is 1.6361488103866577
epoch: 5 step: 382, loss is 1.5933778285980225
epoch: 5 step: 383, loss is 1.5234771966934204
epoch: 5 step: 384, loss is 1.636739730834961
epoch: 5 step: 385, loss is 1.6566827297210693
epoch: 5 step: 386, loss is 1.5916155576705933
epoch: 5 step: 387, loss is 1.6105828285217285
epoch: 5 step: 388, loss is 1.483259916305542
epoch: 5 step: 389, loss is 1.6239800453186035
epoch: 5 step: 390, loss is 1.6295044422149658
Train epoch time: 141315.200 ms, per step time: 362.347 ms
total time:0h 14m 38s
============== Train Success ==============
2024-07-26 13:10:06 littlewoodhu

训练好的模型保存在当前目录的shufflenetv1-5_390.ckpt中,用作评估。


模型评估
在CIFAR-10的测试集上对模型进行评估。

设置好评估模型的路径后加载数据集,并设置Top 1, Top 5的评估标准,最后用model.eval()接口对模型进行评估。

2024-07-26 13:12:47 littlewoodhu
model size is  2.0x
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.175.441 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.176.033 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.178.767 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.182.675 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.184.922 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.186.321 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.188.563 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.190.344 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.191.313 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.193.074 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.194.901 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.197.086 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.198.914 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.203.316 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.203.821 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.209.014 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.209.532 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.210.503 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.213.997 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.214.081 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.216.718 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.218.101 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.221.238 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.223.003 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.226.538 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.228.291 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.229.274 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.234.036 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.235.840 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.236.370 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.238.174 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.240.785 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
result:{'Loss': 1.57918219994276, 'Top_1_Acc': 0.5193309294871795, 'Top_5_Acc': 0.9412059294871795}, ckpt:'./shufflenetv1-5_390.ckpt', time: 0h 0m 59s
2024-07-26 13:13:48 littlewoodhu

模型预测
在CIFAR-10的测试集上对模型进行预测,并将预测结果可视化。

2024-07-26 13:17:47 littlewoodhu
model size is  2.0x
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.143.404 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/1681751341.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.143.520 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/1681751341.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.147.209 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.147.744 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.149.133 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/778396864.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.149.664 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.153.573 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.155.817 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.157.214 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.159.441 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.161.215 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.162.186 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.163.952 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.165.782 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.167.964 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.169.789 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.172.813 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/778396864.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.173.357 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.173.857 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.179.055 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.179.561 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.180.587 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.184.078 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.184.164 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.186.834 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.188.188 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.189.530 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/778396864.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.190.495 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.192.257 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.195.771 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.197.553 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.198.532 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.203.338 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.205.223 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.205.756 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.207.547 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.210.215 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]

2024-07-26 13:18:28 littlewoodhu
 

  • 5
    点赞
  • 8
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值