不支持GPU设备静态图模式运行
ShuffleNet网络介绍
ShuffleNetV1是旷视科技提出的一种计算高效的CNN模型,和MobileNet, SqueezeNet等一样主要应用在移动端,所以模型的设计目标就是利用有限的计算资源来达到最好的模型精度。ShuffleNetV1的设计核心是引入了两种操作:Pointwise Group Convolution和Channel Shuffle,这在保持精度的同时大大降低了模型的计算量。因此,ShuffleNetV1和MobileNet类似,都是通过设计更高效的网络结构来实现模型的压缩和加速。
了解ShuffleNet更多详细内容,详见论文ShuffleNet。
如下图所示,ShuffleNet在保持不低的准确率的前提下,将参数量几乎降低到了最小,因此其运算速度较快,单位参数量对模型准确率的贡献非常高。
模型架构
ShuffleNet最显著的特点在于对不同通道进行重排来解决Group Convolution带来的弊端。通过对ResNet的Bottleneck单元进行改进,在较小的计算量的情况下达到了较高的准确率。
Pointwise Group Convolution
Group Convolution(分组卷积)原理如下图所示,相比于普通的卷积操作,分组卷积的情况下,每一组的卷积核大小为in_channels/g*k*k,一共有g组,所有组共有(in_channels/g*k*k)*out_channels个参数,是正常卷积参数的1/g。分组卷积中,每个卷积核只处理输入特征图的一部分通道,其优点在于参数量会有所降低,但输出通道数仍等于卷积核的数量。
Depthwise Convolution(深度可分离卷积)将组数g分为和输入通道相等的in_channels,然后对每一个in_channels做卷积操作,每个卷积核只处理一个通道,记卷积核大小为1*k*k,则卷积核参数量为:in_channels*k*k,得到的feature maps通道数与输入通道数相等;
Pointwise Group Convolution(逐点分组卷积)在分组卷积的基础上,令每一组的卷积核大小为 1×1,卷积核参数量为(in_channels/g*1*1)*out_channels。
Channel Shuffle
Group Convolution的弊端在于不同组别的通道无法进行信息交流,堆积GConv层后一个问题是不同组之间的特征图是不通信的,这就好像分成了g个互不相干的道路,每一个人各走各的,这可能会降低网络的特征提取能力。这也是Xception,MobileNet等网络采用密集的1x1卷积(Dense Pointwise Convolution)的原因。
为了解决不同组别通道“近亲繁殖”的问题,ShuffleNet优化了大量密集的1x1卷积(在使用的情况下计算量占用率达到了惊人的93.4%),引入Channel Shuffle机制(通道重排)。这项操作直观上表现为将不同分组通道均匀分散重组,使网络在下一层能处理不同组别通道的信息。
如下图所示,对于g组,每组有n个通道的特征图,首先reshape成g行n列的矩阵,再将矩阵转置成n行g列,最后进行flatten操作,得到新的排列。这些操作都是可微分可导的且计算简单,在解决了信息交互的同时符合了ShuffleNet轻量级网络设计的轻量特征。
ShuffleNet模块
如下图所示,ShuffleNet对ResNet中的Bottleneck结构进行由(a)到(b), (c)的更改:
将开始和最后的 1×1卷积模块(降维、升维)改成Point Wise Group Convolution;
为了进行不同通道的信息交流,再降维之后进行Channel Shuffle;
降采样模块中, 3×3Depth Wise Convolution的步长设置为2,长宽降为原来的一半,因此shortcut中采用步长为2的 3×3平均池化,并把相加改成拼接。
构建ShuffleNet网络
ShuffleNet网络结构如下图所示,以输入图像 224×224组数3(g = 3)为例,首先通过数量24,卷积核大小为 3×3,stride为2的卷积层,输出特征图大小为 112×112,channel为24;然后通过stride为2的最大池化层,输出特征图大小为 56×56,channel数不变;再堆叠3个ShuffleNet模块(Stage2, Stage3, Stage4),三个模块分别重复4次、8次、4次,其中每个模块开始先经过一次下采样模块(上图(c)),使特征图长宽减半,channel翻倍(Stage2的下采样模块除外,将channel数从24变为240);随后经过全局平均池化,输出大小为 1×1×960,再经过全连接层和softmax,得到分类概率。
模型训练和评估
采用CIFAR-10数据集对ShuffleNet进行预训练。
训练集准备与加载
采用CIFAR-10数据集对ShuffleNet进行预训练。CIFAR-10共有60000张32*32的彩色图像,均匀地分为10个类别,其中50000张图片作为训练集,10000图片作为测试集。如下示例使用mindspore.dataset.Cifar10Dataset接口下载并加载CIFAR-10的训练集。目前仅支持二进制版本(CIFAR-10 binary version)。
模型训练
本节用随机初始化的参数做预训练。首先调用ShuffleNetV1定义网络,参数量选择"2.0x",并定义损失函数为交叉熵损失,学习率经过4轮的warmup后采用余弦退火,优化器采用Momentum。最后用train.model中的Model接口将模型、损失函数、优化器封装在model中,并用model.train()对网络进行训练。将ModelCheckpoint、CheckpointConfig、TimeMonitor和LossMonitor传入回调函数中,将会打印训练的轮数、损失和时间,并将ckpt文件保存在当前目录下。
2024-07-26 12:55:25 littlewoodhu model size is 2.0x ============== Starting Training ============== epoch: 1 step: 1, loss is 2.6074867248535156 epoch: 1 step: 2, loss is 2.5673539638519287 epoch: 1 step: 3, loss is 2.445343255996704 epoch: 1 step: 4, loss is 2.3593506813049316 epoch: 1 step: 5, loss is 2.4094467163085938 epoch: 1 step: 6, loss is 2.481452465057373 epoch: 1 step: 7, loss is 2.458271026611328 epoch: 1 step: 8, loss is 2.5503995418548584 epoch: 1 step: 9, loss is 2.37678599357605 epoch: 1 step: 10, loss is 2.334716796875 epoch: 1 step: 11, loss is 2.284836769104004 epoch: 1 step: 12, loss is 2.3663182258605957 epoch: 1 step: 13, loss is 2.412055253982544 epoch: 1 step: 14, loss is 2.387876510620117 epoch: 1 step: 15, loss is 2.3900086879730225 epoch: 1 step: 16, loss is 2.3132822513580322 epoch: 1 step: 17, loss is 2.306142807006836 epoch: 1 step: 18, loss is 2.26301908493042 epoch: 1 step: 19, loss is 2.31077241897583 epoch: 1 step: 20, loss is 2.296069383621216 epoch: 1 step: 21, loss is 2.3578920364379883 epoch: 1 step: 22, loss is 2.250715732574463 epoch: 1 step: 23, loss is 2.264376401901245 epoch: 1 step: 24, loss is 2.2397103309631348 epoch: 1 step: 25, loss is 2.2723278999328613 epoch: 1 step: 26, loss is 2.280475616455078 epoch: 1 step: 27, loss is 2.2611913681030273 epoch: 1 step: 28, loss is 2.201805353164673 epoch: 1 step: 29, loss is 2.225273609161377 epoch: 1 step: 30, loss is 2.1677911281585693 epoch: 1 step: 31, loss is 2.2753074169158936 epoch: 1 step: 32, loss is 2.3028290271759033 epoch: 1 step: 33, loss is 2.2406489849090576 epoch: 1 step: 34, loss is 2.1358611583709717 epoch: 1 step: 35, loss is 2.194544553756714 epoch: 1 step: 36, loss is 2.2103586196899414 epoch: 1 step: 37, loss is 2.2556700706481934 epoch: 1 step: 38, loss is 2.1384105682373047 epoch: 1 step: 39, loss is 2.235822916030884 epoch: 1 step: 40, loss is 2.2255942821502686 epoch: 1 step: 41, loss is 2.177318572998047 epoch: 1 step: 42, loss is 2.1832027435302734 epoch: 1 step: 43, loss is 2.1653120517730713 epoch: 1 step: 44, loss is 2.2017037868499756 epoch: 1 step: 45, loss is 2.097541332244873 epoch: 1 step: 46, loss is 2.174957036972046 epoch: 1 step: 47, loss is 2.1471595764160156 epoch: 1 step: 48, loss is 2.1461291313171387 epoch: 1 step: 49, loss is 2.1069915294647217 epoch: 1 step: 50, loss is 2.148634910583496 epoch: 1 step: 51, loss is 2.1617047786712646 epoch: 1 step: 52, loss is 2.1390111446380615 epoch: 1 step: 53, loss is 2.1788032054901123 epoch: 1 step: 54, loss is 2.1838719844818115 epoch: 1 step: 55, loss is 2.1878702640533447 epoch: 1 step: 56, loss is 2.254121780395508 epoch: 1 step: 57, loss is 2.1568925380706787 epoch: 1 step: 58, loss is 2.1664700508117676 epoch: 1 step: 59, loss is 2.2042391300201416 epoch: 1 step: 60, loss is 2.138359546661377 epoch: 1 step: 61, loss is 2.162292003631592 epoch: 1 step: 62, loss is 2.107999801635742 epoch: 1 step: 63, loss is 2.20550799369812 epoch: 1 step: 64, loss is 2.13793683052063 epoch: 1 step: 65, loss is 2.1301920413970947 epoch: 1 step: 66, loss is 2.1473281383514404 epoch: 1 step: 67, loss is 2.1408190727233887 epoch: 1 step: 68, loss is 2.1125881671905518 epoch: 1 step: 69, loss is 2.1237294673919678 epoch: 1 step: 70, loss is 2.0798583030700684 epoch: 1 step: 71, loss is 2.0484025478363037 epoch: 1 step: 72, loss is 2.0448365211486816 epoch: 1 step: 73, loss is 2.1569836139678955 epoch: 1 step: 74, loss is 2.1584362983703613 epoch: 1 step: 75, loss is 2.0341908931732178 epoch: 1 step: 76, loss is 2.17295241355896 epoch: 1 step: 77, loss is 2.1361145973205566 epoch: 1 step: 78, loss is 2.095043182373047 epoch: 1 step: 79, loss is 2.0424888134002686 epoch: 1 step: 80, loss is 2.147951126098633 epoch: 1 step: 81, loss is 2.0803651809692383 epoch: 1 step: 82, loss is 1.9917545318603516 epoch: 1 step: 83, loss is 2.065797805786133 epoch: 1 step: 84, loss is 2.120495557785034 epoch: 1 step: 85, loss is 2.156358242034912 epoch: 1 step: 86, loss is 2.1676712036132812 epoch: 1 step: 87, loss is 2.088245153427124 epoch: 1 step: 88, loss is 2.117332935333252 epoch: 1 step: 89, loss is 2.166799783706665 epoch: 1 step: 90, loss is 2.0351638793945312 epoch: 1 step: 91, loss is 2.0535051822662354 epoch: 1 step: 92, loss is 2.0263636112213135 epoch: 1 step: 93, loss is 2.0360491275787354 epoch: 1 step: 94, loss is 2.0493998527526855 epoch: 1 step: 95, loss is 2.093966484069824 epoch: 1 step: 96, loss is 2.0876405239105225 epoch: 1 step: 97, loss is 2.0177552700042725 epoch: 1 step: 98, loss is 2.0328681468963623 epoch: 1 step: 99, loss is 2.1284565925598145 epoch: 1 step: 100, loss is 2.038817882537842 epoch: 1 step: 101, loss is 2.1064395904541016 epoch: 1 step: 102, loss is 2.0493288040161133 epoch: 1 step: 103, loss is 2.0016369819641113 epoch: 1 step: 104, loss is 2.004840850830078 epoch: 1 step: 105, loss is 2.1012399196624756 epoch: 1 step: 106, loss is 1.9893112182617188 epoch: 1 step: 107, loss is 1.9789358377456665 epoch: 1 step: 108, loss is 2.016775608062744 epoch: 1 step: 109, loss is 2.039645195007324 epoch: 1 step: 110, loss is 2.033100128173828 epoch: 1 step: 111, loss is 2.0751750469207764 epoch: 1 step: 112, loss is 2.054032325744629 epoch: 1 step: 113, loss is 2.0449352264404297 epoch: 1 step: 114, loss is 2.1600353717803955 epoch: 1 step: 115, loss is 1.9872206449508667 epoch: 1 step: 116, loss is 2.1373400688171387 epoch: 1 step: 117, loss is 2.125105857849121 epoch: 1 step: 118, loss is 2.0569660663604736 epoch: 1 step: 119, loss is 2.0372209548950195 epoch: 1 step: 120, loss is 2.0964741706848145 epoch: 1 step: 121, loss is 1.9715644121170044 epoch: 1 step: 122, loss is 2.0824663639068604 epoch: 1 step: 123, loss is 1.9248427152633667 epoch: 1 step: 124, loss is 2.064785957336426 epoch: 1 step: 125, loss is 2.0272512435913086 epoch: 1 step: 126, loss is 1.881690263748169 epoch: 1 step: 127, loss is 2.079745292663574 epoch: 1 step: 128, loss is 1.9766654968261719 epoch: 1 step: 129, loss is 2.0431463718414307 epoch: 1 step: 130, loss is 2.0141639709472656 epoch: 1 step: 131, loss is 2.036090850830078 epoch: 1 step: 132, loss is 2.042241096496582 epoch: 1 step: 133, loss is 1.9501811265945435 epoch: 1 step: 134, loss is 1.8937513828277588 epoch: 1 step: 135, loss is 2.0809664726257324 epoch: 1 step: 136, loss is 1.9941296577453613 epoch: 1 step: 137, loss is 2.07505202293396 epoch: 1 step: 138, loss is 2.0699925422668457 epoch: 1 step: 139, loss is 1.99008047580719 epoch: 1 step: 140, loss is 1.9953538179397583 epoch: 1 step: 141, loss is 1.9193512201309204 epoch: 1 step: 142, loss is 1.9427956342697144 epoch: 1 step: 143, loss is 1.9388988018035889 epoch: 1 step: 144, loss is 1.973027229309082 epoch: 1 step: 145, loss is 2.048292875289917 epoch: 1 step: 146, loss is 2.0957119464874268 epoch: 1 step: 147, loss is 2.0850741863250732 epoch: 1 step: 148, loss is 1.9283853769302368 epoch: 1 step: 149, loss is 2.019087076187134 epoch: 1 step: 150, loss is 1.9871562719345093 epoch: 1 step: 151, loss is 1.9826608896255493 epoch: 1 step: 152, loss is 2.0052707195281982 epoch: 1 step: 153, loss is 2.0426347255706787 epoch: 1 step: 154, loss is 1.9916598796844482 epoch: 1 step: 155, loss is 1.9741183519363403 epoch: 1 step: 156, loss is 2.0702977180480957 epoch: 1 step: 157, loss is 2.0325732231140137 epoch: 1 step: 158, loss is 1.9691928625106812 epoch: 1 step: 159, loss is 2.0533018112182617 epoch: 1 step: 160, loss is 2.09694766998291 epoch: 1 step: 161, loss is 1.9766359329223633 epoch: 1 step: 162, loss is 1.9411063194274902 epoch: 1 step: 163, loss is 1.9895853996276855 epoch: 1 step: 164, loss is 2.015362501144409 epoch: 1 step: 165, loss is 2.064490556716919 epoch: 1 step: 166, loss is 1.9311487674713135 epoch: 1 step: 167, loss is 2.0468943119049072 epoch: 1 step: 168, loss is 1.9442241191864014 epoch: 1 step: 169, loss is 1.9285390377044678 epoch: 1 step: 170, loss is 1.9346100091934204 epoch: 1 step: 171, loss is 2.1009116172790527 epoch: 1 step: 172, loss is 2.0505332946777344 epoch: 1 step: 173, loss is 2.0046796798706055 epoch: 1 step: 174, loss is 2.024062156677246 epoch: 1 step: 175, loss is 1.9987787008285522 epoch: 1 step: 176, loss is 1.9816559553146362 epoch: 1 step: 177, loss is 1.9588998556137085 epoch: 1 step: 178, loss is 1.9840435981750488 epoch: 1 step: 179, loss is 1.9875191450119019 epoch: 1 step: 180, loss is 1.9972584247589111 epoch: 1 step: 181, loss is 1.9971120357513428 epoch: 1 step: 182, loss is 1.9379992485046387 epoch: 1 step: 183, loss is 2.124464511871338 epoch: 1 step: 184, loss is 2.132340908050537 epoch: 1 step: 185, loss is 1.951235294342041 epoch: 1 step: 186, loss is 1.9694455862045288 epoch: 1 step: 187, loss is 1.9557605981826782 epoch: 1 step: 188, loss is 2.0418901443481445 epoch: 1 step: 189, loss is 1.9324007034301758 epoch: 1 step: 190, loss is 1.9891881942749023 epoch: 1 step: 191, loss is 1.8896417617797852 epoch: 1 step: 192, loss is 1.9871134757995605 epoch: 1 step: 193, loss is 1.9819241762161255 epoch: 1 step: 194, loss is 2.067260265350342 epoch: 1 step: 195, loss is 1.9608522653579712 epoch: 1 step: 196, loss is 2.0522453784942627 epoch: 1 step: 197, loss is 2.029884099960327 epoch: 1 step: 198, loss is 2.145172119140625 epoch: 1 step: 199, loss is 1.9631972312927246 epoch: 1 step: 200, loss is 2.002553701400757 epoch: 1 step: 201, loss is 2.0363729000091553 epoch: 1 step: 202, loss is 2.058772563934326 epoch: 1 step: 203, loss is 2.0027172565460205 epoch: 1 step: 204, loss is 1.9935832023620605 epoch: 1 step: 205, loss is 2.090135097503662 epoch: 1 step: 206, loss is 2.0681467056274414 epoch: 1 step: 207, loss is 2.0434460639953613 epoch: 1 step: 208, loss is 1.9367997646331787 epoch: 1 step: 209, loss is 2.147993564605713 epoch: 1 step: 210, loss is 1.9816290140151978 epoch: 1 step: 211, loss is 1.985154390335083 epoch: 1 step: 212, loss is 2.055087089538574 epoch: 1 step: 213, loss is 2.0359644889831543 epoch: 1 step: 214, loss is 1.950507640838623 epoch: 1 step: 215, loss is 1.95393967628479 epoch: 1 step: 216, loss is 1.991260051727295 epoch: 1 step: 217, loss is 1.9864580631256104 epoch: 1 step: 218, loss is 1.9721453189849854 epoch: 1 step: 219, loss is 1.9942901134490967 epoch: 1 step: 220, loss is 1.8883270025253296 epoch: 1 step: 221, loss is 1.8978381156921387 epoch: 1 step: 222, loss is 1.9188950061798096 epoch: 1 step: 223, loss is 1.9739024639129639 epoch: 1 step: 224, loss is 1.9460232257843018 epoch: 1 step: 225, loss is 2.071077823638916 epoch: 1 step: 226, loss is 1.915285348892212 epoch: 1 step: 227, loss is 1.990149736404419 epoch: 1 step: 228, loss is 1.8998172283172607 epoch: 1 step: 229, loss is 1.9340561628341675 epoch: 1 step: 230, loss is 2.058088541030884 epoch: 1 step: 231, loss is 1.8666603565216064 epoch: 1 step: 232, loss is 1.9561322927474976 epoch: 1 step: 233, loss is 1.9107087850570679 epoch: 1 step: 234, loss is 1.911312460899353 epoch: 1 step: 235, loss is 1.9710066318511963 epoch: 1 step: 236, loss is 1.9095042943954468 epoch: 1 step: 237, loss is 2.0002048015594482 epoch: 1 step: 238, loss is 1.9421601295471191 epoch: 1 step: 239, loss is 1.9567835330963135 epoch: 1 step: 240, loss is 1.9102545976638794 epoch: 1 step: 241, loss is 1.9200680255889893 epoch: 1 step: 242, loss is 1.9566885232925415 epoch: 1 step: 243, loss is 1.8946560621261597 epoch: 1 step: 244, loss is 2.032424211502075 epoch: 1 step: 245, loss is 1.8792742490768433 epoch: 1 step: 246, loss is 1.8991434574127197 epoch: 1 step: 247, loss is 1.8859573602676392 epoch: 1 step: 248, loss is 1.804595708847046 epoch: 1 step: 249, loss is 1.969016671180725 epoch: 1 step: 250, loss is 2.0294530391693115 epoch: 1 step: 251, loss is 1.9516113996505737 epoch: 1 step: 252, loss is 1.9055083990097046 epoch: 1 step: 253, loss is 1.9428240060806274 epoch: 1 step: 254, loss is 1.9211325645446777 epoch: 1 step: 255, loss is 1.9290534257888794 epoch: 1 step: 256, loss is 1.927079677581787 epoch: 1 step: 257, loss is 2.036738872528076 epoch: 1 step: 258, loss is 1.9652936458587646 epoch: 1 step: 259, loss is 1.9190199375152588 epoch: 1 step: 260, loss is 1.9562625885009766 epoch: 1 step: 261, loss is 1.92203688621521 epoch: 1 step: 262, loss is 1.901296854019165 epoch: 1 step: 263, loss is 1.9890611171722412 epoch: 1 step: 264, loss is 2.0135769844055176 epoch: 1 step: 265, loss is 1.936768651008606 epoch: 1 step: 266, loss is 1.914853572845459 epoch: 1 step: 267, loss is 1.942449688911438 epoch: 1 step: 268, loss is 1.908927083015442 epoch: 1 step: 269, loss is 1.8932807445526123 epoch: 1 step: 270, loss is 2.0091052055358887 epoch: 1 step: 271, loss is 1.9299839735031128 epoch: 1 step: 272, loss is 1.970038890838623 epoch: 1 step: 273, loss is 1.9781885147094727 epoch: 1 step: 274, loss is 1.8617650270462036 epoch: 1 step: 275, loss is 2.008246421813965 epoch: 1 step: 276, loss is 1.9354159832000732 epoch: 1 step: 277, loss is 1.9291837215423584 epoch: 1 step: 278, loss is 2.012862205505371 epoch: 1 step: 279, loss is 1.8868024349212646 epoch: 1 step: 280, loss is 1.8654429912567139 epoch: 1 step: 281, loss is 1.9085495471954346 epoch: 1 step: 282, loss is 2.0358519554138184 epoch: 1 step: 283, loss is 1.9203890562057495 epoch: 1 step: 284, loss is 1.9827072620391846 epoch: 1 step: 285, loss is 1.8735699653625488 epoch: 1 step: 286, loss is 1.9019166231155396 epoch: 1 step: 287, loss is 2.0084218978881836 epoch: 1 step: 288, loss is 2.000614881515503 epoch: 1 step: 289, loss is 2.0126662254333496 epoch: 1 step: 290, loss is 1.9186487197875977 epoch: 1 step: 291, loss is 1.8803479671478271 epoch: 1 step: 292, loss is 1.873417854309082 epoch: 1 step: 293, loss is 1.8859927654266357 epoch: 1 step: 294, loss is 1.9195986986160278 epoch: 1 step: 295, loss is 1.9440782070159912 epoch: 1 step: 296, loss is 1.9989290237426758 epoch: 1 step: 297, loss is 1.9591376781463623 epoch: 1 step: 298, loss is 2.0234837532043457 epoch: 1 step: 299, loss is 1.948691964149475 epoch: 1 step: 300, loss is 2.0098040103912354 epoch: 1 step: 301, loss is 1.9991217851638794 epoch: 1 step: 302, loss is 1.8470983505249023 epoch: 1 step: 303, loss is 1.9883179664611816 epoch: 1 step: 304, loss is 1.9590797424316406 epoch: 1 step: 305, loss is 1.9329893589019775 epoch: 1 step: 306, loss is 2.0018374919891357 epoch: 1 step: 307, loss is 2.001723289489746 epoch: 1 step: 308, loss is 1.8369154930114746 epoch: 1 step: 309, loss is 1.9257686138153076 epoch: 1 step: 310, loss is 1.9213684797286987 epoch: 1 step: 311, loss is 1.91152024269104 epoch: 1 step: 312, loss is 1.9036773443222046 epoch: 1 step: 313, loss is 1.9819672107696533 epoch: 1 step: 314, loss is 1.9471276998519897 epoch: 1 step: 315, loss is 1.9795304536819458 epoch: 1 step: 316, loss is 1.8561713695526123 epoch: 1 step: 317, loss is 2.045935869216919 epoch: 1 step: 318, loss is 1.9417272806167603 epoch: 1 step: 319, loss is 2.10587215423584 epoch: 1 step: 320, loss is 1.82456636428833 epoch: 1 step: 321, loss is 1.9213714599609375 epoch: 1 step: 322, loss is 1.8929128646850586 epoch: 1 step: 323, loss is 2.016751289367676 epoch: 1 step: 324, loss is 1.8333282470703125 epoch: 1 step: 325, loss is 1.904754877090454 epoch: 1 step: 326, loss is 2.0365891456604004 epoch: 1 step: 327, loss is 1.8032381534576416 epoch: 1 step: 328, loss is 1.889327049255371 epoch: 1 step: 329, loss is 1.8690977096557617 epoch: 1 step: 330, loss is 1.9579219818115234 epoch: 1 step: 331, loss is 1.897598147392273 epoch: 1 step: 332, loss is 1.936568021774292 epoch: 1 step: 333, loss is 1.9168888330459595 epoch: 1 step: 334, loss is 1.9970436096191406 epoch: 1 step: 335, loss is 1.8795043230056763 epoch: 1 step: 336, loss is 1.8512083292007446 epoch: 1 step: 337, loss is 1.9615848064422607 epoch: 1 step: 338, loss is 1.8836190700531006 epoch: 1 step: 339, loss is 1.8904942274093628 epoch: 1 step: 340, loss is 2.0177059173583984 epoch: 1 step: 341, loss is 1.9037587642669678 epoch: 1 step: 342, loss is 1.9891458749771118 epoch: 1 step: 343, loss is 1.901099681854248 epoch: 1 step: 344, loss is 1.9564093351364136 epoch: 1 step: 345, loss is 2.050227165222168 epoch: 1 step: 346, loss is 1.9546282291412354 epoch: 1 step: 347, loss is 1.8253402709960938 epoch: 1 step: 348, loss is 1.881971001625061 epoch: 1 step: 349, loss is 1.915027379989624 epoch: 1 step: 350, loss is 1.9205769300460815 epoch: 1 step: 351, loss is 1.9755690097808838 epoch: 1 step: 352, loss is 1.9711400270462036 epoch: 1 step: 353, loss is 1.9381728172302246 epoch: 1 step: 354, loss is 1.746055245399475 epoch: 1 step: 355, loss is 1.942064881324768 epoch: 1 step: 356, loss is 1.8913757801055908 epoch: 1 step: 357, loss is 1.8841240406036377 epoch: 1 step: 358, loss is 1.978926420211792 epoch: 1 step: 359, loss is 2.0343894958496094 epoch: 1 step: 360, loss is 1.8985085487365723 epoch: 1 step: 361, loss is 1.889115810394287 epoch: 1 step: 362, loss is 1.9065433740615845 epoch: 1 step: 363, loss is 1.9389437437057495 epoch: 1 step: 364, loss is 1.9310613870620728 epoch: 1 step: 365, loss is 1.965096354484558 epoch: 1 step: 366, loss is 2.001617908477783 epoch: 1 step: 367, loss is 1.9463062286376953 epoch: 1 step: 368, loss is 1.9296867847442627 epoch: 1 step: 369, loss is 1.9059423208236694 epoch: 1 step: 370, loss is 1.9741623401641846 epoch: 1 step: 371, loss is 1.9527919292449951 epoch: 1 step: 372, loss is 1.8885220289230347 epoch: 1 step: 373, loss is 1.87715482711792 epoch: 1 step: 374, loss is 2.00614333152771 epoch: 1 step: 375, loss is 1.8491806983947754 epoch: 1 step: 376, loss is 1.8577344417572021 epoch: 1 step: 377, loss is 2.001436948776245 epoch: 1 step: 378, loss is 1.8928618431091309 epoch: 1 step: 379, loss is 2.0045433044433594 epoch: 1 step: 380, loss is 2.115802049636841 epoch: 1 step: 381, loss is 1.8844345808029175 epoch: 1 step: 382, loss is 1.9139431715011597 epoch: 1 step: 383, loss is 2.0448434352874756 epoch: 1 step: 384, loss is 2.0351815223693848 epoch: 1 step: 385, loss is 1.9286075830459595 epoch: 1 step: 386, loss is 1.908137321472168 epoch: 1 step: 387, loss is 1.8745241165161133 epoch: 1 step: 388, loss is 1.9043793678283691 epoch: 1 step: 389, loss is 1.8875763416290283 epoch: 1 step: 390, loss is 1.8309433460235596 Train epoch time: 301169.930 ms, per step time: 772.231 ms epoch: 2 step: 1, loss is 2.0086636543273926 epoch: 2 step: 2, loss is 1.8561439514160156 epoch: 2 step: 3, loss is 1.959549903869629 epoch: 2 step: 4, loss is 1.845109462738037 epoch: 2 step: 5, loss is 1.859502911567688 epoch: 2 step: 6, loss is 1.8266891241073608 epoch: 2 step: 7, loss is 1.9940757751464844 epoch: 2 step: 8, loss is 1.887444257736206 epoch: 2 step: 9, loss is 1.941131353378296 epoch: 2 step: 10, loss is 2.024829387664795 epoch: 2 step: 11, loss is 1.8800655603408813 epoch: 2 step: 12, loss is 1.9109078645706177 epoch: 2 step: 13, loss is 1.9579572677612305 epoch: 2 step: 14, loss is 1.941686749458313 epoch: 2 step: 15, loss is 1.8821125030517578 epoch: 2 step: 16, loss is 1.8421412706375122 epoch: 2 step: 17, loss is 1.814638376235962 epoch: 2 step: 18, loss is 1.8196184635162354 epoch: 2 step: 19, loss is 1.9221185445785522 epoch: 2 step: 20, loss is 1.9255518913269043 epoch: 2 step: 21, loss is 1.8777854442596436 epoch: 2 step: 22, loss is 1.8691197633743286 epoch: 2 step: 23, loss is 1.9205360412597656 epoch: 2 step: 24, loss is 1.8332617282867432 epoch: 2 step: 25, loss is 1.88694429397583 epoch: 2 step: 26, loss is 1.8043739795684814 epoch: 2 step: 27, loss is 1.939123511314392 epoch: 2 step: 28, loss is 1.844259262084961 epoch: 2 step: 29, loss is 1.8621373176574707 epoch: 2 step: 30, loss is 1.9329479932785034 epoch: 2 step: 31, loss is 1.8549472093582153 epoch: 2 step: 32, loss is 1.8989877700805664 epoch: 2 step: 33, loss is 1.9532358646392822 epoch: 2 step: 34, loss is 1.8432390689849854 epoch: 2 step: 35, loss is 1.9938075542449951 epoch: 2 step: 36, loss is 1.8990700244903564 epoch: 2 step: 37, loss is 1.8663583993911743 epoch: 2 step: 38, loss is 1.8983309268951416 epoch: 2 step: 39, loss is 1.832546591758728 epoch: 2 step: 40, loss is 1.8912301063537598 epoch: 2 step: 41, loss is 1.889316439628601 epoch: 2 step: 42, loss is 1.9854687452316284 epoch: 2 step: 43, loss is 1.8240479230880737 epoch: 2 step: 44, loss is 1.9101498126983643 epoch: 2 step: 45, loss is 1.797684669494629 epoch: 2 step: 46, loss is 1.8577277660369873 epoch: 2 step: 47, loss is 1.8356904983520508 epoch: 2 step: 48, loss is 1.923057198524475 epoch: 2 step: 49, loss is 1.9273544549942017 epoch: 2 step: 50, loss is 1.8730634450912476 epoch: 2 step: 51, loss is 1.7962920665740967 epoch: 2 step: 52, loss is 1.840844988822937 epoch: 2 step: 53, loss is 1.9192214012145996 epoch: 2 step: 54, loss is 1.7557487487792969 epoch: 2 step: 55, loss is 1.8856722116470337 epoch: 2 step: 56, loss is 1.9432092905044556 epoch: 2 step: 57, loss is 1.858530879020691 epoch: 2 step: 58, loss is 2.002671718597412 epoch: 2 step: 59, loss is 1.856890082359314 epoch: 2 step: 60, loss is 1.820954442024231 epoch: 2 step: 61, loss is 1.946556568145752 epoch: 2 step: 62, loss is 1.8445112705230713 epoch: 2 step: 63, loss is 1.7778434753417969 epoch: 2 step: 64, loss is 1.8148187398910522 epoch: 2 step: 65, loss is 1.8800560235977173 epoch: 2 step: 66, loss is 1.7955501079559326 epoch: 2 step: 67, loss is 1.9290978908538818 epoch: 2 step: 68, loss is 1.9206385612487793 epoch: 2 step: 69, loss is 1.8472034931182861 epoch: 2 step: 70, loss is 1.8666832447052002 epoch: 2 step: 71, loss is 1.9787473678588867 epoch: 2 step: 72, loss is 1.8786567449569702 epoch: 2 step: 73, loss is 1.7307939529418945 epoch: 2 step: 74, loss is 1.9424220323562622 epoch: 2 step: 75, loss is 1.8803863525390625 epoch: 2 step: 76, loss is 1.8787437677383423 epoch: 2 step: 77, loss is 1.8715336322784424 epoch: 2 step: 78, loss is 1.8748095035552979 epoch: 2 step: 79, loss is 1.8724067211151123 epoch: 2 step: 80, loss is 1.8271408081054688 epoch: 2 step: 81, loss is 2.0027823448181152 epoch: 2 step: 82, loss is 1.7880679368972778 epoch: 2 step: 83, loss is 1.8629825115203857 epoch: 2 step: 84, loss is 1.7959777116775513 epoch: 2 step: 85, loss is 1.7876453399658203 epoch: 2 step: 86, loss is 1.9502708911895752 epoch: 2 step: 87, loss is 1.833869218826294 epoch: 2 step: 88, loss is 1.8148149251937866 epoch: 2 step: 89, loss is 1.86910879611969 epoch: 2 step: 90, loss is 1.8948606252670288 epoch: 2 step: 91, loss is 1.8240607976913452 epoch: 2 step: 92, loss is 1.7175065279006958 epoch: 2 step: 93, loss is 1.940483570098877 epoch: 2 step: 94, loss is 1.7632392644882202 epoch: 2 step: 95, loss is 1.737947940826416 epoch: 2 step: 96, loss is 1.7186986207962036 epoch: 2 step: 97, loss is 1.8999754190444946 epoch: 2 step: 98, loss is 1.7857352495193481 epoch: 2 step: 99, loss is 1.9178612232208252 epoch: 2 step: 100, loss is 1.8766601085662842 epoch: 2 step: 101, loss is 1.82723069190979 epoch: 2 step: 102, loss is 1.9207415580749512 epoch: 2 step: 103, loss is 1.8620266914367676 epoch: 2 step: 104, loss is 1.8713843822479248 epoch: 2 step: 105, loss is 1.8156766891479492 epoch: 2 step: 106, loss is 1.8680598735809326 epoch: 2 step: 107, loss is 1.8187801837921143 epoch: 2 step: 108, loss is 1.7260297536849976 epoch: 2 step: 109, loss is 1.8599132299423218 epoch: 2 step: 110, loss is 1.881721019744873 epoch: 2 step: 111, loss is 1.8597749471664429 epoch: 2 step: 112, loss is 1.821563720703125 epoch: 2 step: 113, loss is 1.8555083274841309 epoch: 2 step: 114, loss is 1.9005643129348755 epoch: 2 step: 115, loss is 1.7498300075531006 epoch: 2 step: 116, loss is 1.9095624685287476 epoch: 2 step: 117, loss is 2.02589750289917 epoch: 2 step: 118, loss is 1.7067241668701172 epoch: 2 step: 119, loss is 1.9239228963851929 epoch: 2 step: 120, loss is 1.9379498958587646 epoch: 2 step: 121, loss is 1.7686045169830322 epoch: 2 step: 122, loss is 1.8032222986221313 epoch: 2 step: 123, loss is 1.8437695503234863 epoch: 2 step: 124, loss is 1.851049780845642 epoch: 2 step: 125, loss is 1.8590387105941772 epoch: 2 step: 126, loss is 1.816530466079712 epoch: 2 step: 127, loss is 1.7972630262374878 epoch: 2 step: 128, loss is 1.9503204822540283 epoch: 2 step: 129, loss is 1.8170355558395386 epoch: 2 step: 130, loss is 1.9414114952087402 epoch: 2 step: 131, loss is 1.9720697402954102 epoch: 2 step: 132, loss is 1.8431159257888794 epoch: 2 step: 133, loss is 1.8431402444839478 epoch: 2 step: 134, loss is 1.880516529083252 epoch: 2 step: 135, loss is 1.9251197576522827 epoch: 2 step: 136, loss is 1.7931485176086426 epoch: 2 step: 137, loss is 1.8366456031799316 epoch: 2 step: 138, loss is 1.8162732124328613 epoch: 2 step: 139, loss is 1.766951084136963 epoch: 2 step: 140, loss is 1.9067617654800415 epoch: 2 step: 141, loss is 1.8185125589370728 epoch: 2 step: 142, loss is 1.7571754455566406 epoch: 2 step: 143, loss is 1.895729422569275 epoch: 2 step: 144, loss is 1.8479565382003784 epoch: 2 step: 145, loss is 1.851345419883728 epoch: 2 step: 146, loss is 1.7962510585784912 epoch: 2 step: 147, loss is 1.8403642177581787 epoch: 2 step: 148, loss is 1.9102624654769897 epoch: 2 step: 149, loss is 1.8668683767318726 epoch: 2 step: 150, loss is 1.8768565654754639 epoch: 2 step: 151, loss is 1.803843379020691 epoch: 2 step: 152, loss is 1.80448579788208 epoch: 2 step: 153, loss is 1.9157750606536865 epoch: 2 step: 154, loss is 1.7746250629425049 epoch: 2 step: 155, loss is 1.9414875507354736 epoch: 2 step: 156, loss is 1.8375035524368286 epoch: 2 step: 157, loss is 1.7554367780685425 epoch: 2 step: 158, loss is 1.849830985069275 epoch: 2 step: 159, loss is 1.9036533832550049 epoch: 2 step: 160, loss is 1.9142879247665405 epoch: 2 step: 161, loss is 1.8832579851150513 epoch: 2 step: 162, loss is 1.792252779006958 epoch: 2 step: 163, loss is 1.862593412399292 epoch: 2 step: 164, loss is 1.8466187715530396 epoch: 2 step: 165, loss is 1.7753026485443115 epoch: 2 step: 166, loss is 1.7677288055419922 epoch: 2 step: 167, loss is 1.9642395973205566 epoch: 2 step: 168, loss is 1.9213532209396362 epoch: 2 step: 169, loss is 1.9132721424102783 epoch: 2 step: 170, loss is 1.8411797285079956 epoch: 2 step: 171, loss is 1.8710469007492065 epoch: 2 step: 172, loss is 1.8579764366149902 epoch: 2 step: 173, loss is 1.8785043954849243 epoch: 2 step: 174, loss is 1.792521595954895 epoch: 2 step: 175, loss is 1.8440773487091064 epoch: 2 step: 176, loss is 1.8609850406646729 epoch: 2 step: 177, loss is 1.8927359580993652 epoch: 2 step: 178, loss is 1.7732219696044922 epoch: 2 step: 179, loss is 1.8476505279541016 epoch: 2 step: 180, loss is 1.8040062189102173 epoch: 2 step: 181, loss is 1.7327547073364258 epoch: 2 step: 182, loss is 1.7375595569610596 epoch: 2 step: 183, loss is 1.819338083267212 epoch: 2 step: 184, loss is 1.8011999130249023 epoch: 2 step: 185, loss is 1.830836534500122 epoch: 2 step: 186, loss is 1.7516945600509644 epoch: 2 step: 187, loss is 1.8135254383087158 epoch: 2 step: 188, loss is 1.8124158382415771 epoch: 2 step: 189, loss is 1.6668801307678223 epoch: 2 step: 190, loss is 1.7354092597961426 epoch: 2 step: 191, loss is 1.8319144248962402 epoch: 2 step: 192, loss is 1.7852344512939453 epoch: 2 step: 193, loss is 1.8413450717926025 epoch: 2 step: 194, loss is 1.7621142864227295 epoch: 2 step: 195, loss is 1.893214464187622 epoch: 2 step: 196, loss is 1.85415518283844 epoch: 2 step: 197, loss is 1.8773202896118164 epoch: 2 step: 198, loss is 1.7623281478881836 epoch: 2 step: 199, loss is 1.7395901679992676 epoch: 2 step: 200, loss is 1.8567042350769043 epoch: 2 step: 201, loss is 1.778634786605835 epoch: 2 step: 202, loss is 1.7063484191894531 epoch: 2 step: 203, loss is 1.749333143234253 epoch: 2 step: 204, loss is 1.8264789581298828 epoch: 2 step: 205, loss is 1.8365423679351807 epoch: 2 step: 206, loss is 1.8969218730926514 epoch: 2 step: 207, loss is 1.8710927963256836 epoch: 2 step: 208, loss is 1.7963848114013672 epoch: 2 step: 209, loss is 1.8325331211090088 epoch: 2 step: 210, loss is 1.8086352348327637 epoch: 2 step: 211, loss is 1.7635247707366943 epoch: 2 step: 212, loss is 1.786803960800171 epoch: 2 step: 213, loss is 1.86027193069458 epoch: 2 step: 214, loss is 1.8134807348251343 epoch: 2 step: 215, loss is 1.6304543018341064 epoch: 2 step: 216, loss is 1.9413152933120728 epoch: 2 step: 217, loss is 1.8634803295135498 epoch: 2 step: 218, loss is 1.7682809829711914 epoch: 2 step: 219, loss is 1.7050912380218506 epoch: 2 step: 220, loss is 1.8277148008346558 epoch: 2 step: 221, loss is 1.765455961227417 epoch: 2 step: 222, loss is 1.8628239631652832 epoch: 2 step: 223, loss is 1.7774879932403564 epoch: 2 step: 224, loss is 1.7936487197875977 epoch: 2 step: 225, loss is 1.8603432178497314 epoch: 2 step: 226, loss is 1.8159501552581787 epoch: 2 step: 227, loss is 1.8525795936584473 epoch: 2 step: 228, loss is 1.777559757232666 epoch: 2 step: 229, loss is 1.7203599214553833 epoch: 2 step: 230, loss is 1.7874581813812256 epoch: 2 step: 231, loss is 1.772555947303772 epoch: 2 step: 232, loss is 1.6849935054779053 epoch: 2 step: 233, loss is 1.9093971252441406 epoch: 2 step: 234, loss is 1.771955966949463 epoch: 2 step: 235, loss is 1.8456029891967773 epoch: 2 step: 236, loss is 1.7137638330459595 epoch: 2 step: 237, loss is 1.9477996826171875 epoch: 2 step: 238, loss is 1.8464946746826172 epoch: 2 step: 239, loss is 1.7354912757873535 epoch: 2 step: 240, loss is 1.7765865325927734 epoch: 2 step: 241, loss is 1.8965409994125366 epoch: 2 step: 242, loss is 1.8762569427490234 epoch: 2 step: 243, loss is 1.7814252376556396 epoch: 2 step: 244, loss is 1.73980712890625 epoch: 2 step: 245, loss is 1.8301444053649902 epoch: 2 step: 246, loss is 1.811732292175293 epoch: 2 step: 247, loss is 1.8318946361541748 epoch: 2 step: 248, loss is 1.8942484855651855 epoch: 2 step: 249, loss is 1.730800747871399 epoch: 2 step: 250, loss is 1.7132985591888428 epoch: 2 step: 251, loss is 1.7972244024276733 epoch: 2 step: 252, loss is 1.8195381164550781 epoch: 2 step: 253, loss is 1.6059634685516357 epoch: 2 step: 254, loss is 1.7207951545715332 epoch: 2 step: 255, loss is 1.7502498626708984 epoch: 2 step: 256, loss is 1.755635380744934 epoch: 2 step: 257, loss is 1.758680820465088 epoch: 2 step: 258, loss is 1.8161877393722534 epoch: 2 step: 259, loss is 1.847859263420105 epoch: 2 step: 260, loss is 1.7462278604507446 epoch: 2 step: 261, loss is 1.7590373754501343 epoch: 2 step: 262, loss is 1.8493421077728271 epoch: 2 step: 263, loss is 1.7797712087631226 epoch: 2 step: 264, loss is 1.8671611547470093 epoch: 2 step: 265, loss is 1.848267912864685 epoch: 2 step: 266, loss is 1.7634716033935547 epoch: 2 step: 267, loss is 1.6971874237060547 epoch: 2 step: 268, loss is 1.699387788772583 epoch: 2 step: 269, loss is 1.791347622871399 epoch: 2 step: 270, loss is 1.801035761833191 epoch: 2 step: 271, loss is 1.851177453994751 epoch: 2 step: 272, loss is 1.9037847518920898 epoch: 2 step: 273, loss is 1.8303537368774414 epoch: 2 step: 274, loss is 1.8420476913452148 epoch: 2 step: 275, loss is 1.764127492904663 epoch: 2 step: 276, loss is 1.888632893562317 epoch: 2 step: 277, loss is 1.8250199556350708 epoch: 2 step: 278, loss is 1.8460564613342285 epoch: 2 step: 279, loss is 1.767008900642395 epoch: 2 step: 280, loss is 1.9221689701080322 epoch: 2 step: 281, loss is 1.8578596115112305 epoch: 2 step: 282, loss is 1.8321722745895386 epoch: 2 step: 283, loss is 1.738450288772583 epoch: 2 step: 284, loss is 1.8421459197998047 epoch: 2 step: 285, loss is 1.7913256883621216 epoch: 2 step: 286, loss is 1.7157938480377197 epoch: 2 step: 287, loss is 1.768418312072754 epoch: 2 step: 288, loss is 1.8244106769561768 epoch: 2 step: 289, loss is 1.900506615638733 epoch: 2 step: 290, loss is 1.73917555809021 epoch: 2 step: 291, loss is 1.6665081977844238 epoch: 2 step: 292, loss is 1.8142967224121094 epoch: 2 step: 293, loss is 1.8735641241073608 epoch: 2 step: 294, loss is 1.729585886001587 epoch: 2 step: 295, loss is 1.8190531730651855 epoch: 2 step: 296, loss is 1.820863127708435 epoch: 2 step: 297, loss is 1.800586462020874 epoch: 2 step: 298, loss is 1.8346083164215088 epoch: 2 step: 299, loss is 1.8632863759994507 epoch: 2 step: 300, loss is 1.7105364799499512 epoch: 2 step: 301, loss is 1.8319177627563477 epoch: 2 step: 302, loss is 1.8042783737182617 epoch: 2 step: 303, loss is 1.9537193775177002 epoch: 2 step: 304, loss is 1.7980555295944214 epoch: 2 step: 305, loss is 1.6234668493270874 epoch: 2 step: 306, loss is 1.7497445344924927 epoch: 2 step: 307, loss is 1.650848627090454 epoch: 2 step: 308, loss is 1.7433573007583618 epoch: 2 step: 309, loss is 1.8352850675582886 epoch: 2 step: 310, loss is 1.7705419063568115 epoch: 2 step: 311, loss is 1.7893061637878418 epoch: 2 step: 312, loss is 1.8017258644104004 epoch: 2 step: 313, loss is 1.8343088626861572 epoch: 2 step: 314, loss is 1.863562822341919 epoch: 2 step: 315, loss is 1.7683537006378174 epoch: 2 step: 316, loss is 1.7513844966888428 epoch: 2 step: 317, loss is 1.8488166332244873 epoch: 2 step: 318, loss is 1.6568918228149414 epoch: 2 step: 319, loss is 1.7907938957214355 epoch: 2 step: 320, loss is 1.842228889465332 epoch: 2 step: 321, loss is 1.740508794784546 epoch: 2 step: 322, loss is 1.8639365434646606 epoch: 2 step: 323, loss is 1.7938766479492188 epoch: 2 step: 324, loss is 1.7994264364242554 epoch: 2 step: 325, loss is 1.7769482135772705 epoch: 2 step: 326, loss is 1.7914179563522339 epoch: 2 step: 327, loss is 1.8817980289459229 epoch: 2 step: 328, loss is 1.780959129333496 epoch: 2 step: 329, loss is 1.7240921258926392 epoch: 2 step: 330, loss is 1.8149974346160889 epoch: 2 step: 331, loss is 1.7758080959320068 epoch: 2 step: 332, loss is 1.850448489189148 epoch: 2 step: 333, loss is 1.8151671886444092 epoch: 2 step: 334, loss is 1.6135375499725342 epoch: 2 step: 335, loss is 1.8676254749298096 epoch: 2 step: 336, loss is 1.796992540359497 epoch: 2 step: 337, loss is 1.7397630214691162 epoch: 2 step: 338, loss is 1.8294241428375244 epoch: 2 step: 339, loss is 1.918345332145691 epoch: 2 step: 340, loss is 1.7349798679351807 epoch: 2 step: 341, loss is 1.831843376159668 epoch: 2 step: 343, loss is 1.8684601783752441 epoch: 2 step: 344, loss is 1.7865076065063477 epoch: 2 step: 345, loss is 1.7199623584747314 epoch: 2 step: 346, loss is 1.7887533903121948 epoch: 2 step: 347, loss is 1.92432701587677 epoch: 2 step: 348, loss is 1.7560091018676758 epoch: 2 step: 349, loss is 1.8518497943878174 epoch: 2 step: 350, loss is 1.7869535684585571 epoch: 2 step: 351, loss is 1.7211872339248657 epoch: 2 step: 352, loss is 1.985832691192627 epoch: 2 step: 353, loss is 1.8105260133743286 epoch: 2 step: 354, loss is 1.835160255432129 epoch: 2 step: 355, loss is 1.7800557613372803 epoch: 2 step: 356, loss is 1.7522625923156738 epoch: 2 step: 357, loss is 1.7488387823104858 epoch: 2 step: 358, loss is 1.7606761455535889 epoch: 2 step: 359, loss is 1.8526567220687866 epoch: 2 step: 360, loss is 1.7670416831970215 epoch: 2 step: 361, loss is 1.8308148384094238 epoch: 2 step: 362, loss is 1.7068452835083008 epoch: 2 step: 363, loss is 1.8467777967453003 epoch: 2 step: 364, loss is 1.7617813348770142 epoch: 2 step: 365, loss is 1.8768242597579956 epoch: 2 step: 366, loss is 1.7613787651062012 epoch: 2 step: 367, loss is 1.670498251914978 epoch: 2 step: 368, loss is 1.785229206085205 epoch: 2 step: 369, loss is 1.794966459274292 epoch: 2 step: 370, loss is 1.78444504737854 epoch: 2 step: 371, loss is 1.8249762058258057 epoch: 2 step: 372, loss is 1.7419517040252686 epoch: 2 step: 373, loss is 1.7378820180892944 epoch: 2 step: 374, loss is 1.8318363428115845 epoch: 2 step: 375, loss is 1.8721061944961548 epoch: 2 step: 376, loss is 1.6173243522644043 epoch: 2 step: 377, loss is 1.7758114337921143 epoch: 2 step: 378, loss is 1.756319522857666 epoch: 2 step: 379, loss is 1.6763895750045776 epoch: 2 step: 380, loss is 1.7565263509750366 epoch: 2 step: 381, loss is 1.705394983291626 epoch: 2 step: 382, loss is 1.7143198251724243 epoch: 2 step: 383, loss is 1.6517314910888672 epoch: 2 step: 384, loss is 1.722870111465454 epoch: 2 step: 385, loss is 1.7791318893432617 epoch: 2 step: 386, loss is 1.7326841354370117 epoch: 2 step: 387, loss is 1.7197530269622803 epoch: 2 step: 388, loss is 1.6298965215682983 epoch: 2 step: 389, loss is 1.8236207962036133 epoch: 2 step: 390, loss is 1.7148630619049072 Train epoch time: 155993.727 ms, per step time: 399.984 ms epoch: 3 step: 1, loss is 1.7641422748565674 epoch: 3 step: 2, loss is 1.7547181844711304 epoch: 3 step: 3, loss is 1.793548345565796 epoch: 3 step: 4, loss is 1.8427743911743164 epoch: 3 step: 5, loss is 1.6967132091522217 epoch: 3 step: 6, loss is 1.750718355178833 epoch: 3 step: 7, loss is 1.75998055934906 epoch: 3 step: 8, loss is 1.760133981704712 epoch: 3 step: 9, loss is 1.7812654972076416 epoch: 3 step: 10, loss is 1.716094732284546 epoch: 3 step: 11, loss is 1.7356353998184204 epoch: 3 step: 12, loss is 1.729407787322998 epoch: 3 step: 13, loss is 1.7473516464233398 epoch: 3 step: 14, loss is 1.7516013383865356 epoch: 3 step: 15, loss is 1.7336506843566895 epoch: 3 step: 16, loss is 1.7298648357391357 epoch: 3 step: 17, loss is 1.8397022485733032 epoch: 3 step: 18, loss is 1.824225664138794 epoch: 3 step: 19, loss is 1.7612080574035645 epoch: 3 step: 20, loss is 1.8021020889282227 epoch: 3 step: 21, loss is 1.7924782037734985 epoch: 3 step: 22, loss is 1.820109248161316 epoch: 3 step: 23, loss is 1.8755810260772705 epoch: 3 step: 24, loss is 1.8197954893112183 epoch: 3 step: 25, loss is 1.761940360069275 epoch: 3 step: 26, loss is 1.8842095136642456 epoch: 3 step: 27, loss is 1.7360103130340576 epoch: 3 step: 28, loss is 1.8140599727630615 epoch: 3 step: 29, loss is 1.7043712139129639 epoch: 3 step: 30, loss is 1.7635387182235718 epoch: 3 step: 31, loss is 1.8017815351486206 epoch: 3 step: 32, loss is 1.720243215560913 epoch: 3 step: 33, loss is 1.8202407360076904 epoch: 3 step: 34, loss is 1.775327444076538 epoch: 3 step: 35, loss is 1.674959659576416 epoch: 3 step: 36, loss is 1.562670111656189 epoch: 3 step: 37, loss is 1.7781317234039307 epoch: 3 step: 38, loss is 1.7011034488677979 epoch: 3 step: 39, loss is 1.8195388317108154 epoch: 3 step: 40, loss is 1.6667931079864502 epoch: 3 step: 41, loss is 1.7732675075531006 epoch: 3 step: 42, loss is 1.8425345420837402 epoch: 3 step: 43, loss is 1.6280186176300049 epoch: 3 step: 44, loss is 1.7534148693084717 epoch: 3 step: 45, loss is 1.749954104423523 epoch: 3 step: 46, loss is 1.7335509061813354 epoch: 3 step: 47, loss is 1.8067073822021484 epoch: 3 step: 48, loss is 1.6626200675964355 epoch: 3 step: 49, loss is 1.715445637702942 epoch: 3 step: 50, loss is 1.762415885925293 epoch: 3 step: 51, loss is 1.7296688556671143 epoch: 3 step: 52, loss is 1.7437942028045654 epoch: 3 step: 53, loss is 1.731560468673706 epoch: 3 step: 54, loss is 1.8204193115234375 epoch: 3 step: 55, loss is 1.7309167385101318 epoch: 3 step: 56, loss is 1.7726114988327026 epoch: 3 step: 57, loss is 1.681417465209961 epoch: 3 step: 58, loss is 1.7564997673034668 epoch: 3 step: 59, loss is 1.6605130434036255 epoch: 3 step: 60, loss is 1.7276595830917358 epoch: 3 step: 61, loss is 1.7239662408828735 epoch: 3 step: 62, loss is 1.796515703201294 epoch: 3 step: 63, loss is 1.6727538108825684 epoch: 3 step: 64, loss is 1.7312177419662476 epoch: 3 step: 65, loss is 1.7125099897384644 epoch: 3 step: 66, loss is 1.6296565532684326 epoch: 3 step: 67, loss is 1.7298835515975952 epoch: 3 step: 68, loss is 1.6555956602096558 epoch: 3 step: 69, loss is 1.7559605836868286 epoch: 3 step: 70, loss is 1.802220344543457 epoch: 3 step: 71, loss is 1.8298285007476807 epoch: 3 step: 72, loss is 1.6363513469696045 epoch: 3 step: 73, loss is 1.7824245691299438 epoch: 3 step: 74, loss is 1.7274856567382812 epoch: 3 step: 75, loss is 1.6231114864349365 epoch: 3 step: 76, loss is 1.6169097423553467 epoch: 3 step: 77, loss is 1.7331278324127197 epoch: 3 step: 78, loss is 1.9007054567337036 epoch: 3 step: 79, loss is 1.5682278871536255 epoch: 3 step: 80, loss is 1.6923885345458984 epoch: 3 step: 81, loss is 1.7956039905548096 epoch: 3 step: 82, loss is 1.7601070404052734 epoch: 3 step: 83, loss is 1.7837741374969482 epoch: 3 step: 84, loss is 1.7951622009277344 epoch: 3 step: 85, loss is 1.8333876132965088 epoch: 3 step: 86, loss is 1.819688320159912 epoch: 3 step: 87, loss is 1.6919232606887817 epoch: 3 step: 88, loss is 1.621659278869629 epoch: 3 step: 89, loss is 1.7886335849761963 epoch: 3 step: 90, loss is 1.751739263534546 epoch: 3 step: 91, loss is 1.6526401042938232 epoch: 3 step: 92, loss is 1.7481526136398315 epoch: 3 step: 93, loss is 1.8013017177581787 epoch: 3 step: 94, loss is 1.6293811798095703 epoch: 3 step: 95, loss is 1.578917384147644 epoch: 3 step: 96, loss is 1.7730318307876587 epoch: 3 step: 97, loss is 1.7906396389007568 epoch: 3 step: 98, loss is 1.7818952798843384 epoch: 3 step: 99, loss is 1.8343371152877808 epoch: 3 step: 100, loss is 1.8384268283843994 epoch: 3 step: 101, loss is 1.7740890979766846 epoch: 3 step: 102, loss is 1.693620204925537 epoch: 3 step: 103, loss is 1.6320619583129883 epoch: 3 step: 104, loss is 1.758415699005127 epoch: 3 step: 105, loss is 1.732062578201294 epoch: 3 step: 106, loss is 1.7792495489120483 epoch: 3 step: 107, loss is 1.6865469217300415 epoch: 3 step: 108, loss is 1.830143690109253 epoch: 3 step: 109, loss is 1.7891284227371216 epoch: 3 step: 110, loss is 1.7002507448196411 epoch: 3 step: 111, loss is 1.76425302028656 epoch: 3 step: 112, loss is 1.7251293659210205 epoch: 3 step: 113, loss is 1.7241863012313843 epoch: 3 step: 114, loss is 1.758310079574585 epoch: 3 step: 115, loss is 1.6906688213348389 epoch: 3 step: 116, loss is 1.6078766584396362 epoch: 3 step: 117, loss is 1.7742276191711426 epoch: 3 step: 118, loss is 1.7505218982696533 epoch: 3 step: 119, loss is 1.8065091371536255 epoch: 3 step: 120, loss is 1.77422297000885 epoch: 3 step: 121, loss is 1.6769416332244873 epoch: 3 step: 122, loss is 1.6930099725723267 epoch: 3 step: 123, loss is 1.721684217453003 epoch: 3 step: 124, loss is 1.8611409664154053 epoch: 3 step: 125, loss is 1.7026392221450806 epoch: 3 step: 126, loss is 1.73283851146698 epoch: 3 step: 127, loss is 1.6186188459396362 epoch: 3 step: 128, loss is 1.7966184616088867 epoch: 3 step: 129, loss is 1.65543794631958 epoch: 3 step: 130, loss is 1.7405024766921997 epoch: 3 step: 131, loss is 1.8584294319152832 epoch: 3 step: 132, loss is 1.7835478782653809 epoch: 3 step: 133, loss is 1.8870224952697754 epoch: 3 step: 134, loss is 1.8632557392120361 epoch: 3 step: 135, loss is 1.8035264015197754 epoch: 3 step: 136, loss is 1.7724859714508057 epoch: 3 step: 137, loss is 1.7620857954025269 epoch: 3 step: 138, loss is 1.7977924346923828 epoch: 3 step: 139, loss is 1.7587790489196777 epoch: 3 step: 140, loss is 1.781812310218811 epoch: 3 step: 141, loss is 1.843968391418457 epoch: 3 step: 142, loss is 1.7268784046173096 epoch: 3 step: 143, loss is 1.6952234506607056 epoch: 3 step: 144, loss is 1.6672649383544922 epoch: 3 step: 145, loss is 1.7690939903259277 epoch: 3 step: 146, loss is 1.8385887145996094 epoch: 3 step: 147, loss is 1.705284833908081 epoch: 3 step: 148, loss is 1.669526219367981 epoch: 3 step: 149, loss is 1.7944085597991943 epoch: 3 step: 150, loss is 1.7651945352554321 epoch: 3 step: 151, loss is 1.6766066551208496 epoch: 3 step: 152, loss is 1.5969195365905762 epoch: 3 step: 153, loss is 1.6448216438293457 epoch: 3 step: 154, loss is 1.751434564590454 epoch: 3 step: 155, loss is 1.7247257232666016 epoch: 3 step: 156, loss is 1.7154661417007446 epoch: 3 step: 157, loss is 1.7714577913284302 epoch: 3 step: 158, loss is 1.7265665531158447 epoch: 3 step: 159, loss is 1.7543656826019287 epoch: 3 step: 160, loss is 1.8084063529968262 epoch: 3 step: 161, loss is 1.705984115600586 epoch: 3 step: 162, loss is 1.7250216007232666 epoch: 3 step: 163, loss is 1.6781470775604248 epoch: 3 step: 164, loss is 1.7092280387878418 epoch: 3 step: 165, loss is 1.6585453748703003 epoch: 3 step: 166, loss is 1.8353822231292725 epoch: 3 step: 167, loss is 1.827893853187561 epoch: 3 step: 168, loss is 1.672469139099121 epoch: 3 step: 169, loss is 1.7827669382095337 epoch: 3 step: 170, loss is 1.6130626201629639 epoch: 3 step: 171, loss is 1.6939573287963867 epoch: 3 step: 172, loss is 1.724724531173706 epoch: 3 step: 173, loss is 1.8338401317596436 epoch: 3 step: 174, loss is 1.7745299339294434 epoch: 3 step: 175, loss is 1.8342199325561523 epoch: 3 step: 176, loss is 1.8572815656661987 epoch: 3 step: 177, loss is 1.6647486686706543 epoch: 3 step: 178, loss is 1.7664051055908203 epoch: 3 step: 179, loss is 1.7424143552780151 epoch: 3 step: 180, loss is 1.7544529438018799 epoch: 3 step: 181, loss is 1.7419337034225464 epoch: 3 step: 182, loss is 1.7064985036849976 epoch: 3 step: 183, loss is 1.7526941299438477 epoch: 3 step: 184, loss is 1.6765990257263184 epoch: 3 step: 185, loss is 1.7984998226165771 epoch: 3 step: 186, loss is 1.8013088703155518 epoch: 3 step: 187, loss is 1.8446683883666992 epoch: 3 step: 188, loss is 1.695939302444458 epoch: 3 step: 189, loss is 1.5769917964935303 epoch: 3 step: 190, loss is 1.765929937362671 epoch: 3 step: 191, loss is 1.760894775390625 epoch: 3 step: 192, loss is 1.765155553817749 epoch: 3 step: 193, loss is 1.6571260690689087 epoch: 3 step: 194, loss is 1.782891035079956 epoch: 3 step: 195, loss is 1.822501301765442 epoch: 3 step: 196, loss is 1.685598611831665 epoch: 3 step: 197, loss is 1.7144808769226074 epoch: 3 step: 198, loss is 1.7803773880004883 epoch: 3 step: 199, loss is 1.6685271263122559 epoch: 3 step: 200, loss is 1.7109404802322388 epoch: 3 step: 201, loss is 1.7236040830612183 epoch: 3 step: 202, loss is 1.6281933784484863 epoch: 3 step: 203, loss is 1.6821147203445435 epoch: 3 step: 204, loss is 1.6687819957733154 epoch: 3 step: 205, loss is 1.7789862155914307 epoch: 3 step: 206, loss is 1.769896149635315 epoch: 3 step: 207, loss is 1.823211908340454 epoch: 3 step: 208, loss is 1.6506959199905396 epoch: 3 step: 209, loss is 1.5910999774932861 epoch: 3 step: 210, loss is 1.636285662651062 epoch: 3 step: 211, loss is 1.734756588935852 epoch: 3 step: 212, loss is 1.6787893772125244 epoch: 3 step: 213, loss is 1.8897600173950195 epoch: 3 step: 214, loss is 1.7878351211547852 epoch: 3 step: 215, loss is 1.809739589691162 epoch: 3 step: 216, loss is 1.618238925933838 epoch: 3 step: 217, loss is 1.835951328277588 epoch: 3 step: 218, loss is 1.7431846857070923 epoch: 3 step: 219, loss is 1.777294635772705 epoch: 3 step: 220, loss is 1.7508474588394165 epoch: 3 step: 221, loss is 1.7766541242599487 epoch: 3 step: 222, loss is 1.7049247026443481 epoch: 3 step: 223, loss is 1.681959867477417 epoch: 3 step: 224, loss is 1.8347787857055664 epoch: 3 step: 225, loss is 1.7920269966125488 epoch: 3 step: 226, loss is 1.6991766691207886 epoch: 3 step: 227, loss is 1.6482125520706177 epoch: 3 step: 228, loss is 1.7189157009124756 epoch: 3 step: 229, loss is 1.6653929948806763 epoch: 3 step: 230, loss is 1.6467382907867432 epoch: 3 step: 231, loss is 1.7463375329971313 epoch: 3 step: 232, loss is 1.7395448684692383 epoch: 3 step: 233, loss is 1.5935211181640625 epoch: 3 step: 234, loss is 1.6529710292816162 epoch: 3 step: 235, loss is 1.6924574375152588 epoch: 3 step: 236, loss is 1.782355785369873 epoch: 3 step: 237, loss is 1.7127463817596436 epoch: 3 step: 238, loss is 1.688645362854004 epoch: 3 step: 239, loss is 1.7146425247192383 epoch: 3 step: 240, loss is 1.7599706649780273 epoch: 3 step: 241, loss is 1.804896593093872 epoch: 3 step: 242, loss is 1.704947829246521 epoch: 3 step: 243, loss is 1.6239776611328125 epoch: 3 step: 244, loss is 1.700961947441101 epoch: 3 step: 245, loss is 1.6744734048843384 epoch: 3 step: 246, loss is 1.7533576488494873 epoch: 3 step: 247, loss is 1.7620048522949219 epoch: 3 step: 248, loss is 1.6602070331573486 epoch: 3 step: 249, loss is 1.7579090595245361 epoch: 3 step: 250, loss is 1.8472994565963745 epoch: 3 step: 251, loss is 1.7926963567733765 epoch: 3 step: 252, loss is 1.6725696325302124 epoch: 3 step: 253, loss is 1.7283315658569336 epoch: 3 step: 254, loss is 1.6876342296600342 epoch: 3 step: 255, loss is 1.685721755027771 epoch: 3 step: 256, loss is 1.781528115272522 epoch: 3 step: 257, loss is 1.6120579242706299 epoch: 3 step: 258, loss is 1.741850733757019 epoch: 3 step: 259, loss is 1.7349295616149902 epoch: 3 step: 260, loss is 1.6147620677947998 epoch: 3 step: 261, loss is 1.7662999629974365 epoch: 3 step: 262, loss is 1.570710301399231 epoch: 3 step: 263, loss is 1.7080087661743164 epoch: 3 step: 264, loss is 1.7875734567642212 epoch: 3 step: 265, loss is 1.7477959394454956 epoch: 3 step: 266, loss is 1.6720731258392334 epoch: 3 step: 267, loss is 1.6743179559707642 epoch: 3 step: 268, loss is 1.7317537069320679 epoch: 3 step: 269, loss is 1.6166419982910156 epoch: 3 step: 270, loss is 1.703937292098999 epoch: 3 step: 271, loss is 1.7867095470428467 epoch: 3 step: 272, loss is 1.6708333492279053 epoch: 3 step: 273, loss is 1.6910622119903564 epoch: 3 step: 274, loss is 1.8313789367675781 epoch: 3 step: 275, loss is 1.734154462814331 epoch: 3 step: 276, loss is 1.5862051248550415 epoch: 3 step: 277, loss is 1.842326283454895 epoch: 3 step: 278, loss is 1.7333091497421265 epoch: 3 step: 279, loss is 1.6914715766906738 epoch: 3 step: 280, loss is 1.7590086460113525 epoch: 3 step: 281, loss is 1.782589316368103 epoch: 3 step: 282, loss is 1.671002745628357 epoch: 3 step: 283, loss is 1.7889219522476196 epoch: 3 step: 284, loss is 1.7865937948226929 epoch: 3 step: 285, loss is 1.840965986251831 epoch: 3 step: 286, loss is 1.6704823970794678 epoch: 3 step: 287, loss is 1.7697992324829102 epoch: 3 step: 288, loss is 1.796462893486023 epoch: 3 step: 289, loss is 1.6941657066345215 epoch: 3 step: 290, loss is 1.7072052955627441 epoch: 3 step: 291, loss is 1.7631559371948242 epoch: 3 step: 292, loss is 1.766723394393921 epoch: 3 step: 293, loss is 1.8078240156173706 epoch: 3 step: 294, loss is 1.7114737033843994 epoch: 3 step: 295, loss is 1.6542181968688965 epoch: 3 step: 296, loss is 1.7234742641448975 epoch: 3 step: 297, loss is 1.753818154335022 epoch: 3 step: 298, loss is 1.6865930557250977 epoch: 3 step: 299, loss is 1.7363752126693726 epoch: 3 step: 300, loss is 1.5910722017288208 epoch: 3 step: 301, loss is 1.6719822883605957 epoch: 3 step: 302, loss is 1.6721827983856201 epoch: 3 step: 303, loss is 1.6900347471237183 epoch: 3 step: 304, loss is 1.716599702835083 epoch: 3 step: 305, loss is 1.6665544509887695 epoch: 3 step: 306, loss is 1.6840133666992188 epoch: 3 step: 307, loss is 1.6914595365524292 epoch: 3 step: 308, loss is 1.6977345943450928 epoch: 3 step: 309, loss is 1.7174962759017944 epoch: 3 step: 310, loss is 1.7119249105453491 epoch: 3 step: 311, loss is 1.7550396919250488 epoch: 3 step: 312, loss is 1.6406378746032715 epoch: 3 step: 313, loss is 1.6643996238708496 epoch: 3 step: 314, loss is 1.6515495777130127 epoch: 3 step: 315, loss is 1.7749066352844238 epoch: 3 step: 316, loss is 1.6713601350784302 epoch: 3 step: 317, loss is 1.6499664783477783 epoch: 3 step: 318, loss is 1.6136360168457031 epoch: 3 step: 319, loss is 1.635528564453125 epoch: 3 step: 320, loss is 1.7853888273239136 epoch: 3 step: 321, loss is 1.6467822790145874 epoch: 3 step: 322, loss is 1.61776864528656 epoch: 3 step: 323, loss is 1.6731338500976562 epoch: 3 step: 324, loss is 1.6838327646255493 epoch: 3 step: 325, loss is 1.5910794734954834 epoch: 3 step: 326, loss is 1.6860685348510742 epoch: 3 step: 327, loss is 1.6163591146469116 epoch: 3 step: 328, loss is 1.8347625732421875 epoch: 3 step: 329, loss is 1.6064834594726562 epoch: 3 step: 330, loss is 1.7510273456573486 epoch: 3 step: 331, loss is 1.7274161577224731 epoch: 3 step: 332, loss is 1.8178198337554932 epoch: 3 step: 333, loss is 1.7871026992797852 epoch: 3 step: 334, loss is 1.6706526279449463 epoch: 3 step: 335, loss is 1.6998460292816162 epoch: 3 step: 336, loss is 1.7630598545074463 epoch: 3 step: 337, loss is 1.6557632684707642 epoch: 3 step: 338, loss is 1.7528998851776123 epoch: 3 step: 339, loss is 1.7049301862716675 epoch: 3 step: 340, loss is 1.594982385635376 epoch: 3 step: 341, loss is 1.772371530532837 epoch: 3 step: 342, loss is 1.685744047164917 epoch: 3 step: 343, loss is 1.731490969657898 epoch: 3 step: 344, loss is 1.6554176807403564 epoch: 3 step: 345, loss is 1.8010971546173096 epoch: 3 step: 346, loss is 1.5836352109909058 epoch: 3 step: 347, loss is 1.7499160766601562 epoch: 3 step: 348, loss is 1.6829849481582642 epoch: 3 step: 349, loss is 1.5817031860351562 epoch: 3 step: 350, loss is 1.703901767730713 epoch: 3 step: 351, loss is 1.8326001167297363 epoch: 3 step: 352, loss is 1.7085936069488525 epoch: 3 step: 353, loss is 1.6891402006149292 epoch: 3 step: 354, loss is 1.7434598207473755 epoch: 3 step: 355, loss is 1.677656888961792 epoch: 3 step: 356, loss is 1.8194599151611328 epoch: 3 step: 357, loss is 1.6406285762786865 epoch: 3 step: 358, loss is 1.6195244789123535 epoch: 3 step: 359, loss is 1.5961867570877075 epoch: 3 step: 360, loss is 1.5857040882110596 epoch: 3 step: 361, loss is 1.6756489276885986 epoch: 3 step: 362, loss is 1.6826366186141968 epoch: 3 step: 363, loss is 1.695976734161377 epoch: 3 step: 364, loss is 1.7368282079696655 epoch: 3 step: 365, loss is 1.6196650266647339 epoch: 3 step: 366, loss is 1.6286401748657227 epoch: 3 step: 367, loss is 1.5430071353912354 epoch: 3 step: 368, loss is 1.5972182750701904 epoch: 3 step: 369, loss is 1.7685972452163696 epoch: 3 step: 370, loss is 1.7089072465896606 epoch: 3 step: 371, loss is 1.6556273698806763 epoch: 3 step: 372, loss is 1.637276291847229 epoch: 3 step: 373, loss is 1.7345547676086426 epoch: 3 step: 374, loss is 1.701845407485962 epoch: 3 step: 375, loss is 1.7591440677642822 epoch: 3 step: 376, loss is 1.6914732456207275 epoch: 3 step: 377, loss is 1.7411625385284424 epoch: 3 step: 378, loss is 1.6531426906585693 epoch: 3 step: 379, loss is 1.688938856124878 epoch: 3 step: 380, loss is 1.7634522914886475 epoch: 3 step: 381, loss is 1.7102078199386597 epoch: 3 step: 382, loss is 1.653367280960083 epoch: 3 step: 383, loss is 1.7889350652694702 epoch: 3 step: 384, loss is 1.77799391746521 epoch: 3 step: 385, loss is 1.6914746761322021 epoch: 3 step: 386, loss is 1.7278087139129639 epoch: 3 step: 387, loss is 1.7500312328338623 epoch: 3 step: 388, loss is 1.5245214700698853 epoch: 3 step: 389, loss is 1.574200987815857 epoch: 3 step: 390, loss is 1.750159502029419 Train epoch time: 145058.088 ms, per step time: 371.944 ms epoch: 4 step: 1, loss is 1.6695165634155273 epoch: 4 step: 2, loss is 1.7273000478744507 epoch: 4 step: 3, loss is 1.7329519987106323 epoch: 4 step: 4, loss is 1.7230921983718872 epoch: 4 step: 5, loss is 1.5585060119628906 epoch: 4 step: 6, loss is 1.645129680633545 epoch: 4 step: 7, loss is 1.8048557043075562 epoch: 4 step: 8, loss is 1.669471263885498 epoch: 4 step: 9, loss is 1.7026921510696411 epoch: 4 step: 10, loss is 1.7005460262298584 epoch: 4 step: 11, loss is 1.570026159286499 epoch: 4 step: 12, loss is 1.6120418310165405 epoch: 4 step: 13, loss is 1.7042216062545776 epoch: 4 step: 14, loss is 1.6466180086135864 epoch: 4 step: 15, loss is 1.656440019607544 epoch: 4 step: 16, loss is 1.638288974761963 epoch: 4 step: 17, loss is 1.7356566190719604 epoch: 4 step: 18, loss is 1.6732922792434692 epoch: 4 step: 19, loss is 1.8225958347320557 epoch: 4 step: 20, loss is 1.6666510105133057 epoch: 4 step: 21, loss is 1.7109061479568481 epoch: 4 step: 22, loss is 1.6872361898422241 epoch: 4 step: 23, loss is 1.656557559967041 epoch: 4 step: 24, loss is 1.7234022617340088 epoch: 4 step: 25, loss is 1.6205995082855225 epoch: 4 step: 26, loss is 1.791795015335083 epoch: 4 step: 27, loss is 1.8158366680145264 epoch: 4 step: 28, loss is 1.72850501537323 epoch: 4 step: 29, loss is 1.7563163042068481 epoch: 4 step: 30, loss is 1.7978169918060303 epoch: 4 step: 31, loss is 1.642251968383789 epoch: 4 step: 32, loss is 1.7088783979415894 epoch: 4 step: 33, loss is 1.7903128862380981 epoch: 4 step: 34, loss is 1.6428248882293701 epoch: 4 step: 35, loss is 1.5747734308242798 epoch: 4 step: 36, loss is 1.730666160583496 epoch: 4 step: 37, loss is 1.752526879310608 epoch: 4 step: 38, loss is 1.5840187072753906 epoch: 4 step: 39, loss is 1.6418838500976562 epoch: 4 step: 40, loss is 1.6454088687896729 epoch: 4 step: 41, loss is 1.519988775253296 epoch: 4 step: 42, loss is 1.667872667312622 epoch: 4 step: 43, loss is 1.6414804458618164 epoch: 4 step: 44, loss is 1.5913159847259521 epoch: 4 step: 45, loss is 1.6626425981521606 epoch: 4 step: 46, loss is 1.6450990438461304 epoch: 4 step: 47, loss is 1.647599697113037 epoch: 4 step: 48, loss is 1.5927698612213135 epoch: 4 step: 49, loss is 1.7867192029953003 epoch: 4 step: 50, loss is 1.539243459701538 epoch: 4 step: 51, loss is 1.6675819158554077 epoch: 4 step: 52, loss is 1.6236523389816284 epoch: 4 step: 53, loss is 1.6589174270629883 epoch: 4 step: 54, loss is 1.6060881614685059 epoch: 4 step: 55, loss is 1.6542298793792725 epoch: 4 step: 56, loss is 1.771117925643921 epoch: 4 step: 57, loss is 1.736961007118225 epoch: 4 step: 58, loss is 1.6171984672546387 epoch: 4 step: 59, loss is 1.8743512630462646 epoch: 4 step: 60, loss is 1.6800825595855713 epoch: 4 step: 61, loss is 1.666163682937622 epoch: 4 step: 62, loss is 1.6979151964187622 epoch: 4 step: 63, loss is 1.6055777072906494 epoch: 4 step: 64, loss is 1.5906095504760742 epoch: 4 step: 65, loss is 1.757987380027771 epoch: 4 step: 66, loss is 1.6081726551055908 epoch: 4 step: 67, loss is 1.676893949508667 epoch: 4 step: 68, loss is 1.7651326656341553 epoch: 4 step: 69, loss is 1.8206675052642822 epoch: 4 step: 70, loss is 1.661590337753296 epoch: 4 step: 71, loss is 1.6335906982421875 epoch: 4 step: 72, loss is 1.764451026916504 epoch: 4 step: 73, loss is 1.7256431579589844 epoch: 4 step: 74, loss is 1.6364463567733765 epoch: 4 step: 75, loss is 1.7188787460327148 epoch: 4 step: 76, loss is 1.6673758029937744 epoch: 4 step: 77, loss is 1.664868950843811 epoch: 4 step: 78, loss is 1.6314384937286377 epoch: 4 step: 79, loss is 1.763289451599121 epoch: 4 step: 80, loss is 1.6337754726409912 epoch: 4 step: 81, loss is 1.6575185060501099 epoch: 4 step: 82, loss is 1.622018575668335 epoch: 4 step: 83, loss is 1.6691434383392334 epoch: 4 step: 84, loss is 1.6801186800003052 epoch: 4 step: 85, loss is 1.7004921436309814 epoch: 4 step: 86, loss is 1.5824577808380127 epoch: 4 step: 87, loss is 1.6731836795806885 epoch: 4 step: 88, loss is 1.7173842191696167 epoch: 4 step: 89, loss is 1.6667479276657104 epoch: 4 step: 90, loss is 1.6506376266479492 epoch: 4 step: 91, loss is 1.6161320209503174 epoch: 4 step: 92, loss is 1.72096848487854 epoch: 4 step: 93, loss is 1.6653797626495361 epoch: 4 step: 94, loss is 1.5721209049224854 epoch: 4 step: 95, loss is 1.8106515407562256 epoch: 4 step: 96, loss is 1.631089448928833 epoch: 4 step: 97, loss is 1.6779348850250244 epoch: 4 step: 98, loss is 1.738546371459961 epoch: 4 step: 99, loss is 1.5996187925338745 epoch: 4 step: 100, loss is 1.6185498237609863 epoch: 4 step: 101, loss is 1.7036750316619873 epoch: 4 step: 102, loss is 1.6072015762329102 epoch: 4 step: 103, loss is 1.5683672428131104 epoch: 4 step: 104, loss is 1.573211431503296 epoch: 4 step: 105, loss is 1.6568917036056519 epoch: 4 step: 106, loss is 1.6942692995071411 epoch: 4 step: 107, loss is 1.701817274093628 epoch: 4 step: 108, loss is 1.6250083446502686 epoch: 4 step: 109, loss is 1.6087796688079834 epoch: 4 step: 110, loss is 1.7560656070709229 epoch: 4 step: 111, loss is 1.6171553134918213 epoch: 4 step: 112, loss is 1.53173828125 epoch: 4 step: 113, loss is 1.6244738101959229 epoch: 4 step: 114, loss is 1.6011207103729248 epoch: 4 step: 115, loss is 1.5669540166854858 epoch: 4 step: 116, loss is 1.7150381803512573 epoch: 4 step: 117, loss is 1.6521196365356445 epoch: 4 step: 118, loss is 1.6225037574768066 epoch: 4 step: 119, loss is 1.683433175086975 epoch: 4 step: 120, loss is 1.6779825687408447 epoch: 4 step: 121, loss is 1.7159409523010254 epoch: 4 step: 122, loss is 1.6259791851043701 epoch: 4 step: 123, loss is 1.7155523300170898 epoch: 4 step: 124, loss is 1.60838782787323 epoch: 4 step: 125, loss is 1.6821777820587158 epoch: 4 step: 126, loss is 1.6594120264053345 epoch: 4 step: 127, loss is 1.6180691719055176 epoch: 4 step: 128, loss is 1.6326568126678467 epoch: 4 step: 129, loss is 1.5710145235061646 epoch: 4 step: 130, loss is 1.600823163986206 epoch: 4 step: 131, loss is 1.630587100982666 epoch: 4 step: 132, loss is 1.767482042312622 epoch: 4 step: 133, loss is 1.540154218673706 epoch: 4 step: 134, loss is 1.6062562465667725 epoch: 4 step: 135, loss is 1.6783620119094849 epoch: 4 step: 136, loss is 1.7135610580444336 epoch: 4 step: 137, loss is 1.4579838514328003 epoch: 4 step: 138, loss is 1.6518511772155762 epoch: 4 step: 139, loss is 1.7175908088684082 epoch: 4 step: 140, loss is 1.7394475936889648 epoch: 4 step: 141, loss is 1.5693387985229492 epoch: 4 step: 142, loss is 1.6008765697479248 epoch: 4 step: 143, loss is 1.5412213802337646 epoch: 4 step: 144, loss is 1.6418453454971313 epoch: 4 step: 145, loss is 1.4967046976089478 epoch: 4 step: 146, loss is 1.7385708093643188 epoch: 4 step: 147, loss is 1.7062017917633057 epoch: 4 step: 148, loss is 1.617221474647522 epoch: 4 step: 149, loss is 1.6853370666503906 epoch: 4 step: 150, loss is 1.5452866554260254 epoch: 4 step: 151, loss is 1.755713939666748 epoch: 4 step: 152, loss is 1.7040354013442993 epoch: 4 step: 153, loss is 1.617229700088501 epoch: 4 step: 154, loss is 1.7074425220489502 epoch: 4 step: 155, loss is 1.7260022163391113 epoch: 4 step: 156, loss is 1.6384129524230957 epoch: 4 step: 157, loss is 1.6073106527328491 epoch: 4 step: 158, loss is 1.5754611492156982 epoch: 4 step: 159, loss is 1.715348243713379 epoch: 4 step: 160, loss is 1.7685743570327759 epoch: 4 step: 161, loss is 1.6414381265640259 epoch: 4 step: 162, loss is 1.7249082326889038 epoch: 4 step: 163, loss is 1.6838865280151367 epoch: 4 step: 164, loss is 1.7254281044006348 epoch: 4 step: 165, loss is 1.6781275272369385 epoch: 4 step: 166, loss is 1.702733039855957 epoch: 4 step: 167, loss is 1.6596628427505493 epoch: 4 step: 168, loss is 1.610282301902771 epoch: 4 step: 169, loss is 1.6470028162002563 epoch: 4 step: 170, loss is 1.611755132675171 epoch: 4 step: 171, loss is 1.6413168907165527 epoch: 4 step: 172, loss is 1.8116881847381592 epoch: 4 step: 173, loss is 1.6114356517791748 epoch: 4 step: 174, loss is 1.7080440521240234 epoch: 4 step: 175, loss is 1.7400500774383545 epoch: 4 step: 176, loss is 1.6361223459243774 epoch: 4 step: 177, loss is 1.6080799102783203 epoch: 4 step: 178, loss is 1.7389817237854004 epoch: 4 step: 179, loss is 1.6394613981246948 epoch: 4 step: 180, loss is 1.6609821319580078 epoch: 4 step: 181, loss is 1.7083775997161865 epoch: 4 step: 182, loss is 1.6260814666748047 epoch: 4 step: 183, loss is 1.670709490776062 epoch: 4 step: 184, loss is 1.733322024345398 epoch: 4 step: 185, loss is 1.6730746030807495 epoch: 4 step: 186, loss is 1.7290054559707642 epoch: 4 step: 187, loss is 1.6578032970428467 epoch: 4 step: 188, loss is 1.607759714126587 epoch: 4 step: 189, loss is 1.6292657852172852 epoch: 4 step: 190, loss is 1.6871564388275146 epoch: 4 step: 191, loss is 1.6059832572937012 epoch: 4 step: 192, loss is 1.673574686050415 epoch: 4 step: 193, loss is 1.6916306018829346 epoch: 4 step: 194, loss is 1.6836265325546265 epoch: 4 step: 195, loss is 1.7154841423034668 epoch: 4 step: 196, loss is 1.7032926082611084 epoch: 4 step: 197, loss is 1.5874526500701904 epoch: 4 step: 198, loss is 1.5883681774139404 epoch: 4 step: 199, loss is 1.636406660079956 epoch: 4 step: 200, loss is 1.6678447723388672 epoch: 4 step: 201, loss is 1.506903886795044 epoch: 4 step: 202, loss is 1.694869041442871 epoch: 4 step: 203, loss is 1.7244055271148682 epoch: 4 step: 204, loss is 1.6359894275665283 epoch: 4 step: 205, loss is 1.63210928440094 epoch: 4 step: 206, loss is 1.6406002044677734 epoch: 4 step: 207, loss is 1.6306772232055664 epoch: 4 step: 208, loss is 1.6416488885879517 epoch: 4 step: 209, loss is 1.7348030805587769 epoch: 4 step: 210, loss is 1.7382646799087524 epoch: 4 step: 211, loss is 1.6398556232452393 epoch: 4 step: 212, loss is 1.7287929058074951 epoch: 4 step: 213, loss is 1.5922458171844482 epoch: 4 step: 214, loss is 1.7324360609054565 epoch: 4 step: 215, loss is 1.6540369987487793 epoch: 4 step: 216, loss is 1.6112148761749268 epoch: 4 step: 217, loss is 1.6772197484970093 epoch: 4 step: 218, loss is 1.848122477531433 epoch: 4 step: 219, loss is 1.764024257659912 epoch: 4 step: 220, loss is 1.563981533050537 epoch: 4 step: 221, loss is 1.5943589210510254 epoch: 4 step: 222, loss is 1.6494523286819458 epoch: 4 step: 223, loss is 1.6289910078048706 epoch: 4 step: 224, loss is 1.7430896759033203 epoch: 4 step: 225, loss is 1.6236622333526611 epoch: 4 step: 226, loss is 1.6899158954620361 epoch: 4 step: 227, loss is 1.6302130222320557 epoch: 4 step: 228, loss is 1.6347051858901978 epoch: 4 step: 229, loss is 1.5936694145202637 epoch: 4 step: 230, loss is 1.699882984161377 epoch: 4 step: 231, loss is 1.6149802207946777 epoch: 4 step: 232, loss is 1.6307373046875 epoch: 4 step: 233, loss is 1.660753846168518 epoch: 4 step: 234, loss is 1.6779448986053467 epoch: 4 step: 235, loss is 1.7419378757476807 epoch: 4 step: 236, loss is 1.646225094795227 epoch: 4 step: 237, loss is 1.566084623336792 epoch: 4 step: 238, loss is 1.611794114112854 epoch: 4 step: 239, loss is 1.6340889930725098 epoch: 4 step: 240, loss is 1.735528826713562 epoch: 4 step: 241, loss is 1.6432706117630005 epoch: 4 step: 242, loss is 1.6555874347686768 epoch: 4 step: 243, loss is 1.700577974319458 epoch: 4 step: 244, loss is 1.5956780910491943 epoch: 4 step: 245, loss is 1.6119542121887207 epoch: 4 step: 246, loss is 1.6775035858154297 epoch: 4 step: 247, loss is 1.6030528545379639 epoch: 4 step: 248, loss is 1.5934855937957764 epoch: 4 step: 249, loss is 1.6230729818344116 epoch: 4 step: 250, loss is 1.7012053728103638 epoch: 4 step: 251, loss is 1.5713801383972168 epoch: 4 step: 252, loss is 1.5452080965042114 epoch: 4 step: 253, loss is 1.6255402565002441 epoch: 4 step: 254, loss is 1.6258832216262817 epoch: 4 step: 255, loss is 1.6188931465148926 epoch: 4 step: 256, loss is 1.6053047180175781 epoch: 4 step: 257, loss is 1.6065144538879395 epoch: 4 step: 258, loss is 1.7241655588150024 epoch: 4 step: 259, loss is 1.6181094646453857 epoch: 4 step: 260, loss is 1.5777652263641357 epoch: 4 step: 261, loss is 1.6306815147399902 epoch: 4 step: 262, loss is 1.5533638000488281 epoch: 4 step: 263, loss is 1.6355679035186768 epoch: 4 step: 264, loss is 1.631014108657837 epoch: 4 step: 265, loss is 1.5684889554977417 epoch: 4 step: 266, loss is 1.6852974891662598 epoch: 4 step: 267, loss is 1.7343311309814453 epoch: 4 step: 268, loss is 1.6117439270019531 epoch: 4 step: 269, loss is 1.6637423038482666 epoch: 4 step: 270, loss is 1.6423864364624023 epoch: 4 step: 271, loss is 1.6112496852874756 epoch: 4 step: 272, loss is 1.6339552402496338 epoch: 4 step: 273, loss is 1.605186939239502 epoch: 4 step: 274, loss is 1.6171804666519165 epoch: 4 step: 275, loss is 1.6028566360473633 epoch: 4 step: 276, loss is 1.665071964263916 epoch: 4 step: 277, loss is 1.6269690990447998 epoch: 4 step: 278, loss is 1.5817041397094727 epoch: 4 step: 279, loss is 1.5939949750900269 epoch: 4 step: 280, loss is 1.6965526342391968 epoch: 4 step: 281, loss is 1.5738329887390137 epoch: 4 step: 282, loss is 1.6287593841552734 epoch: 4 step: 283, loss is 1.6668312549591064 epoch: 4 step: 284, loss is 1.6443724632263184 epoch: 4 step: 285, loss is 1.6746225357055664 epoch: 4 step: 286, loss is 1.5106184482574463 epoch: 4 step: 287, loss is 1.761210322380066 epoch: 4 step: 288, loss is 1.6763477325439453 epoch: 4 step: 289, loss is 1.6645381450653076 epoch: 4 step: 290, loss is 1.5973329544067383 epoch: 4 step: 291, loss is 1.5937137603759766 epoch: 4 step: 292, loss is 1.6388037204742432 epoch: 4 step: 293, loss is 1.687715768814087 epoch: 4 step: 294, loss is 1.7647030353546143 epoch: 4 step: 295, loss is 1.6563464403152466 epoch: 4 step: 296, loss is 1.6363365650177002 epoch: 4 step: 297, loss is 1.6472853422164917 epoch: 4 step: 298, loss is 1.6601572036743164 epoch: 4 step: 299, loss is 1.6924667358398438 epoch: 4 step: 300, loss is 1.6505777835845947 epoch: 4 step: 301, loss is 1.6166727542877197 epoch: 4 step: 302, loss is 1.7392905950546265 epoch: 4 step: 303, loss is 1.7075598239898682 epoch: 4 step: 304, loss is 1.6641656160354614 epoch: 4 step: 305, loss is 1.6933059692382812 epoch: 4 step: 306, loss is 1.5921763181686401 epoch: 4 step: 307, loss is 1.6138687133789062 epoch: 4 step: 308, loss is 1.711918592453003 epoch: 4 step: 309, loss is 1.5096725225448608 epoch: 4 step: 310, loss is 1.6413471698760986 epoch: 4 step: 311, loss is 1.6407179832458496 epoch: 4 step: 312, loss is 1.7054457664489746 epoch: 4 step: 313, loss is 1.6631145477294922 epoch: 4 step: 314, loss is 1.5111362934112549 epoch: 4 step: 315, loss is 1.536847472190857 epoch: 4 step: 316, loss is 1.5711177587509155 epoch: 4 step: 317, loss is 1.7044873237609863 epoch: 4 step: 318, loss is 1.621565580368042 epoch: 4 step: 319, loss is 1.5832188129425049 epoch: 4 step: 320, loss is 1.56040358543396 epoch: 4 step: 321, loss is 1.5922833681106567 epoch: 4 step: 322, loss is 1.6988639831542969 epoch: 4 step: 323, loss is 1.6180733442306519 epoch: 4 step: 324, loss is 1.682058334350586 epoch: 4 step: 325, loss is 1.5848283767700195 epoch: 4 step: 326, loss is 1.5157688856124878 epoch: 4 step: 327, loss is 1.7158801555633545 epoch: 4 step: 328, loss is 1.6353027820587158 epoch: 4 step: 329, loss is 1.879155158996582 epoch: 4 step: 330, loss is 1.5287706851959229 epoch: 4 step: 331, loss is 1.648926019668579 epoch: 4 step: 332, loss is 1.6658390760421753 epoch: 4 step: 333, loss is 1.6448297500610352 epoch: 4 step: 334, loss is 1.652221918106079 epoch: 4 step: 335, loss is 1.5456767082214355 epoch: 4 step: 336, loss is 1.5358079671859741 epoch: 4 step: 337, loss is 1.6559476852416992 epoch: 4 step: 338, loss is 1.6299281120300293 epoch: 4 step: 339, loss is 1.745874285697937 epoch: 4 step: 340, loss is 1.610398292541504 epoch: 4 step: 341, loss is 1.6859097480773926 epoch: 4 step: 342, loss is 1.6728047132492065 epoch: 4 step: 343, loss is 1.5981897115707397 epoch: 4 step: 344, loss is 1.7159134149551392 epoch: 4 step: 345, loss is 1.605245590209961 epoch: 4 step: 346, loss is 1.6806387901306152 epoch: 4 step: 347, loss is 1.538119912147522 epoch: 4 step: 348, loss is 1.586168646812439 epoch: 4 step: 349, loss is 1.6935168504714966 epoch: 4 step: 350, loss is 1.5582554340362549 epoch: 4 step: 351, loss is 1.6914149522781372 epoch: 4 step: 352, loss is 1.5586116313934326 epoch: 4 step: 353, loss is 1.6963200569152832 epoch: 4 step: 354, loss is 1.6159112453460693 epoch: 4 step: 355, loss is 1.6362673044204712 epoch: 4 step: 356, loss is 1.5766600370407104 epoch: 4 step: 357, loss is 1.6794461011886597 epoch: 4 step: 358, loss is 1.5064743757247925 epoch: 4 step: 359, loss is 1.7019294500350952 epoch: 4 step: 360, loss is 1.5504231452941895 epoch: 4 step: 361, loss is 1.6210225820541382 epoch: 4 step: 362, loss is 1.7284070253372192 epoch: 4 step: 363, loss is 1.6535985469818115 epoch: 4 step: 364, loss is 1.5663567781448364 epoch: 4 step: 365, loss is 1.7884958982467651 epoch: 4 step: 366, loss is 1.5132598876953125 epoch: 4 step: 367, loss is 1.7188857793807983 epoch: 4 step: 368, loss is 1.565704345703125 epoch: 4 step: 369, loss is 1.724658489227295 epoch: 4 step: 370, loss is 1.6204599142074585 epoch: 4 step: 371, loss is 1.6468414068222046 epoch: 4 step: 372, loss is 1.565652847290039 epoch: 4 step: 373, loss is 1.6791701316833496 epoch: 4 step: 374, loss is 1.5494015216827393 epoch: 4 step: 375, loss is 1.5870392322540283 epoch: 4 step: 376, loss is 1.688801884651184 epoch: 4 step: 377, loss is 1.6855888366699219 epoch: 4 step: 378, loss is 1.6413899660110474 epoch: 4 step: 379, loss is 1.5933438539505005 epoch: 4 step: 380, loss is 1.7512990236282349 epoch: 4 step: 381, loss is 1.5685606002807617 epoch: 4 step: 382, loss is 1.6456449031829834 epoch: 4 step: 383, loss is 1.5981110334396362 epoch: 4 step: 384, loss is 1.6993358135223389 epoch: 4 step: 385, loss is 1.6400423049926758 epoch: 4 step: 386, loss is 1.6877739429473877 epoch: 4 step: 387, loss is 1.7578630447387695 epoch: 4 step: 388, loss is 1.6703307628631592 epoch: 4 step: 389, loss is 1.64776611328125 epoch: 4 step: 390, loss is 1.6677515506744385 Train epoch time: 135002.970 ms, per step time: 346.161 ms epoch: 5 step: 1, loss is 1.6414284706115723 epoch: 5 step: 2, loss is 1.6133499145507812 epoch: 5 step: 3, loss is 1.6073421239852905 epoch: 5 step: 4, loss is 1.5657422542572021 epoch: 5 step: 5, loss is 1.5712566375732422 epoch: 5 step: 6, loss is 1.7336235046386719 epoch: 5 step: 7, loss is 1.6461542844772339 epoch: 5 step: 8, loss is 1.5539560317993164 epoch: 5 step: 9, loss is 1.6430338621139526 epoch: 5 step: 10, loss is 1.5809056758880615 epoch: 5 step: 11, loss is 1.5767440795898438 epoch: 5 step: 12, loss is 1.4443682432174683 epoch: 5 step: 13, loss is 1.7491919994354248 epoch: 5 step: 14, loss is 1.5007736682891846 epoch: 5 step: 15, loss is 1.6383237838745117 epoch: 5 step: 16, loss is 1.595729947090149 epoch: 5 step: 17, loss is 1.6138263940811157 epoch: 5 step: 18, loss is 1.582689642906189 epoch: 5 step: 19, loss is 1.5651130676269531 epoch: 5 step: 20, loss is 1.591292381286621 epoch: 5 step: 21, loss is 1.576625943183899 epoch: 5 step: 22, loss is 1.6917927265167236 epoch: 5 step: 23, loss is 1.6040232181549072 epoch: 5 step: 24, loss is 1.546478271484375 epoch: 5 step: 25, loss is 1.6349976062774658 epoch: 5 step: 26, loss is 1.6656715869903564 epoch: 5 step: 27, loss is 1.6220475435256958 epoch: 5 step: 28, loss is 1.661820650100708 epoch: 5 step: 29, loss is 1.543447494506836 epoch: 5 step: 30, loss is 1.5551376342773438 epoch: 5 step: 31, loss is 1.5874927043914795 epoch: 5 step: 32, loss is 1.66179621219635 epoch: 5 step: 33, loss is 1.6630723476409912 epoch: 5 step: 34, loss is 1.6434706449508667 epoch: 5 step: 35, loss is 1.6164604425430298 epoch: 5 step: 36, loss is 1.5964670181274414 epoch: 5 step: 37, loss is 1.5660440921783447 epoch: 5 step: 38, loss is 1.6367616653442383 epoch: 5 step: 39, loss is 1.6313118934631348 epoch: 5 step: 40, loss is 1.5402803421020508 epoch: 5 step: 41, loss is 1.6043792963027954 epoch: 5 step: 42, loss is 1.5902783870697021 epoch: 5 step: 43, loss is 1.6030004024505615 epoch: 5 step: 44, loss is 1.51835298538208 epoch: 5 step: 45, loss is 1.6403353214263916 epoch: 5 step: 46, loss is 1.5426907539367676 epoch: 5 step: 47, loss is 1.6529593467712402 epoch: 5 step: 48, loss is 1.7222962379455566 epoch: 5 step: 49, loss is 1.5422757863998413 epoch: 5 step: 50, loss is 1.6748955249786377 epoch: 5 step: 51, loss is 1.7007254362106323 epoch: 5 step: 52, loss is 1.5466275215148926 epoch: 5 step: 53, loss is 1.600034236907959 epoch: 5 step: 54, loss is 1.6911506652832031 epoch: 5 step: 55, loss is 1.5752044916152954 epoch: 5 step: 56, loss is 1.5590535402297974 epoch: 5 step: 57, loss is 1.675789713859558 epoch: 5 step: 58, loss is 1.4998986721038818 epoch: 5 step: 59, loss is 1.5317859649658203 epoch: 5 step: 60, loss is 1.6225993633270264 epoch: 5 step: 61, loss is 1.5759437084197998 epoch: 5 step: 62, loss is 1.5726115703582764 epoch: 5 step: 63, loss is 1.6666594743728638 epoch: 5 step: 64, loss is 1.636668086051941 epoch: 5 step: 65, loss is 1.46066415309906 epoch: 5 step: 66, loss is 1.6098400354385376 epoch: 5 step: 67, loss is 1.494192361831665 epoch: 5 step: 68, loss is 1.6731780767440796 epoch: 5 step: 69, loss is 1.5911043882369995 epoch: 5 step: 70, loss is 1.640868067741394 epoch: 5 step: 71, loss is 1.6505470275878906 epoch: 5 step: 72, loss is 1.5993077754974365 epoch: 5 step: 73, loss is 1.58303701877594 epoch: 5 step: 74, loss is 1.567945957183838 epoch: 5 step: 75, loss is 1.6461838483810425 epoch: 5 step: 76, loss is 1.5701122283935547 epoch: 5 step: 77, loss is 1.578521490097046 epoch: 5 step: 78, loss is 1.5445590019226074 epoch: 5 step: 79, loss is 1.6084973812103271 epoch: 5 step: 80, loss is 1.5336830615997314 epoch: 5 step: 81, loss is 1.5514973402023315 epoch: 5 step: 82, loss is 1.5627862215042114 epoch: 5 step: 83, loss is 1.4522783756256104 epoch: 5 step: 84, loss is 1.6608734130859375 epoch: 5 step: 85, loss is 1.5586647987365723 epoch: 5 step: 86, loss is 1.4971017837524414 epoch: 5 step: 87, loss is 1.6389094591140747 epoch: 5 step: 88, loss is 1.649534821510315 epoch: 5 step: 89, loss is 1.7511341571807861 epoch: 5 step: 90, loss is 1.7455153465270996 epoch: 5 step: 91, loss is 1.7708572149276733 epoch: 5 step: 92, loss is 1.633859395980835 epoch: 5 step: 93, loss is 1.5422446727752686 epoch: 5 step: 94, loss is 1.6506986618041992 epoch: 5 step: 95, loss is 1.6532657146453857 epoch: 5 step: 96, loss is 1.6974544525146484 epoch: 5 step: 97, loss is 1.5629090070724487 epoch: 5 step: 98, loss is 1.6426180601119995 epoch: 5 step: 99, loss is 1.6129649877548218 epoch: 5 step: 100, loss is 1.6500146389007568 epoch: 5 step: 101, loss is 1.6136314868927002 epoch: 5 step: 102, loss is 1.6230109930038452 epoch: 5 step: 103, loss is 1.5851149559020996 epoch: 5 step: 104, loss is 1.5509576797485352 epoch: 5 step: 105, loss is 1.648830771446228 epoch: 5 step: 106, loss is 1.5736055374145508 epoch: 5 step: 107, loss is 1.563952088356018 epoch: 5 step: 108, loss is 1.6155402660369873 epoch: 5 step: 109, loss is 1.601898431777954 epoch: 5 step: 110, loss is 1.6058074235916138 epoch: 5 step: 111, loss is 1.6439316272735596 epoch: 5 step: 112, loss is 1.7084676027297974 epoch: 5 step: 113, loss is 1.5610415935516357 epoch: 5 step: 114, loss is 1.6334338188171387 epoch: 5 step: 115, loss is 1.5861258506774902 epoch: 5 step: 116, loss is 1.4557653665542603 epoch: 5 step: 117, loss is 1.6719934940338135 epoch: 5 step: 118, loss is 1.7527364492416382 epoch: 5 step: 119, loss is 1.5504150390625 epoch: 5 step: 120, loss is 1.5088813304901123 epoch: 5 step: 121, loss is 1.5114814043045044 epoch: 5 step: 122, loss is 1.564155101776123 epoch: 5 step: 123, loss is 1.6420687437057495 epoch: 5 step: 124, loss is 1.721785306930542 epoch: 5 step: 125, loss is 1.6231032609939575 epoch: 5 step: 126, loss is 1.6007492542266846 epoch: 5 step: 127, loss is 1.5860202312469482 epoch: 5 step: 128, loss is 1.6081591844558716 epoch: 5 step: 129, loss is 1.4897911548614502 epoch: 5 step: 130, loss is 1.6453657150268555 epoch: 5 step: 131, loss is 1.7042666673660278 epoch: 5 step: 132, loss is 1.6355501413345337 epoch: 5 step: 133, loss is 1.6283111572265625 epoch: 5 step: 134, loss is 1.5827581882476807 epoch: 5 step: 135, loss is 1.590122938156128 epoch: 5 step: 136, loss is 1.5546517372131348 epoch: 5 step: 137, loss is 1.451050877571106 epoch: 5 step: 138, loss is 1.8012257814407349 epoch: 5 step: 139, loss is 1.6254920959472656 epoch: 5 step: 140, loss is 1.5159741640090942 epoch: 5 step: 141, loss is 1.5933189392089844 epoch: 5 step: 142, loss is 1.728009819984436 epoch: 5 step: 143, loss is 1.5668749809265137 epoch: 5 step: 144, loss is 1.7010149955749512 epoch: 5 step: 145, loss is 1.6198683977127075 epoch: 5 step: 146, loss is 1.6176905632019043 epoch: 5 step: 147, loss is 1.5649861097335815 epoch: 5 step: 148, loss is 1.627281665802002 epoch: 5 step: 149, loss is 1.417607069015503 epoch: 5 step: 150, loss is 1.7324036359786987 epoch: 5 step: 151, loss is 1.648249864578247 epoch: 5 step: 152, loss is 1.5773850679397583 epoch: 5 step: 153, loss is 1.5680890083312988 epoch: 5 step: 154, loss is 1.5867016315460205 epoch: 5 step: 155, loss is 1.630353569984436 epoch: 5 step: 156, loss is 1.6103355884552002 epoch: 5 step: 157, loss is 1.620666265487671 epoch: 5 step: 158, loss is 1.5707900524139404 epoch: 5 step: 159, loss is 1.5001070499420166 epoch: 5 step: 160, loss is 1.52289617061615 epoch: 5 step: 161, loss is 1.5643471479415894 epoch: 5 step: 162, loss is 1.5430119037628174 epoch: 5 step: 163, loss is 1.579246163368225 epoch: 5 step: 164, loss is 1.6190059185028076 epoch: 5 step: 165, loss is 1.5879192352294922 epoch: 5 step: 166, loss is 1.6042240858078003 epoch: 5 step: 167, loss is 1.5793931484222412 epoch: 5 step: 168, loss is 1.5850214958190918 epoch: 5 step: 169, loss is 1.4722135066986084 epoch: 5 step: 170, loss is 1.4814708232879639 epoch: 5 step: 171, loss is 1.6623531579971313 epoch: 5 step: 172, loss is 1.7225098609924316 epoch: 5 step: 173, loss is 1.5727636814117432 epoch: 5 step: 174, loss is 1.5415968894958496 epoch: 5 step: 175, loss is 1.5758564472198486 epoch: 5 step: 176, loss is 1.7106969356536865 epoch: 5 step: 177, loss is 1.4958528280258179 epoch: 5 step: 178, loss is 1.6774400472640991 epoch: 5 step: 179, loss is 1.69835364818573 epoch: 5 step: 180, loss is 1.6368367671966553 epoch: 5 step: 181, loss is 1.691902756690979 epoch: 5 step: 182, loss is 1.621715784072876 epoch: 5 step: 183, loss is 1.668091058731079 epoch: 5 step: 184, loss is 1.5842854976654053 epoch: 5 step: 185, loss is 1.6971570253372192 epoch: 5 step: 186, loss is 1.6501247882843018 epoch: 5 step: 187, loss is 1.6283658742904663 epoch: 5 step: 188, loss is 1.61997652053833 epoch: 5 step: 189, loss is 1.706433653831482 epoch: 5 step: 190, loss is 1.5875580310821533 epoch: 5 step: 191, loss is 1.585848093032837 epoch: 5 step: 192, loss is 1.6266297101974487 epoch: 5 step: 193, loss is 1.6387460231781006 epoch: 5 step: 194, loss is 1.648606538772583 epoch: 5 step: 195, loss is 1.5681453943252563 epoch: 5 step: 196, loss is 1.6784400939941406 epoch: 5 step: 197, loss is 1.6807732582092285 epoch: 5 step: 198, loss is 1.5594044923782349 epoch: 5 step: 199, loss is 1.6462832689285278 epoch: 5 step: 200, loss is 1.5839227437973022 epoch: 5 step: 201, loss is 1.704714298248291 epoch: 5 step: 202, loss is 1.539453387260437 epoch: 5 step: 203, loss is 1.6192526817321777 epoch: 5 step: 204, loss is 1.714280366897583 epoch: 5 step: 205, loss is 1.6327862739562988 epoch: 5 step: 206, loss is 1.5994987487792969 epoch: 5 step: 207, loss is 1.6471309661865234 epoch: 5 step: 208, loss is 1.5992558002471924 epoch: 5 step: 209, loss is 1.5258523225784302 epoch: 5 step: 210, loss is 1.589914321899414 epoch: 5 step: 211, loss is 1.569895625114441 epoch: 5 step: 212, loss is 1.5933986902236938 epoch: 5 step: 213, loss is 1.467947244644165 epoch: 5 step: 214, loss is 1.5026377439498901 epoch: 5 step: 215, loss is 1.5750608444213867 epoch: 5 step: 216, loss is 1.5915688276290894 epoch: 5 step: 217, loss is 1.6002682447433472 epoch: 5 step: 218, loss is 1.6803622245788574 epoch: 5 step: 219, loss is 1.577143907546997 epoch: 5 step: 220, loss is 1.6161741018295288 epoch: 5 step: 221, loss is 1.5676504373550415 epoch: 5 step: 222, loss is 1.5366853475570679 epoch: 5 step: 223, loss is 1.6090028285980225 epoch: 5 step: 224, loss is 1.5079410076141357 epoch: 5 step: 225, loss is 1.5618683099746704 epoch: 5 step: 226, loss is 1.638017177581787 epoch: 5 step: 227, loss is 1.5741792917251587 epoch: 5 step: 228, loss is 1.5870435237884521 epoch: 5 step: 229, loss is 1.6679387092590332 epoch: 5 step: 230, loss is 1.4960081577301025 epoch: 5 step: 231, loss is 1.538419246673584 epoch: 5 step: 232, loss is 1.5862116813659668 epoch: 5 step: 233, loss is 1.5973320007324219 epoch: 5 step: 234, loss is 1.5232126712799072 epoch: 5 step: 235, loss is 1.5877140760421753 epoch: 5 step: 236, loss is 1.6911592483520508 epoch: 5 step: 237, loss is 1.6973609924316406 epoch: 5 step: 238, loss is 1.5384098291397095 epoch: 5 step: 239, loss is 1.564143180847168 epoch: 5 step: 240, loss is 1.513045072555542 epoch: 5 step: 241, loss is 1.6131844520568848 epoch: 5 step: 242, loss is 1.494818925857544 epoch: 5 step: 243, loss is 1.523600697517395 epoch: 5 step: 244, loss is 1.5096313953399658 epoch: 5 step: 245, loss is 1.6016907691955566 epoch: 5 step: 246, loss is 1.4984893798828125 epoch: 5 step: 247, loss is 1.5625672340393066 epoch: 5 step: 248, loss is 1.5749375820159912 epoch: 5 step: 249, loss is 1.5304640531539917 epoch: 5 step: 250, loss is 1.5693752765655518 epoch: 5 step: 251, loss is 1.705450415611267 epoch: 5 step: 252, loss is 1.5074079036712646 epoch: 5 step: 253, loss is 1.5789763927459717 epoch: 5 step: 254, loss is 1.530361294746399 epoch: 5 step: 255, loss is 1.7976694107055664 epoch: 5 step: 256, loss is 1.6102001667022705 epoch: 5 step: 257, loss is 1.4735286235809326 epoch: 5 step: 258, loss is 1.6302123069763184 epoch: 5 step: 259, loss is 1.5966647863388062 epoch: 5 step: 260, loss is 1.6233352422714233 epoch: 5 step: 261, loss is 1.5608391761779785 epoch: 5 step: 262, loss is 1.606345534324646 epoch: 5 step: 263, loss is 1.5095794200897217 epoch: 5 step: 264, loss is 1.509196162223816 epoch: 5 step: 265, loss is 1.563521385192871 epoch: 5 step: 266, loss is 1.5619696378707886 epoch: 5 step: 267, loss is 1.6058316230773926 epoch: 5 step: 268, loss is 1.6006145477294922 epoch: 5 step: 269, loss is 1.53438401222229 epoch: 5 step: 270, loss is 1.671415090560913 epoch: 5 step: 271, loss is 1.5893216133117676 epoch: 5 step: 272, loss is 1.501842975616455 epoch: 5 step: 273, loss is 1.7832317352294922 epoch: 5 step: 274, loss is 1.4561469554901123 epoch: 5 step: 275, loss is 1.5942699909210205 epoch: 5 step: 276, loss is 1.4835069179534912 epoch: 5 step: 277, loss is 1.5552664995193481 epoch: 5 step: 278, loss is 1.538780689239502 epoch: 5 step: 279, loss is 1.523700475692749 epoch: 5 step: 280, loss is 1.5699076652526855 epoch: 5 step: 281, loss is 1.566076397895813 epoch: 5 step: 282, loss is 1.595410943031311 epoch: 5 step: 283, loss is 1.5802452564239502 epoch: 5 step: 284, loss is 1.5255502462387085 epoch: 5 step: 285, loss is 1.6844873428344727 epoch: 5 step: 286, loss is 1.5715992450714111 epoch: 5 step: 287, loss is 1.529794454574585 epoch: 5 step: 288, loss is 1.5810072422027588 epoch: 5 step: 289, loss is 1.7997422218322754 epoch: 5 step: 290, loss is 1.5457688570022583 epoch: 5 step: 291, loss is 1.592775583267212 epoch: 5 step: 292, loss is 1.6729099750518799 epoch: 5 step: 293, loss is 1.4456005096435547 epoch: 5 step: 294, loss is 1.682685375213623 epoch: 5 step: 295, loss is 1.559996485710144 epoch: 5 step: 296, loss is 1.587217092514038 epoch: 5 step: 297, loss is 1.4631487131118774 epoch: 5 step: 298, loss is 1.6209064722061157 epoch: 5 step: 299, loss is 1.6023503541946411 epoch: 5 step: 300, loss is 1.5702555179595947 epoch: 5 step: 301, loss is 1.656314730644226 epoch: 5 step: 302, loss is 1.6821871995925903 epoch: 5 step: 303, loss is 1.6096503734588623 epoch: 5 step: 304, loss is 1.6399579048156738 epoch: 5 step: 305, loss is 1.6309127807617188 epoch: 5 step: 306, loss is 1.5221701860427856 epoch: 5 step: 307, loss is 1.5615696907043457 epoch: 5 step: 308, loss is 1.5472203493118286 epoch: 5 step: 309, loss is 1.521003007888794 epoch: 5 step: 310, loss is 1.612450361251831 epoch: 5 step: 311, loss is 1.5359975099563599 epoch: 5 step: 312, loss is 1.631341814994812 epoch: 5 step: 313, loss is 1.575690746307373 epoch: 5 step: 314, loss is 1.5513726472854614 epoch: 5 step: 315, loss is 1.5994372367858887 epoch: 5 step: 316, loss is 1.533798098564148 epoch: 5 step: 317, loss is 1.5343968868255615 epoch: 5 step: 318, loss is 1.4302661418914795 epoch: 5 step: 319, loss is 1.5225865840911865 epoch: 5 step: 320, loss is 1.624767541885376 epoch: 5 step: 321, loss is 1.501556396484375 epoch: 5 step: 322, loss is 1.5757759809494019 epoch: 5 step: 324, loss is 1.578230857849121 epoch: 5 step: 325, loss is 1.729386568069458 epoch: 5 step: 326, loss is 1.5106805562973022 epoch: 5 step: 327, loss is 1.6114463806152344 epoch: 5 step: 328, loss is 1.5462558269500732 epoch: 5 step: 329, loss is 1.5644663572311401 epoch: 5 step: 330, loss is 1.6486071348190308 epoch: 5 step: 331, loss is 1.6583094596862793 epoch: 5 step: 332, loss is 1.4188274145126343 epoch: 5 step: 333, loss is 1.485639214515686 epoch: 5 step: 334, loss is 1.558194637298584 epoch: 5 step: 335, loss is 1.7039393186569214 epoch: 5 step: 336, loss is 1.7015631198883057 epoch: 5 step: 337, loss is 1.5780913829803467 epoch: 5 step: 338, loss is 1.5033587217330933 epoch: 5 step: 339, loss is 1.537819743156433 epoch: 5 step: 340, loss is 1.8121535778045654 epoch: 5 step: 341, loss is 1.5806516408920288 epoch: 5 step: 342, loss is 1.5796537399291992 epoch: 5 step: 343, loss is 1.6168782711029053 epoch: 5 step: 344, loss is 1.641409158706665 epoch: 5 step: 345, loss is 1.5133163928985596 epoch: 5 step: 346, loss is 1.6856012344360352 epoch: 5 step: 347, loss is 1.7320266962051392 epoch: 5 step: 348, loss is 1.5903704166412354 epoch: 5 step: 349, loss is 1.7494301795959473 epoch: 5 step: 350, loss is 1.6118680238723755 epoch: 5 step: 351, loss is 1.5314308404922485 epoch: 5 step: 352, loss is 1.556494951248169 epoch: 5 step: 353, loss is 1.714756727218628 epoch: 5 step: 354, loss is 1.5310161113739014 epoch: 5 step: 355, loss is 1.7872475385665894 epoch: 5 step: 356, loss is 1.687711238861084 epoch: 5 step: 357, loss is 1.7892705202102661 epoch: 5 step: 358, loss is 1.5611436367034912 epoch: 5 step: 359, loss is 1.5671555995941162 epoch: 5 step: 360, loss is 1.6102842092514038 epoch: 5 step: 361, loss is 1.5782215595245361 epoch: 5 step: 362, loss is 1.6032049655914307 epoch: 5 step: 363, loss is 1.6011167764663696 epoch: 5 step: 364, loss is 1.4326987266540527 epoch: 5 step: 365, loss is 1.5786182880401611 epoch: 5 step: 366, loss is 1.61308753490448 epoch: 5 step: 367, loss is 1.5704137086868286 epoch: 5 step: 368, loss is 1.5738422870635986 epoch: 5 step: 369, loss is 1.6481573581695557 epoch: 5 step: 370, loss is 1.5465118885040283 epoch: 5 step: 371, loss is 1.5927613973617554 epoch: 5 step: 372, loss is 1.5779811143875122 epoch: 5 step: 373, loss is 1.6706527471542358 epoch: 5 step: 374, loss is 1.658896565437317 epoch: 5 step: 375, loss is 1.6150617599487305 epoch: 5 step: 376, loss is 1.706315517425537 epoch: 5 step: 377, loss is 1.5464962720870972 epoch: 5 step: 378, loss is 1.5608888864517212 epoch: 5 step: 379, loss is 1.610548973083496 epoch: 5 step: 380, loss is 1.610349416732788 epoch: 5 step: 381, loss is 1.6361488103866577 epoch: 5 step: 382, loss is 1.5933778285980225 epoch: 5 step: 383, loss is 1.5234771966934204 epoch: 5 step: 384, loss is 1.636739730834961 epoch: 5 step: 385, loss is 1.6566827297210693 epoch: 5 step: 386, loss is 1.5916155576705933 epoch: 5 step: 387, loss is 1.6105828285217285 epoch: 5 step: 388, loss is 1.483259916305542 epoch: 5 step: 389, loss is 1.6239800453186035 epoch: 5 step: 390, loss is 1.6295044422149658 Train epoch time: 141315.200 ms, per step time: 362.347 ms total time:0h 14m 38s ============== Train Success ============== 2024-07-26 13:10:06 littlewoodhu
训练好的模型保存在当前目录的shufflenetv1-5_390.ckpt中,用作评估。
模型评估
在CIFAR-10的测试集上对模型进行评估。
设置好评估模型的路径后加载数据集,并设置Top 1, Top 5的评估标准,最后用model.eval()接口对模型进行评估。
2024-07-26 13:12:47 littlewoodhu
model size is 2.0x
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.175.441 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.176.033 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.178.767 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.182.675 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.184.922 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.186.321 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.188.563 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.190.344 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.191.313 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.193.074 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.194.901 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.197.086 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.198.914 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.203.316 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.203.821 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.209.014 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.209.532 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.210.503 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.213.997 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.214.081 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.216.718 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.218.101 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.221.238 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.223.003 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.226.538 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.228.291 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.229.274 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.234.036 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.235.840 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.236.370 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.238.174 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:12:56.240.785 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
result:{'Loss': 1.57918219994276, 'Top_1_Acc': 0.5193309294871795, 'Top_5_Acc': 0.9412059294871795}, ckpt:'./shufflenetv1-5_390.ckpt', time: 0h 0m 59s
2024-07-26 13:13:48 littlewoodhu
模型预测
在CIFAR-10的测试集上对模型进行预测,并将预测结果可视化。
2024-07-26 13:17:47 littlewoodhu
model size is 2.0x
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.143.404 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/1681751341.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.143.520 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/1681751341.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.147.209 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.147.744 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.149.133 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/778396864.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.149.664 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.153.573 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.155.817 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.157.214 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.159.441 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.161.215 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.162.186 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.163.952 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.165.782 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.167.964 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.169.789 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.172.813 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/778396864.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.173.357 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.173.857 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.179.055 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.179.561 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.180.587 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.184.078 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.184.164 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.186.834 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.188.188 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.189.530 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/778396864.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.190.495 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.192.257 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.195.771 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.197.553 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.198.532 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.203.338 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.205.223 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.205.756 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.207.547 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
[ERROR] CORE(15409,ffff8e187930,python):2024-07-26-13:17:57.210.215 [mindspore/core/utils/file_utils.cc:253] GetRealPath] Get realpath failed, path[/tmp/ipykernel_15409/4100419229.py]
2024-07-26 13:18:28 littlewoodhu