Number of GFLOPs: 15.571533312 Number of million parameters: 88.673166 Start training for 30 epochs Epoch: [0/30] [ 0/1251] eta: 1:59:58 lr: 0.000000 loss: 6.907769 (6.907769) time: 5.753918 data: 2.032651 max mem: 17831 Epoch: [0/30] [ 50/1251] eta: 0:21:28 lr: 0.000001 loss: 6.907279 (6.907505) time: 1.004004 data: 0.000173 max mem: 18813 Epoch: [0/30] [ 100/1251] eta: 0:19:07 lr: 0.000001 loss: 6.905505 (6.906830) time: 0.914264 data: 0.000169 max mem: 18813 Epoch: [0/30] [ 150/1251] eta: 0:18:04 lr: 0.000002 loss: 6.902154 (6.905675) time: 0.927384 data: 0.000164 max mem: 18813 Epoch: [0/30] [ 200/1251] eta: 0:17:02 lr: 0.000003 loss: 6.896926 (6.903918) time: 0.915076 data: 0.000159 max mem: 18813 Epoch: [0/30] [ 250/1251] eta: 0:16:15 lr: 0.000003 loss: 6.888228 (6.901438) time: 0.974258 data: 0.000175 max mem: 18813 Epoch: [0/30] [ 300/1251] eta: 0:15:24 lr: 0.000004 loss: 6.876930 (6.898049) time: 0.985533 data: 0.000148 max mem: 18813 Epoch: [0/30] [ 350/1251] eta: 0:14:29 lr: 0.000005 loss: 6.860416 (6.893497) time: 0.913502 data: 0.000140 max mem: 18813 Epoch: [0/30] [ 400/1251] eta: 0:13:41 lr: 0.000005 loss: 6.839359 (6.887606) time: 0.923989 data: 0.000159 max mem: 18813 Epoch: [0/30] [ 450/1251] eta: 0:12:53 lr: 0.000006 loss: 6.811071 (6.880047) time: 0.922484 data: 0.000150 max mem: 18813 Epoch: [0/30] [ 500/1251] eta: 0:12:03 lr: 0.000006 loss: 6.775376 (6.870621) time: 0.954371 data: 0.000168 max mem: 18813 Epoch: [0/30] [ 550/1251] eta: 0:11:15 lr: 0.000007 loss: 6.729095 (6.859184) time: 0.995157 data: 0.000174 max mem: 18813 Epoch: [0/30] [ 600/1251] eta: 0:10:24 lr: 0.000008 loss: 6.672769 (6.845325) time: 0.911196 data: 0.000150 max mem: 18813 Epoch: [0/30] [ 650/1251] eta: 0:09:37 lr: 0.000008 loss: 6.617243 (6.829399) time: 0.931651 data: 0.000136 max mem: 18813 Epoch: [0/30] [ 700/1251] eta: 0:08:50 lr: 0.000009 loss: 6.554528 (6.810939) time: 0.916377 data: 0.000140 max mem: 18813 Epoch: [0/30] [ 750/1251] eta: 0:08:01 lr: 0.000010 loss: 6.475686 (6.790411) time: 1.000564 data: 0.000165 max mem: 18813 Epoch: [0/30] [ 800/1251] eta: 0:07:13 lr: 0.000010 loss: 6.394699 (6.767452) time: 0.972682 data: 0.000174 max mem: 18813 Epoch: [0/30] [ 850/1251] eta: 0:06:24 lr: 0.000011 loss: 6.320889 (6.742600) time: 0.911281 data: 0.000172 max mem: 18813 Epoch: [0/30] [ 900/1251] eta: 0:05:36 lr: 0.000012 loss: 6.222483 (6.715599) time: 0.927846 data: 0.000169 max mem: 18813 Epoch: [0/30] [ 950/1251] eta: 0:04:48 lr: 0.000012 loss: 6.142892 (6.686553) time: 0.961660 data: 0.000178 max mem: 18813 Epoch: [0/30] [1000/1251] eta: 0:04:00 lr: 0.000013 loss: 6.043197 (6.656102) time: 1.024278 data: 0.000172 max mem: 18813 Epoch: [0/30] [1050/1251] eta: 0:03:12 lr: 0.000013 loss: 5.933037 (6.624304) time: 0.992375 data: 0.000156 max mem: 18813 Epoch: [0/30] [1100/1251] eta: 0:02:24 lr: 0.000014 loss: 5.845234 (6.590228) time: 0.939977 data: 0.000154 max mem: 18813 Epoch: [0/30] [1150/1251] eta: 0:01:36 lr: 0.000015 loss: 5.761835 (6.554999) time: 0.929816 data: 0.000170 max mem: 18813 Epoch: [0/30] [1200/1251] eta: 0:00:48 lr: 0.000015 loss: 5.645113 (6.518176) time: 0.962107 data: 0.000144 max mem: 18813 Epoch: [0/30] [1250/1251] eta: 0:00:00 lr: 0.000016 loss: 5.534029 (6.480035) time: 1.001614 data: 0.000703 max mem: 18813 Epoch: [0/30] Total time: 0:20:00 (0.959699 s / it) Averaged stats: lr: 0.000016 loss: 5.534029 (6.479627) Test: [ 0/49] eta: 0:01:35 loss: 5.120972 (5.120972) acc1: 48.437500 (48.437500) acc5: 85.937500 (85.937500) time: 1.950763 data: 1.549305 max mem: 18813 Test: [10/49] eta: 0:00:19 loss: 5.161194 (5.155257) acc1: 48.437500 (48.295455) acc5: 81.250000 (81.250000) time: 0.508665 data: 0.140980 max mem: 18813 Test: [20/49] eta: 0:00:12 loss: 5.170288 (5.172878) acc1: 48.437500 (47.916667) acc5: 79.687500 (80.580357) time: 0.363348 data: 0.000154 max mem: 18813 Test: [30/49] eta: 0:00:07 loss: 5.165039 (5.160627) acc1: 45.312500 (47.530242) acc5: 81.250000 (80.796371) time: 0.361733 data: 0.000162 max mem: 18813 Test: [40/49] eta: 0:00:03 loss: 5.162354 (5.169319) acc1: 46.875000 (47.332317) acc5: 79.687500 (79.954268) time: 0.359716 data: 0.000169 max mem: 18813 Test: [48/49] eta: 0:00:00 loss: 5.165039 (5.165702) acc1: 48.437500 (48.192000) acc5: 79.687500 (80.320000) time: 0.359953 data: 0.000144 max mem: 18813 Test: Total time: 0:00:19 (0.397619 s / it) * Acc@1 47.522 Acc@5 80.188 loss 5.178 Max accuracy: 47.52% Epoch: [1/30] [ 0/1251] eta: 0:44:42 lr: 0.000016 loss: 5.342369 (5.342369) time: 2.144481 data: 1.209252 max mem: 18813 Epoch: [1/30] [ 50/1251] eta: 0:19:41 lr: 0.000017 loss: 5.398561 (5.433383) time: 0.947995 data: 0.000449 max mem: 18815 Epoch: [1/30] [ 100/1251] eta: 0:18:30 lr: 0.000017 loss: 5.294445 (5.385332) time: 0.920365 data: 0.000182 max mem: 18815 Epoch: [1/30] [ 150/1251] eta: 0:17:43 lr: 0.000018 loss: 5.200863 (5.332964) time: 0.939633 data: 0.000167 max mem: 18815 Epoch: [1/30] [ 200/1251] eta: 0:16:56 lr: 0.000019 loss: 5.069255 (5.278503) time: 0.978079 data: 0.000192 max mem: 18815 Epoch: [1/30] [ 250/1251] eta: 0:16:03 lr: 0.000019 loss: 4.974529 (5.224460) time: 0.991067 data: 0.000173 max mem: 18815 Epoch: [1/30] [ 300/1251] eta: 0:15:13 lr: 0.000020 loss: 4.802037 (5.164992) time: 0.929451 data: 0.000155 max mem: 18815 Epoch: [1/30] [ 350/1251] eta: 0:14:25 lr: 0.000021 loss: 4.736430 (5.106482) time: 0.905852 data: 0.000154 max mem: 18815 Epoch: [1/30] [ 400/1251] eta: 0:13:39 lr: 0.000021 loss: 4.585870 (5.050001) time: 1.015032 data: 0.000163 max mem: 18815 Epoch: [1/30] [ 450/1251] eta: 0:12:50 lr: 0.000022 loss: 4.528177 (4.992104) time: 0.987965 data: 0.000153 max mem: 18815 Epoch: [1/30] [ 500/1251] eta: 0:12:01 lr: 0.000022 loss: 4.325572 (4.931572) time: 0.933089 data: 0.000147 max mem: 18815 Epoch: [1/30] [ 550/1251] eta: 0:11:13 lr: 0.000023 loss: 4.276558 (4.873658) time: 0.929154 data: 0.000141 max mem: 18815 Epoch: [1/30] [ 600/1251] eta: 0:10:26 lr: 0.000024 loss: 4.132976 (4.816234) time: 0.924124 data: 0.000154 max mem: 18815 Epoch: [1/30] [ 650/1251] eta: 0:09:38 lr: 0.000024 loss: 4.040769 (4.760690) time: 1.005866 data: 0.000165 max mem: 18815 Epoch: [1/30] [ 700/1251] eta: 0:08:50 lr: 0.000025 loss: 3.895721 (4.703592) time: 1.020725 data: 0.000144 max mem: 18815 Epoch: [1/30] [ 750/1251] eta: 0:08:01 lr: 0.000026 loss: 3.750602 (4.646671) time: 0.919406 data: 0.000170 max mem: 18815 Epoch: [1/30] [ 800/1251] eta: 0:07:13 lr: 0.000026 loss: 3.705172 (4.590001) time: 0.927382 data: 0.000164 max mem: 18815 Epoch: [1/30] [ 850/1251] eta: 0:06:25 lr: 0.000027 loss: 3.554118 (4.533204) time: 0.921614 data: 0.000178 max mem: 18815 Epoch: [1/30] [ 900/1251] eta: 0:05:37 lr: 0.000028 loss: 3.518379 (4.477905) time: 0.979525 data: 0.000157 max mem: 18815 Epoch: [1/30] [ 950/1251] eta: 0:04:49 lr: 0.000028 loss: 3.409413 (4.424141) time: 0.977659 data: 0.000158 max mem: 18815 Epoch: [1/30] [1000/1251] eta: 0:04:01 lr: 0.000029 loss: 3.358865 (4.370425) time: 0.982971 data: 0.000137 max mem: 18815 Epoch: [1/30] [1050/1251] eta: 0:03:12 lr: 0.000029 loss: 3.279218 (4.317772) time: 0.908248 data: 0.000155 max mem: 18815 Epoch: [1/30] [1100/1251] eta: 0:02:25 lr: 0.000030 loss: 3.090842 (4.265965) time: 0.921398 data: 0.000169 max mem: 18815 Epoch: [1/30] [1150/1251] eta: 0:01:37 lr: 0.000031 loss: 3.064238 (4.216055) time: 0.980600 data: 0.000152 max mem: 18815 Epoch: [1/30] [1200/1251] eta: 0:00:48 lr: 0.000031 loss: 2.981287 (4.165706) time: 1.012517 data: 0.000151 max mem: 18815 Epoch: [1/30] [1250/1251] eta: 0:00:00 lr: 0.000032 loss: 2.869059 (4.115613) time: 0.927814 data: 0.000762 max mem: 18815 Epoch: [1/30] Total time: 0:20:00 (0.959955 s / it) Averaged stats: lr: 0.000032 loss: 2.869059 (4.110146) Test: [ 0/49] eta: 0:01:20 loss: 1.794739 (1.794739) acc1: 65.625000 (65.625000) acc5: 95.312500 (95.312500) time: 1.648818 data: 1.224955 max mem: 18815 Test: [10/49] eta: 0:00:18 loss: 1.838058 (1.840876) acc1: 67.187500 (69.744318) acc5: 90.625000 (91.477273) time: 0.486946 data: 0.111511 max mem: 18815 Test: [20/49] eta: 0:00:12 loss: 1.848785 (1.840974) acc1: 68.750000 (69.270833) acc5: 90.625000 (92.113095) time: 0.365655 data: 0.000159 max mem: 18815 Test: [30/49] eta: 0:00:07 loss: 1.829212 (1.827243) acc1: 71.875000 (70.211694) acc5: 92.187500 (92.389113) time: 0.361064 data: 0.000157 max mem: 18815 Test: [40/49] eta: 0:00:03 loss: 1.828815 (1.836450) acc1: 71.875000 (70.350610) acc5: 92.187500 (92.187500) time: 0.431129 data: 0.000164 max mem: 18815 Test: [48/49] eta: 0:00:00 loss: 1.860443 (1.835775) acc1: 70.312500 (70.400000) acc5: 92.187500 (92.192000) time: 0.425537 data: 0.000127 max mem: 18815 Test: Total time: 0:00:20 (0.418199 s / it) * Acc@1 69.804 Acc@5 92.670 loss 1.847 Max accuracy: 69.80% Epoch: [2/30] [ 0/1251] eta: 2:37:42 lr: 0.000032 loss: 3.117259 (3.117259) time: 7.563788 data: 2.038023 max mem: 18508 Epoch: [2/30] [ 50/1251] eta: 0:22:03 lr: 0.000033 loss: 2.823927 (2.882034) time: 0.989222 data: 0.000191 max mem: 18811 Epoch: [2/30] [ 100/1251] eta: 0:19:51 lr: 0.000033 loss: 2.749346 (2.851519) time: 0.970854 data: 0.000176 max mem: 18811 Epoch: [2/30] [ 150/1251] eta: 0:18:24 lr: 0.000034 loss: 2.729421 (2.816434) time: 0.949202 data: 0.000173 max mem: 18811 Epoch: [2/30] [ 200/1251] eta: 0:17:18 lr: 0.000035 loss: 2.702589 (2.781354) time: 0.917322 data: 0.000182 max mem: 18811 Epoch: [2/30] [ 250/1251] eta: 0:16:27 lr: 0.000035 loss: 2.679373 (2.751496) time: 0.919550 data: 0.000179 max mem: 18811 Epoch: [2/30] [ 300/1251] eta: 0:15:35 lr: 0.000036 loss: 2.548131 (2.721130) time: 1.002336 data: 0.000169 max mem: 18811 Epoch: [2/30] [ 350/1251] eta: 0:14:44 lr: 0.000037 loss: 2.503245 (2.696706) time: 0.981758 data: 0.000162 max mem: 18811 Epoch: [2/30] [ 400/1251] eta: 0:13:51 lr: 0.000037 loss: 2.475694 (2.666958) time: 0.974602 data: 0.000151 max mem: 18811 Epoch: [2/30] [ 450/1251] eta: 0:13:00 lr: 0.000038 loss: 2.386580 (2.646074) time: 0.926414 data: 0.000167 max mem: 18811 Epoch: [2/30] [ 500/1251] eta: 0:12:10 lr: 0.000038 loss: 2.408618 (2.625968) time: 0.923288 data: 0.000169 max mem: 18811 Epoch: [2/30] [ 550/1251] eta: 0:11:20 lr: 0.000039 loss: 2.445055 (2.612654) time: 0.971602 data: 0.000173 max mem: 18811 Epoch: [2/30] [ 600/1251] eta: 0:10:31 lr: 0.000040 loss: 2.472404 (2.594114) time: 0.978623 data: 0.000165 max mem: 18811 Epoch: [2/30] [ 650/1251] eta: 0:09:41 lr: 0.000040 loss: 2.333077 (2.577416) time: 0.957133 data: 0.000158 max mem: 18811 Epoch: [2/30] [ 700/1251] eta: 0:08:52 lr: 0.000041 loss: 2.345978 (2.558095) time: 0.938147 data: 0.000177 max mem: 18811 Epoch: [2/30] [ 750/1251] eta: 0:08:04 lr: 0.000042 loss: 2.117947 (2.539319) time: 0.921189 data: 0.000186 max mem: 18811 Epoch: [2/30] [ 800/1251] eta: 0:07:15 lr: 0.000042 loss: 2.311274 (2.523961) time: 0.987141 data: 0.000181 max mem: 18811 Epoch: [2/30] [ 850/1251] eta: 0:06:27 lr: 0.000043 loss: 2.192958 (2.508633) time: 0.995674 data: 0.000174 max mem: 18811 Epoch: [2/30] [ 900/1251] eta: 0:05:39 lr: 0.000044 loss: 2.184378 (2.493338) time: 0.988767 data: 0.000185 max mem: 18811 Epoch: [2/30] [ 950/1251] eta: 0:04:50 lr: 0.000044 loss: 2.148633 (2.479272) time: 0.928081 data: 0.000182 max mem: 18811 Epoch: [2/30] [1000/1251] eta: 0:04:02 lr: 0.000045 loss: 2.224178 (2.468308) time: 0.917606 data: 0.000153 max mem: 18811 Epoch: [2/30] [1050/1251] eta: 0:03:14 lr: 0.000045 loss: 2.176979 (2.456314) time: 0.981390 data: 0.000167 max mem: 18811 Epoch: [2/30] [1100/1251] eta: 0:02:25 lr: 0.000046 loss: 2.122762 (2.444699) time: 0.978532 data: 0.000162 max mem: 18811 Epoch: [2/30] [1150/1251] eta: 0:01:37 lr: 0.000047 loss: 2.234807 (2.433034) time: 0.950007 data: 0.000158 max mem: 18811 Epoch: [2/30] [1200/1251] eta: 0:00:49 lr: 0.000047 loss: 2.101430 (2.419012) time: 0.914246 data: 0.000171 max mem: 18811 Epoch: [2/30] [1250/1251] eta: 0:00:00 lr: 0.000048 loss: 2.141081 (2.408438) time: 0.909848 data: 0.000749 max mem: 18811 Epoch: [2/30] Total time: 0:20:06 (0.964195 s / it) Averaged stats: lr: 0.000048 loss: 2.141081 (2.411216) Test: [ 0/49] eta: 0:01:12 loss: 0.819277 (0.819277) acc1: 82.812500 (82.812500) acc5: 95.312500 (95.312500) time: 1.488688 data: 1.060550 max mem: 18811 Test: [10/49] eta: 0:00:18 loss: 0.819277 (0.873148) acc1: 82.812500 (80.823864) acc5: 95.312500 (95.028409) time: 0.479735 data: 0.096547 max mem: 18811 Test: [20/49] eta: 0:00:12 loss: 0.886889 (0.873632) acc1: 79.687500 (79.836310) acc5: 95.312500 (95.461310) time: 0.370680 data: 0.000146 max mem: 18811 Test: [30/49] eta: 0:00:07 loss: 0.896453 (0.868674) acc1: 79.687500 (80.191532) acc5: 95.312500 (95.362903) time: 0.364471 data: 0.000147 max mem: 18811 Test: [40/49] eta: 0:00:03 loss: 0.876935 (0.873630) acc1: 79.687500 (80.030488) acc5: 96.875000 (95.769817) time: 0.363941 data: 0.000144 max mem: 18811 Test: [48/49] eta: 0:00:00 loss: 0.877183 (0.874001) acc1: 79.687500 (79.968000) acc5: 96.875000 (95.776000) time: 0.364124 data: 0.000114 max mem: 18811 Test: Total time: 0:00:19 (0.391470 s / it) * Acc@1 79.852 Acc@5 96.026 loss 0.866 Max accuracy: 79.85% Epoch: [3/30] [ 0/1251] eta: 0:44:29 lr: 0.000048 loss: 2.093296 (2.093296) time: 2.133612 data: 1.211761 max mem: 18811 Epoch: [3/30] [ 50/1251] eta: 0:19:38 lr: 0.000049 loss: 2.161170 (2.116903) time: 0.976229 data: 0.000198 max mem: 18812 Epoch: [3/30] [ 100/1251] eta: 0:18:31 lr: 0.000049 loss: 1.968759 (2.124200) time: 0.981426 data: 0.000172 max mem: 18812 Epoch: [3/30] [ 150/1251] eta: 0:17:31 lr: 0.000050 loss: 2.138469 (2.119772) time: 0.912619 data: 0.000198 max mem: 18812 Epoch: [3/30] [ 200/1251] eta: 0:16:50 lr: 0.000051 loss: 2.112669 (2.118256) time: 0.940744 data: 0.000181 max mem: 18812 Epoch: [3/30] [ 250/1251] eta: 0:16:02 lr: 0.000051 loss: 2.047241 (2.108643) time: 0.983221 data: 0.000170 max mem: 18812 Epoch: [3/30] [ 300/1251] eta: 0:15:15 lr: 0.000052 loss: 2.155700 (2.110632) time: 0.972091 data: 0.000163 max mem: 18812 Epoch: [3/30] [ 350/1251] eta: 0:14:23 lr: 0.000053 loss: 2.185365 (2.113258) time: 0.940752 data: 0.000159 max mem: 18812 Epoch: [3/30] [ 400/1251] eta: 0:13:34 lr: 0.000053 loss: 1.957350 (2.110682) time: 0.920554 data: 0.000156 max mem: 18812 Epoch: [3/30] [ 450/1251] eta: 0:12:46 lr: 0.000054 loss: 2.043673 (2.105016) time: 0.909549 data: 0.000175 max mem: 18812 Epoch: [3/30] [ 500/1251] eta: 0:11:59 lr: 0.000054 loss: 2.044163 (2.100855) time: 1.003967 data: 0.000164 max mem: 18812 Epoch: [3/30] [ 550/1251] eta: 0:11:12 lr: 0.000055 loss: 2.013303 (2.096932) time: 0.983365 data: 0.000178 max mem: 18812 Epoch: [3/30] [ 600/1251] eta: 0:10:23 lr: 0.000056 loss: 2.030689 (2.094262) time: 0.972647 data: 0.000146 max mem: 18812 Epoch: [3/30] [ 650/1251] eta: 0:09:34 lr: 0.000056 loss: 2.063792 (2.091872) time: 0.916057 data: 0.000151 max mem: 18812 Epoch: [3/30] [ 700/1251] eta: 0:08:47 lr: 0.000057 loss: 2.096756 (2.092382) time: 0.927922 data: 0.000159 max mem: 18812 Epoch: [3/30] [ 750/1251] eta: 0:07:59 lr: 0.000058 loss: 2.050256 (2.092355) time: 0.964038 data: 0.000184 max mem: 18812 Epoch: [3/30] [ 800/1251] eta: 0:07:11 lr: 0.000058 loss: 2.070899 (2.087696) time: 0.993038 data: 0.000179 max mem: 18812 Epoch: [3/30] [ 850/1251] eta: 0:06:23 lr: 0.000059 loss: 1.955710 (2.082950) time: 0.988669 data: 0.000186 max mem: 18812 Epoch: [3/30] [ 900/1251] eta: 0:05:35 lr: 0.000060 loss: 2.070636 (2.080995) time: 0.904785 data: 0.000171 max mem: 18812 Epoch: [3/30] [ 950/1251] eta: 0:04:47 lr: 0.000060 loss: 1.996132 (2.078922) time: 0.925930 data: 0.000186 max mem: 18812 Epoch: [3/30] [1000/1251] eta: 0:04:00 lr: 0.000061 loss: 2.003647 (2.077625) time: 0.996475 data: 0.000172 max mem: 18812 Epoch: [3/30] [1050/1251] eta: 0:03:12 lr: 0.000061 loss: 2.020558 (2.074844) time: 0.995397 data: 0.000173 max mem: 18812 Epoch: [3/30] [1100/1251] eta: 0:02:24 lr: 0.000062 loss: 1.929363 (2.070751) time: 0.983087 data: 0.000163 max mem: 18812 Epoch: [3/30] [1150/1251] eta: 0:01:36 lr: 0.000063 loss: 2.067504 (2.068354) time: 0.914764 data: 0.000184 max mem: 18812 Epoch: [3/30] [1200/1251] eta: 0:00:48 lr: 0.000063 loss: 1.995960 (2.065156) time: 0.907111 data: 0.000170 max mem: 18812 Epoch: [3/30] [1250/1251] eta: 0:00:00 lr: 0.000064 loss: 2.059585 (2.062525) time: 0.969865 data: 0.000782 max mem: 18812 Epoch: [3/30] Total time: 0:19:58 (0.958131 s / it) Averaged stats: lr: 0.000064 loss: 2.059585 (2.060936) Test: [ 0/49] eta: 0:01:16 loss: 0.553213 (0.553213) acc1: 89.062500 (89.062500) acc5: 98.437500 (98.437500) time: 1.569177 data: 1.123201 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.665076 (0.710930) acc1: 84.375000 (83.806818) acc5: 96.875000 (96.306818) time: 0.477183 data: 0.102264 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.729310 (0.728863) acc1: 82.812500 (82.589286) acc5: 96.875000 (96.428571) time: 0.364525 data: 0.000154 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.743747 (0.727426) acc1: 82.812500 (82.711694) acc5: 96.875000 (96.522177) time: 0.371103 data: 0.000132 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.721907 (0.732621) acc1: 81.250000 (82.202744) acc5: 96.875000 (96.722561) time: 0.376360 data: 0.000120 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.726166 (0.733993) acc1: 81.250000 (82.304000) acc5: 96.875000 (96.672000) time: 0.374233 data: 0.000101 max mem: 18812 Test: Total time: 0:00:19 (0.396993 s / it) * Acc@1 82.110 Acc@5 96.888 loss 0.727 Max accuracy: 82.11% Epoch: [4/30] [ 0/1251] eta: 0:41:51 lr: 0.000064 loss: 2.160758 (2.160758) time: 2.007854 data: 1.113592 max mem: 18812 Epoch: [4/30] [ 50/1251] eta: 0:19:49 lr: 0.000065 loss: 1.921715 (1.970887) time: 0.976855 data: 0.000188 max mem: 18812 Epoch: [4/30] [ 100/1251] eta: 0:18:34 lr: 0.000065 loss: 1.963769 (1.981809) time: 0.916847 data: 0.000201 max mem: 18812 Epoch: [4/30] [ 150/1251] eta: 0:17:43 lr: 0.000066 loss: 2.004008 (1.989794) time: 0.941835 data: 0.000192 max mem: 18812 Epoch: [4/30] [ 200/1251] eta: 0:16:55 lr: 0.000067 loss: 2.024620 (1.988152) time: 0.976843 data: 0.000193 max mem: 18812 Epoch: [4/30] [ 250/1251] eta: 0:16:01 lr: 0.000067 loss: 2.002316 (1.988869) time: 0.983996 data: 0.000188 max mem: 18812 Epoch: [4/30] [ 300/1251] eta: 0:15:14 lr: 0.000068 loss: 1.978310 (1.986969) time: 0.984047 data: 0.000173 max mem: 18812 Epoch: [4/30] [ 350/1251] eta: 0:14:24 lr: 0.000069 loss: 1.968387 (1.983862) time: 0.916790 data: 0.000159 max mem: 18812 Epoch: [4/30] [ 400/1251] eta: 0:13:36 lr: 0.000069 loss: 1.898435 (1.976150) time: 0.965119 data: 0.000176 max mem: 18812 Epoch: [4/30] [ 450/1251] eta: 0:12:49 lr: 0.000070 loss: 1.921138 (1.977248) time: 0.966287 data: 0.000179 max mem: 18812 Epoch: [4/30] [ 500/1251] eta: 0:12:00 lr: 0.000070 loss: 1.945817 (1.974590) time: 0.983934 data: 0.000169 max mem: 18812 Epoch: [4/30] [ 550/1251] eta: 0:11:12 lr: 0.000071 loss: 2.011374 (1.977236) time: 0.956899 data: 0.000157 max mem: 18812 Epoch: [4/30] [ 600/1251] eta: 0:10:23 lr: 0.000072 loss: 1.888811 (1.980192) time: 0.926140 data: 0.000162 max mem: 18812 Epoch: [4/30] [ 650/1251] eta: 0:09:36 lr: 0.000072 loss: 1.939773 (1.980285) time: 0.976516 data: 0.000175 max mem: 18812 Epoch: [4/30] [ 700/1251] eta: 0:08:48 lr: 0.000073 loss: 1.914326 (1.977802) time: 0.963142 data: 0.000156 max mem: 18812 Epoch: [4/30] [ 750/1251] eta: 0:07:59 lr: 0.000074 loss: 1.980073 (1.976506) time: 0.964793 data: 0.000180 max mem: 18812 Epoch: [4/30] [ 800/1251] eta: 0:07:11 lr: 0.000074 loss: 1.949038 (1.975322) time: 0.924085 data: 0.000178 max mem: 18812 Epoch: [4/30] [ 850/1251] eta: 0:06:23 lr: 0.000075 loss: 1.955564 (1.973173) time: 0.919495 data: 0.000196 max mem: 18812 Epoch: [4/30] [ 900/1251] eta: 0:05:35 lr: 0.000076 loss: 1.910495 (1.971710) time: 0.980969 data: 0.000177 max mem: 18812 Epoch: [4/30] [ 950/1251] eta: 0:04:48 lr: 0.000076 loss: 1.916286 (1.970237) time: 0.991974 data: 0.000188 max mem: 18812 Epoch: [4/30] [1000/1251] eta: 0:04:00 lr: 0.000077 loss: 1.901226 (1.967417) time: 0.968497 data: 0.000157 max mem: 18812 Epoch: [4/30] [1050/1251] eta: 0:03:12 lr: 0.000077 loss: 1.960696 (1.966882) time: 0.921725 data: 0.000162 max mem: 18812 Epoch: [4/30] [1100/1251] eta: 0:02:24 lr: 0.000078 loss: 1.890753 (1.963948) time: 0.906941 data: 0.000175 max mem: 18812 Epoch: [4/30] [1150/1251] eta: 0:01:36 lr: 0.000079 loss: 1.954876 (1.963607) time: 0.981185 data: 0.000174 max mem: 18812 Epoch: [4/30] [1200/1251] eta: 0:00:48 lr: 0.000079 loss: 1.912019 (1.962697) time: 0.973780 data: 0.000169 max mem: 18812 Epoch: [4/30] [1250/1251] eta: 0:00:00 lr: 0.000075 loss: 1.904804 (1.962699) time: 0.979933 data: 0.000777 max mem: 18812 Epoch: [4/30] Total time: 0:19:56 (0.956739 s / it) Averaged stats: lr: 0.000075 loss: 1.904804 (1.965014) Test: [ 0/49] eta: 0:01:25 loss: 0.524498 (0.524498) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.750638 data: 1.341911 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.603143 (0.673801) acc1: 84.375000 (84.943182) acc5: 96.875000 (96.875000) time: 0.493820 data: 0.122140 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.703104 (0.693057) acc1: 82.812500 (83.630952) acc5: 96.875000 (96.875000) time: 0.375726 data: 0.000144 max mem: 18812 Test: [30/49] eta: 0:00:09 loss: 0.710021 (0.690330) acc1: 84.375000 (83.568548) acc5: 96.875000 (97.026210) time: 0.479815 data: 0.000138 max mem: 18812 Test: [40/49] eta: 0:00:04 loss: 0.696287 (0.696782) acc1: 82.812500 (83.269817) acc5: 96.875000 (97.141768) time: 0.466116 data: 0.000131 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.696287 (0.697040) acc1: 82.812500 (83.456000) acc5: 98.437500 (97.088000) time: 0.353604 data: 0.000099 max mem: 18812 Test: Total time: 0:00:21 (0.439599 s / it) * Acc@1 83.270 Acc@5 97.224 loss 0.682 Max accuracy: 83.27% Epoch: [5/30] [ 0/1251] eta: 0:39:38 lr: 0.000075 loss: 2.209451 (2.209451) time: 1.900977 data: 1.014769 max mem: 18812 Epoch: [5/30] [ 50/1251] eta: 0:19:42 lr: 0.000075 loss: 1.905142 (1.917064) time: 0.920362 data: 0.000166 max mem: 18812 Epoch: [5/30] [ 100/1251] eta: 0:18:41 lr: 0.000075 loss: 1.932738 (1.901703) time: 0.980790 data: 0.000174 max mem: 18812 Epoch: [5/30] [ 150/1251] eta: 0:17:47 lr: 0.000074 loss: 1.893975 (1.901860) time: 0.987697 data: 0.000184 max mem: 18812 Epoch: [5/30] [ 200/1251] eta: 0:16:51 lr: 0.000074 loss: 1.947010 (1.917580) time: 0.963803 data: 0.000197 max mem: 18812 Epoch: [5/30] [ 250/1251] eta: 0:15:58 lr: 0.000074 loss: 1.870482 (1.921539) time: 0.917345 data: 0.000194 max mem: 18812 Epoch: [5/30] [ 300/1251] eta: 0:15:11 lr: 0.000074 loss: 1.866921 (1.921628) time: 0.911576 data: 0.000170 max mem: 18812 Epoch: [5/30] [ 350/1251] eta: 0:14:24 lr: 0.000074 loss: 1.953143 (1.921888) time: 0.985249 data: 0.000171 max mem: 18812 Epoch: [5/30] [ 400/1251] eta: 0:13:36 lr: 0.000074 loss: 1.921026 (1.921441) time: 1.016263 data: 0.000180 max mem: 18812 Epoch: [5/30] [ 450/1251] eta: 0:12:49 lr: 0.000074 loss: 1.903014 (1.921728) time: 0.978920 data: 0.000165 max mem: 18812 Epoch: [5/30] [ 500/1251] eta: 0:12:00 lr: 0.000074 loss: 1.858516 (1.920903) time: 0.927472 data: 0.000178 max mem: 18812 Epoch: [5/30] [ 550/1251] eta: 0:11:11 lr: 0.000074 loss: 1.824472 (1.920795) time: 0.921937 data: 0.000163 max mem: 18812 Epoch: [5/30] [ 600/1251] eta: 0:10:24 lr: 0.000074 loss: 1.880072 (1.921171) time: 0.963613 data: 0.000182 max mem: 18812 Epoch: [5/30] [ 650/1251] eta: 0:09:35 lr: 0.000074 loss: 1.919482 (1.920251) time: 0.977795 data: 0.000149 max mem: 18812 Epoch: [5/30] [ 700/1251] eta: 0:08:48 lr: 0.000073 loss: 1.879211 (1.918780) time: 0.984189 data: 0.000158 max mem: 18812 Epoch: [5/30] [ 750/1251] eta: 0:07:59 lr: 0.000073 loss: 1.888605 (1.916457) time: 0.927460 data: 0.000178 max mem: 18812 Epoch: [5/30] [ 800/1251] eta: 0:07:12 lr: 0.000073 loss: 1.871942 (1.914772) time: 0.917653 data: 0.000165 max mem: 18812 Epoch: [5/30] [ 850/1251] eta: 0:06:25 lr: 0.000073 loss: 1.971064 (1.917035) time: 0.985981 data: 0.000199 max mem: 18812 Epoch: [5/30] [ 900/1251] eta: 0:05:37 lr: 0.000073 loss: 1.834193 (1.916646) time: 0.988450 data: 0.000172 max mem: 18812 Epoch: [5/30] [ 950/1251] eta: 0:04:48 lr: 0.000073 loss: 1.839175 (1.917754) time: 0.975249 data: 0.000180 max mem: 18812 Epoch: [5/30] [1000/1251] eta: 0:04:00 lr: 0.000073 loss: 1.831096 (1.918259) time: 0.911295 data: 0.000161 max mem: 18812 Epoch: [5/30] [1050/1251] eta: 0:03:12 lr: 0.000073 loss: 1.834446 (1.915930) time: 0.921430 data: 0.000176 max mem: 18812 Epoch: [5/30] [1100/1251] eta: 0:02:24 lr: 0.000073 loss: 1.863348 (1.915434) time: 0.980570 data: 0.000188 max mem: 18812 Epoch: [5/30] [1150/1251] eta: 0:01:37 lr: 0.000073 loss: 1.774477 (1.913591) time: 1.008331 data: 0.000169 max mem: 18812 Epoch: [5/30] [1200/1251] eta: 0:00:48 lr: 0.000073 loss: 1.930065 (1.915164) time: 0.967118 data: 0.000175 max mem: 18812 Epoch: [5/30] [1250/1251] eta: 0:00:00 lr: 0.000072 loss: 1.948004 (1.915004) time: 0.910531 data: 0.000758 max mem: 18812 Epoch: [5/30] Total time: 0:20:00 (0.959921 s / it) Averaged stats: lr: 0.000072 loss: 1.948004 (1.908953) Test: [ 0/49] eta: 0:01:22 loss: 0.495361 (0.495361) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.688453 data: 1.203045 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.596660 (0.649006) acc1: 84.375000 (84.659091) acc5: 95.312500 (96.875000) time: 0.500709 data: 0.109510 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.702886 (0.669918) acc1: 82.812500 (83.333333) acc5: 96.875000 (97.172619) time: 0.371829 data: 0.000149 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.702886 (0.670160) acc1: 82.812500 (83.316532) acc5: 96.875000 (97.177419) time: 0.361714 data: 0.000150 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.656528 (0.674092) acc1: 82.812500 (83.193598) acc5: 96.875000 (97.141768) time: 0.359572 data: 0.000147 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.662994 (0.675792) acc1: 82.812500 (83.360000) acc5: 96.875000 (97.088000) time: 0.354583 data: 0.000119 max mem: 18812 Test: Total time: 0:00:19 (0.392108 s / it) * Acc@1 83.776 Acc@5 97.394 loss 0.661 Max accuracy: 83.78% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_22kto1k_224_fp_30eps_re2/checkpoint_0005.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_22kto1k_224_fp_30eps_re2/checkpoint_0005.pth Epoch: [6/30] [ 0/1251] eta: 0:42:06 lr: 0.000072 loss: 2.256146 (2.256146) time: 2.019937 data: 1.114167 max mem: 18812 Epoch: [6/30] [ 50/1251] eta: 0:19:28 lr: 0.000072 loss: 1.814839 (1.910421) time: 0.907120 data: 0.000198 max mem: 18812 Epoch: [6/30] [ 100/1251] eta: 0:18:39 lr: 0.000072 loss: 1.836862 (1.900827) time: 1.009867 data: 0.000192 max mem: 18812 Epoch: [6/30] [ 150/1251] eta: 0:17:36 lr: 0.000072 loss: 1.837834 (1.893245) time: 0.957120 data: 0.000191 max mem: 18812 Epoch: [6/30] [ 200/1251] eta: 0:16:49 lr: 0.000072 loss: 1.950390 (1.899342) time: 0.974899 data: 0.000201 max mem: 18812 Epoch: [6/30] [ 250/1251] eta: 0:15:56 lr: 0.000072 loss: 1.975465 (1.902351) time: 0.912932 data: 0.000176 max mem: 18812 Epoch: [6/30] [ 300/1251] eta: 0:15:07 lr: 0.000072 loss: 1.930150 (1.898405) time: 0.906647 data: 0.000168 max mem: 18812 Epoch: [6/30] [ 350/1251] eta: 0:14:21 lr: 0.000072 loss: 1.868215 (1.898499) time: 0.965484 data: 0.000157 max mem: 18812 Epoch: [6/30] [ 400/1251] eta: 0:13:31 lr: 0.000072 loss: 1.878245 (1.893229) time: 0.968198 data: 0.000183 max mem: 18812 Epoch: [6/30] [ 450/1251] eta: 0:12:46 lr: 0.000072 loss: 1.874498 (1.891897) time: 1.004254 data: 0.000193 max mem: 18812 Epoch: [6/30] [ 500/1251] eta: 0:11:57 lr: 0.000071 loss: 1.852112 (1.889789) time: 0.937716 data: 0.000173 max mem: 18812 Epoch: [6/30] [ 550/1251] eta: 0:11:10 lr: 0.000071 loss: 1.916459 (1.886574) time: 0.922044 data: 0.000166 max mem: 18812 Epoch: [6/30] [ 600/1251] eta: 0:10:23 lr: 0.000071 loss: 1.976836 (1.886190) time: 0.989770 data: 0.000157 max mem: 18812 Epoch: [6/30] [ 650/1251] eta: 0:09:34 lr: 0.000071 loss: 1.822263 (1.883937) time: 0.972389 data: 0.000164 max mem: 18812 Epoch: [6/30] [ 700/1251] eta: 0:08:47 lr: 0.000071 loss: 1.887238 (1.881142) time: 0.976707 data: 0.000164 max mem: 18812 Epoch: [6/30] [ 750/1251] eta: 0:07:58 lr: 0.000071 loss: 1.895592 (1.882614) time: 0.928687 data: 0.000185 max mem: 18812 Epoch: [6/30] [ 800/1251] eta: 0:07:11 lr: 0.000071 loss: 1.866830 (1.882707) time: 0.961178 data: 0.000178 max mem: 18812 Epoch: [6/30] [ 850/1251] eta: 0:06:23 lr: 0.000071 loss: 1.815309 (1.881441) time: 0.967870 data: 0.000181 max mem: 18812 Epoch: [6/30] [ 900/1251] eta: 0:05:35 lr: 0.000071 loss: 1.792257 (1.881032) time: 0.960536 data: 0.000179 max mem: 18812 Epoch: [6/30] [ 950/1251] eta: 0:04:47 lr: 0.000070 loss: 1.799257 (1.879653) time: 0.965216 data: 0.000187 max mem: 18812 Epoch: [6/30] [1000/1251] eta: 0:03:59 lr: 0.000070 loss: 1.850610 (1.880246) time: 0.933749 data: 0.000154 max mem: 18812 Epoch: [6/30] [1050/1251] eta: 0:03:12 lr: 0.000070 loss: 1.883662 (1.878953) time: 0.995367 data: 0.000158 max mem: 18812 Epoch: [6/30] [1100/1251] eta: 0:02:24 lr: 0.000070 loss: 1.725171 (1.877425) time: 0.959723 data: 0.000178 max mem: 18812 Epoch: [6/30] [1150/1251] eta: 0:01:36 lr: 0.000070 loss: 1.803506 (1.875147) time: 0.954237 data: 0.000173 max mem: 18812 Epoch: [6/30] [1200/1251] eta: 0:00:48 lr: 0.000070 loss: 1.794795 (1.873223) time: 0.908556 data: 0.000174 max mem: 18812 Epoch: [6/30] [1250/1251] eta: 0:00:00 lr: 0.000070 loss: 1.869519 (1.873383) time: 0.910989 data: 0.000738 max mem: 18812 Epoch: [6/30] Total time: 0:19:54 (0.954660 s / it) Averaged stats: lr: 0.000070 loss: 1.869519 (1.874207) Test: [ 0/49] eta: 0:01:23 loss: 0.467843 (0.467843) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.712952 data: 1.300659 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.562363 (0.616298) acc1: 85.937500 (84.375000) acc5: 96.875000 (97.017045) time: 0.490982 data: 0.118395 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.681767 (0.643998) acc1: 82.812500 (83.556548) acc5: 96.875000 (97.247024) time: 0.365281 data: 0.000151 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.676632 (0.645184) acc1: 82.812500 (83.618952) acc5: 96.875000 (97.227823) time: 0.367272 data: 0.000144 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.654451 (0.654161) acc1: 84.375000 (83.727134) acc5: 96.875000 (97.103659) time: 0.367614 data: 0.000153 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.654451 (0.655517) acc1: 84.375000 (83.808000) acc5: 96.875000 (97.216000) time: 0.359822 data: 0.000123 max mem: 18812 Test: Total time: 0:00:19 (0.395189 s / it) * Acc@1 84.042 Acc@5 97.468 loss 0.647 Max accuracy: 84.04% Epoch: [7/30] [ 0/1251] eta: 0:42:05 lr: 0.000070 loss: 1.779595 (1.779595) time: 2.018712 data: 1.116346 max mem: 18812 Epoch: [7/30] [ 50/1251] eta: 0:19:36 lr: 0.000070 loss: 1.883872 (1.866050) time: 0.979339 data: 0.000178 max mem: 18812 Epoch: [7/30] [ 100/1251] eta: 0:18:38 lr: 0.000070 loss: 1.808625 (1.840765) time: 0.965782 data: 0.000183 max mem: 18812 Epoch: [7/30] [ 150/1251] eta: 0:17:40 lr: 0.000069 loss: 1.749591 (1.840650) time: 0.977369 data: 0.000175 max mem: 18812 Epoch: [7/30] [ 200/1251] eta: 0:16:50 lr: 0.000069 loss: 1.878559 (1.843975) time: 0.948848 data: 0.000185 max mem: 18812 Epoch: [7/30] [ 250/1251] eta: 0:15:59 lr: 0.000069 loss: 1.907720 (1.849435) time: 0.911855 data: 0.000186 max mem: 18812 Epoch: [7/30] [ 300/1251] eta: 0:15:13 lr: 0.000069 loss: 1.825781 (1.846732) time: 0.983979 data: 0.000170 max mem: 18812 Epoch: [7/30] [ 350/1251] eta: 0:14:24 lr: 0.000069 loss: 1.848579 (1.847570) time: 0.969997 data: 0.000161 max mem: 18812 Epoch: [7/30] [ 400/1251] eta: 0:13:34 lr: 0.000069 loss: 1.778763 (1.842124) time: 0.978028 data: 0.000166 max mem: 18812 Epoch: [7/30] [ 450/1251] eta: 0:12:45 lr: 0.000069 loss: 1.827060 (1.843205) time: 0.927645 data: 0.000182 max mem: 18812 Epoch: [7/30] [ 500/1251] eta: 0:11:58 lr: 0.000069 loss: 1.797400 (1.839637) time: 0.921528 data: 0.000180 max mem: 18812 Epoch: [7/30] [ 550/1251] eta: 0:11:11 lr: 0.000069 loss: 1.816780 (1.839389) time: 1.006770 data: 0.000171 max mem: 18812 Epoch: [7/30] [ 600/1251] eta: 0:10:24 lr: 0.000068 loss: 1.844169 (1.839469) time: 0.987966 data: 0.000163 max mem: 18812 Epoch: [7/30] [ 650/1251] eta: 0:09:35 lr: 0.000068 loss: 1.827510 (1.843036) time: 0.966906 data: 0.000169 max mem: 18812 Epoch: [7/30] [ 700/1251] eta: 0:08:47 lr: 0.000068 loss: 1.900440 (1.846435) time: 0.915585 data: 0.000164 max mem: 18812 Epoch: [7/30] [ 750/1251] eta: 0:07:59 lr: 0.000068 loss: 1.867763 (1.845455) time: 0.915976 data: 0.000193 max mem: 18812 Epoch: [7/30] [ 800/1251] eta: 0:07:12 lr: 0.000068 loss: 1.879420 (1.848077) time: 0.982716 data: 0.000174 max mem: 18812 Epoch: [7/30] [ 850/1251] eta: 0:06:24 lr: 0.000068 loss: 1.771608 (1.846991) time: 0.977121 data: 0.000202 max mem: 18812 Epoch: [7/30] [ 900/1251] eta: 0:05:36 lr: 0.000068 loss: 1.894188 (1.847259) time: 0.978867 data: 0.000184 max mem: 18812 Epoch: [7/30] [ 950/1251] eta: 0:04:48 lr: 0.000068 loss: 1.794217 (1.848682) time: 0.921566 data: 0.000181 max mem: 18812 Epoch: [7/30] [1000/1251] eta: 0:04:00 lr: 0.000067 loss: 1.826646 (1.846241) time: 0.918980 data: 0.000161 max mem: 18812 Epoch: [7/30] [1050/1251] eta: 0:03:12 lr: 0.000067 loss: 1.876930 (1.846889) time: 0.967100 data: 0.000178 max mem: 18812 Epoch: [7/30] [1100/1251] eta: 0:02:24 lr: 0.000067 loss: 1.738198 (1.846129) time: 0.982864 data: 0.000198 max mem: 18812 Epoch: [7/30] [1150/1251] eta: 0:01:36 lr: 0.000067 loss: 1.797922 (1.844263) time: 0.962484 data: 0.000164 max mem: 18812 Epoch: [7/30] [1200/1251] eta: 0:00:48 lr: 0.000067 loss: 1.831975 (1.844344) time: 0.910105 data: 0.000173 max mem: 18812 Epoch: [7/30] [1250/1251] eta: 0:00:00 lr: 0.000067 loss: 1.797986 (1.844356) time: 0.906929 data: 0.000797 max mem: 18812 Epoch: [7/30] Total time: 0:19:55 (0.955472 s / it) Averaged stats: lr: 0.000067 loss: 1.797986 (1.847148) Test: [ 0/49] eta: 0:01:15 loss: 0.435593 (0.435593) acc1: 89.062500 (89.062500) acc5: 98.437500 (98.437500) time: 1.535293 data: 1.087272 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.560252 (0.600161) acc1: 82.812500 (84.943182) acc5: 98.437500 (97.443182) time: 0.492593 data: 0.099003 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.661968 (0.628444) acc1: 82.812500 (84.523810) acc5: 96.875000 (97.470238) time: 0.377545 data: 0.000149 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.661968 (0.631338) acc1: 82.812500 (84.324597) acc5: 96.875000 (97.580645) time: 0.364446 data: 0.000131 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.639653 (0.639994) acc1: 84.375000 (84.375000) acc5: 96.875000 (97.370427) time: 0.363160 data: 0.000125 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.641627 (0.641506) acc1: 84.375000 (84.512000) acc5: 98.437500 (97.472000) time: 0.360468 data: 0.000098 max mem: 18812 Test: Total time: 0:00:19 (0.394204 s / it) * Acc@1 84.414 Acc@5 97.568 loss 0.631 Max accuracy: 84.41% Epoch: [8/30] [ 0/1251] eta: 0:39:02 lr: 0.000067 loss: 2.278732 (2.278732) time: 1.872609 data: 0.978264 max mem: 18812 Epoch: [8/30] [ 50/1251] eta: 0:19:05 lr: 0.000067 loss: 1.837615 (1.862578) time: 0.955880 data: 0.000204 max mem: 18812 Epoch: [8/30] [ 100/1251] eta: 0:18:05 lr: 0.000067 loss: 1.782380 (1.832593) time: 0.913699 data: 0.000201 max mem: 18812 Epoch: [8/30] [ 150/1251] eta: 0:17:26 lr: 0.000067 loss: 1.833732 (1.838804) time: 0.915864 data: 0.000190 max mem: 18812 Epoch: [8/30] [ 200/1251] eta: 0:16:38 lr: 0.000066 loss: 1.821814 (1.840462) time: 0.965137 data: 0.000193 max mem: 18812 Epoch: [8/30] [ 250/1251] eta: 0:15:50 lr: 0.000066 loss: 1.822512 (1.838280) time: 0.982232 data: 0.000166 max mem: 18812 Epoch: [8/30] [ 300/1251] eta: 0:15:03 lr: 0.000066 loss: 1.872082 (1.835882) time: 0.956943 data: 0.000173 max mem: 18812 Epoch: [8/30] [ 350/1251] eta: 0:14:15 lr: 0.000066 loss: 1.781383 (1.827364) time: 0.928398 data: 0.000169 max mem: 18812 Epoch: [8/30] [ 400/1251] eta: 0:13:28 lr: 0.000066 loss: 1.819593 (1.825362) time: 0.909979 data: 0.000179 max mem: 18812 Epoch: [8/30] [ 450/1251] eta: 0:12:43 lr: 0.000066 loss: 1.819228 (1.823005) time: 0.991457 data: 0.000175 max mem: 18812 Epoch: [8/30] [ 500/1251] eta: 0:11:55 lr: 0.000066 loss: 1.873376 (1.826838) time: 0.968816 data: 0.000181 max mem: 18812 Epoch: [8/30] [ 550/1251] eta: 0:11:08 lr: 0.000065 loss: 1.787760 (1.824584) time: 0.982891 data: 0.000179 max mem: 18812 Epoch: [8/30] [ 600/1251] eta: 0:10:19 lr: 0.000065 loss: 1.824075 (1.827164) time: 0.912751 data: 0.000178 max mem: 18812 Epoch: [8/30] [ 650/1251] eta: 0:09:31 lr: 0.000065 loss: 1.757083 (1.825754) time: 0.924617 data: 0.000157 max mem: 18812 Epoch: [8/30] [ 700/1251] eta: 0:08:46 lr: 0.000065 loss: 1.815807 (1.827021) time: 0.967581 data: 0.000176 max mem: 18812 Epoch: [8/30] [ 750/1251] eta: 0:07:57 lr: 0.000065 loss: 1.800824 (1.826980) time: 0.970112 data: 0.000181 max mem: 18812 Epoch: [8/30] [ 800/1251] eta: 0:07:10 lr: 0.000065 loss: 1.813806 (1.827718) time: 0.969368 data: 0.000200 max mem: 18812 Epoch: [8/30] [ 850/1251] eta: 0:06:22 lr: 0.000065 loss: 1.758800 (1.826298) time: 0.918235 data: 0.000178 max mem: 18812 Epoch: [8/30] [ 900/1251] eta: 0:05:34 lr: 0.000065 loss: 1.851745 (1.827945) time: 0.919808 data: 0.000198 max mem: 18812 Epoch: [8/30] [ 950/1251] eta: 0:04:47 lr: 0.000064 loss: 1.836256 (1.829967) time: 0.984151 data: 0.000187 max mem: 18812 Epoch: [8/30] [1000/1251] eta: 0:03:59 lr: 0.000064 loss: 1.838976 (1.829319) time: 0.980888 data: 0.000154 max mem: 18812 Epoch: [8/30] [1050/1251] eta: 0:03:11 lr: 0.000064 loss: 1.871866 (1.831278) time: 0.967550 data: 0.000162 max mem: 18812 Epoch: [8/30] [1100/1251] eta: 0:02:23 lr: 0.000064 loss: 1.872787 (1.830536) time: 0.915898 data: 0.000186 max mem: 18812 Epoch: [8/30] [1150/1251] eta: 0:01:36 lr: 0.000064 loss: 1.755787 (1.828655) time: 0.933319 data: 0.000179 max mem: 18812 Epoch: [8/30] [1200/1251] eta: 0:00:48 lr: 0.000064 loss: 1.878102 (1.829924) time: 0.958388 data: 0.000175 max mem: 18812 Epoch: [8/30] [1250/1251] eta: 0:00:00 lr: 0.000064 loss: 1.771639 (1.830442) time: 0.966101 data: 0.000760 max mem: 18812 Epoch: [8/30] Total time: 0:19:54 (0.954829 s / it) Averaged stats: lr: 0.000064 loss: 1.771639 (1.830775) Test: [ 0/49] eta: 0:01:15 loss: 0.382962 (0.382962) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.540995 data: 1.109775 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.554015 (0.583609) acc1: 84.375000 (85.795455) acc5: 98.437500 (97.443182) time: 0.498620 data: 0.101040 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.672994 (0.612777) acc1: 82.812500 (84.895833) acc5: 96.875000 (97.619048) time: 0.377575 data: 0.000147 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.672994 (0.617474) acc1: 84.375000 (84.929435) acc5: 96.875000 (97.782258) time: 0.360491 data: 0.000139 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.632356 (0.625562) acc1: 85.937500 (84.794207) acc5: 96.875000 (97.675305) time: 0.362843 data: 0.000149 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.635358 (0.629469) acc1: 84.375000 (84.864000) acc5: 98.437500 (97.696000) time: 0.358578 data: 0.000125 max mem: 18812 Test: Total time: 0:00:19 (0.393537 s / it) * Acc@1 84.760 Acc@5 97.672 loss 0.622 Max accuracy: 84.76% Epoch: [9/30] [ 0/1251] eta: 0:40:33 lr: 0.000064 loss: 1.944867 (1.944867) time: 1.945437 data: 1.060756 max mem: 18812 Epoch: [9/30] [ 50/1251] eta: 0:19:15 lr: 0.000064 loss: 1.783577 (1.799547) time: 0.907094 data: 0.000169 max mem: 18812 Epoch: [9/30] [ 100/1251] eta: 0:18:31 lr: 0.000063 loss: 1.860990 (1.815781) time: 0.940380 data: 0.000196 max mem: 18812 Epoch: [9/30] [ 150/1251] eta: 0:17:43 lr: 0.000063 loss: 1.758959 (1.819963) time: 0.937975 data: 0.000190 max mem: 18812 Epoch: [9/30] [ 200/1251] eta: 0:16:55 lr: 0.000063 loss: 1.796161 (1.808801) time: 0.976634 data: 0.000177 max mem: 18812 Epoch: [9/30] [ 250/1251] eta: 0:16:03 lr: 0.000063 loss: 1.834486 (1.803020) time: 0.974104 data: 0.000483 max mem: 18812 Epoch: [9/30] [ 300/1251] eta: 0:15:11 lr: 0.000063 loss: 1.728493 (1.805808) time: 0.905589 data: 0.000151 max mem: 18812 Epoch: [9/30] [ 350/1251] eta: 0:14:23 lr: 0.000063 loss: 1.730683 (1.800324) time: 0.925737 data: 0.000173 max mem: 18812 Epoch: [9/30] [ 400/1251] eta: 0:13:35 lr: 0.000063 loss: 1.815730 (1.803007) time: 0.928184 data: 0.000159 max mem: 18812 Epoch: [9/30] [ 450/1251] eta: 0:12:47 lr: 0.000062 loss: 1.809693 (1.804973) time: 1.026757 data: 0.000169 max mem: 18812 Epoch: [9/30] [ 500/1251] eta: 0:11:59 lr: 0.000062 loss: 1.781762 (1.804799) time: 0.987336 data: 0.000200 max mem: 18812 Epoch: [9/30] [ 550/1251] eta: 0:11:10 lr: 0.000062 loss: 1.758025 (1.804904) time: 0.914310 data: 0.000193 max mem: 18812 Epoch: [9/30] [ 600/1251] eta: 0:10:23 lr: 0.000062 loss: 1.775693 (1.805819) time: 0.924592 data: 0.000199 max mem: 18812 Epoch: [9/30] [ 650/1251] eta: 0:09:35 lr: 0.000062 loss: 1.813233 (1.805330) time: 0.959064 data: 0.000166 max mem: 18812 Epoch: [9/30] [ 700/1251] eta: 0:08:48 lr: 0.000062 loss: 1.779301 (1.806601) time: 1.030171 data: 0.000153 max mem: 18812 Epoch: [9/30] [ 750/1251] eta: 0:08:00 lr: 0.000062 loss: 1.890613 (1.810662) time: 0.987439 data: 0.000185 max mem: 18812 Epoch: [9/30] [ 800/1251] eta: 0:07:11 lr: 0.000061 loss: 1.734972 (1.808342) time: 0.913873 data: 0.000186 max mem: 18812 Epoch: [9/30] [ 850/1251] eta: 0:06:23 lr: 0.000061 loss: 1.802484 (1.810302) time: 0.935054 data: 0.000175 max mem: 18812 Epoch: [9/30] [ 900/1251] eta: 0:05:36 lr: 0.000061 loss: 1.855392 (1.810814) time: 1.009262 data: 0.000221 max mem: 18812 Epoch: [9/30] [ 950/1251] eta: 0:04:48 lr: 0.000061 loss: 1.788210 (1.809850) time: 1.056885 data: 0.000176 max mem: 18812 Epoch: [9/30] [1000/1251] eta: 0:04:00 lr: 0.000061 loss: 1.816767 (1.810689) time: 0.964520 data: 0.000163 max mem: 18812 Epoch: [9/30] [1050/1251] eta: 0:03:12 lr: 0.000061 loss: 1.780681 (1.809665) time: 0.909057 data: 0.000173 max mem: 18812 Epoch: [9/30] [1100/1251] eta: 0:02:24 lr: 0.000061 loss: 1.880471 (1.810270) time: 0.927340 data: 0.000163 max mem: 18812 Epoch: [9/30] [1150/1251] eta: 0:01:36 lr: 0.000060 loss: 1.739758 (1.808993) time: 0.965909 data: 0.000171 max mem: 18812 Epoch: [9/30] [1200/1251] eta: 0:00:48 lr: 0.000060 loss: 1.835641 (1.808544) time: 1.022069 data: 0.000172 max mem: 18812 Epoch: [9/30] [1250/1251] eta: 0:00:00 lr: 0.000060 loss: 1.790626 (1.807763) time: 0.978072 data: 0.000743 max mem: 18812 Epoch: [9/30] Total time: 0:19:58 (0.958159 s / it) Averaged stats: lr: 0.000060 loss: 1.790626 (1.811611) Test: [ 0/49] eta: 0:01:14 loss: 0.417019 (0.417019) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.525340 data: 1.098037 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.558769 (0.580052) acc1: 84.375000 (84.943182) acc5: 98.437500 (97.585227) time: 0.474472 data: 0.099953 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.628082 (0.609892) acc1: 82.812500 (84.002976) acc5: 96.875000 (97.693452) time: 0.364413 data: 0.000140 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.635185 (0.614742) acc1: 84.375000 (84.122984) acc5: 96.875000 (97.782258) time: 0.360617 data: 0.000135 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.632385 (0.623902) acc1: 84.375000 (84.070122) acc5: 96.875000 (97.560976) time: 0.359418 data: 0.000128 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.632385 (0.624864) acc1: 84.375000 (84.416000) acc5: 96.875000 (97.664000) time: 0.354011 data: 0.000102 max mem: 18812 Test: Total time: 0:00:18 (0.386004 s / it) * Acc@1 84.878 Acc@5 97.684 loss 0.615 Max accuracy: 84.88% Epoch: [10/30] [ 0/1251] eta: 0:42:10 lr: 0.000060 loss: 1.947424 (1.947424) time: 2.022711 data: 1.138835 max mem: 18812 Epoch: [10/30] [ 50/1251] eta: 0:19:28 lr: 0.000060 loss: 1.812691 (1.828199) time: 0.917713 data: 0.000173 max mem: 18812 Epoch: [10/30] [ 100/1251] eta: 0:18:40 lr: 0.000060 loss: 1.819594 (1.814202) time: 0.939556 data: 0.000189 max mem: 18812 Epoch: [10/30] [ 150/1251] eta: 0:17:49 lr: 0.000060 loss: 1.803063 (1.798251) time: 0.979080 data: 0.000183 max mem: 18812 Epoch: [10/30] [ 200/1251] eta: 0:16:55 lr: 0.000060 loss: 1.830863 (1.799756) time: 1.007169 data: 0.000182 max mem: 18812 Epoch: [10/30] [ 250/1251] eta: 0:16:01 lr: 0.000059 loss: 1.781184 (1.799319) time: 0.970007 data: 0.000175 max mem: 18812 Epoch: [10/30] [ 300/1251] eta: 0:15:10 lr: 0.000059 loss: 1.859955 (1.802404) time: 0.916236 data: 0.000150 max mem: 18812 Epoch: [10/30] [ 350/1251] eta: 0:14:24 lr: 0.000059 loss: 1.838077 (1.805196) time: 0.953055 data: 0.000169 max mem: 18812 Epoch: [10/30] [ 400/1251] eta: 0:13:37 lr: 0.000059 loss: 1.825118 (1.803604) time: 0.967650 data: 0.000172 max mem: 18812 Epoch: [10/30] [ 450/1251] eta: 0:12:49 lr: 0.000059 loss: 1.734564 (1.806753) time: 1.030101 data: 0.000166 max mem: 18812 Epoch: [10/30] [ 500/1251] eta: 0:12:00 lr: 0.000059 loss: 1.714654 (1.800688) time: 0.987611 data: 0.000152 max mem: 18812 Epoch: [10/30] [ 550/1251] eta: 0:11:10 lr: 0.000059 loss: 1.808621 (1.801928) time: 0.911597 data: 0.000162 max mem: 18812 Epoch: [10/30] [ 600/1251] eta: 0:10:23 lr: 0.000058 loss: 1.790093 (1.800743) time: 0.931985 data: 0.000162 max mem: 18812 Epoch: [10/30] [ 650/1251] eta: 0:09:36 lr: 0.000058 loss: 1.814965 (1.803377) time: 0.981554 data: 0.000165 max mem: 18812 Epoch: [10/30] [ 700/1251] eta: 0:08:48 lr: 0.000058 loss: 1.772759 (1.800370) time: 1.023503 data: 0.000158 max mem: 18812 Epoch: [10/30] [ 750/1251] eta: 0:08:00 lr: 0.000058 loss: 1.766604 (1.796488) time: 0.971178 data: 0.000187 max mem: 18812 Epoch: [10/30] [ 800/1251] eta: 0:07:11 lr: 0.000058 loss: 1.728859 (1.795182) time: 0.930150 data: 0.000188 max mem: 18812 Epoch: [10/30] [ 850/1251] eta: 0:06:24 lr: 0.000058 loss: 1.800125 (1.795028) time: 0.942865 data: 0.000205 max mem: 18812 Epoch: [10/30] [ 900/1251] eta: 0:05:36 lr: 0.000058 loss: 1.775710 (1.795155) time: 0.978935 data: 0.000190 max mem: 18812 Epoch: [10/30] [ 950/1251] eta: 0:04:48 lr: 0.000057 loss: 1.732173 (1.795935) time: 1.029482 data: 0.000179 max mem: 18812 Epoch: [10/30] [1000/1251] eta: 0:04:00 lr: 0.000057 loss: 1.753081 (1.797212) time: 0.943958 data: 0.000157 max mem: 18812 Epoch: [10/30] [1050/1251] eta: 0:03:12 lr: 0.000057 loss: 1.743012 (1.796324) time: 0.924558 data: 0.000166 max mem: 18812 Epoch: [10/30] [1100/1251] eta: 0:02:24 lr: 0.000057 loss: 1.821608 (1.796947) time: 0.925094 data: 0.000170 max mem: 18812 Epoch: [10/30] [1150/1251] eta: 0:01:36 lr: 0.000057 loss: 1.691934 (1.796580) time: 0.974770 data: 0.000163 max mem: 18812 Epoch: [10/30] [1200/1251] eta: 0:00:48 lr: 0.000057 loss: 1.812716 (1.796938) time: 1.015968 data: 0.000156 max mem: 18812 Epoch: [10/30] [1250/1251] eta: 0:00:00 lr: 0.000056 loss: 1.756043 (1.796584) time: 0.913278 data: 0.000762 max mem: 18812 Epoch: [10/30] Total time: 0:19:56 (0.956775 s / it) Averaged stats: lr: 0.000056 loss: 1.756043 (1.797633) Test: [ 0/49] eta: 0:01:50 loss: 0.384241 (0.384241) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 2.250666 data: 1.441274 max mem: 18812 Test: [10/49] eta: 0:00:20 loss: 0.550690 (0.576986) acc1: 84.375000 (86.079545) acc5: 98.437500 (97.585227) time: 0.535571 data: 0.131155 max mem: 18812 Test: [20/49] eta: 0:00:13 loss: 0.645569 (0.607402) acc1: 82.812500 (84.747024) acc5: 98.437500 (97.693452) time: 0.364301 data: 0.000136 max mem: 18812 Test: [30/49] eta: 0:00:08 loss: 0.645569 (0.608842) acc1: 82.812500 (84.526210) acc5: 98.437500 (97.832661) time: 0.363743 data: 0.000142 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.621702 (0.616430) acc1: 84.375000 (84.679878) acc5: 96.875000 (97.675305) time: 0.359722 data: 0.000140 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.621702 (0.618117) acc1: 84.375000 (84.864000) acc5: 98.437500 (97.728000) time: 0.354182 data: 0.000115 max mem: 18812 Test: Total time: 0:00:19 (0.401557 s / it) * Acc@1 84.998 Acc@5 97.750 loss 0.610 Max accuracy: 85.00% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_22kto1k_224_fp_30eps_re2/checkpoint_0010.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_22kto1k_224_fp_30eps_re2/checkpoint_0010.pth Epoch: [11/30] [ 0/1251] eta: 0:44:55 lr: 0.000056 loss: 1.585870 (1.585870) time: 2.154571 data: 1.246629 max mem: 18812 Epoch: [11/30] [ 50/1251] eta: 0:19:36 lr: 0.000056 loss: 1.803415 (1.769002) time: 0.934130 data: 0.000238 max mem: 18812 Epoch: [11/30] [ 100/1251] eta: 0:18:40 lr: 0.000056 loss: 1.739824 (1.783141) time: 0.932610 data: 0.000199 max mem: 18812 Epoch: [11/30] [ 150/1251] eta: 0:17:48 lr: 0.000056 loss: 1.830393 (1.791663) time: 0.974256 data: 0.000182 max mem: 18812 Epoch: [11/30] [ 200/1251] eta: 0:16:57 lr: 0.000056 loss: 1.758318 (1.783742) time: 1.016521 data: 0.000185 max mem: 18812 Epoch: [11/30] [ 250/1251] eta: 0:16:03 lr: 0.000056 loss: 1.739699 (1.780403) time: 0.964672 data: 0.000230 max mem: 18812 Epoch: [11/30] [ 300/1251] eta: 0:15:13 lr: 0.000056 loss: 1.749462 (1.775432) time: 0.910925 data: 0.000193 max mem: 18812 Epoch: [11/30] [ 350/1251] eta: 0:14:25 lr: 0.000055 loss: 1.790305 (1.774335) time: 0.936759 data: 0.000203 max mem: 18812 Epoch: [11/30] [ 400/1251] eta: 0:13:38 lr: 0.000055 loss: 1.715571 (1.774264) time: 0.984669 data: 0.000239 max mem: 18812 Epoch: [11/30] [ 450/1251] eta: 0:12:50 lr: 0.000055 loss: 1.808415 (1.780526) time: 1.030403 data: 0.000250 max mem: 18812 Epoch: [11/30] [ 500/1251] eta: 0:12:01 lr: 0.000055 loss: 1.776036 (1.784324) time: 0.977779 data: 0.000245 max mem: 18812 Epoch: [11/30] [ 550/1251] eta: 0:11:14 lr: 0.000055 loss: 1.708061 (1.783457) time: 0.921505 data: 0.000209 max mem: 18812 Epoch: [11/30] [ 600/1251] eta: 0:10:26 lr: 0.000055 loss: 1.761101 (1.782704) time: 0.946184 data: 0.000217 max mem: 18812 Epoch: [11/30] [ 650/1251] eta: 0:09:38 lr: 0.000054 loss: 1.764183 (1.783000) time: 0.987385 data: 0.000156 max mem: 18812 Epoch: [11/30] [ 700/1251] eta: 0:08:49 lr: 0.000054 loss: 1.719653 (1.783106) time: 0.995628 data: 0.000155 max mem: 18812 Epoch: [11/30] [ 750/1251] eta: 0:08:01 lr: 0.000054 loss: 1.788435 (1.783162) time: 0.976634 data: 0.000201 max mem: 18812 Epoch: [11/30] [ 800/1251] eta: 0:07:12 lr: 0.000054 loss: 1.883777 (1.786632) time: 0.919294 data: 0.000182 max mem: 18812 Epoch: [11/30] [ 850/1251] eta: 0:06:25 lr: 0.000054 loss: 1.745838 (1.785887) time: 0.963787 data: 0.000187 max mem: 18812 Epoch: [11/30] [ 900/1251] eta: 0:05:37 lr: 0.000054 loss: 1.749315 (1.788516) time: 0.959392 data: 0.000196 max mem: 18812 Epoch: [11/30] [ 950/1251] eta: 0:04:49 lr: 0.000054 loss: 1.829687 (1.789492) time: 0.978041 data: 0.000180 max mem: 18812 Epoch: [11/30] [1000/1251] eta: 0:04:00 lr: 0.000053 loss: 1.750024 (1.789593) time: 0.924542 data: 0.000153 max mem: 18812 Epoch: [11/30] [1050/1251] eta: 0:03:12 lr: 0.000053 loss: 1.798124 (1.790179) time: 0.920999 data: 0.000194 max mem: 18812 Epoch: [11/30] [1100/1251] eta: 0:02:24 lr: 0.000053 loss: 1.732297 (1.791065) time: 0.977094 data: 0.000201 max mem: 18812 Epoch: [11/30] [1150/1251] eta: 0:01:36 lr: 0.000053 loss: 1.823976 (1.792204) time: 0.960438 data: 0.000199 max mem: 18812 Epoch: [11/30] [1200/1251] eta: 0:00:48 lr: 0.000053 loss: 1.647201 (1.790728) time: 0.972585 data: 0.000216 max mem: 18812 Epoch: [11/30] [1250/1251] eta: 0:00:00 lr: 0.000053 loss: 1.683604 (1.788092) time: 0.908721 data: 0.000777 max mem: 18812 Epoch: [11/30] Total time: 0:19:59 (0.958489 s / it) Averaged stats: lr: 0.000053 loss: 1.683604 (1.786280) Test: [ 0/49] eta: 0:01:29 loss: 0.374121 (0.374121) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.826945 data: 1.416971 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.544050 (0.557762) acc1: 85.937500 (85.795455) acc5: 98.437500 (97.443182) time: 0.498987 data: 0.128959 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.627999 (0.592346) acc1: 84.375000 (84.895833) acc5: 96.875000 (97.395833) time: 0.369679 data: 0.000146 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.627999 (0.594132) acc1: 82.812500 (84.879032) acc5: 98.437500 (97.681452) time: 0.367741 data: 0.000130 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.605059 (0.605683) acc1: 84.375000 (84.756098) acc5: 98.437500 (97.560976) time: 0.359006 data: 0.000120 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.614107 (0.608441) acc1: 84.375000 (84.960000) acc5: 96.875000 (97.600000) time: 0.353879 data: 0.000101 max mem: 18812 Test: Total time: 0:00:19 (0.394556 s / it) * Acc@1 85.066 Acc@5 97.756 loss 0.606 Max accuracy: 85.07% Epoch: [12/30] [ 0/1251] eta: 0:43:05 lr: 0.000053 loss: 1.690533 (1.690533) time: 2.066912 data: 1.154934 max mem: 18812 Epoch: [12/30] [ 50/1251] eta: 0:20:02 lr: 0.000052 loss: 1.804166 (1.769934) time: 0.987684 data: 0.000202 max mem: 18812 Epoch: [12/30] [ 100/1251] eta: 0:18:36 lr: 0.000052 loss: 1.842350 (1.790599) time: 0.967123 data: 0.000186 max mem: 18812 Epoch: [12/30] [ 150/1251] eta: 0:17:42 lr: 0.000052 loss: 1.857965 (1.786854) time: 0.956152 data: 0.000188 max mem: 18812 Epoch: [12/30] [ 200/1251] eta: 0:16:47 lr: 0.000052 loss: 1.748422 (1.777368) time: 0.918813 data: 0.000171 max mem: 18812 Epoch: [12/30] [ 250/1251] eta: 0:16:01 lr: 0.000052 loss: 1.791651 (1.779217) time: 0.929659 data: 0.000185 max mem: 18812 Epoch: [12/30] [ 300/1251] eta: 0:15:14 lr: 0.000052 loss: 1.787573 (1.780091) time: 0.983771 data: 0.000172 max mem: 18812 Epoch: [12/30] [ 350/1251] eta: 0:14:23 lr: 0.000051 loss: 1.778688 (1.776487) time: 0.972906 data: 0.000151 max mem: 18812 Epoch: [12/30] [ 400/1251] eta: 0:13:34 lr: 0.000051 loss: 1.787815 (1.778149) time: 0.943516 data: 0.000203 max mem: 18812 Epoch: [12/30] [ 450/1251] eta: 0:12:45 lr: 0.000051 loss: 1.782403 (1.780407) time: 0.916707 data: 0.000170 max mem: 18812 Epoch: [12/30] [ 500/1251] eta: 0:11:57 lr: 0.000051 loss: 1.753670 (1.779463) time: 0.983501 data: 0.000187 max mem: 18812 Epoch: [12/30] [ 550/1251] eta: 0:11:09 lr: 0.000051 loss: 1.767211 (1.779418) time: 0.966435 data: 0.000168 max mem: 18812 Epoch: [12/30] [ 600/1251] eta: 0:10:21 lr: 0.000051 loss: 1.682500 (1.776121) time: 0.969220 data: 0.000202 max mem: 18812 Epoch: [12/30] [ 650/1251] eta: 0:09:33 lr: 0.000051 loss: 1.797923 (1.777011) time: 0.923353 data: 0.000159 max mem: 18812 Epoch: [12/30] [ 700/1251] eta: 0:08:46 lr: 0.000050 loss: 1.732703 (1.774815) time: 0.916068 data: 0.000170 max mem: 18812 Epoch: [12/30] [ 750/1251] eta: 0:07:58 lr: 0.000050 loss: 1.745026 (1.774076) time: 0.966533 data: 0.000191 max mem: 18812 Epoch: [12/30] [ 800/1251] eta: 0:07:10 lr: 0.000050 loss: 1.711819 (1.773857) time: 0.968300 data: 0.000195 max mem: 18812 Epoch: [12/30] [ 850/1251] eta: 0:06:22 lr: 0.000050 loss: 1.715222 (1.772865) time: 0.956483 data: 0.000189 max mem: 18812 Epoch: [12/30] [ 900/1251] eta: 0:05:34 lr: 0.000050 loss: 1.736004 (1.772683) time: 0.928643 data: 0.000194 max mem: 18812 Epoch: [12/30] [ 950/1251] eta: 0:04:47 lr: 0.000050 loss: 1.683893 (1.771506) time: 0.921910 data: 0.000175 max mem: 18812 Epoch: [12/30] [1000/1251] eta: 0:03:59 lr: 0.000049 loss: 1.713495 (1.772098) time: 0.966636 data: 0.000163 max mem: 18812 Epoch: [12/30] [1050/1251] eta: 0:03:11 lr: 0.000049 loss: 1.792511 (1.772514) time: 0.963066 data: 0.000172 max mem: 18812 Epoch: [12/30] [1100/1251] eta: 0:02:24 lr: 0.000049 loss: 1.705688 (1.771494) time: 0.973216 data: 0.000159 max mem: 18812 Epoch: [12/30] [1150/1251] eta: 0:01:36 lr: 0.000049 loss: 1.771270 (1.771122) time: 0.907426 data: 0.000156 max mem: 18812 Epoch: [12/30] [1200/1251] eta: 0:00:48 lr: 0.000049 loss: 1.743741 (1.770164) time: 0.913393 data: 0.000155 max mem: 18812 Epoch: [12/30] [1250/1251] eta: 0:00:00 lr: 0.000049 loss: 1.796166 (1.770378) time: 0.986074 data: 0.000769 max mem: 18812 Epoch: [12/30] Total time: 0:19:54 (0.954750 s / it) Averaged stats: lr: 0.000049 loss: 1.796166 (1.775423) Test: [ 0/49] eta: 0:01:29 loss: 0.362884 (0.362884) acc1: 90.625000 (90.625000) acc5: 100.000000 (100.000000) time: 1.825308 data: 1.415446 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.552410 (0.556036) acc1: 85.937500 (85.795455) acc5: 98.437500 (97.869318) time: 0.500668 data: 0.128832 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.616076 (0.588397) acc1: 84.375000 (84.970238) acc5: 96.875000 (97.619048) time: 0.365463 data: 0.000163 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.616076 (0.592740) acc1: 84.375000 (84.778226) acc5: 96.875000 (97.782258) time: 0.362916 data: 0.000157 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.606983 (0.604198) acc1: 84.375000 (84.870427) acc5: 96.875000 (97.637195) time: 0.379212 data: 0.000152 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.606983 (0.604578) acc1: 85.937500 (85.120000) acc5: 96.875000 (97.632000) time: 0.376306 data: 0.000125 max mem: 18812 Test: Total time: 0:00:19 (0.402026 s / it) * Acc@1 85.318 Acc@5 97.788 loss 0.601 Max accuracy: 85.32% Epoch: [13/30] [ 0/1251] eta: 0:41:47 lr: 0.000049 loss: 1.567338 (1.567338) time: 2.004309 data: 1.104059 max mem: 18812 Epoch: [13/30] [ 50/1251] eta: 0:19:48 lr: 0.000048 loss: 1.681407 (1.762277) time: 0.998556 data: 0.000181 max mem: 18812 Epoch: [13/30] [ 100/1251] eta: 0:18:32 lr: 0.000048 loss: 1.751907 (1.765504) time: 0.922253 data: 0.000196 max mem: 18812 Epoch: [13/30] [ 150/1251] eta: 0:17:38 lr: 0.000048 loss: 1.710547 (1.765519) time: 0.963127 data: 0.000189 max mem: 18812 Epoch: [13/30] [ 200/1251] eta: 0:16:50 lr: 0.000048 loss: 1.776474 (1.768635) time: 0.987799 data: 0.000189 max mem: 18812 Epoch: [13/30] [ 250/1251] eta: 0:15:56 lr: 0.000048 loss: 1.763463 (1.765112) time: 0.962954 data: 0.000181 max mem: 18812 Epoch: [13/30] [ 300/1251] eta: 0:15:06 lr: 0.000048 loss: 1.685082 (1.762051) time: 0.932518 data: 0.000175 max mem: 18812 Epoch: [13/30] [ 350/1251] eta: 0:14:19 lr: 0.000047 loss: 1.719089 (1.760950) time: 0.904339 data: 0.000169 max mem: 18812 Epoch: [13/30] [ 400/1251] eta: 0:13:32 lr: 0.000047 loss: 1.733432 (1.760284) time: 0.984131 data: 0.000190 max mem: 18812 Epoch: [13/30] [ 450/1251] eta: 0:12:45 lr: 0.000047 loss: 1.762834 (1.763243) time: 0.969380 data: 0.000168 max mem: 18812 Epoch: [13/30] [ 500/1251] eta: 0:11:57 lr: 0.000047 loss: 1.679172 (1.762145) time: 0.986159 data: 0.000179 max mem: 18812 Epoch: [13/30] [ 550/1251] eta: 0:11:09 lr: 0.000047 loss: 1.787206 (1.761817) time: 0.912233 data: 0.000189 max mem: 18812 Epoch: [13/30] [ 600/1251] eta: 0:10:21 lr: 0.000047 loss: 1.745861 (1.762197) time: 0.913308 data: 0.000160 max mem: 18812 Epoch: [13/30] [ 650/1251] eta: 0:09:33 lr: 0.000046 loss: 1.807314 (1.764785) time: 0.967846 data: 0.000172 max mem: 18812 Epoch: [13/30] [ 700/1251] eta: 0:08:46 lr: 0.000046 loss: 1.688389 (1.764849) time: 0.959638 data: 0.000177 max mem: 18812 Epoch: [13/30] [ 750/1251] eta: 0:07:58 lr: 0.000046 loss: 1.738119 (1.765586) time: 0.973530 data: 0.000178 max mem: 18812 Epoch: [13/30] [ 800/1251] eta: 0:07:10 lr: 0.000046 loss: 1.672349 (1.764329) time: 0.919341 data: 0.000172 max mem: 18812 Epoch: [13/30] [ 850/1251] eta: 0:06:22 lr: 0.000046 loss: 1.734828 (1.764737) time: 0.924859 data: 0.000214 max mem: 18812 Epoch: [13/30] [ 900/1251] eta: 0:05:35 lr: 0.000046 loss: 1.669457 (1.764162) time: 0.983414 data: 0.000186 max mem: 18812 Epoch: [13/30] [ 950/1251] eta: 0:04:47 lr: 0.000045 loss: 1.661307 (1.763124) time: 0.983335 data: 0.000212 max mem: 18812 Epoch: [13/30] [1000/1251] eta: 0:03:59 lr: 0.000045 loss: 1.747259 (1.762642) time: 0.971802 data: 0.000168 max mem: 18812 Epoch: [13/30] [1050/1251] eta: 0:03:11 lr: 0.000045 loss: 1.741934 (1.762285) time: 0.911844 data: 0.000179 max mem: 18812 Epoch: [13/30] [1100/1251] eta: 0:02:24 lr: 0.000045 loss: 1.734775 (1.764348) time: 0.928401 data: 0.000195 max mem: 18812 Epoch: [13/30] [1150/1251] eta: 0:01:36 lr: 0.000045 loss: 1.788469 (1.764334) time: 0.961281 data: 0.000177 max mem: 18812 Epoch: [13/30] [1200/1251] eta: 0:00:48 lr: 0.000045 loss: 1.820967 (1.764593) time: 0.987689 data: 0.000164 max mem: 18812 Epoch: [13/30] [1250/1251] eta: 0:00:00 lr: 0.000044 loss: 1.733539 (1.763889) time: 0.989936 data: 0.000758 max mem: 18812 Epoch: [13/30] Total time: 0:19:55 (0.955638 s / it) Averaged stats: lr: 0.000044 loss: 1.733539 (1.766910) Test: [ 0/49] eta: 0:01:20 loss: 0.371555 (0.371555) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.648961 data: 1.220623 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.534734 (0.559179) acc1: 85.937500 (85.795455) acc5: 98.437500 (97.727273) time: 0.484040 data: 0.111125 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.623157 (0.588205) acc1: 84.375000 (85.119048) acc5: 98.437500 (97.842262) time: 0.364110 data: 0.000162 max mem: 18812 Test: [30/49] eta: 0:00:09 loss: 0.618018 (0.591600) acc1: 84.375000 (85.181452) acc5: 98.437500 (97.883065) time: 0.472334 data: 0.000142 max mem: 18812 Test: [40/49] eta: 0:00:04 loss: 0.612675 (0.602686) acc1: 84.375000 (85.022866) acc5: 96.875000 (97.751524) time: 0.470131 data: 0.000122 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.612675 (0.602632) acc1: 84.375000 (85.152000) acc5: 98.437500 (97.760000) time: 0.366527 data: 0.000101 max mem: 18812 Test: Total time: 0:00:21 (0.433190 s / it) * Acc@1 85.284 Acc@5 97.794 loss 0.599 Max accuracy: 85.32% Epoch: [14/30] [ 0/1251] eta: 0:42:41 lr: 0.000044 loss: 1.634925 (1.634925) time: 2.047743 data: 1.163482 max mem: 18812 Epoch: [14/30] [ 50/1251] eta: 0:19:43 lr: 0.000044 loss: 1.743064 (1.794205) time: 0.937626 data: 0.000190 max mem: 18812 Epoch: [14/30] [ 100/1251] eta: 0:18:38 lr: 0.000044 loss: 1.719532 (1.771160) time: 0.971811 data: 0.000189 max mem: 18812 Epoch: [14/30] [ 150/1251] eta: 0:17:38 lr: 0.000044 loss: 1.730282 (1.763111) time: 0.975560 data: 0.000183 max mem: 18812 Epoch: [14/30] [ 200/1251] eta: 0:16:48 lr: 0.000044 loss: 1.769789 (1.755802) time: 0.962545 data: 0.000188 max mem: 18812 Epoch: [14/30] [ 250/1251] eta: 0:15:57 lr: 0.000044 loss: 1.644609 (1.748820) time: 0.912829 data: 0.000177 max mem: 18812 Epoch: [14/30] [ 300/1251] eta: 0:15:12 lr: 0.000044 loss: 1.780897 (1.753280) time: 0.985921 data: 0.000169 max mem: 18812 Epoch: [14/30] [ 350/1251] eta: 0:14:22 lr: 0.000043 loss: 1.726714 (1.754379) time: 0.956388 data: 0.000163 max mem: 18812 Epoch: [14/30] [ 400/1251] eta: 0:13:35 lr: 0.000043 loss: 1.769570 (1.754750) time: 1.041631 data: 0.000175 max mem: 18812 Epoch: [14/30] [ 450/1251] eta: 0:12:48 lr: 0.000043 loss: 1.736716 (1.752378) time: 0.967132 data: 0.000160 max mem: 18812 Epoch: [14/30] [ 500/1251] eta: 0:11:58 lr: 0.000043 loss: 1.689574 (1.755454) time: 0.910783 data: 0.000154 max mem: 18812 Epoch: [14/30] [ 550/1251] eta: 0:11:10 lr: 0.000043 loss: 1.731620 (1.755565) time: 0.952303 data: 0.000169 max mem: 18812 Epoch: [14/30] [ 600/1251] eta: 0:10:23 lr: 0.000043 loss: 1.741685 (1.753653) time: 0.988703 data: 0.000171 max mem: 18812 Epoch: [14/30] [ 650/1251] eta: 0:09:34 lr: 0.000042 loss: 1.743330 (1.753393) time: 0.992894 data: 0.000159 max mem: 18812 Epoch: [14/30] [ 700/1251] eta: 0:08:46 lr: 0.000042 loss: 1.743790 (1.752140) time: 0.921544 data: 0.000184 max mem: 18812 Epoch: [14/30] [ 750/1251] eta: 0:07:58 lr: 0.000042 loss: 1.711019 (1.754224) time: 0.907734 data: 0.000205 max mem: 18812 Epoch: [14/30] [ 800/1251] eta: 0:07:10 lr: 0.000042 loss: 1.659930 (1.753540) time: 0.930886 data: 0.000210 max mem: 18812 Epoch: [14/30] [ 850/1251] eta: 0:06:23 lr: 0.000042 loss: 1.714998 (1.753235) time: 0.933668 data: 0.000195 max mem: 18812 Epoch: [14/30] [ 900/1251] eta: 0:05:35 lr: 0.000042 loss: 1.703956 (1.752990) time: 1.047437 data: 0.000192 max mem: 18812 Epoch: [14/30] [ 950/1251] eta: 0:04:47 lr: 0.000041 loss: 1.674679 (1.751572) time: 0.957595 data: 0.000185 max mem: 18812 Epoch: [14/30] [1000/1251] eta: 0:03:59 lr: 0.000041 loss: 1.693200 (1.751774) time: 0.911203 data: 0.000154 max mem: 18812 Epoch: [14/30] [1050/1251] eta: 0:03:12 lr: 0.000041 loss: 1.778643 (1.753084) time: 0.924559 data: 0.000192 max mem: 18812 Epoch: [14/30] [1100/1251] eta: 0:02:24 lr: 0.000041 loss: 1.715780 (1.752923) time: 0.953927 data: 0.000166 max mem: 18812 Epoch: [14/30] [1150/1251] eta: 0:01:36 lr: 0.000041 loss: 1.679526 (1.750168) time: 0.995170 data: 0.000166 max mem: 18812 Epoch: [14/30] [1200/1251] eta: 0:00:48 lr: 0.000041 loss: 1.790815 (1.750530) time: 0.964868 data: 0.000170 max mem: 18812 Epoch: [14/30] [1250/1251] eta: 0:00:00 lr: 0.000040 loss: 1.814040 (1.751179) time: 0.917180 data: 0.000756 max mem: 18812 Epoch: [14/30] Total time: 0:19:57 (0.957134 s / it) Averaged stats: lr: 0.000040 loss: 1.814040 (1.757520) Test: [ 0/49] eta: 0:01:15 loss: 0.376775 (0.376775) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.532212 data: 1.103921 max mem: 18812 Test: [10/49] eta: 0:00:25 loss: 0.544994 (0.555387) acc1: 84.375000 (85.369318) acc5: 98.437500 (97.727273) time: 0.645170 data: 0.100493 max mem: 18812 Test: [20/49] eta: 0:00:14 loss: 0.605585 (0.582408) acc1: 84.375000 (85.119048) acc5: 98.437500 (97.767857) time: 0.458742 data: 0.000136 max mem: 18812 Test: [30/49] eta: 0:00:08 loss: 0.605585 (0.587896) acc1: 84.375000 (84.727823) acc5: 98.437500 (97.933468) time: 0.361024 data: 0.000121 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.607942 (0.599316) acc1: 82.812500 (84.717988) acc5: 96.875000 (97.675305) time: 0.359114 data: 0.000116 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.607942 (0.601014) acc1: 84.375000 (84.960000) acc5: 96.875000 (97.664000) time: 0.354079 data: 0.000098 max mem: 18812 Test: Total time: 0:00:20 (0.424132 s / it) * Acc@1 85.228 Acc@5 97.760 loss 0.598 Max accuracy: 85.32% Epoch: [15/30] [ 0/1251] eta: 0:43:05 lr: 0.000040 loss: 1.629013 (1.629013) time: 2.067141 data: 1.162652 max mem: 18812 Epoch: [15/30] [ 50/1251] eta: 0:19:39 lr: 0.000040 loss: 1.755796 (1.776729) time: 0.928653 data: 0.000192 max mem: 18812 Epoch: [15/30] [ 100/1251] eta: 0:18:39 lr: 0.000040 loss: 1.776956 (1.737039) time: 0.968871 data: 0.000171 max mem: 18812 Epoch: [15/30] [ 150/1251] eta: 0:17:42 lr: 0.000040 loss: 1.731659 (1.739568) time: 0.985665 data: 0.000182 max mem: 18812 Epoch: [15/30] [ 200/1251] eta: 0:16:49 lr: 0.000040 loss: 1.741859 (1.733421) time: 0.912833 data: 0.000183 max mem: 18812 Epoch: [15/30] [ 250/1251] eta: 0:16:01 lr: 0.000040 loss: 1.756697 (1.736213) time: 0.919608 data: 0.000190 max mem: 18812 Epoch: [15/30] [ 300/1251] eta: 0:15:15 lr: 0.000039 loss: 1.730416 (1.737444) time: 0.928874 data: 0.000173 max mem: 18812 Epoch: [15/30] [ 350/1251] eta: 0:14:26 lr: 0.000039 loss: 1.760191 (1.739170) time: 0.962383 data: 0.000166 max mem: 18812 Epoch: [15/30] [ 400/1251] eta: 0:13:36 lr: 0.000039 loss: 1.649814 (1.737281) time: 0.972622 data: 0.000168 max mem: 18812 Epoch: [15/30] [ 450/1251] eta: 0:12:47 lr: 0.000039 loss: 1.735478 (1.738472) time: 0.917004 data: 0.000163 max mem: 18812 Epoch: [15/30] [ 500/1251] eta: 0:11:58 lr: 0.000039 loss: 1.758040 (1.739545) time: 0.913103 data: 0.000178 max mem: 18812 Epoch: [15/30] [ 550/1251] eta: 0:11:11 lr: 0.000039 loss: 1.742140 (1.735678) time: 0.932419 data: 0.000177 max mem: 18812 Epoch: [15/30] [ 600/1251] eta: 0:10:23 lr: 0.000038 loss: 1.697648 (1.735667) time: 1.038712 data: 0.000180 max mem: 18812 Epoch: [15/30] [ 650/1251] eta: 0:09:35 lr: 0.000038 loss: 1.746334 (1.737044) time: 0.972246 data: 0.000177 max mem: 18812 Epoch: [15/30] [ 700/1251] eta: 0:08:47 lr: 0.000038 loss: 1.763330 (1.740056) time: 0.913694 data: 0.000166 max mem: 18812 Epoch: [15/30] [ 750/1251] eta: 0:08:00 lr: 0.000038 loss: 1.729458 (1.738319) time: 0.938847 data: 0.000190 max mem: 18812 Epoch: [15/30] [ 800/1251] eta: 0:07:12 lr: 0.000038 loss: 1.712738 (1.737811) time: 0.953561 data: 0.000190 max mem: 18812 Epoch: [15/30] [ 850/1251] eta: 0:06:24 lr: 0.000038 loss: 1.779667 (1.740749) time: 0.984068 data: 0.000209 max mem: 18812 Epoch: [15/30] [ 900/1251] eta: 0:05:36 lr: 0.000037 loss: 1.785777 (1.741540) time: 0.973434 data: 0.000205 max mem: 18812 Epoch: [15/30] [ 950/1251] eta: 0:04:48 lr: 0.000037 loss: 1.763334 (1.742933) time: 0.911046 data: 0.000194 max mem: 18812 Epoch: [15/30] [1000/1251] eta: 0:04:00 lr: 0.000037 loss: 1.761111 (1.745459) time: 0.928906 data: 0.000169 max mem: 18812 Epoch: [15/30] [1050/1251] eta: 0:03:12 lr: 0.000037 loss: 1.701011 (1.744855) time: 0.925408 data: 0.000171 max mem: 18812 Epoch: [15/30] [1100/1251] eta: 0:02:24 lr: 0.000037 loss: 1.784865 (1.747315) time: 0.991778 data: 0.000161 max mem: 18812 Epoch: [15/30] [1150/1251] eta: 0:01:36 lr: 0.000037 loss: 1.722616 (1.748949) time: 0.948070 data: 0.000189 max mem: 18812 Epoch: [15/30] [1200/1251] eta: 0:00:48 lr: 0.000036 loss: 1.715170 (1.749953) time: 0.929516 data: 0.000174 max mem: 18812 Epoch: [15/30] [1250/1251] eta: 0:00:00 lr: 0.000036 loss: 1.785124 (1.749703) time: 0.944522 data: 0.000756 max mem: 18812 Epoch: [15/30] Total time: 0:19:58 (0.957972 s / it) Averaged stats: lr: 0.000036 loss: 1.785124 (1.748121) Test: [ 0/49] eta: 0:01:29 loss: 0.373645 (0.373645) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.828904 data: 1.448304 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.536277 (0.548028) acc1: 85.937500 (85.653409) acc5: 98.437500 (98.011364) time: 0.510604 data: 0.131792 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.611888 (0.579648) acc1: 84.375000 (84.895833) acc5: 96.875000 (97.842262) time: 0.370463 data: 0.000139 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.611888 (0.583920) acc1: 84.375000 (84.677419) acc5: 98.437500 (98.034274) time: 0.361788 data: 0.000131 max mem: 18812 Test: [40/49] eta: 0:00:04 loss: 0.607924 (0.595819) acc1: 84.375000 (84.946646) acc5: 96.875000 (97.751524) time: 0.454759 data: 0.000119 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.607924 (0.598482) acc1: 84.375000 (85.152000) acc5: 96.875000 (97.664000) time: 0.449465 data: 0.000100 max mem: 18812 Test: Total time: 0:00:21 (0.434238 s / it) * Acc@1 85.362 Acc@5 97.794 loss 0.595 Max accuracy: 85.36% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_22kto1k_224_fp_30eps_re2/checkpoint_0015.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_22kto1k_224_fp_30eps_re2/checkpoint_0015.pth Epoch: [16/30] [ 0/1251] eta: 0:47:56 lr: 0.000036 loss: 1.582695 (1.582695) time: 2.299187 data: 1.394363 max mem: 18812 Epoch: [16/30] [ 50/1251] eta: 0:20:00 lr: 0.000036 loss: 1.726024 (1.735101) time: 0.934356 data: 0.000175 max mem: 18812 Epoch: [16/30] [ 100/1251] eta: 0:18:51 lr: 0.000036 loss: 1.704473 (1.732103) time: 0.985125 data: 0.000175 max mem: 18812 Epoch: [16/30] [ 150/1251] eta: 0:17:52 lr: 0.000036 loss: 1.713967 (1.726363) time: 0.995250 data: 0.000177 max mem: 18812 Epoch: [16/30] [ 200/1251] eta: 0:16:50 lr: 0.000036 loss: 1.794364 (1.734188) time: 0.907406 data: 0.000165 max mem: 18812 Epoch: [16/30] [ 250/1251] eta: 0:16:04 lr: 0.000035 loss: 1.754582 (1.736585) time: 0.939423 data: 0.000179 max mem: 18812 Epoch: [16/30] [ 300/1251] eta: 0:15:18 lr: 0.000035 loss: 1.689234 (1.733215) time: 0.937926 data: 0.000156 max mem: 18812 Epoch: [16/30] [ 350/1251] eta: 0:14:28 lr: 0.000035 loss: 1.703331 (1.736891) time: 0.962157 data: 0.000149 max mem: 18812 Epoch: [16/30] [ 400/1251] eta: 0:13:39 lr: 0.000035 loss: 1.761457 (1.741026) time: 0.981613 data: 0.000175 max mem: 18812 Epoch: [16/30] [ 450/1251] eta: 0:12:48 lr: 0.000035 loss: 1.724724 (1.745725) time: 0.916449 data: 0.000171 max mem: 18812 Epoch: [16/30] [ 500/1251] eta: 0:12:01 lr: 0.000035 loss: 1.786344 (1.748878) time: 0.929924 data: 0.000168 max mem: 18812 Epoch: [16/30] [ 550/1251] eta: 0:11:12 lr: 0.000034 loss: 1.685905 (1.747657) time: 0.920568 data: 0.000179 max mem: 18812 Epoch: [16/30] [ 600/1251] eta: 0:10:25 lr: 0.000034 loss: 1.667354 (1.745531) time: 0.978295 data: 0.000179 max mem: 18812 Epoch: [16/30] [ 650/1251] eta: 0:09:36 lr: 0.000034 loss: 1.823143 (1.747901) time: 0.960940 data: 0.000175 max mem: 18812 Epoch: [16/30] [ 700/1251] eta: 0:08:47 lr: 0.000034 loss: 1.693023 (1.744938) time: 0.912530 data: 0.000159 max mem: 18812 Epoch: [16/30] [ 750/1251] eta: 0:07:59 lr: 0.000034 loss: 1.686521 (1.744534) time: 0.919980 data: 0.000176 max mem: 18812 Epoch: [16/30] [ 800/1251] eta: 0:07:12 lr: 0.000034 loss: 1.659893 (1.743751) time: 0.922881 data: 0.000177 max mem: 18812 Epoch: [16/30] [ 850/1251] eta: 0:06:24 lr: 0.000033 loss: 1.653609 (1.744196) time: 0.968797 data: 0.000177 max mem: 18812 Epoch: [16/30] [ 900/1251] eta: 0:05:35 lr: 0.000033 loss: 1.740914 (1.745687) time: 0.958388 data: 0.000179 max mem: 18812 Epoch: [16/30] [ 950/1251] eta: 0:04:47 lr: 0.000033 loss: 1.813010 (1.745934) time: 0.909975 data: 0.000188 max mem: 18812 Epoch: [16/30] [1000/1251] eta: 0:04:00 lr: 0.000033 loss: 1.808637 (1.747255) time: 0.954129 data: 0.000174 max mem: 18812 Epoch: [16/30] [1050/1251] eta: 0:03:12 lr: 0.000033 loss: 1.698074 (1.746642) time: 0.915098 data: 0.000164 max mem: 18812 Epoch: [16/30] [1100/1251] eta: 0:02:24 lr: 0.000033 loss: 1.712190 (1.745549) time: 1.041545 data: 0.000162 max mem: 18812 Epoch: [16/30] [1150/1251] eta: 0:01:36 lr: 0.000032 loss: 1.651682 (1.745277) time: 0.987495 data: 0.000163 max mem: 18812 Epoch: [16/30] [1200/1251] eta: 0:00:48 lr: 0.000032 loss: 1.764604 (1.745263) time: 0.913746 data: 0.000174 max mem: 18812 Epoch: [16/30] [1250/1251] eta: 0:00:00 lr: 0.000032 loss: 1.747824 (1.745583) time: 0.935432 data: 0.000761 max mem: 18812 Epoch: [16/30] Total time: 0:19:57 (0.957586 s / it) Averaged stats: lr: 0.000032 loss: 1.747824 (1.743766) Test: [ 0/49] eta: 0:01:28 loss: 0.383320 (0.383320) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.803406 data: 1.400303 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.528935 (0.553825) acc1: 85.937500 (85.795455) acc5: 98.437500 (98.011364) time: 0.497661 data: 0.127454 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.597470 (0.577044) acc1: 84.375000 (85.416667) acc5: 98.437500 (97.916667) time: 0.364352 data: 0.000163 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.597470 (0.579602) acc1: 84.375000 (85.030242) acc5: 98.437500 (98.034274) time: 0.362278 data: 0.000154 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.598059 (0.595010) acc1: 82.812500 (84.908537) acc5: 96.875000 (97.751524) time: 0.360458 data: 0.000148 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.624869 (0.597079) acc1: 82.812500 (85.152000) acc5: 96.875000 (97.696000) time: 0.448067 data: 0.000122 max mem: 18812 Test: Total time: 0:00:21 (0.429898 s / it) * Acc@1 85.400 Acc@5 97.832 loss 0.596 Max accuracy: 85.40% Epoch: [17/30] [ 0/1251] eta: 0:43:34 lr: 0.000032 loss: 1.587610 (1.587610) time: 2.089875 data: 1.123466 max mem: 18812 Epoch: [17/30] [ 50/1251] eta: 0:20:02 lr: 0.000032 loss: 1.782680 (1.735598) time: 1.044265 data: 0.000182 max mem: 18812 Epoch: [17/30] [ 100/1251] eta: 0:18:43 lr: 0.000032 loss: 1.742444 (1.730475) time: 0.974476 data: 0.000173 max mem: 18812 Epoch: [17/30] [ 150/1251] eta: 0:17:42 lr: 0.000032 loss: 1.710018 (1.723210) time: 0.910793 data: 0.000172 max mem: 18812 Epoch: [17/30] [ 200/1251] eta: 0:16:54 lr: 0.000031 loss: 1.691485 (1.730145) time: 0.934769 data: 0.000198 max mem: 18812 Epoch: [17/30] [ 250/1251] eta: 0:16:07 lr: 0.000031 loss: 1.756372 (1.734928) time: 0.989263 data: 0.000189 max mem: 18812 Epoch: [17/30] [ 300/1251] eta: 0:15:19 lr: 0.000031 loss: 1.754730 (1.740707) time: 1.025811 data: 0.000160 max mem: 18812 Epoch: [17/30] [ 350/1251] eta: 0:14:27 lr: 0.000031 loss: 1.663962 (1.735662) time: 0.966268 data: 0.000158 max mem: 18812 Epoch: [17/30] [ 400/1251] eta: 0:13:37 lr: 0.000031 loss: 1.646496 (1.733202) time: 0.920598 data: 0.000157 max mem: 18812 Epoch: [17/30] [ 450/1251] eta: 0:12:49 lr: 0.000031 loss: 1.693566 (1.730619) time: 0.923810 data: 0.000171 max mem: 18812 Epoch: [17/30] [ 500/1251] eta: 0:12:01 lr: 0.000031 loss: 1.743041 (1.728057) time: 0.975988 data: 0.000179 max mem: 18812 Epoch: [17/30] [ 550/1251] eta: 0:11:13 lr: 0.000030 loss: 1.689659 (1.732192) time: 1.023671 data: 0.000164 max mem: 18812 Epoch: [17/30] [ 600/1251] eta: 0:10:22 lr: 0.000030 loss: 1.728415 (1.735665) time: 0.903059 data: 0.000177 max mem: 18812 Epoch: [17/30] [ 650/1251] eta: 0:09:35 lr: 0.000030 loss: 1.745252 (1.737525) time: 0.927255 data: 0.000160 max mem: 18812 Epoch: [17/30] [ 700/1251] eta: 0:08:47 lr: 0.000030 loss: 1.659012 (1.737187) time: 0.924742 data: 0.000173 max mem: 18812 Epoch: [17/30] [ 750/1251] eta: 0:07:59 lr: 0.000030 loss: 1.743121 (1.737884) time: 0.984424 data: 0.000180 max mem: 18812 Epoch: [17/30] [ 800/1251] eta: 0:07:12 lr: 0.000030 loss: 1.707643 (1.737579) time: 1.047737 data: 0.000182 max mem: 18812 Epoch: [17/30] [ 850/1251] eta: 0:06:23 lr: 0.000029 loss: 1.729763 (1.738467) time: 0.925360 data: 0.000190 max mem: 18812 Epoch: [17/30] [ 900/1251] eta: 0:05:36 lr: 0.000029 loss: 1.720826 (1.738848) time: 0.926963 data: 0.000210 max mem: 18812 Epoch: [17/30] [ 950/1251] eta: 0:04:48 lr: 0.000029 loss: 1.647875 (1.738137) time: 0.945083 data: 0.000181 max mem: 18812 Epoch: [17/30] [1000/1251] eta: 0:04:00 lr: 0.000029 loss: 1.746311 (1.737439) time: 0.976544 data: 0.000154 max mem: 18812 Epoch: [17/30] [1050/1251] eta: 0:03:12 lr: 0.000029 loss: 1.737347 (1.735674) time: 1.025388 data: 0.000173 max mem: 18812 Epoch: [17/30] [1100/1251] eta: 0:02:24 lr: 0.000029 loss: 1.713539 (1.736030) time: 0.912337 data: 0.000173 max mem: 18812 Epoch: [17/30] [1150/1251] eta: 0:01:36 lr: 0.000028 loss: 1.659840 (1.734438) time: 0.921549 data: 0.000179 max mem: 18812 Epoch: [17/30] [1200/1251] eta: 0:00:48 lr: 0.000028 loss: 1.794591 (1.734964) time: 0.907021 data: 0.000170 max mem: 18812 Epoch: [17/30] [1250/1251] eta: 0:00:00 lr: 0.000028 loss: 1.811027 (1.735369) time: 0.990904 data: 0.000734 max mem: 18812 Epoch: [17/30] Total time: 0:19:59 (0.959201 s / it) Averaged stats: lr: 0.000028 loss: 1.811027 (1.735531) Test: [ 0/49] eta: 0:01:27 loss: 0.358385 (0.358385) acc1: 90.625000 (90.625000) acc5: 100.000000 (100.000000) time: 1.775738 data: 1.359391 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.539403 (0.550142) acc1: 85.937500 (86.505682) acc5: 98.437500 (97.727273) time: 0.493046 data: 0.123708 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.601492 (0.579673) acc1: 85.937500 (86.011905) acc5: 98.437500 (97.842262) time: 0.362687 data: 0.000139 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.600692 (0.582891) acc1: 84.375000 (85.685484) acc5: 98.437500 (98.084677) time: 0.362501 data: 0.000142 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.600692 (0.597310) acc1: 84.375000 (85.670732) acc5: 98.437500 (97.789634) time: 0.370084 data: 0.000140 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.626329 (0.598082) acc1: 84.375000 (85.792000) acc5: 96.875000 (97.728000) time: 0.364875 data: 0.000112 max mem: 18812 Test: Total time: 0:00:19 (0.395421 s / it) * Acc@1 85.480 Acc@5 97.792 loss 0.595 Max accuracy: 85.48% Epoch: [18/30] [ 0/1251] eta: 0:39:43 lr: 0.000028 loss: 1.712152 (1.712152) time: 1.905149 data: 0.997955 max mem: 18812 Epoch: [18/30] [ 50/1251] eta: 0:19:23 lr: 0.000028 loss: 1.734906 (1.725716) time: 0.923193 data: 0.000205 max mem: 18812 Epoch: [18/30] [ 100/1251] eta: 0:18:37 lr: 0.000028 loss: 1.728961 (1.709758) time: 0.936925 data: 0.000195 max mem: 18812 Epoch: [18/30] [ 150/1251] eta: 0:17:46 lr: 0.000028 loss: 1.732016 (1.723468) time: 0.961274 data: 0.000175 max mem: 18812 Epoch: [18/30] [ 200/1251] eta: 0:16:54 lr: 0.000027 loss: 1.742008 (1.729604) time: 1.011279 data: 0.000206 max mem: 18812 Epoch: [18/30] [ 250/1251] eta: 0:16:02 lr: 0.000027 loss: 1.716956 (1.731001) time: 0.964452 data: 0.000189 max mem: 18812 Epoch: [18/30] [ 300/1251] eta: 0:15:11 lr: 0.000027 loss: 1.730488 (1.731369) time: 0.934422 data: 0.000160 max mem: 18812 Epoch: [18/30] [ 350/1251] eta: 0:14:25 lr: 0.000027 loss: 1.658267 (1.725857) time: 0.935146 data: 0.000180 max mem: 18812 Epoch: [18/30] [ 400/1251] eta: 0:13:35 lr: 0.000027 loss: 1.697788 (1.724465) time: 0.966684 data: 0.000202 max mem: 18812 Epoch: [18/30] [ 450/1251] eta: 0:12:48 lr: 0.000027 loss: 1.734428 (1.724498) time: 1.036673 data: 0.000158 max mem: 18812 Epoch: [18/30] [ 500/1251] eta: 0:12:00 lr: 0.000027 loss: 1.807396 (1.729161) time: 0.982789 data: 0.000161 max mem: 18812 Epoch: [18/30] [ 550/1251] eta: 0:11:11 lr: 0.000026 loss: 1.697870 (1.726792) time: 0.911961 data: 0.000161 max mem: 18812 Epoch: [18/30] [ 600/1251] eta: 0:10:23 lr: 0.000026 loss: 1.727084 (1.722839) time: 0.935476 data: 0.000165 max mem: 18812 Epoch: [18/30] [ 650/1251] eta: 0:09:36 lr: 0.000026 loss: 1.802219 (1.725304) time: 1.001289 data: 0.000170 max mem: 18812 Epoch: [18/30] [ 700/1251] eta: 0:08:48 lr: 0.000026 loss: 1.668446 (1.722130) time: 1.027159 data: 0.000166 max mem: 18812 Epoch: [18/30] [ 750/1251] eta: 0:08:00 lr: 0.000026 loss: 1.711350 (1.723978) time: 0.959012 data: 0.000183 max mem: 18812 Epoch: [18/30] [ 800/1251] eta: 0:07:11 lr: 0.000026 loss: 1.645809 (1.724186) time: 0.923245 data: 0.000192 max mem: 18812 Epoch: [18/30] [ 850/1251] eta: 0:06:24 lr: 0.000025 loss: 1.653204 (1.722492) time: 0.942842 data: 0.000197 max mem: 18812 Epoch: [18/30] [ 900/1251] eta: 0:05:36 lr: 0.000025 loss: 1.724745 (1.724366) time: 0.972424 data: 0.000183 max mem: 18812 Epoch: [18/30] [ 950/1251] eta: 0:04:48 lr: 0.000025 loss: 1.693393 (1.723721) time: 1.026318 data: 0.000199 max mem: 18812 Epoch: [18/30] [1000/1251] eta: 0:04:00 lr: 0.000025 loss: 1.732962 (1.723710) time: 0.966914 data: 0.000170 max mem: 18812 Epoch: [18/30] [1050/1251] eta: 0:03:12 lr: 0.000025 loss: 1.794536 (1.725282) time: 0.916304 data: 0.000165 max mem: 18812 Epoch: [18/30] [1100/1251] eta: 0:02:24 lr: 0.000025 loss: 1.667027 (1.724133) time: 0.933206 data: 0.000175 max mem: 18812 Epoch: [18/30] [1150/1251] eta: 0:01:36 lr: 0.000025 loss: 1.776127 (1.725855) time: 0.972888 data: 0.000177 max mem: 18812 Epoch: [18/30] [1200/1251] eta: 0:00:48 lr: 0.000024 loss: 1.692149 (1.727137) time: 1.030331 data: 0.000177 max mem: 18812 Epoch: [18/30] [1250/1251] eta: 0:00:00 lr: 0.000024 loss: 1.705695 (1.726698) time: 0.936837 data: 0.000749 max mem: 18812 Epoch: [18/30] Total time: 0:19:57 (0.956931 s / it) Averaged stats: lr: 0.000024 loss: 1.705695 (1.728174) Test: [ 0/49] eta: 0:01:17 loss: 0.360131 (0.360131) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.591639 data: 1.185440 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.532138 (0.543507) acc1: 85.937500 (86.505682) acc5: 98.437500 (97.585227) time: 0.479019 data: 0.107925 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.603169 (0.572552) acc1: 84.375000 (85.788690) acc5: 96.875000 (97.619048) time: 0.364639 data: 0.000153 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.603169 (0.577750) acc1: 84.375000 (85.383065) acc5: 98.437500 (97.832661) time: 0.361889 data: 0.000131 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.626729 (0.592958) acc1: 84.375000 (85.137195) acc5: 96.875000 (97.599085) time: 0.358820 data: 0.000123 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.626799 (0.594962) acc1: 82.812500 (85.216000) acc5: 96.875000 (97.568000) time: 0.353118 data: 0.000104 max mem: 18812 Test: Total time: 0:00:19 (0.388206 s / it) * Acc@1 85.490 Acc@5 97.822 loss 0.592 Max accuracy: 85.49% Epoch: [19/30] [ 0/1251] eta: 0:40:11 lr: 0.000024 loss: 1.568820 (1.568820) time: 1.927818 data: 1.033013 max mem: 18812 Epoch: [19/30] [ 50/1251] eta: 0:19:36 lr: 0.000024 loss: 1.792273 (1.734114) time: 0.931835 data: 0.000177 max mem: 18812 Epoch: [19/30] [ 100/1251] eta: 0:18:39 lr: 0.000024 loss: 1.634716 (1.727958) time: 0.994340 data: 0.000188 max mem: 18812 Epoch: [19/30] [ 150/1251] eta: 0:17:52 lr: 0.000024 loss: 1.692297 (1.724462) time: 1.045634 data: 0.000183 max mem: 18812 Epoch: [19/30] [ 200/1251] eta: 0:16:49 lr: 0.000024 loss: 1.702482 (1.728958) time: 0.913942 data: 0.000178 max mem: 18812 Epoch: [19/30] [ 250/1251] eta: 0:16:02 lr: 0.000023 loss: 1.707488 (1.735476) time: 0.930878 data: 0.000179 max mem: 18812 Epoch: [19/30] [ 300/1251] eta: 0:15:15 lr: 0.000023 loss: 1.686273 (1.732537) time: 0.926520 data: 0.000163 max mem: 18812 Epoch: [19/30] [ 350/1251] eta: 0:14:27 lr: 0.000023 loss: 1.715178 (1.730385) time: 0.966465 data: 0.000166 max mem: 18812 Epoch: [19/30] [ 400/1251] eta: 0:13:39 lr: 0.000023 loss: 1.694107 (1.728963) time: 1.013300 data: 0.000158 max mem: 18812 Epoch: [19/30] [ 450/1251] eta: 0:12:47 lr: 0.000023 loss: 1.705663 (1.727646) time: 0.903727 data: 0.000173 max mem: 18812 Epoch: [19/30] [ 500/1251] eta: 0:12:01 lr: 0.000023 loss: 1.730580 (1.732364) time: 0.939799 data: 0.000169 max mem: 18812 Epoch: [19/30] [ 550/1251] eta: 0:11:13 lr: 0.000023 loss: 1.756773 (1.733760) time: 0.929926 data: 0.000164 max mem: 18812 Epoch: [19/30] [ 600/1251] eta: 0:10:25 lr: 0.000022 loss: 1.711662 (1.735373) time: 0.963551 data: 0.000156 max mem: 18812 Epoch: [19/30] [ 650/1251] eta: 0:09:36 lr: 0.000022 loss: 1.734860 (1.735599) time: 0.954898 data: 0.000165 max mem: 18812 Epoch: [19/30] [ 700/1251] eta: 0:08:47 lr: 0.000022 loss: 1.741485 (1.733772) time: 0.924754 data: 0.000176 max mem: 18812 Epoch: [19/30] [ 750/1251] eta: 0:08:00 lr: 0.000022 loss: 1.710081 (1.735290) time: 0.923421 data: 0.000190 max mem: 18812 Epoch: [19/30] [ 800/1251] eta: 0:07:12 lr: 0.000022 loss: 1.637212 (1.734532) time: 0.939589 data: 0.000190 max mem: 18812 Epoch: [19/30] [ 850/1251] eta: 0:06:24 lr: 0.000022 loss: 1.728407 (1.735598) time: 0.980275 data: 0.000204 max mem: 18812 Epoch: [19/30] [ 900/1251] eta: 0:05:36 lr: 0.000022 loss: 1.670158 (1.734487) time: 0.972700 data: 0.000189 max mem: 18812 Epoch: [19/30] [ 950/1251] eta: 0:04:48 lr: 0.000021 loss: 1.736431 (1.734062) time: 0.905073 data: 0.000188 max mem: 18812 Epoch: [19/30] [1000/1251] eta: 0:04:00 lr: 0.000021 loss: 1.757944 (1.735631) time: 0.933170 data: 0.000178 max mem: 18812 Epoch: [19/30] [1050/1251] eta: 0:03:12 lr: 0.000021 loss: 1.654005 (1.733333) time: 0.937873 data: 0.000169 max mem: 18812 Epoch: [19/30] [1100/1251] eta: 0:02:24 lr: 0.000021 loss: 1.665610 (1.731202) time: 1.025633 data: 0.000192 max mem: 18812 Epoch: [19/30] [1150/1251] eta: 0:01:36 lr: 0.000021 loss: 1.770297 (1.730898) time: 0.969754 data: 0.000162 max mem: 18812 Epoch: [19/30] [1200/1251] eta: 0:00:48 lr: 0.000021 loss: 1.700556 (1.728151) time: 0.920372 data: 0.000157 max mem: 18812 Epoch: [19/30] [1250/1251] eta: 0:00:00 lr: 0.000021 loss: 1.700640 (1.728486) time: 0.941512 data: 0.000754 max mem: 18812 Epoch: [19/30] Total time: 0:19:58 (0.958418 s / it) Averaged stats: lr: 0.000021 loss: 1.700640 (1.725103) Test: [ 0/49] eta: 0:01:14 loss: 0.347148 (0.347148) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.523434 data: 1.082807 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.524053 (0.542621) acc1: 85.937500 (86.505682) acc5: 98.437500 (97.443182) time: 0.471429 data: 0.098583 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.597908 (0.573310) acc1: 85.937500 (85.937500) acc5: 96.875000 (97.544643) time: 0.363751 data: 0.000142 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.597908 (0.575824) acc1: 85.937500 (85.584677) acc5: 98.437500 (97.883065) time: 0.362687 data: 0.000141 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.605704 (0.591298) acc1: 84.375000 (85.442073) acc5: 98.437500 (97.713415) time: 0.368729 data: 0.000140 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.609435 (0.593970) acc1: 84.375000 (85.472000) acc5: 96.875000 (97.664000) time: 0.461320 data: 0.000109 max mem: 18812 Test: Total time: 0:00:21 (0.430378 s / it) * Acc@1 85.506 Acc@5 97.812 loss 0.591 Max accuracy: 85.51% Epoch: [20/30] [ 0/1251] eta: 0:40:26 lr: 0.000021 loss: 1.639448 (1.639448) time: 1.939995 data: 1.049074 max mem: 18812 Epoch: [20/30] [ 50/1251] eta: 0:19:31 lr: 0.000020 loss: 1.700135 (1.738005) time: 1.022182 data: 0.000201 max mem: 18812 Epoch: [20/30] [ 100/1251] eta: 0:18:26 lr: 0.000020 loss: 1.789981 (1.730135) time: 0.981105 data: 0.000198 max mem: 18812 Epoch: [20/30] [ 150/1251] eta: 0:17:42 lr: 0.000020 loss: 1.725372 (1.725316) time: 0.916817 data: 0.000188 max mem: 18812 Epoch: [20/30] [ 200/1251] eta: 0:16:57 lr: 0.000020 loss: 1.703311 (1.718329) time: 0.947458 data: 0.000188 max mem: 18812 Epoch: [20/30] [ 250/1251] eta: 0:16:09 lr: 0.000020 loss: 1.666571 (1.717286) time: 0.941249 data: 0.000180 max mem: 18812 Epoch: [20/30] [ 300/1251] eta: 0:15:18 lr: 0.000020 loss: 1.709194 (1.719343) time: 1.000236 data: 0.000175 max mem: 18812 Epoch: [20/30] [ 350/1251] eta: 0:14:27 lr: 0.000020 loss: 1.792702 (1.721130) time: 0.966985 data: 0.000163 max mem: 18812 Epoch: [20/30] [ 400/1251] eta: 0:13:38 lr: 0.000019 loss: 1.756247 (1.727511) time: 0.921952 data: 0.000175 max mem: 18812 Epoch: [20/30] [ 450/1251] eta: 0:12:49 lr: 0.000019 loss: 1.713487 (1.724902) time: 0.925998 data: 0.000166 max mem: 18812 Epoch: [20/30] [ 500/1251] eta: 0:12:01 lr: 0.000019 loss: 1.660882 (1.722401) time: 0.973303 data: 0.000164 max mem: 18812 Epoch: [20/30] [ 550/1251] eta: 0:11:12 lr: 0.000019 loss: 1.683687 (1.724410) time: 0.958365 data: 0.000180 max mem: 18812 Epoch: [20/30] [ 600/1251] eta: 0:10:23 lr: 0.000019 loss: 1.718542 (1.722367) time: 0.908387 data: 0.000172 max mem: 18812 Epoch: [20/30] [ 650/1251] eta: 0:09:35 lr: 0.000019 loss: 1.689715 (1.721134) time: 0.906397 data: 0.000148 max mem: 18812 Epoch: [20/30] [ 700/1251] eta: 0:08:48 lr: 0.000019 loss: 1.664315 (1.721974) time: 0.968124 data: 0.000166 max mem: 18812 Epoch: [20/30] [ 750/1251] eta: 0:07:59 lr: 0.000018 loss: 1.688684 (1.720528) time: 0.950194 data: 0.000194 max mem: 18812 Epoch: [20/30] [ 800/1251] eta: 0:07:11 lr: 0.000018 loss: 1.712211 (1.720908) time: 0.975435 data: 0.000196 max mem: 18812 Epoch: [20/30] [ 850/1251] eta: 0:06:23 lr: 0.000018 loss: 1.652575 (1.719456) time: 0.913620 data: 0.000175 max mem: 18812 Epoch: [20/30] [ 900/1251] eta: 0:05:35 lr: 0.000018 loss: 1.671695 (1.716833) time: 0.982242 data: 0.000189 max mem: 18812 Epoch: [20/30] [ 950/1251] eta: 0:04:47 lr: 0.000018 loss: 1.642860 (1.717285) time: 0.957819 data: 0.000205 max mem: 18812 Epoch: [20/30] [1000/1251] eta: 0:03:59 lr: 0.000018 loss: 1.775203 (1.717242) time: 0.973824 data: 0.000163 max mem: 18812 Epoch: [20/30] [1050/1251] eta: 0:03:11 lr: 0.000018 loss: 1.719810 (1.717187) time: 0.916861 data: 0.000184 max mem: 18812 Epoch: [20/30] [1100/1251] eta: 0:02:24 lr: 0.000017 loss: 1.661499 (1.717070) time: 0.909699 data: 0.000185 max mem: 18812 Epoch: [20/30] [1150/1251] eta: 0:01:36 lr: 0.000017 loss: 1.791123 (1.718845) time: 0.976148 data: 0.000174 max mem: 18812 Epoch: [20/30] [1200/1251] eta: 0:00:48 lr: 0.000017 loss: 1.668604 (1.719757) time: 0.976099 data: 0.000185 max mem: 18812 Epoch: [20/30] [1250/1251] eta: 0:00:00 lr: 0.000017 loss: 1.751268 (1.720459) time: 0.977249 data: 0.000753 max mem: 18812 Epoch: [20/30] Total time: 0:19:54 (0.955094 s / it) Averaged stats: lr: 0.000017 loss: 1.751268 (1.721079) Test: [ 0/49] eta: 0:01:16 loss: 0.343274 (0.343274) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.563083 data: 1.128516 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.534922 (0.545761) acc1: 85.937500 (86.079545) acc5: 98.437500 (97.869318) time: 0.479757 data: 0.102757 max mem: 18812 Test: [20/49] eta: 0:00:14 loss: 0.596569 (0.573200) acc1: 84.375000 (85.639881) acc5: 98.437500 (97.916667) time: 0.460722 data: 0.000151 max mem: 18812 Test: [30/49] eta: 0:00:08 loss: 0.594989 (0.576241) acc1: 84.375000 (85.181452) acc5: 98.437500 (98.084677) time: 0.455295 data: 0.000129 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.587017 (0.590250) acc1: 84.375000 (85.213415) acc5: 96.875000 (97.903963) time: 0.358418 data: 0.000124 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.599122 (0.591479) acc1: 84.375000 (85.344000) acc5: 96.875000 (97.856000) time: 0.353328 data: 0.000096 max mem: 18812 Test: Total time: 0:00:20 (0.425972 s / it) * Acc@1 85.488 Acc@5 97.866 loss 0.591 Max accuracy: 85.51% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_22kto1k_224_fp_30eps_re2/checkpoint_0020.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_22kto1k_224_fp_30eps_re2/checkpoint_0020.pth Epoch: [21/30] [ 0/1251] eta: 0:55:01 lr: 0.000017 loss: 1.725681 (1.725681) time: 2.639255 data: 1.073334 max mem: 18812 Epoch: [21/30] [ 50/1251] eta: 0:19:53 lr: 0.000017 loss: 1.741234 (1.704777) time: 0.995021 data: 0.000190 max mem: 18812 Epoch: [21/30] [ 100/1251] eta: 0:18:38 lr: 0.000017 loss: 1.682104 (1.711729) time: 0.920420 data: 0.000177 max mem: 18812 Epoch: [21/30] [ 150/1251] eta: 0:17:47 lr: 0.000017 loss: 1.667497 (1.726605) time: 0.917753 data: 0.000185 max mem: 18812 Epoch: [21/30] [ 200/1251] eta: 0:16:58 lr: 0.000017 loss: 1.732460 (1.718334) time: 0.978812 data: 0.000185 max mem: 18812 Epoch: [21/30] [ 250/1251] eta: 0:16:03 lr: 0.000016 loss: 1.721237 (1.718312) time: 0.960328 data: 0.000168 max mem: 18812 Epoch: [21/30] [ 300/1251] eta: 0:15:17 lr: 0.000016 loss: 1.736606 (1.714369) time: 0.995166 data: 0.000157 max mem: 18812 Epoch: [21/30] [ 350/1251] eta: 0:14:25 lr: 0.000016 loss: 1.727548 (1.714364) time: 0.928584 data: 0.000172 max mem: 18812 Epoch: [21/30] [ 400/1251] eta: 0:13:38 lr: 0.000016 loss: 1.693542 (1.712166) time: 0.916346 data: 0.000164 max mem: 18812 Epoch: [21/30] [ 450/1251] eta: 0:12:51 lr: 0.000016 loss: 1.748498 (1.712271) time: 1.011895 data: 0.000168 max mem: 18812 Epoch: [21/30] [ 500/1251] eta: 0:12:01 lr: 0.000016 loss: 1.744147 (1.712047) time: 0.979787 data: 0.000170 max mem: 18812 Epoch: [21/30] [ 550/1251] eta: 0:11:14 lr: 0.000016 loss: 1.675792 (1.710777) time: 0.977533 data: 0.000178 max mem: 18812 Epoch: [21/30] [ 600/1251] eta: 0:10:25 lr: 0.000015 loss: 1.723370 (1.708864) time: 0.914228 data: 0.000173 max mem: 18812 Epoch: [21/30] [ 650/1251] eta: 0:09:37 lr: 0.000015 loss: 1.635196 (1.709177) time: 0.920633 data: 0.000148 max mem: 18812 Epoch: [21/30] [ 700/1251] eta: 0:08:50 lr: 0.000015 loss: 1.682433 (1.711881) time: 0.969404 data: 0.000156 max mem: 18812 Epoch: [21/30] [ 750/1251] eta: 0:08:01 lr: 0.000015 loss: 1.719092 (1.709721) time: 0.976761 data: 0.000195 max mem: 18812 Epoch: [21/30] [ 800/1251] eta: 0:07:13 lr: 0.000015 loss: 1.643188 (1.707888) time: 0.964683 data: 0.000179 max mem: 18812 Epoch: [21/30] [ 850/1251] eta: 0:06:24 lr: 0.000015 loss: 1.649026 (1.706046) time: 0.908095 data: 0.000187 max mem: 18812 Epoch: [21/30] [ 900/1251] eta: 0:05:36 lr: 0.000015 loss: 1.695044 (1.706430) time: 0.911585 data: 0.000181 max mem: 18812 Epoch: [21/30] [ 950/1251] eta: 0:04:48 lr: 0.000015 loss: 1.732001 (1.706969) time: 0.968496 data: 0.000187 max mem: 18812 Epoch: [21/30] [1000/1251] eta: 0:04:00 lr: 0.000014 loss: 1.742151 (1.709129) time: 0.967360 data: 0.000162 max mem: 18812 Epoch: [21/30] [1050/1251] eta: 0:03:12 lr: 0.000014 loss: 1.693296 (1.710430) time: 0.984582 data: 0.000172 max mem: 18812 Epoch: [21/30] [1100/1251] eta: 0:02:24 lr: 0.000014 loss: 1.728060 (1.710615) time: 0.912079 data: 0.000169 max mem: 18812 Epoch: [21/30] [1150/1251] eta: 0:01:36 lr: 0.000014 loss: 1.668685 (1.711342) time: 0.917827 data: 0.000162 max mem: 18812 Epoch: [21/30] [1200/1251] eta: 0:00:48 lr: 0.000014 loss: 1.701019 (1.711124) time: 0.985624 data: 0.000183 max mem: 18812 Epoch: [21/30] [1250/1251] eta: 0:00:00 lr: 0.000014 loss: 1.706857 (1.710100) time: 0.982832 data: 0.000764 max mem: 18812 Epoch: [21/30] Total time: 0:20:00 (0.959271 s / it) Averaged stats: lr: 0.000014 loss: 1.706857 (1.718771) Test: [ 0/49] eta: 0:01:27 loss: 0.331654 (0.331654) acc1: 90.625000 (90.625000) acc5: 100.000000 (100.000000) time: 1.790415 data: 1.367682 max mem: 18812 Test: [10/49] eta: 0:00:25 loss: 0.525531 (0.539207) acc1: 85.937500 (86.221591) acc5: 98.437500 (97.869318) time: 0.658064 data: 0.124470 max mem: 18812 Test: [20/49] eta: 0:00:14 loss: 0.596051 (0.570233) acc1: 84.375000 (85.565476) acc5: 98.437500 (97.916667) time: 0.452832 data: 0.000140 max mem: 18812 Test: [30/49] eta: 0:00:08 loss: 0.589969 (0.573319) acc1: 84.375000 (85.282258) acc5: 98.437500 (98.034274) time: 0.360645 data: 0.000131 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.588577 (0.588346) acc1: 84.375000 (85.327744) acc5: 96.875000 (97.865854) time: 0.358868 data: 0.000123 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.610656 (0.590989) acc1: 84.375000 (85.440000) acc5: 96.875000 (97.792000) time: 0.354213 data: 0.000100 max mem: 18812 Test: Total time: 0:00:20 (0.428404 s / it) * Acc@1 85.618 Acc@5 97.828 loss 0.589 Max accuracy: 85.62% Epoch: [22/30] [ 0/1251] eta: 0:41:39 lr: 0.000014 loss: 1.826995 (1.826995) time: 1.997633 data: 1.111887 max mem: 18812 Epoch: [22/30] [ 50/1251] eta: 0:19:46 lr: 0.000014 loss: 1.679108 (1.698171) time: 0.915263 data: 0.000215 max mem: 18812 Epoch: [22/30] [ 100/1251] eta: 0:18:41 lr: 0.000014 loss: 1.700381 (1.700411) time: 0.996300 data: 0.000197 max mem: 18812 Epoch: [22/30] [ 150/1251] eta: 0:17:47 lr: 0.000013 loss: 1.710433 (1.709404) time: 0.967946 data: 0.000207 max mem: 18812 Epoch: [22/30] [ 200/1251] eta: 0:16:48 lr: 0.000013 loss: 1.757009 (1.716275) time: 0.952305 data: 0.000194 max mem: 18812 Epoch: [22/30] [ 250/1251] eta: 0:15:58 lr: 0.000013 loss: 1.727069 (1.718765) time: 0.922171 data: 0.000187 max mem: 18812 Epoch: [22/30] [ 300/1251] eta: 0:15:11 lr: 0.000013 loss: 1.736590 (1.721835) time: 0.918038 data: 0.000157 max mem: 18812 Epoch: [22/30] [ 350/1251] eta: 0:14:26 lr: 0.000013 loss: 1.728330 (1.724255) time: 0.989197 data: 0.000169 max mem: 18812 Epoch: [22/30] [ 400/1251] eta: 0:13:37 lr: 0.000013 loss: 1.710452 (1.724489) time: 0.979889 data: 0.000176 max mem: 18812 Epoch: [22/30] [ 450/1251] eta: 0:12:48 lr: 0.000013 loss: 1.653856 (1.722447) time: 0.974539 data: 0.000190 max mem: 18812 Epoch: [22/30] [ 500/1251] eta: 0:11:59 lr: 0.000013 loss: 1.682916 (1.723604) time: 0.922615 data: 0.000177 max mem: 18812 Epoch: [22/30] [ 550/1251] eta: 0:11:12 lr: 0.000013 loss: 1.647693 (1.720731) time: 0.921186 data: 0.000175 max mem: 18812 Epoch: [22/30] [ 600/1251] eta: 0:10:24 lr: 0.000012 loss: 1.824356 (1.722973) time: 0.982931 data: 0.000174 max mem: 18812 Epoch: [22/30] [ 650/1251] eta: 0:09:36 lr: 0.000012 loss: 1.713780 (1.722818) time: 0.962531 data: 0.000179 max mem: 18812 Epoch: [22/30] [ 700/1251] eta: 0:08:47 lr: 0.000012 loss: 1.741136 (1.723021) time: 0.960812 data: 0.000160 max mem: 18812 Epoch: [22/30] [ 750/1251] eta: 0:07:59 lr: 0.000012 loss: 1.668992 (1.724232) time: 0.917710 data: 0.000204 max mem: 18812 Epoch: [22/30] [ 800/1251] eta: 0:07:12 lr: 0.000012 loss: 1.708058 (1.724399) time: 0.930064 data: 0.000192 max mem: 18812 Epoch: [22/30] [ 850/1251] eta: 0:06:24 lr: 0.000012 loss: 1.686606 (1.722621) time: 0.967161 data: 0.000195 max mem: 18812 Epoch: [22/30] [ 900/1251] eta: 0:05:35 lr: 0.000012 loss: 1.732469 (1.721568) time: 0.948233 data: 0.000183 max mem: 18812 Epoch: [22/30] [ 950/1251] eta: 0:04:48 lr: 0.000012 loss: 1.799495 (1.723800) time: 0.971742 data: 0.000183 max mem: 18812 Epoch: [22/30] [1000/1251] eta: 0:03:59 lr: 0.000011 loss: 1.645990 (1.722445) time: 0.920707 data: 0.000170 max mem: 18812 Epoch: [22/30] [1050/1251] eta: 0:03:12 lr: 0.000011 loss: 1.704033 (1.722857) time: 0.934057 data: 0.000180 max mem: 18812 Epoch: [22/30] [1100/1251] eta: 0:02:24 lr: 0.000011 loss: 1.711836 (1.722783) time: 0.973201 data: 0.000179 max mem: 18812 Epoch: [22/30] [1150/1251] eta: 0:01:36 lr: 0.000011 loss: 1.737943 (1.722052) time: 0.965591 data: 0.000173 max mem: 18812 Epoch: [22/30] [1200/1251] eta: 0:00:48 lr: 0.000011 loss: 1.691854 (1.722576) time: 0.953434 data: 0.000164 max mem: 18812 Epoch: [22/30] [1250/1251] eta: 0:00:00 lr: 0.000011 loss: 1.643645 (1.720964) time: 0.924074 data: 0.000792 max mem: 18812 Epoch: [22/30] Total time: 0:19:56 (0.956548 s / it) Averaged stats: lr: 0.000011 loss: 1.643645 (1.717237) Test: [ 0/49] eta: 0:01:15 loss: 0.342112 (0.342112) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.544604 data: 1.109595 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.521951 (0.544267) acc1: 85.937500 (86.221591) acc5: 98.437500 (98.011364) time: 0.481145 data: 0.101036 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.592641 (0.571373) acc1: 84.375000 (85.565476) acc5: 96.875000 (97.842262) time: 0.378094 data: 0.000156 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.590415 (0.574533) acc1: 84.375000 (85.282258) acc5: 98.437500 (98.034274) time: 0.371295 data: 0.000142 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.590415 (0.588618) acc1: 84.375000 (85.327744) acc5: 96.875000 (97.789634) time: 0.358638 data: 0.000132 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.598543 (0.591258) acc1: 84.375000 (85.472000) acc5: 96.875000 (97.760000) time: 0.353652 data: 0.000104 max mem: 18812 Test: Total time: 0:00:19 (0.391391 s / it) * Acc@1 85.612 Acc@5 97.822 loss 0.589 Max accuracy: 85.62% Epoch: [23/30] [ 0/1251] eta: 0:42:07 lr: 0.000011 loss: 1.786645 (1.786645) time: 2.020348 data: 1.116016 max mem: 18812 Epoch: [23/30] [ 50/1251] eta: 0:19:29 lr: 0.000011 loss: 1.710497 (1.718724) time: 0.977257 data: 0.000162 max mem: 18812 Epoch: [23/30] [ 100/1251] eta: 0:18:32 lr: 0.000011 loss: 1.659574 (1.710675) time: 0.978427 data: 0.000181 max mem: 18812 Epoch: [23/30] [ 150/1251] eta: 0:17:33 lr: 0.000011 loss: 1.708647 (1.707133) time: 0.964568 data: 0.000182 max mem: 18812 Epoch: [23/30] [ 200/1251] eta: 0:16:43 lr: 0.000010 loss: 1.646135 (1.716065) time: 0.925768 data: 0.000190 max mem: 18812 Epoch: [23/30] [ 250/1251] eta: 0:15:59 lr: 0.000010 loss: 1.688911 (1.720339) time: 0.916402 data: 0.000193 max mem: 18812 Epoch: [23/30] [ 300/1251] eta: 0:15:11 lr: 0.000010 loss: 1.706838 (1.720633) time: 0.981502 data: 0.000188 max mem: 18812 Epoch: [23/30] [ 350/1251] eta: 0:14:21 lr: 0.000010 loss: 1.772202 (1.725239) time: 0.979924 data: 0.000171 max mem: 18812 Epoch: [23/30] [ 400/1251] eta: 0:13:34 lr: 0.000010 loss: 1.616986 (1.726111) time: 1.021550 data: 0.000183 max mem: 18812 Epoch: [23/30] [ 450/1251] eta: 0:12:45 lr: 0.000010 loss: 1.618611 (1.720633) time: 0.970786 data: 0.000176 max mem: 18812 Epoch: [23/30] [ 500/1251] eta: 0:11:56 lr: 0.000010 loss: 1.784245 (1.724029) time: 0.908605 data: 0.000170 max mem: 18812 Epoch: [23/30] [ 550/1251] eta: 0:11:10 lr: 0.000010 loss: 1.726896 (1.724807) time: 0.951325 data: 0.000180 max mem: 18812 Epoch: [23/30] [ 600/1251] eta: 0:10:22 lr: 0.000010 loss: 1.629350 (1.725615) time: 0.984482 data: 0.000174 max mem: 18812 Epoch: [23/30] [ 650/1251] eta: 0:09:35 lr: 0.000010 loss: 1.837631 (1.728127) time: 1.020352 data: 0.000163 max mem: 18812 Epoch: [23/30] [ 700/1251] eta: 0:08:46 lr: 0.000009 loss: 1.688645 (1.725753) time: 0.981304 data: 0.000168 max mem: 18812 Epoch: [23/30] [ 750/1251] eta: 0:07:58 lr: 0.000009 loss: 1.709640 (1.725817) time: 0.931901 data: 0.000185 max mem: 18812 Epoch: [23/30] [ 800/1251] eta: 0:07:11 lr: 0.000009 loss: 1.723961 (1.723757) time: 0.919457 data: 0.000187 max mem: 18812 Epoch: [23/30] [ 850/1251] eta: 0:06:23 lr: 0.000009 loss: 1.688346 (1.724044) time: 0.952130 data: 0.000177 max mem: 18812 Epoch: [23/30] [ 900/1251] eta: 0:05:35 lr: 0.000009 loss: 1.739145 (1.723593) time: 1.035365 data: 0.000175 max mem: 18812 Epoch: [23/30] [ 950/1251] eta: 0:04:47 lr: 0.000009 loss: 1.608363 (1.722322) time: 0.971384 data: 0.000182 max mem: 18812 Epoch: [23/30] [1000/1251] eta: 0:04:00 lr: 0.000009 loss: 1.697251 (1.719804) time: 0.933332 data: 0.000167 max mem: 18812 Epoch: [23/30] [1050/1251] eta: 0:03:12 lr: 0.000009 loss: 1.730965 (1.720538) time: 0.942913 data: 0.000171 max mem: 18812 Epoch: [23/30] [1100/1251] eta: 0:02:24 lr: 0.000009 loss: 1.695127 (1.719991) time: 0.981160 data: 0.000174 max mem: 18812 Epoch: [23/30] [1150/1251] eta: 0:01:36 lr: 0.000009 loss: 1.628804 (1.719850) time: 1.021422 data: 0.000176 max mem: 18812 Epoch: [23/30] [1200/1251] eta: 0:00:48 lr: 0.000008 loss: 1.717237 (1.719161) time: 0.988673 data: 0.000171 max mem: 18812 Epoch: [23/30] [1250/1251] eta: 0:00:00 lr: 0.000008 loss: 1.719431 (1.719961) time: 0.921679 data: 0.000914 max mem: 18812 Epoch: [23/30] Total time: 0:19:57 (0.957236 s / it) Averaged stats: lr: 0.000008 loss: 1.719431 (1.712951) Test: [ 0/49] eta: 0:01:15 loss: 0.343414 (0.343414) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.549233 data: 1.139113 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.522627 (0.543613) acc1: 85.937500 (86.363636) acc5: 98.437500 (98.011364) time: 0.476613 data: 0.103713 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.595375 (0.571892) acc1: 84.375000 (85.639881) acc5: 98.437500 (97.916667) time: 0.367917 data: 0.000151 max mem: 18812 Test: [30/49] eta: 0:00:08 loss: 0.592236 (0.573661) acc1: 84.375000 (85.332661) acc5: 98.437500 (98.034274) time: 0.452888 data: 0.000130 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.589384 (0.586895) acc1: 84.375000 (85.327744) acc5: 96.875000 (97.827744) time: 0.448203 data: 0.000122 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.589384 (0.588945) acc1: 84.375000 (85.504000) acc5: 96.875000 (97.760000) time: 0.354088 data: 0.000101 max mem: 18812 Test: Total time: 0:00:20 (0.424563 s / it) * Acc@1 85.608 Acc@5 97.818 loss 0.588 Max accuracy: 85.62% Epoch: [24/30] [ 0/1251] eta: 0:42:29 lr: 0.000008 loss: 1.851905 (1.851905) time: 2.037903 data: 1.139083 max mem: 18812 Epoch: [24/30] [ 50/1251] eta: 0:19:43 lr: 0.000008 loss: 1.760676 (1.729805) time: 0.966020 data: 0.000206 max mem: 18812 Epoch: [24/30] [ 100/1251] eta: 0:18:45 lr: 0.000008 loss: 1.716962 (1.727900) time: 1.030236 data: 0.000186 max mem: 18812 Epoch: [24/30] [ 150/1251] eta: 0:17:35 lr: 0.000008 loss: 1.626923 (1.720191) time: 0.909539 data: 0.000197 max mem: 18812 Epoch: [24/30] [ 200/1251] eta: 0:16:48 lr: 0.000008 loss: 1.647194 (1.712454) time: 0.913787 data: 0.000189 max mem: 18812 Epoch: [24/30] [ 250/1251] eta: 0:16:02 lr: 0.000008 loss: 1.759315 (1.709681) time: 0.930317 data: 0.000202 max mem: 18812 Epoch: [24/30] [ 300/1251] eta: 0:15:13 lr: 0.000008 loss: 1.700958 (1.711666) time: 0.976110 data: 0.000184 max mem: 18812 Epoch: [24/30] [ 350/1251] eta: 0:14:26 lr: 0.000008 loss: 1.683725 (1.709959) time: 1.023806 data: 0.000190 max mem: 18812 Epoch: [24/30] [ 400/1251] eta: 0:13:33 lr: 0.000008 loss: 1.667649 (1.708112) time: 0.906742 data: 0.000180 max mem: 18812 Epoch: [24/30] [ 450/1251] eta: 0:12:46 lr: 0.000007 loss: 1.645607 (1.711640) time: 0.921900 data: 0.000177 max mem: 18812 Epoch: [24/30] [ 500/1251] eta: 0:11:59 lr: 0.000007 loss: 1.686567 (1.713913) time: 0.930276 data: 0.000159 max mem: 18812 Epoch: [24/30] [ 550/1251] eta: 0:11:10 lr: 0.000007 loss: 1.707220 (1.717299) time: 0.974527 data: 0.000172 max mem: 18812 Epoch: [24/30] [ 600/1251] eta: 0:10:22 lr: 0.000007 loss: 1.718188 (1.717184) time: 0.953992 data: 0.000188 max mem: 18812 Epoch: [24/30] [ 650/1251] eta: 0:09:34 lr: 0.000007 loss: 1.756866 (1.716851) time: 0.922407 data: 0.000172 max mem: 18812 Epoch: [24/30] [ 700/1251] eta: 0:08:47 lr: 0.000007 loss: 1.668591 (1.715607) time: 0.943738 data: 0.000165 max mem: 18812 Epoch: [24/30] [ 750/1251] eta: 0:07:59 lr: 0.000007 loss: 1.701852 (1.713353) time: 0.926698 data: 0.000199 max mem: 18812 Epoch: [24/30] [ 800/1251] eta: 0:07:11 lr: 0.000007 loss: 1.702851 (1.713523) time: 0.973278 data: 0.000194 max mem: 18812 Epoch: [24/30] [ 850/1251] eta: 0:06:23 lr: 0.000007 loss: 1.705127 (1.713889) time: 0.961910 data: 0.000162 max mem: 18812 Epoch: [24/30] [ 900/1251] eta: 0:05:35 lr: 0.000007 loss: 1.630045 (1.713570) time: 0.905123 data: 0.000188 max mem: 18812 Epoch: [24/30] [ 950/1251] eta: 0:04:47 lr: 0.000007 loss: 1.648326 (1.713910) time: 0.939325 data: 0.000181 max mem: 18812 Epoch: [24/30] [1000/1251] eta: 0:04:00 lr: 0.000006 loss: 1.628403 (1.712238) time: 0.932797 data: 0.000177 max mem: 18812 Epoch: [24/30] [1050/1251] eta: 0:03:12 lr: 0.000006 loss: 1.643457 (1.712545) time: 1.030507 data: 0.000162 max mem: 18812 Epoch: [24/30] [1100/1251] eta: 0:02:24 lr: 0.000006 loss: 1.649410 (1.712960) time: 0.951345 data: 0.000183 max mem: 18812 Epoch: [24/30] [1150/1251] eta: 0:01:36 lr: 0.000006 loss: 1.689739 (1.712256) time: 0.915553 data: 0.000159 max mem: 18812 Epoch: [24/30] [1200/1251] eta: 0:00:48 lr: 0.000006 loss: 1.682417 (1.712907) time: 0.926658 data: 0.000164 max mem: 18812 Epoch: [24/30] [1250/1251] eta: 0:00:00 lr: 0.000006 loss: 1.675373 (1.712838) time: 0.965974 data: 0.000788 max mem: 18812 Epoch: [24/30] Total time: 0:19:54 (0.954934 s / it) Averaged stats: lr: 0.000006 loss: 1.675373 (1.709825) Test: [ 0/49] eta: 0:01:24 loss: 0.334919 (0.334919) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.720886 data: 1.308816 max mem: 18812 Test: [10/49] eta: 0:00:19 loss: 0.529734 (0.538746) acc1: 85.937500 (86.505682) acc5: 98.437500 (97.727273) time: 0.492008 data: 0.119136 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.597628 (0.568815) acc1: 84.375000 (85.788690) acc5: 98.437500 (97.767857) time: 0.365558 data: 0.000151 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.593320 (0.572337) acc1: 84.375000 (85.332661) acc5: 98.437500 (97.883065) time: 0.368188 data: 0.000144 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.593320 (0.586189) acc1: 84.375000 (85.289634) acc5: 96.875000 (97.713415) time: 0.365428 data: 0.000150 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.600097 (0.588415) acc1: 84.375000 (85.440000) acc5: 96.875000 (97.664000) time: 0.353944 data: 0.000123 max mem: 18812 Test: Total time: 0:00:19 (0.393491 s / it) * Acc@1 85.634 Acc@5 97.818 loss 0.587 Max accuracy: 85.63% Epoch: [25/30] [ 0/1251] eta: 0:42:51 lr: 0.000006 loss: 1.796982 (1.796982) time: 2.055805 data: 1.167197 max mem: 18812 Epoch: [25/30] [ 50/1251] eta: 0:19:24 lr: 0.000006 loss: 1.705197 (1.733410) time: 0.962009 data: 0.000185 max mem: 18812 Epoch: [25/30] [ 100/1251] eta: 0:18:20 lr: 0.000006 loss: 1.685781 (1.726177) time: 0.929369 data: 0.000181 max mem: 18812 Epoch: [25/30] [ 150/1251] eta: 0:17:36 lr: 0.000006 loss: 1.707385 (1.716587) time: 0.937525 data: 0.000182 max mem: 18812 Epoch: [25/30] [ 200/1251] eta: 0:16:53 lr: 0.000006 loss: 1.709246 (1.719409) time: 0.987257 data: 0.000171 max mem: 18812 Epoch: [25/30] [ 250/1251] eta: 0:16:03 lr: 0.000006 loss: 1.723381 (1.718142) time: 1.010769 data: 0.000188 max mem: 18812 Epoch: [25/30] [ 300/1251] eta: 0:15:12 lr: 0.000006 loss: 1.704921 (1.714964) time: 0.973527 data: 0.000153 max mem: 18812 Epoch: [25/30] [ 350/1251] eta: 0:14:21 lr: 0.000006 loss: 1.712058 (1.712348) time: 0.921778 data: 0.000172 max mem: 18812 Epoch: [25/30] [ 400/1251] eta: 0:13:35 lr: 0.000005 loss: 1.754629 (1.710478) time: 0.932975 data: 0.000185 max mem: 18812 Epoch: [25/30] [ 450/1251] eta: 0:12:47 lr: 0.000005 loss: 1.710636 (1.706412) time: 0.996383 data: 0.000165 max mem: 18812 Epoch: [25/30] [ 500/1251] eta: 0:12:01 lr: 0.000005 loss: 1.685908 (1.708252) time: 1.027446 data: 0.000176 max mem: 18812 Epoch: [25/30] [ 550/1251] eta: 0:11:11 lr: 0.000005 loss: 1.705683 (1.709365) time: 0.948877 data: 0.000180 max mem: 18812 Epoch: [25/30] [ 600/1251] eta: 0:10:23 lr: 0.000005 loss: 1.713332 (1.711064) time: 0.922046 data: 0.000166 max mem: 18812 Epoch: [25/30] [ 650/1251] eta: 0:09:35 lr: 0.000005 loss: 1.642387 (1.711263) time: 0.934439 data: 0.000164 max mem: 18812 Epoch: [25/30] [ 700/1251] eta: 0:08:48 lr: 0.000005 loss: 1.700153 (1.711625) time: 0.973663 data: 0.000170 max mem: 18812 Epoch: [25/30] [ 750/1251] eta: 0:08:00 lr: 0.000005 loss: 1.629091 (1.709471) time: 1.063357 data: 0.000183 max mem: 18812 Epoch: [25/30] [ 800/1251] eta: 0:07:12 lr: 0.000005 loss: 1.705673 (1.708902) time: 0.975897 data: 0.000194 max mem: 18812 Epoch: [25/30] [ 850/1251] eta: 0:06:24 lr: 0.000005 loss: 1.725399 (1.710595) time: 0.919946 data: 0.000169 max mem: 18812 Epoch: [25/30] [ 900/1251] eta: 0:05:36 lr: 0.000005 loss: 1.720020 (1.709662) time: 0.919503 data: 0.000200 max mem: 18812 Epoch: [25/30] [ 950/1251] eta: 0:04:48 lr: 0.000005 loss: 1.696533 (1.708857) time: 0.958224 data: 0.000194 max mem: 18812 Epoch: [25/30] [1000/1251] eta: 0:04:00 lr: 0.000005 loss: 1.709342 (1.710664) time: 1.029966 data: 0.000163 max mem: 18812 Epoch: [25/30] [1050/1251] eta: 0:03:12 lr: 0.000004 loss: 1.688883 (1.709185) time: 0.922718 data: 0.000172 max mem: 18812 Epoch: [25/30] [1100/1251] eta: 0:02:24 lr: 0.000004 loss: 1.680670 (1.707982) time: 0.932244 data: 0.000167 max mem: 18812 Epoch: [25/30] [1150/1251] eta: 0:01:36 lr: 0.000004 loss: 1.671634 (1.707982) time: 0.937951 data: 0.000166 max mem: 18812 Epoch: [25/30] [1200/1251] eta: 0:00:48 lr: 0.000004 loss: 1.621385 (1.707251) time: 0.961322 data: 0.000181 max mem: 18812 Epoch: [25/30] [1250/1251] eta: 0:00:00 lr: 0.000004 loss: 1.649875 (1.706433) time: 1.000117 data: 0.000752 max mem: 18812 Epoch: [25/30] Total time: 0:20:00 (0.959292 s / it) Averaged stats: lr: 0.000004 loss: 1.649875 (1.708344) Test: [ 0/49] eta: 0:01:17 loss: 0.338944 (0.338944) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.585847 data: 1.129093 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.522907 (0.539107) acc1: 85.937500 (86.647727) acc5: 98.437500 (98.011364) time: 0.480084 data: 0.102814 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.594834 (0.569748) acc1: 85.937500 (85.863095) acc5: 96.875000 (97.842262) time: 0.364627 data: 0.000162 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.592423 (0.572795) acc1: 84.375000 (85.383065) acc5: 98.437500 (97.983871) time: 0.363272 data: 0.000136 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.586037 (0.585958) acc1: 84.375000 (85.365854) acc5: 96.875000 (97.789634) time: 0.382347 data: 0.000123 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.605633 (0.587762) acc1: 84.375000 (85.568000) acc5: 96.875000 (97.760000) time: 0.376329 data: 0.000098 max mem: 18812 Test: Total time: 0:00:19 (0.397532 s / it) * Acc@1 85.648 Acc@5 97.838 loss 0.586 Max accuracy: 85.65% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_22kto1k_224_fp_30eps_re2/checkpoint_0025.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_22kto1k_224_fp_30eps_re2/checkpoint_0025.pth Epoch: [26/30] [ 0/1251] eta: 0:47:18 lr: 0.000004 loss: 1.734438 (1.734438) time: 2.269037 data: 1.373828 max mem: 18812 Epoch: [26/30] [ 50/1251] eta: 0:19:13 lr: 0.000004 loss: 1.659621 (1.707327) time: 0.909288 data: 0.000211 max mem: 18812 Epoch: [26/30] [ 100/1251] eta: 0:18:20 lr: 0.000004 loss: 1.781090 (1.732906) time: 0.962012 data: 0.000203 max mem: 18812 Epoch: [26/30] [ 150/1251] eta: 0:17:33 lr: 0.000004 loss: 1.682000 (1.721015) time: 0.967029 data: 0.000181 max mem: 18812 Epoch: [26/30] [ 200/1251] eta: 0:16:40 lr: 0.000004 loss: 1.637964 (1.708290) time: 0.974646 data: 0.000211 max mem: 18812 Epoch: [26/30] [ 250/1251] eta: 0:15:52 lr: 0.000004 loss: 1.716527 (1.713497) time: 0.988714 data: 0.000185 max mem: 18812 Epoch: [26/30] [ 300/1251] eta: 0:15:03 lr: 0.000004 loss: 1.640757 (1.710326) time: 0.914495 data: 0.000183 max mem: 18812 Epoch: [26/30] [ 350/1251] eta: 0:14:18 lr: 0.000004 loss: 1.695570 (1.708419) time: 0.934666 data: 0.000167 max mem: 18812 Epoch: [26/30] [ 400/1251] eta: 0:13:31 lr: 0.000004 loss: 1.697181 (1.710832) time: 0.926298 data: 0.000215 max mem: 18812 Epoch: [26/30] [ 450/1251] eta: 0:12:44 lr: 0.000004 loss: 1.623442 (1.707774) time: 1.026006 data: 0.000200 max mem: 18812 Epoch: [26/30] [ 500/1251] eta: 0:11:55 lr: 0.000004 loss: 1.714599 (1.707357) time: 0.971069 data: 0.000227 max mem: 18812 Epoch: [26/30] [ 550/1251] eta: 0:11:08 lr: 0.000003 loss: 1.717962 (1.706068) time: 0.926143 data: 0.000210 max mem: 18812 Epoch: [26/30] [ 600/1251] eta: 0:10:21 lr: 0.000003 loss: 1.783836 (1.708837) time: 0.930148 data: 0.000203 max mem: 18812 Epoch: [26/30] [ 650/1251] eta: 0:09:34 lr: 0.000003 loss: 1.700233 (1.707005) time: 0.930092 data: 0.000208 max mem: 18812 Epoch: [26/30] [ 700/1251] eta: 0:08:46 lr: 0.000003 loss: 1.729394 (1.708885) time: 1.039135 data: 0.000214 max mem: 18812 Epoch: [26/30] [ 750/1251] eta: 0:07:58 lr: 0.000003 loss: 1.673200 (1.708304) time: 0.976434 data: 0.000233 max mem: 18812 Epoch: [26/30] [ 800/1251] eta: 0:07:10 lr: 0.000003 loss: 1.615154 (1.706600) time: 0.910099 data: 0.000225 max mem: 18812 Epoch: [26/30] [ 850/1251] eta: 0:06:23 lr: 0.000003 loss: 1.714345 (1.706266) time: 0.932523 data: 0.000215 max mem: 18812 Epoch: [26/30] [ 900/1251] eta: 0:05:35 lr: 0.000003 loss: 1.703850 (1.707977) time: 0.924354 data: 0.000238 max mem: 18812 Epoch: [26/30] [ 950/1251] eta: 0:04:47 lr: 0.000003 loss: 1.690216 (1.707746) time: 1.020622 data: 0.000223 max mem: 18812 Epoch: [26/30] [1000/1251] eta: 0:03:59 lr: 0.000003 loss: 1.688014 (1.708196) time: 0.991121 data: 0.000205 max mem: 18812 Epoch: [26/30] [1050/1251] eta: 0:03:12 lr: 0.000003 loss: 1.707736 (1.709893) time: 0.924256 data: 0.000203 max mem: 18812 Epoch: [26/30] [1100/1251] eta: 0:02:24 lr: 0.000003 loss: 1.687302 (1.708839) time: 0.933238 data: 0.000205 max mem: 18812 Epoch: [26/30] [1150/1251] eta: 0:01:36 lr: 0.000003 loss: 1.741980 (1.709103) time: 0.955529 data: 0.000212 max mem: 18812 Epoch: [26/30] [1200/1251] eta: 0:00:48 lr: 0.000003 loss: 1.662614 (1.708201) time: 1.015108 data: 0.000208 max mem: 18812 Epoch: [26/30] [1250/1251] eta: 0:00:00 lr: 0.000003 loss: 1.740089 (1.709805) time: 0.968149 data: 0.000860 max mem: 18812 Epoch: [26/30] Total time: 0:19:56 (0.956506 s / it) Averaged stats: lr: 0.000003 loss: 1.740089 (1.706568) Test: [ 0/49] eta: 0:01:16 loss: 0.339082 (0.339082) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.564917 data: 1.115362 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.524004 (0.538648) acc1: 85.937500 (86.647727) acc5: 98.437500 (98.011364) time: 0.474168 data: 0.101534 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.591808 (0.569445) acc1: 85.937500 (85.714286) acc5: 98.437500 (97.916667) time: 0.362955 data: 0.000143 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.589390 (0.572656) acc1: 84.375000 (85.383065) acc5: 98.437500 (97.983871) time: 0.360691 data: 0.000134 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.589390 (0.585980) acc1: 84.375000 (85.365854) acc5: 96.875000 (97.827744) time: 0.358565 data: 0.000124 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.603338 (0.588034) acc1: 84.375000 (85.536000) acc5: 96.875000 (97.760000) time: 0.374920 data: 0.000101 max mem: 18812 Test: Total time: 0:00:19 (0.394535 s / it) * Acc@1 85.638 Acc@5 97.840 loss 0.587 Max accuracy: 85.65% Epoch: [27/30] [ 0/1251] eta: 0:40:20 lr: 0.000003 loss: 1.762790 (1.762790) time: 1.934904 data: 1.045201 max mem: 18812 Epoch: [27/30] [ 50/1251] eta: 0:19:50 lr: 0.000003 loss: 1.665527 (1.700779) time: 0.938742 data: 0.000192 max mem: 18812 Epoch: [27/30] [ 100/1251] eta: 0:18:40 lr: 0.000003 loss: 1.677129 (1.692109) time: 0.970864 data: 0.000176 max mem: 18812 Epoch: [27/30] [ 150/1251] eta: 0:17:43 lr: 0.000003 loss: 1.648490 (1.688668) time: 1.000306 data: 0.000181 max mem: 18812 Epoch: [27/30] [ 200/1251] eta: 0:16:46 lr: 0.000003 loss: 1.654902 (1.687839) time: 0.912811 data: 0.000175 max mem: 18812 Epoch: [27/30] [ 250/1251] eta: 0:15:58 lr: 0.000002 loss: 1.663283 (1.689269) time: 0.931006 data: 0.000188 max mem: 18812 Epoch: [27/30] [ 300/1251] eta: 0:15:12 lr: 0.000002 loss: 1.759680 (1.698632) time: 0.932768 data: 0.000161 max mem: 18812 Epoch: [27/30] [ 350/1251] eta: 0:14:25 lr: 0.000002 loss: 1.761722 (1.697701) time: 0.981832 data: 0.000184 max mem: 18812 Epoch: [27/30] [ 400/1251] eta: 0:13:35 lr: 0.000002 loss: 1.727931 (1.702082) time: 0.966680 data: 0.000169 max mem: 18812 Epoch: [27/30] [ 450/1251] eta: 0:12:46 lr: 0.000002 loss: 1.707126 (1.706830) time: 0.913556 data: 0.000169 max mem: 18812 Epoch: [27/30] [ 500/1251] eta: 0:11:58 lr: 0.000002 loss: 1.696238 (1.705389) time: 0.927077 data: 0.000174 max mem: 18812 Epoch: [27/30] [ 550/1251] eta: 0:11:11 lr: 0.000002 loss: 1.665804 (1.706490) time: 0.932338 data: 0.000180 max mem: 18812 Epoch: [27/30] [ 600/1251] eta: 0:10:23 lr: 0.000002 loss: 1.625869 (1.703503) time: 0.984204 data: 0.000168 max mem: 18812 Epoch: [27/30] [ 650/1251] eta: 0:09:34 lr: 0.000002 loss: 1.761680 (1.703286) time: 0.958074 data: 0.000161 max mem: 18812 Epoch: [27/30] [ 700/1251] eta: 0:08:46 lr: 0.000002 loss: 1.745652 (1.703281) time: 0.909127 data: 0.000168 max mem: 18812 Epoch: [27/30] [ 750/1251] eta: 0:07:59 lr: 0.000002 loss: 1.715998 (1.704091) time: 0.945934 data: 0.000175 max mem: 18812 Epoch: [27/30] [ 800/1251] eta: 0:07:11 lr: 0.000002 loss: 1.723595 (1.705790) time: 0.925629 data: 0.000192 max mem: 18812 Epoch: [27/30] [ 850/1251] eta: 0:06:23 lr: 0.000002 loss: 1.706996 (1.706418) time: 1.016430 data: 0.000197 max mem: 18812 Epoch: [27/30] [ 900/1251] eta: 0:05:35 lr: 0.000002 loss: 1.629060 (1.704213) time: 0.960542 data: 0.000197 max mem: 18812 Epoch: [27/30] [ 950/1251] eta: 0:04:47 lr: 0.000002 loss: 1.651234 (1.703974) time: 0.910913 data: 0.000187 max mem: 18812 Epoch: [27/30] [1000/1251] eta: 0:03:59 lr: 0.000002 loss: 1.623693 (1.702857) time: 0.935957 data: 0.000174 max mem: 18812 Epoch: [27/30] [1050/1251] eta: 0:03:12 lr: 0.000002 loss: 1.683679 (1.705848) time: 0.979747 data: 0.000171 max mem: 18812 Epoch: [27/30] [1100/1251] eta: 0:02:24 lr: 0.000002 loss: 1.662323 (1.705792) time: 1.040989 data: 0.000165 max mem: 18812 Epoch: [27/30] [1150/1251] eta: 0:01:36 lr: 0.000002 loss: 1.667479 (1.706250) time: 0.971551 data: 0.000171 max mem: 18812 Epoch: [27/30] [1200/1251] eta: 0:00:48 lr: 0.000002 loss: 1.703604 (1.706286) time: 0.920181 data: 0.000173 max mem: 18812 Epoch: [27/30] [1250/1251] eta: 0:00:00 lr: 0.000002 loss: 1.629372 (1.705417) time: 0.933432 data: 0.000802 max mem: 18812 Epoch: [27/30] Total time: 0:19:57 (0.957461 s / it) Averaged stats: lr: 0.000002 loss: 1.629372 (1.706583) Test: [ 0/49] eta: 0:01:18 loss: 0.335474 (0.335474) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.606500 data: 1.184648 max mem: 18812 Test: [10/49] eta: 0:00:18 loss: 0.530632 (0.539723) acc1: 85.937500 (86.789773) acc5: 98.437500 (97.869318) time: 0.481134 data: 0.107842 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.597103 (0.569576) acc1: 85.937500 (85.863095) acc5: 98.437500 (97.842262) time: 0.364708 data: 0.000141 max mem: 18812 Test: [30/49] eta: 0:00:07 loss: 0.589198 (0.572691) acc1: 84.375000 (85.383065) acc5: 98.437500 (97.983871) time: 0.360739 data: 0.000133 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.589198 (0.585925) acc1: 84.375000 (85.365854) acc5: 96.875000 (97.789634) time: 0.367272 data: 0.000132 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.602265 (0.587761) acc1: 84.375000 (85.536000) acc5: 96.875000 (97.760000) time: 0.467673 data: 0.000103 max mem: 18812 Test: Total time: 0:00:21 (0.434815 s / it) * Acc@1 85.650 Acc@5 97.826 loss 0.586 Max accuracy: 85.65% Epoch: [28/30] [ 0/1251] eta: 0:43:55 lr: 0.000002 loss: 1.459154 (1.459154) time: 2.106955 data: 1.139308 max mem: 18812 Epoch: [28/30] [ 50/1251] eta: 0:19:54 lr: 0.000002 loss: 1.713498 (1.730267) time: 1.025668 data: 0.000203 max mem: 18812 Epoch: [28/30] [ 100/1251] eta: 0:18:31 lr: 0.000002 loss: 1.686409 (1.713216) time: 0.956538 data: 0.000205 max mem: 18812 Epoch: [28/30] [ 150/1251] eta: 0:17:33 lr: 0.000002 loss: 1.765357 (1.718123) time: 0.937853 data: 0.000199 max mem: 18812 Epoch: [28/30] [ 200/1251] eta: 0:16:48 lr: 0.000002 loss: 1.683870 (1.711956) time: 0.944517 data: 0.000182 max mem: 18812 Epoch: [28/30] [ 250/1251] eta: 0:15:59 lr: 0.000001 loss: 1.729153 (1.709968) time: 0.955511 data: 0.000190 max mem: 18812 Epoch: [28/30] [ 300/1251] eta: 0:15:12 lr: 0.000001 loss: 1.739377 (1.714914) time: 1.020921 data: 0.000161 max mem: 18812 Epoch: [28/30] [ 350/1251] eta: 0:14:21 lr: 0.000001 loss: 1.710580 (1.716010) time: 0.920510 data: 0.000169 max mem: 18812 Epoch: [28/30] [ 400/1251] eta: 0:13:34 lr: 0.000001 loss: 1.682553 (1.708826) time: 0.928307 data: 0.000190 max mem: 18812 Epoch: [28/30] [ 450/1251] eta: 0:12:48 lr: 0.000001 loss: 1.680127 (1.704697) time: 0.941867 data: 0.000170 max mem: 18812 Epoch: [28/30] [ 500/1251] eta: 0:12:00 lr: 0.000001 loss: 1.676027 (1.702510) time: 0.964458 data: 0.000177 max mem: 18812 Epoch: [28/30] [ 550/1251] eta: 0:11:13 lr: 0.000001 loss: 1.675440 (1.702251) time: 1.039173 data: 0.000178 max mem: 18812 Epoch: [28/30] [ 600/1251] eta: 0:10:23 lr: 0.000001 loss: 1.691373 (1.700937) time: 0.929983 data: 0.000172 max mem: 18812 Epoch: [28/30] [ 650/1251] eta: 0:09:35 lr: 0.000001 loss: 1.691321 (1.700928) time: 0.914082 data: 0.000159 max mem: 18812 Epoch: [28/30] [ 700/1251] eta: 0:08:47 lr: 0.000001 loss: 1.750878 (1.700353) time: 0.926554 data: 0.000153 max mem: 18812 Epoch: [28/30] [ 750/1251] eta: 0:07:59 lr: 0.000001 loss: 1.672876 (1.700218) time: 0.973147 data: 0.000194 max mem: 18812 Epoch: [28/30] [ 800/1251] eta: 0:07:11 lr: 0.000001 loss: 1.622330 (1.699492) time: 0.995197 data: 0.000169 max mem: 18812 Epoch: [28/30] [ 850/1251] eta: 0:06:23 lr: 0.000001 loss: 1.608969 (1.698093) time: 0.909335 data: 0.000173 max mem: 18812 Epoch: [28/30] [ 900/1251] eta: 0:05:35 lr: 0.000001 loss: 1.683781 (1.699757) time: 0.929681 data: 0.000187 max mem: 18812 Epoch: [28/30] [ 950/1251] eta: 0:04:47 lr: 0.000001 loss: 1.718425 (1.701949) time: 0.928190 data: 0.000190 max mem: 18812 Epoch: [28/30] [1000/1251] eta: 0:04:00 lr: 0.000001 loss: 1.665921 (1.701414) time: 1.029601 data: 0.000165 max mem: 18812 Epoch: [28/30] [1050/1251] eta: 0:03:12 lr: 0.000001 loss: 1.688461 (1.701797) time: 1.019826 data: 0.000171 max mem: 18812 Epoch: [28/30] [1100/1251] eta: 0:02:24 lr: 0.000001 loss: 1.716565 (1.702659) time: 0.912386 data: 0.000155 max mem: 18812 Epoch: [28/30] [1150/1251] eta: 0:01:36 lr: 0.000001 loss: 1.627299 (1.701474) time: 0.935858 data: 0.000167 max mem: 18812 Epoch: [28/30] [1200/1251] eta: 0:00:48 lr: 0.000001 loss: 1.698768 (1.699832) time: 0.936241 data: 0.000161 max mem: 18812 Epoch: [28/30] [1250/1251] eta: 0:00:00 lr: 0.000001 loss: 1.669420 (1.700766) time: 0.987558 data: 0.000756 max mem: 18812 Epoch: [28/30] Total time: 0:19:59 (0.958836 s / it) Averaged stats: lr: 0.000001 loss: 1.669420 (1.701891) Test: [ 0/49] eta: 0:01:31 loss: 0.334000 (0.334000) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.876382 data: 1.411882 max mem: 18812 Test: [10/49] eta: 0:00:20 loss: 0.528347 (0.538228) acc1: 85.937500 (86.789773) acc5: 98.437500 (98.011364) time: 0.533537 data: 0.128489 max mem: 18812 Test: [20/49] eta: 0:00:13 loss: 0.597841 (0.568894) acc1: 84.375000 (85.863095) acc5: 98.437500 (97.916667) time: 0.382755 data: 0.000141 max mem: 18812 Test: [30/49] eta: 0:00:08 loss: 0.590128 (0.571843) acc1: 84.375000 (85.433468) acc5: 98.437500 (98.034274) time: 0.364018 data: 0.000129 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.590128 (0.585135) acc1: 84.375000 (85.403963) acc5: 96.875000 (97.827744) time: 0.362445 data: 0.000120 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.600563 (0.587077) acc1: 84.375000 (85.568000) acc5: 96.875000 (97.792000) time: 0.356523 data: 0.000100 max mem: 18812 Test: Total time: 0:00:19 (0.402765 s / it) * Acc@1 85.656 Acc@5 97.834 loss 0.586 Max accuracy: 85.66% Epoch: [29/30] [ 0/1251] eta: 0:41:35 lr: 0.000001 loss: 1.768033 (1.768033) time: 1.994500 data: 1.111808 max mem: 18812 Epoch: [29/30] [ 50/1251] eta: 0:19:18 lr: 0.000001 loss: 1.774810 (1.724498) time: 0.905398 data: 0.000170 max mem: 18812 Epoch: [29/30] [ 100/1251] eta: 0:18:30 lr: 0.000001 loss: 1.760143 (1.731375) time: 0.932088 data: 0.000200 max mem: 18812 Epoch: [29/30] [ 150/1251] eta: 0:17:37 lr: 0.000001 loss: 1.739172 (1.731250) time: 0.962916 data: 0.000194 max mem: 18812 Epoch: [29/30] [ 200/1251] eta: 0:16:44 lr: 0.000001 loss: 1.660673 (1.723658) time: 0.975021 data: 0.000194 max mem: 18812 Epoch: [29/30] [ 250/1251] eta: 0:15:55 lr: 0.000001 loss: 1.638941 (1.719053) time: 0.926212 data: 0.000186 max mem: 18812 Epoch: [29/30] [ 300/1251] eta: 0:15:09 lr: 0.000001 loss: 1.646325 (1.719006) time: 0.916274 data: 0.000173 max mem: 18812 Epoch: [29/30] [ 350/1251] eta: 0:14:23 lr: 0.000001 loss: 1.664639 (1.715932) time: 0.980419 data: 0.000168 max mem: 18812 Epoch: [29/30] [ 400/1251] eta: 0:13:36 lr: 0.000001 loss: 1.637712 (1.720423) time: 1.005418 data: 0.000164 max mem: 18812 Epoch: [29/30] [ 450/1251] eta: 0:12:46 lr: 0.000001 loss: 1.733576 (1.723019) time: 0.963992 data: 0.000171 max mem: 18812 Epoch: [29/30] [ 500/1251] eta: 0:11:58 lr: 0.000001 loss: 1.684912 (1.723109) time: 0.920689 data: 0.000167 max mem: 18812 Epoch: [29/30] [ 550/1251] eta: 0:11:10 lr: 0.000001 loss: 1.776633 (1.720794) time: 0.901770 data: 0.000173 max mem: 18812 Epoch: [29/30] [ 600/1251] eta: 0:10:23 lr: 0.000001 loss: 1.631222 (1.716631) time: 0.977641 data: 0.000178 max mem: 18812 Epoch: [29/30] [ 650/1251] eta: 0:09:35 lr: 0.000001 loss: 1.676758 (1.717477) time: 0.998894 data: 0.000176 max mem: 18812 Epoch: [29/30] [ 700/1251] eta: 0:08:47 lr: 0.000001 loss: 1.770023 (1.716150) time: 0.956545 data: 0.000159 max mem: 18812 Epoch: [29/30] [ 750/1251] eta: 0:07:58 lr: 0.000001 loss: 1.729371 (1.713376) time: 0.928006 data: 0.000188 max mem: 18812 Epoch: [29/30] [ 800/1251] eta: 0:07:10 lr: 0.000001 loss: 1.620276 (1.711515) time: 0.913706 data: 0.000187 max mem: 18812 Epoch: [29/30] [ 850/1251] eta: 0:06:23 lr: 0.000001 loss: 1.635838 (1.711388) time: 0.968346 data: 0.000186 max mem: 18812 Epoch: [29/30] [ 900/1251] eta: 0:05:35 lr: 0.000001 loss: 1.670895 (1.711056) time: 0.953673 data: 0.000184 max mem: 18812 Epoch: [29/30] [ 950/1251] eta: 0:04:47 lr: 0.000001 loss: 1.664864 (1.709395) time: 0.971885 data: 0.000187 max mem: 18812 Epoch: [29/30] [1000/1251] eta: 0:03:59 lr: 0.000001 loss: 1.726192 (1.710081) time: 0.910388 data: 0.000157 max mem: 18812 Epoch: [29/30] [1050/1251] eta: 0:03:12 lr: 0.000001 loss: 1.682515 (1.710389) time: 0.932505 data: 0.000167 max mem: 18812 Epoch: [29/30] [1100/1251] eta: 0:02:24 lr: 0.000001 loss: 1.619525 (1.708633) time: 0.970118 data: 0.000173 max mem: 18812 Epoch: [29/30] [1150/1251] eta: 0:01:36 lr: 0.000001 loss: 1.708485 (1.708428) time: 0.994211 data: 0.000160 max mem: 18812 Epoch: [29/30] [1200/1251] eta: 0:00:48 lr: 0.000001 loss: 1.690055 (1.708197) time: 0.967928 data: 0.000173 max mem: 18812 Epoch: [29/30] [1250/1251] eta: 0:00:00 lr: 0.000001 loss: 1.658445 (1.708959) time: 0.924267 data: 0.000761 max mem: 18812 Epoch: [29/30] Total time: 0:19:56 (0.956044 s / it) Averaged stats: lr: 0.000001 loss: 1.658445 (1.705704) Test: [ 0/49] eta: 0:01:29 loss: 0.331710 (0.331710) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.818547 data: 1.400391 max mem: 18812 Test: [10/49] eta: 0:00:20 loss: 0.526171 (0.538607) acc1: 85.937500 (86.647727) acc5: 98.437500 (98.011364) time: 0.513806 data: 0.127463 max mem: 18812 Test: [20/49] eta: 0:00:12 loss: 0.597193 (0.568948) acc1: 84.375000 (85.714286) acc5: 98.437500 (97.916667) time: 0.378249 data: 0.000145 max mem: 18812 Test: [30/49] eta: 0:00:08 loss: 0.588530 (0.571918) acc1: 84.375000 (85.332661) acc5: 98.437500 (97.983871) time: 0.370183 data: 0.000141 max mem: 18812 Test: [40/49] eta: 0:00:03 loss: 0.588530 (0.584994) acc1: 84.375000 (85.327744) acc5: 96.875000 (97.789634) time: 0.361425 data: 0.000136 max mem: 18812 Test: [48/49] eta: 0:00:00 loss: 0.601651 (0.586979) acc1: 84.375000 (85.504000) acc5: 96.875000 (97.792000) time: 0.353406 data: 0.000107 max mem: 18812 Test: Total time: 0:00:19 (0.398713 s / it) * Acc@1 85.672 Acc@5 97.830 loss 0.586 Max accuracy: 85.67%