Number of GFLOPs: 15.571533312 Number of million parameters: 88.673166 Start training for 300 epochs Epoch: [0/300] [ 0/1251] eta: 1:55:40 lr: 0.000002 loss: 7.036375 (7.036375) time: 5.547611 data: 2.038958 max mem: 17833 Epoch: [0/300] [ 50/1251] eta: 0:21:27 lr: 0.000006 loss: 7.056331 (7.081092) time: 0.946209 data: 0.000153 max mem: 18817 Epoch: [0/300] [ 100/1251] eta: 0:19:31 lr: 0.000010 loss: 6.986148 (7.054519) time: 0.976334 data: 0.000169 max mem: 18817 Epoch: [0/300] [ 150/1251] eta: 0:18:23 lr: 0.000014 loss: 6.955162 (7.030137) time: 0.986805 data: 0.000171 max mem: 18817 Epoch: [0/300] [ 200/1251] eta: 0:17:19 lr: 0.000018 loss: 6.931687 (7.008941) time: 0.972635 data: 0.000159 max mem: 18817 Epoch: [0/300] [ 250/1251] eta: 0:16:23 lr: 0.000022 loss: 6.894423 (6.991696) time: 0.926643 data: 0.000202 max mem: 18817 Epoch: [0/300] [ 300/1251] eta: 0:15:34 lr: 0.000026 loss: 6.906753 (6.977478) time: 0.938511 data: 0.000163 max mem: 18817 Epoch: [0/300] [ 350/1251] eta: 0:14:43 lr: 0.000030 loss: 6.884349 (6.965278) time: 0.936545 data: 0.000162 max mem: 18817 Epoch: [0/300] [ 400/1251] eta: 0:13:52 lr: 0.000034 loss: 6.894765 (6.955428) time: 0.979677 data: 0.000181 max mem: 18817 Epoch: [0/300] [ 450/1251] eta: 0:13:00 lr: 0.000038 loss: 6.865134 (6.946372) time: 0.968390 data: 0.000172 max mem: 18817 Epoch: [0/300] [ 500/1251] eta: 0:12:11 lr: 0.000042 loss: 6.863826 (6.938046) time: 0.954194 data: 0.000177 max mem: 18817 Epoch: [0/300] [ 550/1251] eta: 0:11:22 lr: 0.000046 loss: 6.866854 (6.931312) time: 0.947404 data: 0.000199 max mem: 18817 Epoch: [0/300] [ 600/1251] eta: 0:10:33 lr: 0.000050 loss: 6.842896 (6.924275) time: 0.934252 data: 0.000186 max mem: 18817 Epoch: [0/300] [ 650/1251] eta: 0:09:44 lr: 0.000054 loss: 6.803355 (6.918465) time: 0.985098 data: 0.000164 max mem: 18817 Epoch: [0/300] [ 700/1251] eta: 0:08:54 lr: 0.000058 loss: 6.804402 (6.911852) time: 0.970378 data: 0.000172 max mem: 18817 Epoch: [0/300] [ 750/1251] eta: 0:08:06 lr: 0.000062 loss: 6.832589 (6.905943) time: 0.957178 data: 0.000148 max mem: 18817 Epoch: [0/300] [ 800/1251] eta: 0:07:16 lr: 0.000066 loss: 6.805838 (6.901089) time: 0.925060 data: 0.000165 max mem: 18817 Epoch: [0/300] [ 850/1251] eta: 0:06:28 lr: 0.000070 loss: 6.787982 (6.893580) time: 0.959596 data: 0.000261 max mem: 18817 Epoch: [0/300] [ 900/1251] eta: 0:05:39 lr: 0.000074 loss: 6.739362 (6.887362) time: 0.965969 data: 0.000166 max mem: 18817 Epoch: [0/300] [ 950/1251] eta: 0:04:51 lr: 0.000078 loss: 6.690752 (6.880587) time: 0.974560 data: 0.000170 max mem: 18817 Epoch: [0/300] [1000/1251] eta: 0:04:02 lr: 0.000082 loss: 6.789065 (6.875219) time: 0.966928 data: 0.000150 max mem: 18817 Epoch: [0/300] [1050/1251] eta: 0:03:14 lr: 0.000086 loss: 6.732517 (6.869471) time: 0.923193 data: 0.000179 max mem: 18817 Epoch: [0/300] [1100/1251] eta: 0:02:26 lr: 0.000090 loss: 6.719057 (6.863610) time: 0.937976 data: 0.000169 max mem: 18817 Epoch: [0/300] [1150/1251] eta: 0:01:37 lr: 0.000094 loss: 6.695404 (6.857845) time: 0.987002 data: 0.000173 max mem: 18817 Epoch: [0/300] [1200/1251] eta: 0:00:49 lr: 0.000098 loss: 6.754743 (6.852203) time: 0.953576 data: 0.000169 max mem: 18817 Epoch: [0/300] [1250/1251] eta: 0:00:00 lr: 0.000102 loss: 6.661842 (6.846202) time: 0.968746 data: 0.000717 max mem: 18817 Epoch: [0/300] Total time: 0:20:09 (0.967001 s / it) Averaged stats: lr: 0.000102 loss: 6.661842 (6.846860) Test: [ 0/49] eta: 0:01:27 loss: 6.172607 (6.172607) acc1: 1.562500 (1.562500) acc5: 7.812500 (7.812500) time: 1.783924 data: 1.385748 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 6.244751 (6.224086) acc1: 1.562500 (2.272727) acc5: 7.812500 (7.812500) time: 0.494223 data: 0.126123 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 6.261658 (6.243870) acc1: 1.562500 (2.008929) acc5: 7.812500 (7.440476) time: 0.363128 data: 0.000146 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 6.207916 (6.223543) acc1: 1.562500 (2.368952) acc5: 7.812500 (7.610887) time: 0.368225 data: 0.000130 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 6.214630 (6.236816) acc1: 3.125000 (2.439024) acc5: 6.250000 (7.355183) time: 0.447014 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 6.239624 (6.245136) acc1: 1.562500 (2.272000) acc5: 6.250000 (7.360000) time: 0.449182 data: 0.000105 max mem: 18817 Test: Total time: 0:00:21 (0.430071 s / it) * Acc@1 2.302 Acc@5 7.582 loss 6.238 Max accuracy: 2.30% Epoch: [1/300] [ 0/1251] eta: 0:41:14 lr: 0.000102 loss: 6.588513 (6.588513) time: 1.978348 data: 1.062110 max mem: 18817 Epoch: [1/300] [ 50/1251] eta: 0:19:45 lr: 0.000106 loss: 6.689775 (6.683794) time: 0.941148 data: 0.000174 max mem: 18817 Epoch: [1/300] [ 100/1251] eta: 0:18:44 lr: 0.000110 loss: 6.642652 (6.665487) time: 0.989713 data: 0.000158 max mem: 18817 Epoch: [1/300] [ 150/1251] eta: 0:17:49 lr: 0.000114 loss: 6.672263 (6.666143) time: 1.012462 data: 0.000172 max mem: 18817 Epoch: [1/300] [ 200/1251] eta: 0:17:00 lr: 0.000118 loss: 6.646901 (6.664597) time: 1.002742 data: 0.000164 max mem: 18817 Epoch: [1/300] [ 250/1251] eta: 0:16:08 lr: 0.000122 loss: 6.619387 (6.653460) time: 0.930761 data: 0.000175 max mem: 18817 Epoch: [1/300] [ 300/1251] eta: 0:15:21 lr: 0.000126 loss: 6.622009 (6.648337) time: 0.933474 data: 0.000171 max mem: 18817 Epoch: [1/300] [ 350/1251] eta: 0:14:32 lr: 0.000130 loss: 6.610332 (6.640390) time: 0.988860 data: 0.000182 max mem: 18817 Epoch: [1/300] [ 400/1251] eta: 0:13:45 lr: 0.000134 loss: 6.467791 (6.630942) time: 1.031376 data: 0.000187 max mem: 18817 Epoch: [1/300] [ 450/1251] eta: 0:12:55 lr: 0.000138 loss: 6.589191 (6.622450) time: 0.981490 data: 0.000173 max mem: 18817 Epoch: [1/300] [ 500/1251] eta: 0:12:06 lr: 0.000142 loss: 6.513719 (6.614518) time: 0.926334 data: 0.000176 max mem: 18817 Epoch: [1/300] [ 550/1251] eta: 0:11:17 lr: 0.000146 loss: 6.563430 (6.609263) time: 0.935483 data: 0.000175 max mem: 18817 Epoch: [1/300] [ 600/1251] eta: 0:10:29 lr: 0.000150 loss: 6.498859 (6.600625) time: 1.000579 data: 0.000174 max mem: 18817 Epoch: [1/300] [ 650/1251] eta: 0:09:41 lr: 0.000154 loss: 6.488575 (6.593075) time: 1.044934 data: 0.000160 max mem: 18817 Epoch: [1/300] [ 700/1251] eta: 0:08:52 lr: 0.000158 loss: 6.445538 (6.585172) time: 0.977348 data: 0.000155 max mem: 18817 Epoch: [1/300] [ 750/1251] eta: 0:08:03 lr: 0.000162 loss: 6.473807 (6.575279) time: 0.930567 data: 0.000178 max mem: 18817 Epoch: [1/300] [ 800/1251] eta: 0:07:15 lr: 0.000166 loss: 6.465275 (6.570651) time: 0.925647 data: 0.000175 max mem: 18817 Epoch: [1/300] [ 850/1251] eta: 0:06:27 lr: 0.000170 loss: 6.344679 (6.562184) time: 1.010529 data: 0.000166 max mem: 18817 Epoch: [1/300] [ 900/1251] eta: 0:05:39 lr: 0.000174 loss: 6.537063 (6.557608) time: 0.965343 data: 0.000174 max mem: 18817 Epoch: [1/300] [ 950/1251] eta: 0:04:50 lr: 0.000178 loss: 6.422471 (6.552463) time: 0.966331 data: 0.000177 max mem: 18817 Epoch: [1/300] [1000/1251] eta: 0:04:02 lr: 0.000182 loss: 6.331683 (6.542575) time: 0.939640 data: 0.000157 max mem: 18817 Epoch: [1/300] [1050/1251] eta: 0:03:14 lr: 0.000186 loss: 6.395842 (6.535122) time: 0.928946 data: 0.000180 max mem: 18817 Epoch: [1/300] [1100/1251] eta: 0:02:25 lr: 0.000190 loss: 6.327641 (6.530443) time: 1.014364 data: 0.000196 max mem: 18817 Epoch: [1/300] [1150/1251] eta: 0:01:37 lr: 0.000194 loss: 6.345323 (6.523005) time: 0.967567 data: 0.000168 max mem: 18817 Epoch: [1/300] [1200/1251] eta: 0:00:49 lr: 0.000198 loss: 6.488414 (6.518742) time: 0.969608 data: 0.000172 max mem: 18817 Epoch: [1/300] [1250/1251] eta: 0:00:00 lr: 0.000202 loss: 6.316538 (6.511197) time: 0.929628 data: 0.000753 max mem: 18817 Epoch: [1/300] Total time: 0:20:07 (0.965187 s / it) Averaged stats: lr: 0.000202 loss: 6.316538 (6.510880) Test: [ 0/49] eta: 0:01:18 loss: 5.238892 (5.238892) acc1: 7.812500 (7.812500) acc5: 23.437500 (23.437500) time: 1.598421 data: 1.187154 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 5.206131 (5.236242) acc1: 7.812500 (9.232955) acc5: 23.437500 (23.863636) time: 0.479826 data: 0.108051 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 5.248749 (5.308620) acc1: 7.812500 (8.407738) acc5: 21.875000 (22.842262) time: 0.364944 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 5.243011 (5.278266) acc1: 7.812500 (8.770161) acc5: 23.437500 (23.840726) time: 0.361079 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 5.254181 (5.291029) acc1: 7.812500 (8.803354) acc5: 23.437500 (23.399390) time: 0.359534 data: 0.000139 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 5.284576 (5.292247) acc1: 7.812500 (8.672000) acc5: 21.875000 (22.880000) time: 0.466403 data: 0.000117 max mem: 18817 Test: Total time: 0:00:21 (0.434144 s / it) * Acc@1 8.762 Acc@5 22.686 loss 5.274 Max accuracy: 8.76% Epoch: [2/300] [ 0/1251] eta: 0:41:30 lr: 0.000202 loss: 6.089674 (6.089674) time: 1.990829 data: 1.082603 max mem: 18817 Epoch: [2/300] [ 50/1251] eta: 0:19:55 lr: 0.000206 loss: 6.368994 (6.350705) time: 0.940707 data: 0.000163 max mem: 18817 Epoch: [2/300] [ 100/1251] eta: 0:18:48 lr: 0.000210 loss: 6.300272 (6.363552) time: 0.983793 data: 0.000174 max mem: 18817 Epoch: [2/300] [ 150/1251] eta: 0:17:52 lr: 0.000214 loss: 6.474376 (6.347586) time: 1.027818 data: 0.000169 max mem: 18817 Epoch: [2/300] [ 200/1251] eta: 0:16:59 lr: 0.000218 loss: 6.421201 (6.353800) time: 0.995703 data: 0.000165 max mem: 18817 Epoch: [2/300] [ 250/1251] eta: 0:16:05 lr: 0.000222 loss: 6.211859 (6.340772) time: 0.918242 data: 0.000165 max mem: 18817 Epoch: [2/300] [ 300/1251] eta: 0:15:18 lr: 0.000226 loss: 6.217921 (6.327489) time: 0.927558 data: 0.000169 max mem: 18817 Epoch: [2/300] [ 350/1251] eta: 0:14:29 lr: 0.000230 loss: 6.180796 (6.313263) time: 0.979643 data: 0.000175 max mem: 18817 Epoch: [2/300] [ 400/1251] eta: 0:13:42 lr: 0.000234 loss: 6.201005 (6.296158) time: 1.038022 data: 0.000169 max mem: 18817 Epoch: [2/300] [ 450/1251] eta: 0:12:52 lr: 0.000238 loss: 6.235687 (6.289844) time: 0.967904 data: 0.000199 max mem: 18817 Epoch: [2/300] [ 500/1251] eta: 0:12:02 lr: 0.000242 loss: 6.369238 (6.284503) time: 0.927699 data: 0.000170 max mem: 18817 Epoch: [2/300] [ 550/1251] eta: 0:11:15 lr: 0.000246 loss: 6.367862 (6.278116) time: 0.929354 data: 0.000208 max mem: 18817 Epoch: [2/300] [ 600/1251] eta: 0:10:27 lr: 0.000250 loss: 6.378255 (6.271038) time: 0.967282 data: 0.000176 max mem: 18817 Epoch: [2/300] [ 650/1251] eta: 0:09:40 lr: 0.000254 loss: 6.258655 (6.264558) time: 1.042222 data: 0.000166 max mem: 18817 Epoch: [2/300] [ 700/1251] eta: 0:08:52 lr: 0.000258 loss: 6.318496 (6.260676) time: 0.990889 data: 0.000168 max mem: 18817 Epoch: [2/300] [ 750/1251] eta: 0:08:03 lr: 0.000262 loss: 6.311958 (6.257875) time: 0.918548 data: 0.000186 max mem: 18817 Epoch: [2/300] [ 800/1251] eta: 0:07:15 lr: 0.000266 loss: 6.269173 (6.252478) time: 0.927759 data: 0.000181 max mem: 18817 Epoch: [2/300] [ 850/1251] eta: 0:06:26 lr: 0.000270 loss: 6.224512 (6.246138) time: 0.989446 data: 0.000171 max mem: 18817 Epoch: [2/300] [ 900/1251] eta: 0:05:38 lr: 0.000274 loss: 6.149778 (6.238574) time: 0.973495 data: 0.000183 max mem: 18817 Epoch: [2/300] [ 950/1251] eta: 0:04:50 lr: 0.000278 loss: 6.103992 (6.233317) time: 0.988464 data: 0.000174 max mem: 18817 Epoch: [2/300] [1000/1251] eta: 0:04:01 lr: 0.000282 loss: 6.104916 (6.225448) time: 0.916521 data: 0.000162 max mem: 18817 Epoch: [2/300] [1050/1251] eta: 0:03:13 lr: 0.000286 loss: 6.016006 (6.218242) time: 0.941317 data: 0.000196 max mem: 18817 Epoch: [2/300] [1100/1251] eta: 0:02:25 lr: 0.000290 loss: 6.179663 (6.213697) time: 0.981412 data: 0.000176 max mem: 18817 Epoch: [2/300] [1150/1251] eta: 0:01:37 lr: 0.000294 loss: 6.172150 (6.210308) time: 0.950492 data: 0.000177 max mem: 18817 Epoch: [2/300] [1200/1251] eta: 0:00:49 lr: 0.000298 loss: 6.040691 (6.204103) time: 0.981605 data: 0.000177 max mem: 18817 Epoch: [2/300] [1250/1251] eta: 0:00:00 lr: 0.000302 loss: 6.125205 (6.197365) time: 0.919926 data: 0.000755 max mem: 18817 Epoch: [2/300] Total time: 0:20:05 (0.963966 s / it) Averaged stats: lr: 0.000302 loss: 6.125205 (6.195266) Test: [ 0/49] eta: 0:01:20 loss: 4.247444 (4.247444) acc1: 18.750000 (18.750000) acc5: 31.250000 (31.250000) time: 1.650409 data: 1.198769 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 4.317692 (4.372128) acc1: 18.750000 (16.619318) acc5: 37.500000 (37.500000) time: 0.487371 data: 0.109139 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 4.446503 (4.458847) acc1: 18.750000 (17.187500) acc5: 37.500000 (37.127976) time: 0.366830 data: 0.000163 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 4.432492 (4.411307) acc1: 18.750000 (18.245968) acc5: 37.500000 (38.256048) time: 0.363249 data: 0.000142 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 4.383125 (4.410973) acc1: 17.187500 (18.292683) acc5: 40.625000 (38.262195) time: 0.365705 data: 0.000128 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 4.383713 (4.412019) acc1: 18.750000 (18.432000) acc5: 40.625000 (38.368000) time: 0.463739 data: 0.000106 max mem: 18817 Test: Total time: 0:00:21 (0.435421 s / it) * Acc@1 17.726 Acc@5 38.210 loss 4.387 Max accuracy: 17.73% Epoch: [3/300] [ 0/1251] eta: 0:44:24 lr: 0.000302 loss: 5.545636 (5.545636) time: 2.130053 data: 1.222149 max mem: 18817 Epoch: [3/300] [ 50/1251] eta: 0:20:04 lr: 0.000306 loss: 6.100317 (6.009167) time: 0.917384 data: 0.000156 max mem: 18817 Epoch: [3/300] [ 100/1251] eta: 0:18:52 lr: 0.000310 loss: 6.200934 (6.022875) time: 0.980721 data: 0.000161 max mem: 18817 Epoch: [3/300] [ 150/1251] eta: 0:17:54 lr: 0.000314 loss: 6.069191 (5.989395) time: 0.962651 data: 0.000163 max mem: 18817 Epoch: [3/300] [ 200/1251] eta: 0:16:59 lr: 0.000318 loss: 5.798213 (5.975116) time: 0.989723 data: 0.000181 max mem: 18817 Epoch: [3/300] [ 250/1251] eta: 0:16:07 lr: 0.000322 loss: 5.937340 (5.984918) time: 0.936412 data: 0.000168 max mem: 18817 Epoch: [3/300] [ 300/1251] eta: 0:15:19 lr: 0.000326 loss: 6.232008 (5.994251) time: 0.932668 data: 0.000163 max mem: 18817 Epoch: [3/300] [ 350/1251] eta: 0:14:32 lr: 0.000330 loss: 5.930003 (5.975913) time: 1.022245 data: 0.000177 max mem: 18817 Epoch: [3/300] [ 400/1251] eta: 0:13:44 lr: 0.000334 loss: 5.992131 (5.973250) time: 0.967867 data: 0.000186 max mem: 18817 Epoch: [3/300] [ 450/1251] eta: 0:12:53 lr: 0.000338 loss: 5.784190 (5.971660) time: 0.972318 data: 0.000172 max mem: 18817 Epoch: [3/300] [ 500/1251] eta: 0:12:03 lr: 0.000342 loss: 6.045605 (5.968507) time: 0.920357 data: 0.000169 max mem: 18817 Epoch: [3/300] [ 550/1251] eta: 0:11:16 lr: 0.000346 loss: 5.970082 (5.960481) time: 0.934237 data: 0.000165 max mem: 18817 Epoch: [3/300] [ 600/1251] eta: 0:10:28 lr: 0.000350 loss: 5.898058 (5.956081) time: 0.989108 data: 0.000169 max mem: 18817 Epoch: [3/300] [ 650/1251] eta: 0:09:39 lr: 0.000354 loss: 6.055794 (5.956589) time: 0.953848 data: 0.000168 max mem: 18817 Epoch: [3/300] [ 700/1251] eta: 0:08:50 lr: 0.000358 loss: 6.065936 (5.951980) time: 0.954413 data: 0.000171 max mem: 18817 Epoch: [3/300] [ 750/1251] eta: 0:08:02 lr: 0.000362 loss: 5.871759 (5.948303) time: 0.919197 data: 0.000164 max mem: 18817 Epoch: [3/300] [ 800/1251] eta: 0:07:14 lr: 0.000366 loss: 5.945711 (5.946155) time: 0.930356 data: 0.000171 max mem: 18817 Epoch: [3/300] [ 850/1251] eta: 0:06:26 lr: 0.000370 loss: 5.884489 (5.947310) time: 0.971619 data: 0.000160 max mem: 18817 Epoch: [3/300] [ 900/1251] eta: 0:05:37 lr: 0.000374 loss: 6.006259 (5.943203) time: 0.963732 data: 0.000174 max mem: 18817 Epoch: [3/300] [ 950/1251] eta: 0:04:49 lr: 0.000378 loss: 6.041401 (5.944135) time: 0.984386 data: 0.000166 max mem: 18817 Epoch: [3/300] [1000/1251] eta: 0:04:01 lr: 0.000382 loss: 5.948679 (5.939655) time: 0.914570 data: 0.000155 max mem: 18817 Epoch: [3/300] [1050/1251] eta: 0:03:13 lr: 0.000386 loss: 5.985926 (5.933377) time: 0.925842 data: 0.000164 max mem: 18817 Epoch: [3/300] [1100/1251] eta: 0:02:25 lr: 0.000390 loss: 6.011533 (5.926690) time: 0.995923 data: 0.000175 max mem: 18817 Epoch: [3/300] [1150/1251] eta: 0:01:37 lr: 0.000394 loss: 5.648766 (5.915095) time: 0.974981 data: 0.000173 max mem: 18817 Epoch: [3/300] [1200/1251] eta: 0:00:49 lr: 0.000398 loss: 5.831659 (5.912111) time: 0.987548 data: 0.000177 max mem: 18817 Epoch: [3/300] [1250/1251] eta: 0:00:00 lr: 0.000402 loss: 5.886444 (5.909292) time: 0.924835 data: 0.000745 max mem: 18817 Epoch: [3/300] Total time: 0:20:04 (0.962438 s / it) Averaged stats: lr: 0.000402 loss: 5.886444 (5.908455) Test: [ 0/49] eta: 0:01:19 loss: 3.685247 (3.685247) acc1: 28.125000 (28.125000) acc5: 51.562500 (51.562500) time: 1.622668 data: 1.186368 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 3.698172 (3.754427) acc1: 26.562500 (24.857955) acc5: 48.437500 (49.147727) time: 0.483336 data: 0.108003 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 3.896671 (3.839066) acc1: 25.000000 (25.223214) acc5: 45.312500 (47.172619) time: 0.365075 data: 0.000146 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 3.811268 (3.798585) acc1: 26.562500 (25.806452) acc5: 48.437500 (48.437500) time: 0.361409 data: 0.000123 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 3.791145 (3.789254) acc1: 28.125000 (26.448171) acc5: 51.562500 (49.542683) time: 0.368979 data: 0.000123 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 3.791145 (3.786191) acc1: 26.562500 (26.304000) acc5: 51.562500 (49.824000) time: 0.462281 data: 0.000106 max mem: 18817 Test: Total time: 0:00:21 (0.432639 s / it) * Acc@1 26.188 Acc@5 50.408 loss 3.770 Max accuracy: 26.19% Epoch: [4/300] [ 0/1251] eta: 0:42:34 lr: 0.000402 loss: 6.207644 (6.207644) time: 2.041708 data: 1.045974 max mem: 18817 Epoch: [4/300] [ 50/1251] eta: 0:19:34 lr: 0.000406 loss: 5.791006 (5.725608) time: 0.974329 data: 0.000450 max mem: 18817 Epoch: [4/300] [ 100/1251] eta: 0:18:28 lr: 0.000410 loss: 5.863018 (5.731234) time: 0.978278 data: 0.000169 max mem: 18817 Epoch: [4/300] [ 150/1251] eta: 0:17:42 lr: 0.000414 loss: 5.723568 (5.739489) time: 0.988088 data: 0.000190 max mem: 18817 Epoch: [4/300] [ 200/1251] eta: 0:16:51 lr: 0.000418 loss: 5.883533 (5.748046) time: 0.941629 data: 0.000195 max mem: 18817 Epoch: [4/300] [ 250/1251] eta: 0:16:04 lr: 0.000422 loss: 5.965678 (5.761404) time: 0.933887 data: 0.000181 max mem: 18817 Epoch: [4/300] [ 300/1251] eta: 0:15:18 lr: 0.000426 loss: 5.925031 (5.765616) time: 0.988624 data: 0.000184 max mem: 18817 Epoch: [4/300] [ 350/1251] eta: 0:14:27 lr: 0.000430 loss: 5.972918 (5.759008) time: 0.970773 data: 0.000172 max mem: 18817 Epoch: [4/300] [ 400/1251] eta: 0:13:40 lr: 0.000434 loss: 5.947526 (5.747664) time: 0.980762 data: 0.000169 max mem: 18817 Epoch: [4/300] [ 450/1251] eta: 0:12:51 lr: 0.000438 loss: 5.800569 (5.740716) time: 0.944896 data: 0.000192 max mem: 18817 Epoch: [4/300] [ 500/1251] eta: 0:12:03 lr: 0.000442 loss: 5.737844 (5.735846) time: 0.931608 data: 0.000184 max mem: 18817 Epoch: [4/300] [ 550/1251] eta: 0:11:15 lr: 0.000446 loss: 5.715750 (5.725806) time: 0.998363 data: 0.000180 max mem: 18817 Epoch: [4/300] [ 600/1251] eta: 0:10:28 lr: 0.000450 loss: 5.797471 (5.722478) time: 1.021292 data: 0.000189 max mem: 18817 Epoch: [4/300] [ 650/1251] eta: 0:09:39 lr: 0.000454 loss: 5.784964 (5.721887) time: 0.986987 data: 0.000188 max mem: 18817 Epoch: [4/300] [ 700/1251] eta: 0:08:51 lr: 0.000458 loss: 5.784506 (5.717544) time: 0.938510 data: 0.000172 max mem: 18817 Epoch: [4/300] [ 750/1251] eta: 0:08:03 lr: 0.000462 loss: 5.590340 (5.709827) time: 0.935153 data: 0.000187 max mem: 18817 Epoch: [4/300] [ 800/1251] eta: 0:07:14 lr: 0.000466 loss: 5.663821 (5.705334) time: 0.964847 data: 0.000181 max mem: 18817 Epoch: [4/300] [ 850/1251] eta: 0:06:26 lr: 0.000470 loss: 5.587225 (5.701918) time: 1.018138 data: 0.000181 max mem: 18817 Epoch: [4/300] [ 900/1251] eta: 0:05:37 lr: 0.000474 loss: 5.550653 (5.705008) time: 0.955263 data: 0.000176 max mem: 18817 Epoch: [4/300] [ 950/1251] eta: 0:04:49 lr: 0.000478 loss: 5.754186 (5.705040) time: 0.942324 data: 0.000188 max mem: 18817 Epoch: [4/300] [1000/1251] eta: 0:04:01 lr: 0.000482 loss: 5.651780 (5.699811) time: 0.927793 data: 0.000176 max mem: 18817 Epoch: [4/300] [1050/1251] eta: 0:03:13 lr: 0.000486 loss: 5.539279 (5.691956) time: 0.979096 data: 0.000181 max mem: 18817 Epoch: [4/300] [1100/1251] eta: 0:02:25 lr: 0.000490 loss: 5.528618 (5.689824) time: 1.042889 data: 0.000180 max mem: 18817 Epoch: [4/300] [1150/1251] eta: 0:01:37 lr: 0.000494 loss: 5.332361 (5.681020) time: 0.975248 data: 0.000177 max mem: 18817 Epoch: [4/300] [1200/1251] eta: 0:00:49 lr: 0.000498 loss: 5.809560 (5.681022) time: 0.925290 data: 0.000173 max mem: 18817 Epoch: [4/300] [1250/1251] eta: 0:00:00 lr: 0.000502 loss: 5.776066 (5.675797) time: 0.928915 data: 0.000761 max mem: 18817 Epoch: [4/300] Total time: 0:20:05 (0.963261 s / it) Averaged stats: lr: 0.000502 loss: 5.776066 (5.673813) Test: [ 0/49] eta: 0:01:27 loss: 3.420202 (3.420202) acc1: 32.812500 (32.812500) acc5: 57.812500 (57.812500) time: 1.776981 data: 1.379112 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 3.333954 (3.310765) acc1: 32.812500 (32.386364) acc5: 57.812500 (57.670455) time: 0.496777 data: 0.125528 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 3.345182 (3.386261) acc1: 31.250000 (31.994048) acc5: 56.250000 (56.770833) time: 0.382747 data: 0.000155 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 3.345182 (3.349177) acc1: 31.250000 (32.711694) acc5: 57.812500 (57.711694) time: 0.379987 data: 0.000145 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 3.328353 (3.347778) acc1: 32.812500 (33.117378) acc5: 57.812500 (58.307927) time: 0.364416 data: 0.000145 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 3.380131 (3.341031) acc1: 32.812500 (33.088000) acc5: 57.812500 (58.592000) time: 0.359220 data: 0.000123 max mem: 18817 Test: Total time: 0:00:19 (0.401773 s / it) * Acc@1 33.376 Acc@5 58.884 loss 3.325 Max accuracy: 33.38% Epoch: [5/300] [ 0/1251] eta: 0:41:53 lr: 0.000502 loss: 5.825212 (5.825212) time: 2.009079 data: 1.116722 max mem: 18817 Epoch: [5/300] [ 50/1251] eta: 0:19:20 lr: 0.000506 loss: 5.846598 (5.725856) time: 0.973001 data: 0.000157 max mem: 18817 Epoch: [5/300] [ 100/1251] eta: 0:18:32 lr: 0.000510 loss: 5.577931 (5.633069) time: 0.956991 data: 0.000178 max mem: 18817 Epoch: [5/300] [ 150/1251] eta: 0:17:39 lr: 0.000514 loss: 5.560874 (5.625708) time: 0.928447 data: 0.000159 max mem: 18817 Epoch: [5/300] [ 200/1251] eta: 0:16:56 lr: 0.000518 loss: 5.517985 (5.587242) time: 0.946490 data: 0.000172 max mem: 18817 Epoch: [5/300] [ 250/1251] eta: 0:16:08 lr: 0.000522 loss: 5.620955 (5.563108) time: 0.988100 data: 0.000190 max mem: 18817 Epoch: [5/300] [ 300/1251] eta: 0:15:17 lr: 0.000526 loss: 5.732875 (5.565027) time: 0.978905 data: 0.000192 max mem: 18817 Epoch: [5/300] [ 350/1251] eta: 0:14:30 lr: 0.000530 loss: 5.540889 (5.558687) time: 0.973315 data: 0.000180 max mem: 18817 Epoch: [5/300] [ 400/1251] eta: 0:13:40 lr: 0.000534 loss: 5.511111 (5.551208) time: 0.925584 data: 0.000176 max mem: 18817 Epoch: [5/300] [ 450/1251] eta: 0:12:52 lr: 0.000538 loss: 5.691003 (5.540679) time: 0.935454 data: 0.000169 max mem: 18817 Epoch: [5/300] [ 500/1251] eta: 0:12:05 lr: 0.000542 loss: 5.245438 (5.539040) time: 0.982164 data: 0.000179 max mem: 18817 Epoch: [5/300] [ 550/1251] eta: 0:11:16 lr: 0.000546 loss: 5.692153 (5.532619) time: 0.993575 data: 0.000173 max mem: 18817 Epoch: [5/300] [ 600/1251] eta: 0:10:28 lr: 0.000550 loss: 5.315618 (5.522528) time: 0.977608 data: 0.000175 max mem: 18817 Epoch: [5/300] [ 650/1251] eta: 0:09:39 lr: 0.000554 loss: 5.280528 (5.513105) time: 0.933942 data: 0.000185 max mem: 18817 Epoch: [5/300] [ 700/1251] eta: 0:08:51 lr: 0.000558 loss: 5.312055 (5.503047) time: 0.925513 data: 0.000165 max mem: 18817 Epoch: [5/300] [ 750/1251] eta: 0:08:03 lr: 0.000562 loss: 5.025198 (5.489049) time: 0.983652 data: 0.000174 max mem: 18817 Epoch: [5/300] [ 800/1251] eta: 0:07:14 lr: 0.000566 loss: 5.553009 (5.484251) time: 0.962269 data: 0.000164 max mem: 18817 Epoch: [5/300] [ 850/1251] eta: 0:06:26 lr: 0.000570 loss: 5.555139 (5.482889) time: 0.992028 data: 0.000171 max mem: 18817 Epoch: [5/300] [ 900/1251] eta: 0:05:38 lr: 0.000574 loss: 5.737312 (5.484991) time: 0.931553 data: 0.000155 max mem: 18817 Epoch: [5/300] [ 950/1251] eta: 0:04:50 lr: 0.000578 loss: 5.296133 (5.480257) time: 0.923526 data: 0.000157 max mem: 18817 Epoch: [5/300] [1000/1251] eta: 0:04:01 lr: 0.000582 loss: 5.566458 (5.478629) time: 0.965970 data: 0.000184 max mem: 18817 Epoch: [5/300] [1050/1251] eta: 0:03:13 lr: 0.000586 loss: 5.437805 (5.476505) time: 1.011320 data: 0.000181 max mem: 18817 Epoch: [5/300] [1100/1251] eta: 0:02:25 lr: 0.000590 loss: 5.622564 (5.473805) time: 0.967066 data: 0.000169 max mem: 18817 Epoch: [5/300] [1150/1251] eta: 0:01:37 lr: 0.000594 loss: 5.166223 (5.467062) time: 0.928782 data: 0.000169 max mem: 18817 Epoch: [5/300] [1200/1251] eta: 0:00:49 lr: 0.000598 loss: 5.378938 (5.463715) time: 0.942074 data: 0.000169 max mem: 18817 Epoch: [5/300] [1250/1251] eta: 0:00:00 lr: 0.000602 loss: 5.369455 (5.460087) time: 0.990380 data: 0.000962 max mem: 18817 Epoch: [5/300] Total time: 0:20:05 (0.963902 s / it) Averaged stats: lr: 0.000602 loss: 5.369455 (5.460836) Test: [ 0/49] eta: 0:01:31 loss: 2.922135 (2.922135) acc1: 43.750000 (43.750000) acc5: 60.937500 (60.937500) time: 1.872391 data: 1.366646 max mem: 18817 Test: [10/49] eta: 0:00:21 loss: 2.922135 (2.855969) acc1: 40.625000 (41.335227) acc5: 62.500000 (64.062500) time: 0.552996 data: 0.124402 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 2.962648 (2.966154) acc1: 39.062500 (38.616071) acc5: 62.500000 (63.169643) time: 0.410629 data: 0.000147 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 3.002999 (2.945964) acc1: 34.375000 (38.306452) acc5: 62.500000 (63.558468) time: 0.388208 data: 0.000132 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 3.002999 (2.945163) acc1: 39.062500 (38.795732) acc5: 64.062500 (63.833841) time: 0.367110 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 3.026263 (2.953055) acc1: 40.625000 (39.136000) acc5: 62.500000 (63.776000) time: 0.360485 data: 0.000111 max mem: 18817 Test: Total time: 0:00:20 (0.418491 s / it) * Acc@1 39.030 Acc@5 65.020 loss 2.920 Max accuracy: 39.03% Epoch: [6/300] [ 0/1251] eta: 0:39:54 lr: 0.000602 loss: 4.939949 (4.939949) time: 1.914152 data: 1.024324 max mem: 18817 Epoch: [6/300] [ 50/1251] eta: 0:19:15 lr: 0.000606 loss: 5.370317 (5.274172) time: 0.917921 data: 0.000184 max mem: 18817 Epoch: [6/300] [ 100/1251] eta: 0:18:38 lr: 0.000610 loss: 5.486599 (5.312858) time: 0.940798 data: 0.000176 max mem: 18817 Epoch: [6/300] [ 150/1251] eta: 0:17:47 lr: 0.000614 loss: 5.406305 (5.331848) time: 0.949909 data: 0.000179 max mem: 18817 Epoch: [6/300] [ 200/1251] eta: 0:17:01 lr: 0.000618 loss: 5.591793 (5.354424) time: 1.005698 data: 0.000189 max mem: 18817 Epoch: [6/300] [ 250/1251] eta: 0:16:09 lr: 0.000622 loss: 5.269733 (5.330006) time: 0.989994 data: 0.000176 max mem: 18817 Epoch: [6/300] [ 300/1251] eta: 0:15:20 lr: 0.000626 loss: 5.454434 (5.329839) time: 0.939521 data: 0.000171 max mem: 18817 Epoch: [6/300] [ 350/1251] eta: 0:14:29 lr: 0.000630 loss: 5.306414 (5.320119) time: 0.919185 data: 0.000164 max mem: 18817 Epoch: [6/300] [ 400/1251] eta: 0:13:40 lr: 0.000634 loss: 5.614416 (5.338712) time: 0.931830 data: 0.000156 max mem: 18817 Epoch: [6/300] [ 450/1251] eta: 0:12:53 lr: 0.000638 loss: 5.317900 (5.327576) time: 0.981473 data: 0.000176 max mem: 18817 Epoch: [6/300] [ 500/1251] eta: 0:12:03 lr: 0.000642 loss: 5.544040 (5.318859) time: 0.992387 data: 0.000186 max mem: 18817 Epoch: [6/300] [ 550/1251] eta: 0:11:15 lr: 0.000646 loss: 4.985876 (5.303834) time: 0.953702 data: 0.000172 max mem: 18817 Epoch: [6/300] [ 600/1251] eta: 0:10:26 lr: 0.000650 loss: 5.434696 (5.311824) time: 0.918812 data: 0.000158 max mem: 18817 Epoch: [6/300] [ 650/1251] eta: 0:09:38 lr: 0.000654 loss: 5.436084 (5.308010) time: 0.961610 data: 0.000161 max mem: 18817 Epoch: [6/300] [ 700/1251] eta: 0:08:50 lr: 0.000658 loss: 5.452706 (5.305288) time: 0.990342 data: 0.000165 max mem: 18817 Epoch: [6/300] [ 750/1251] eta: 0:08:01 lr: 0.000662 loss: 5.535057 (5.299600) time: 0.986307 data: 0.000165 max mem: 18817 Epoch: [6/300] [ 800/1251] eta: 0:07:13 lr: 0.000666 loss: 5.255094 (5.301982) time: 0.908697 data: 0.000206 max mem: 18817 Epoch: [6/300] [ 850/1251] eta: 0:06:25 lr: 0.000670 loss: 5.159045 (5.299456) time: 0.931171 data: 0.000174 max mem: 18817 Epoch: [6/300] [ 900/1251] eta: 0:05:37 lr: 0.000674 loss: 5.216790 (5.301006) time: 0.949620 data: 0.000178 max mem: 18817 Epoch: [6/300] [ 950/1251] eta: 0:04:49 lr: 0.000678 loss: 5.626678 (5.300670) time: 0.987601 data: 0.000172 max mem: 18817 Epoch: [6/300] [1000/1251] eta: 0:04:01 lr: 0.000682 loss: 5.643181 (5.301722) time: 0.984481 data: 0.000180 max mem: 18817 Epoch: [6/300] [1050/1251] eta: 0:03:13 lr: 0.000686 loss: 5.397301 (5.303294) time: 0.961699 data: 0.000187 max mem: 18817 Epoch: [6/300] [1100/1251] eta: 0:02:25 lr: 0.000690 loss: 5.513777 (5.299282) time: 0.934169 data: 0.000183 max mem: 18817 Epoch: [6/300] [1150/1251] eta: 0:01:37 lr: 0.000694 loss: 5.297903 (5.298141) time: 0.940377 data: 0.000176 max mem: 18817 Epoch: [6/300] [1200/1251] eta: 0:00:49 lr: 0.000698 loss: 5.270936 (5.292601) time: 0.994873 data: 0.000179 max mem: 18817 Epoch: [6/300] [1250/1251] eta: 0:00:00 lr: 0.000702 loss: 5.467337 (5.293814) time: 0.969847 data: 0.000753 max mem: 18817 Epoch: [6/300] Total time: 0:20:03 (0.962366 s / it) Averaged stats: lr: 0.000702 loss: 5.467337 (5.297604) Test: [ 0/49] eta: 0:01:21 loss: 2.625289 (2.625289) acc1: 48.437500 (48.437500) acc5: 71.875000 (71.875000) time: 1.663033 data: 1.186304 max mem: 18817 Test: [10/49] eta: 0:00:25 loss: 2.603971 (2.616596) acc1: 43.750000 (43.750000) acc5: 68.750000 (69.034091) time: 0.643487 data: 0.108001 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 2.645786 (2.718698) acc1: 43.750000 (42.633929) acc5: 65.625000 (67.782738) time: 0.452316 data: 0.000152 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 2.716791 (2.698870) acc1: 42.187500 (43.094758) acc5: 65.625000 (67.741935) time: 0.362483 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 2.764888 (2.698316) acc1: 43.750000 (43.368902) acc5: 68.750000 (68.102134) time: 0.360193 data: 0.000133 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 2.769671 (2.703067) acc1: 45.312500 (43.648000) acc5: 68.750000 (68.288000) time: 0.362407 data: 0.000106 max mem: 18817 Test: Total time: 0:00:20 (0.427711 s / it) * Acc@1 43.070 Acc@5 69.152 loss 2.700 Max accuracy: 43.07% Epoch: [7/300] [ 0/1251] eta: 0:39:15 lr: 0.000702 loss: 5.268206 (5.268206) time: 1.882776 data: 0.986011 max mem: 18817 Epoch: [7/300] [ 50/1251] eta: 0:20:01 lr: 0.000706 loss: 5.248019 (5.196568) time: 0.945119 data: 0.000153 max mem: 18817 Epoch: [7/300] [ 100/1251] eta: 0:18:47 lr: 0.000710 loss: 5.229328 (5.219967) time: 0.975038 data: 0.000187 max mem: 18817 Epoch: [7/300] [ 150/1251] eta: 0:17:52 lr: 0.000714 loss: 5.048426 (5.184258) time: 0.989140 data: 0.000174 max mem: 18817 Epoch: [7/300] [ 200/1251] eta: 0:16:55 lr: 0.000718 loss: 4.990496 (5.201026) time: 0.959646 data: 0.000165 max mem: 18817 Epoch: [7/300] [ 250/1251] eta: 0:16:03 lr: 0.000722 loss: 4.943028 (5.200431) time: 0.915866 data: 0.000170 max mem: 18817 Epoch: [7/300] [ 300/1251] eta: 0:15:15 lr: 0.000726 loss: 5.010655 (5.194260) time: 0.924033 data: 0.000169 max mem: 18817 Epoch: [7/300] [ 350/1251] eta: 0:14:28 lr: 0.000730 loss: 5.355244 (5.197586) time: 0.993959 data: 0.000175 max mem: 18817 Epoch: [7/300] [ 400/1251] eta: 0:13:40 lr: 0.000734 loss: 5.084472 (5.184776) time: 0.985663 data: 0.000199 max mem: 18817 Epoch: [7/300] [ 450/1251] eta: 0:12:49 lr: 0.000738 loss: 5.280194 (5.177431) time: 0.963423 data: 0.000158 max mem: 18817 Epoch: [7/300] [ 500/1251] eta: 0:12:01 lr: 0.000742 loss: 5.199473 (5.175353) time: 0.917349 data: 0.000171 max mem: 18817 Epoch: [7/300] [ 550/1251] eta: 0:11:13 lr: 0.000746 loss: 5.211868 (5.176642) time: 0.924862 data: 0.000170 max mem: 18817 Epoch: [7/300] [ 600/1251] eta: 0:10:25 lr: 0.000750 loss: 4.911556 (5.168413) time: 0.977514 data: 0.000176 max mem: 18817 Epoch: [7/300] [ 650/1251] eta: 0:09:38 lr: 0.000754 loss: 5.385544 (5.175845) time: 1.031453 data: 0.000172 max mem: 18817 Epoch: [7/300] [ 700/1251] eta: 0:08:49 lr: 0.000758 loss: 5.118258 (5.180591) time: 0.992274 data: 0.000159 max mem: 18817 Epoch: [7/300] [ 750/1251] eta: 0:08:01 lr: 0.000762 loss: 4.690910 (5.179050) time: 0.925346 data: 0.000173 max mem: 18817 Epoch: [7/300] [ 800/1251] eta: 0:07:13 lr: 0.000766 loss: 5.076880 (5.178664) time: 0.941002 data: 0.000184 max mem: 18817 Epoch: [7/300] [ 850/1251] eta: 0:06:25 lr: 0.000770 loss: 5.249208 (5.173576) time: 0.975969 data: 0.000175 max mem: 18817 Epoch: [7/300] [ 900/1251] eta: 0:05:37 lr: 0.000774 loss: 5.051578 (5.169697) time: 0.981321 data: 0.000179 max mem: 18817 Epoch: [7/300] [ 950/1251] eta: 0:04:49 lr: 0.000778 loss: 5.338950 (5.178231) time: 0.987459 data: 0.000166 max mem: 18817 Epoch: [7/300] [1000/1251] eta: 0:04:01 lr: 0.000782 loss: 4.993852 (5.170956) time: 0.914664 data: 0.000184 max mem: 18817 Epoch: [7/300] [1050/1251] eta: 0:03:13 lr: 0.000786 loss: 5.226583 (5.165598) time: 0.930652 data: 0.000171 max mem: 18817 Epoch: [7/300] [1100/1251] eta: 0:02:25 lr: 0.000790 loss: 4.987442 (5.159192) time: 0.990222 data: 0.000183 max mem: 18817 Epoch: [7/300] [1150/1251] eta: 0:01:37 lr: 0.000794 loss: 5.094070 (5.156077) time: 0.952676 data: 0.000173 max mem: 18817 Epoch: [7/300] [1200/1251] eta: 0:00:49 lr: 0.000798 loss: 4.961161 (5.152743) time: 0.959280 data: 0.000162 max mem: 18817 Epoch: [7/300] [1250/1251] eta: 0:00:00 lr: 0.000802 loss: 5.083284 (5.149031) time: 0.914317 data: 0.000747 max mem: 18817 Epoch: [7/300] Total time: 0:20:02 (0.961213 s / it) Averaged stats: lr: 0.000802 loss: 5.083284 (5.155048) Test: [ 0/49] eta: 0:01:15 loss: 2.516507 (2.516507) acc1: 51.562500 (51.562500) acc5: 73.437500 (73.437500) time: 1.547760 data: 1.108747 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 2.466596 (2.441710) acc1: 48.437500 (47.869318) acc5: 70.312500 (71.875000) time: 0.479533 data: 0.100942 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 2.495944 (2.509767) acc1: 45.312500 (47.693452) acc5: 68.750000 (70.833333) time: 0.367035 data: 0.000146 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 2.576206 (2.500997) acc1: 45.312500 (47.278226) acc5: 68.750000 (71.068548) time: 0.362117 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 2.503909 (2.478546) acc1: 46.875000 (47.751524) acc5: 70.312500 (71.417683) time: 0.360123 data: 0.000128 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 2.541263 (2.495436) acc1: 48.437500 (47.488000) acc5: 70.312500 (71.296000) time: 0.455385 data: 0.000099 max mem: 18817 Test: Total time: 0:00:21 (0.429461 s / it) * Acc@1 47.234 Acc@5 73.038 loss 2.471 Max accuracy: 47.23% Epoch: [8/300] [ 0/1251] eta: 0:40:16 lr: 0.000802 loss: 5.409663 (5.409663) time: 1.931645 data: 1.027194 max mem: 18817 Epoch: [8/300] [ 50/1251] eta: 0:19:33 lr: 0.000806 loss: 5.425980 (5.288619) time: 0.972628 data: 0.000182 max mem: 18817 Epoch: [8/300] [ 100/1251] eta: 0:18:32 lr: 0.000810 loss: 5.202898 (5.215370) time: 0.992876 data: 0.000173 max mem: 18817 Epoch: [8/300] [ 150/1251] eta: 0:17:44 lr: 0.000814 loss: 4.801965 (5.176030) time: 0.974068 data: 0.000178 max mem: 18817 Epoch: [8/300] [ 200/1251] eta: 0:16:49 lr: 0.000818 loss: 5.256915 (5.171597) time: 0.924188 data: 0.000162 max mem: 18817 Epoch: [8/300] [ 250/1251] eta: 0:16:02 lr: 0.000822 loss: 5.438452 (5.158037) time: 0.927395 data: 0.000185 max mem: 18817 Epoch: [8/300] [ 300/1251] eta: 0:15:15 lr: 0.000826 loss: 5.201970 (5.150848) time: 0.977025 data: 0.000173 max mem: 18817 Epoch: [8/300] [ 350/1251] eta: 0:14:24 lr: 0.000830 loss: 5.278131 (5.135746) time: 0.978214 data: 0.000172 max mem: 18817 Epoch: [8/300] [ 400/1251] eta: 0:13:37 lr: 0.000834 loss: 4.816087 (5.113961) time: 0.982685 data: 0.000186 max mem: 18817 Epoch: [8/300] [ 450/1251] eta: 0:12:47 lr: 0.000838 loss: 5.324336 (5.103843) time: 0.937232 data: 0.000171 max mem: 18817 Epoch: [8/300] [ 500/1251] eta: 0:12:00 lr: 0.000842 loss: 5.231999 (5.106601) time: 0.924088 data: 0.000187 max mem: 18817 Epoch: [8/300] [ 550/1251] eta: 0:11:12 lr: 0.000846 loss: 5.239823 (5.099703) time: 0.973867 data: 0.000185 max mem: 18817 Epoch: [8/300] [ 600/1251] eta: 0:10:23 lr: 0.000850 loss: 4.774966 (5.084593) time: 0.979193 data: 0.000169 max mem: 18817 Epoch: [8/300] [ 650/1251] eta: 0:09:35 lr: 0.000854 loss: 4.923618 (5.078004) time: 0.954178 data: 0.000167 max mem: 18817 Epoch: [8/300] [ 700/1251] eta: 0:08:47 lr: 0.000858 loss: 5.301833 (5.080929) time: 0.924641 data: 0.000188 max mem: 18817 Epoch: [8/300] [ 750/1251] eta: 0:08:00 lr: 0.000862 loss: 5.108052 (5.077859) time: 0.933757 data: 0.000188 max mem: 18817 Epoch: [8/300] [ 800/1251] eta: 0:07:12 lr: 0.000866 loss: 4.803946 (5.069476) time: 0.965329 data: 0.000179 max mem: 18817 Epoch: [8/300] [ 850/1251] eta: 0:06:23 lr: 0.000870 loss: 5.307666 (5.068727) time: 0.977258 data: 0.000182 max mem: 18817 Epoch: [8/300] [ 900/1251] eta: 0:05:36 lr: 0.000874 loss: 5.101882 (5.069712) time: 0.979160 data: 0.000171 max mem: 18817 Epoch: [8/300] [ 950/1251] eta: 0:04:48 lr: 0.000878 loss: 4.975578 (5.061864) time: 0.926639 data: 0.000168 max mem: 18817 Epoch: [8/300] [1000/1251] eta: 0:04:00 lr: 0.000882 loss: 5.214561 (5.063235) time: 0.954206 data: 0.000171 max mem: 18817 Epoch: [8/300] [1050/1251] eta: 0:03:12 lr: 0.000886 loss: 5.083644 (5.063405) time: 0.982297 data: 0.000179 max mem: 18817 Epoch: [8/300] [1100/1251] eta: 0:02:24 lr: 0.000890 loss: 5.089398 (5.060581) time: 0.990355 data: 0.000191 max mem: 18817 Epoch: [8/300] [1150/1251] eta: 0:01:36 lr: 0.000894 loss: 5.129432 (5.060278) time: 0.969307 data: 0.000182 max mem: 18817 Epoch: [8/300] [1200/1251] eta: 0:00:48 lr: 0.000898 loss: 5.107373 (5.054207) time: 0.945205 data: 0.000175 max mem: 18817 Epoch: [8/300] [1250/1251] eta: 0:00:00 lr: 0.000902 loss: 5.193985 (5.051185) time: 0.916205 data: 0.000749 max mem: 18817 Epoch: [8/300] Total time: 0:19:59 (0.959173 s / it) Averaged stats: lr: 0.000902 loss: 5.193985 (5.046524) Test: [ 0/49] eta: 0:01:24 loss: 2.424008 (2.424008) acc1: 45.312500 (45.312500) acc5: 71.875000 (71.875000) time: 1.723084 data: 1.297222 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 2.302963 (2.251557) acc1: 51.562500 (50.710227) acc5: 76.562500 (74.857955) time: 0.505926 data: 0.118085 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 2.303952 (2.324750) acc1: 50.000000 (50.074405) acc5: 76.562500 (74.479167) time: 0.373598 data: 0.000149 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 2.356843 (2.322189) acc1: 48.437500 (50.050403) acc5: 73.437500 (74.344758) time: 0.362840 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 2.361681 (2.334836) acc1: 48.437500 (49.923780) acc5: 75.000000 (74.961890) time: 0.368135 data: 0.000175 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 2.416822 (2.346859) acc1: 46.875000 (49.664000) acc5: 73.437500 (74.656000) time: 0.363033 data: 0.000150 max mem: 18817 Test: Total time: 0:00:19 (0.397317 s / it) * Acc@1 49.848 Acc@5 75.342 loss 2.316 Max accuracy: 49.85% Epoch: [9/300] [ 0/1251] eta: 0:41:02 lr: 0.000902 loss: 4.591849 (4.591849) time: 1.968711 data: 1.077978 max mem: 18817 Epoch: [9/300] [ 50/1251] eta: 0:19:08 lr: 0.000906 loss: 5.166427 (4.974887) time: 0.966771 data: 0.000173 max mem: 18817 Epoch: [9/300] [ 100/1251] eta: 0:18:15 lr: 0.000910 loss: 5.228731 (5.022746) time: 0.925366 data: 0.000177 max mem: 18817 Epoch: [9/300] [ 150/1251] eta: 0:17:37 lr: 0.000914 loss: 4.811389 (4.984284) time: 0.933403 data: 0.000175 max mem: 18817 Epoch: [9/300] [ 200/1251] eta: 0:16:55 lr: 0.000918 loss: 5.116689 (5.002206) time: 1.010812 data: 0.000161 max mem: 18817 Epoch: [9/300] [ 250/1251] eta: 0:16:08 lr: 0.000922 loss: 5.220985 (5.032273) time: 0.970002 data: 0.000166 max mem: 18817 Epoch: [9/300] [ 300/1251] eta: 0:15:15 lr: 0.000926 loss: 4.868523 (5.008470) time: 0.945116 data: 0.000168 max mem: 18817 Epoch: [9/300] [ 350/1251] eta: 0:14:24 lr: 0.000930 loss: 5.054220 (4.986164) time: 0.908696 data: 0.000176 max mem: 18817 Epoch: [9/300] [ 400/1251] eta: 0:13:38 lr: 0.000934 loss: 5.015669 (4.967516) time: 0.937767 data: 0.000183 max mem: 18817 Epoch: [9/300] [ 450/1251] eta: 0:12:50 lr: 0.000938 loss: 5.134059 (4.972978) time: 0.992809 data: 0.000179 max mem: 18817 Epoch: [9/300] [ 500/1251] eta: 0:12:03 lr: 0.000942 loss: 4.848994 (4.954669) time: 0.976137 data: 0.000168 max mem: 18817 Epoch: [9/300] [ 550/1251] eta: 0:11:14 lr: 0.000946 loss: 5.027534 (4.955012) time: 0.983276 data: 0.000164 max mem: 18817 Epoch: [9/300] [ 600/1251] eta: 0:10:25 lr: 0.000950 loss: 4.924536 (4.949332) time: 0.930003 data: 0.000173 max mem: 18817 Epoch: [9/300] [ 650/1251] eta: 0:09:37 lr: 0.000954 loss: 5.351028 (4.951843) time: 0.923403 data: 0.000176 max mem: 18817 Epoch: [9/300] [ 700/1251] eta: 0:08:49 lr: 0.000958 loss: 5.006649 (4.951806) time: 0.987514 data: 0.000183 max mem: 18817 Epoch: [9/300] [ 750/1251] eta: 0:08:01 lr: 0.000962 loss: 5.225226 (4.950051) time: 0.979869 data: 0.000188 max mem: 18817 Epoch: [9/300] [ 800/1251] eta: 0:07:13 lr: 0.000966 loss: 5.055652 (4.946989) time: 0.998394 data: 0.000173 max mem: 18817 Epoch: [9/300] [ 850/1251] eta: 0:06:25 lr: 0.000970 loss: 4.720319 (4.942376) time: 0.932267 data: 0.000170 max mem: 18817 Epoch: [9/300] [ 900/1251] eta: 0:05:37 lr: 0.000974 loss: 5.048200 (4.933764) time: 0.926450 data: 0.000183 max mem: 18817 Epoch: [9/300] [ 950/1251] eta: 0:04:49 lr: 0.000978 loss: 4.493628 (4.923200) time: 1.019990 data: 0.000173 max mem: 18817 Epoch: [9/300] [1000/1251] eta: 0:04:01 lr: 0.000982 loss: 4.814469 (4.923481) time: 0.978507 data: 0.000176 max mem: 18817 Epoch: [9/300] [1050/1251] eta: 0:03:13 lr: 0.000986 loss: 5.051342 (4.922019) time: 0.975513 data: 0.000183 max mem: 18817 Epoch: [9/300] [1100/1251] eta: 0:02:25 lr: 0.000990 loss: 4.826861 (4.919480) time: 0.908142 data: 0.000175 max mem: 18817 Epoch: [9/300] [1150/1251] eta: 0:01:37 lr: 0.000994 loss: 4.894779 (4.912915) time: 0.934503 data: 0.000161 max mem: 18817 Epoch: [9/300] [1200/1251] eta: 0:00:49 lr: 0.000998 loss: 4.853276 (4.904533) time: 0.980960 data: 0.000177 max mem: 18817 Epoch: [9/300] [1250/1251] eta: 0:00:00 lr: 0.001002 loss: 4.954458 (4.902789) time: 0.975871 data: 0.000748 max mem: 18817 Epoch: [9/300] Total time: 0:20:04 (0.963130 s / it) Averaged stats: lr: 0.001002 loss: 4.954458 (4.911384) Test: [ 0/49] eta: 0:01:27 loss: 2.076070 (2.076070) acc1: 53.125000 (53.125000) acc5: 81.250000 (81.250000) time: 1.786212 data: 1.384589 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 2.071701 (2.062067) acc1: 54.687500 (55.397727) acc5: 79.687500 (78.551136) time: 0.496026 data: 0.126018 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 2.167477 (2.175793) acc1: 53.125000 (53.497024) acc5: 76.562500 (76.934524) time: 0.364334 data: 0.000148 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 2.231596 (2.175789) acc1: 50.000000 (52.872984) acc5: 75.000000 (76.764113) time: 0.404854 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 2.176702 (2.161850) acc1: 51.562500 (53.048780) acc5: 78.125000 (77.248476) time: 0.423399 data: 0.000128 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 2.176702 (2.170719) acc1: 51.562500 (53.088000) acc5: 76.562500 (77.152000) time: 0.384199 data: 0.000101 max mem: 18817 Test: Total time: 0:00:20 (0.420355 s / it) * Acc@1 52.592 Acc@5 77.538 loss 2.147 Max accuracy: 52.59% Epoch: [10/300] [ 0/1251] eta: 0:42:17 lr: 0.001002 loss: 5.521834 (5.521834) time: 2.028153 data: 1.134875 max mem: 18817 Epoch: [10/300] [ 50/1251] eta: 0:19:13 lr: 0.001006 loss: 5.019877 (4.927017) time: 0.931620 data: 0.000177 max mem: 18817 Epoch: [10/300] [ 100/1251] eta: 0:18:32 lr: 0.001010 loss: 4.872917 (4.921436) time: 0.936663 data: 0.000175 max mem: 18817 Epoch: [10/300] [ 150/1251] eta: 0:17:42 lr: 0.001014 loss: 5.103694 (4.902044) time: 0.986021 data: 0.000166 max mem: 18817 Epoch: [10/300] [ 200/1251] eta: 0:16:55 lr: 0.001018 loss: 4.963879 (4.915766) time: 0.997654 data: 0.000164 max mem: 18817 Epoch: [10/300] [ 250/1251] eta: 0:16:04 lr: 0.001022 loss: 4.444208 (4.902338) time: 0.976882 data: 0.000182 max mem: 18817 Epoch: [10/300] [ 300/1251] eta: 0:15:12 lr: 0.001026 loss: 4.642780 (4.887103) time: 0.923597 data: 0.000172 max mem: 18817 Epoch: [10/300] [ 350/1251] eta: 0:14:26 lr: 0.001030 loss: 4.869114 (4.886510) time: 0.939310 data: 0.000182 max mem: 18817 Epoch: [10/300] [ 400/1251] eta: 0:13:38 lr: 0.001034 loss: 5.297383 (4.897080) time: 0.968078 data: 0.000164 max mem: 18817 Epoch: [10/300] [ 450/1251] eta: 0:12:51 lr: 0.001038 loss: 5.004879 (4.897801) time: 1.034302 data: 0.000195 max mem: 18817 Epoch: [10/300] [ 500/1251] eta: 0:12:02 lr: 0.001042 loss: 4.893233 (4.881852) time: 0.979720 data: 0.000168 max mem: 18817 Epoch: [10/300] [ 550/1251] eta: 0:11:13 lr: 0.001046 loss: 5.005615 (4.888916) time: 0.920241 data: 0.000183 max mem: 18817 Epoch: [10/300] [ 600/1251] eta: 0:10:25 lr: 0.001050 loss: 4.990455 (4.885277) time: 0.935948 data: 0.000177 max mem: 18817 Epoch: [10/300] [ 650/1251] eta: 0:09:38 lr: 0.001054 loss: 5.139645 (4.887980) time: 0.997198 data: 0.000171 max mem: 18817 Epoch: [10/300] [ 700/1251] eta: 0:08:50 lr: 0.001058 loss: 5.068894 (4.886018) time: 0.987839 data: 0.000163 max mem: 18817 Epoch: [10/300] [ 750/1251] eta: 0:08:01 lr: 0.001062 loss: 4.552231 (4.878351) time: 0.969943 data: 0.000165 max mem: 18817 Epoch: [10/300] [ 800/1251] eta: 0:07:13 lr: 0.001066 loss: 4.568921 (4.867626) time: 0.921264 data: 0.000170 max mem: 18817 Epoch: [10/300] [ 850/1251] eta: 0:06:25 lr: 0.001070 loss: 5.024161 (4.863346) time: 0.950132 data: 0.000177 max mem: 18817 Epoch: [10/300] [ 900/1251] eta: 0:05:37 lr: 0.001074 loss: 4.112504 (4.857765) time: 0.974656 data: 0.000170 max mem: 18817 Epoch: [10/300] [ 950/1251] eta: 0:04:49 lr: 0.001078 loss: 4.859551 (4.862328) time: 0.981862 data: 0.000185 max mem: 18817 Epoch: [10/300] [1000/1251] eta: 0:04:01 lr: 0.001082 loss: 4.752161 (4.861337) time: 0.984135 data: 0.000161 max mem: 18817 Epoch: [10/300] [1050/1251] eta: 0:03:13 lr: 0.001086 loss: 4.904796 (4.850453) time: 0.926178 data: 0.000180 max mem: 18817 Epoch: [10/300] [1100/1251] eta: 0:02:25 lr: 0.001090 loss: 4.651231 (4.841487) time: 0.925796 data: 0.000176 max mem: 18817 Epoch: [10/300] [1150/1251] eta: 0:01:37 lr: 0.001094 loss: 4.399361 (4.830452) time: 0.976066 data: 0.000169 max mem: 18817 Epoch: [10/300] [1200/1251] eta: 0:00:49 lr: 0.001098 loss: 4.952246 (4.832047) time: 0.991409 data: 0.000183 max mem: 18817 Epoch: [10/300] [1250/1251] eta: 0:00:00 lr: 0.001102 loss: 5.044218 (4.832967) time: 0.976449 data: 0.000743 max mem: 18817 Epoch: [10/300] Total time: 0:20:02 (0.961552 s / it) Averaged stats: lr: 0.001102 loss: 5.044218 (4.833570) Test: [ 0/49] eta: 0:01:17 loss: 2.108027 (2.108027) acc1: 54.687500 (54.687500) acc5: 78.125000 (78.125000) time: 1.573989 data: 1.135913 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 2.065920 (2.037916) acc1: 56.250000 (57.386364) acc5: 79.687500 (79.403409) time: 0.482478 data: 0.103424 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 2.065920 (2.089648) acc1: 56.250000 (55.877976) acc5: 76.562500 (78.497024) time: 0.465244 data: 0.000164 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 2.068014 (2.087993) acc1: 53.125000 (55.090726) acc5: 78.125000 (78.729839) time: 0.460083 data: 0.000146 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 2.096431 (2.078803) acc1: 54.687500 (55.297256) acc5: 79.687500 (79.115854) time: 0.361240 data: 0.000139 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 2.096431 (2.078236) acc1: 54.687500 (55.488000) acc5: 79.687500 (79.136000) time: 0.356310 data: 0.000113 max mem: 18817 Test: Total time: 0:00:21 (0.429689 s / it) * Acc@1 55.510 Acc@5 80.072 loss 2.059 Max accuracy: 55.51% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0010.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0010.pth Epoch: [11/300] [ 0/1251] eta: 0:43:27 lr: 0.001102 loss: 5.554416 (5.554416) time: 2.084256 data: 1.087630 max mem: 18817 Epoch: [11/300] [ 50/1251] eta: 0:19:43 lr: 0.001106 loss: 4.546911 (4.772864) time: 1.006859 data: 0.000176 max mem: 18817 Epoch: [11/300] [ 100/1251] eta: 0:18:34 lr: 0.001110 loss: 4.830966 (4.781912) time: 0.974164 data: 0.000150 max mem: 18817 Epoch: [11/300] [ 150/1251] eta: 0:17:39 lr: 0.001114 loss: 4.930822 (4.778720) time: 0.930868 data: 0.000166 max mem: 18817 Epoch: [11/300] [ 200/1251] eta: 0:16:54 lr: 0.001118 loss: 4.531874 (4.735799) time: 0.939676 data: 0.000168 max mem: 18817 Epoch: [11/300] [ 250/1251] eta: 0:16:06 lr: 0.001122 loss: 5.112228 (4.769983) time: 0.994395 data: 0.000169 max mem: 18817 Epoch: [11/300] [ 300/1251] eta: 0:15:19 lr: 0.001126 loss: 4.712565 (4.763571) time: 0.997101 data: 0.000164 max mem: 18817 Epoch: [11/300] [ 350/1251] eta: 0:14:29 lr: 0.001130 loss: 4.416642 (4.774820) time: 0.986099 data: 0.000181 max mem: 18817 Epoch: [11/300] [ 400/1251] eta: 0:13:38 lr: 0.001134 loss: 4.936211 (4.780030) time: 0.921152 data: 0.000185 max mem: 18817 Epoch: [11/300] [ 450/1251] eta: 0:12:51 lr: 0.001138 loss: 4.609922 (4.772954) time: 0.920066 data: 0.000171 max mem: 18817 Epoch: [11/300] [ 500/1251] eta: 0:12:03 lr: 0.001142 loss: 4.762304 (4.766628) time: 0.964843 data: 0.000181 max mem: 18817 Epoch: [11/300] [ 550/1251] eta: 0:11:14 lr: 0.001146 loss: 4.532199 (4.756960) time: 0.954653 data: 0.000181 max mem: 18817 Epoch: [11/300] [ 600/1251] eta: 0:10:26 lr: 0.001150 loss: 4.686060 (4.758443) time: 0.980395 data: 0.000171 max mem: 18817 Epoch: [11/300] [ 650/1251] eta: 0:09:37 lr: 0.001154 loss: 5.052931 (4.771032) time: 0.908592 data: 0.000157 max mem: 18817 Epoch: [11/300] [ 700/1251] eta: 0:08:49 lr: 0.001158 loss: 4.960431 (4.775636) time: 0.927720 data: 0.000160 max mem: 18817 Epoch: [11/300] [ 750/1251] eta: 0:08:01 lr: 0.001162 loss: 4.815722 (4.778448) time: 0.980835 data: 0.000174 max mem: 18817 Epoch: [11/300] [ 800/1251] eta: 0:07:14 lr: 0.001166 loss: 4.938022 (4.783770) time: 0.984311 data: 0.000170 max mem: 18817 Epoch: [11/300] [ 850/1251] eta: 0:06:25 lr: 0.001170 loss: 5.092377 (4.788750) time: 0.962588 data: 0.000173 max mem: 18817 Epoch: [11/300] [ 900/1251] eta: 0:05:37 lr: 0.001174 loss: 4.746301 (4.786369) time: 0.917935 data: 0.000186 max mem: 18817 Epoch: [11/300] [ 950/1251] eta: 0:04:49 lr: 0.001178 loss: 4.488658 (4.773455) time: 0.918941 data: 0.000171 max mem: 18817 Epoch: [11/300] [1000/1251] eta: 0:04:01 lr: 0.001182 loss: 4.718465 (4.768987) time: 0.985721 data: 0.000174 max mem: 18817 Epoch: [11/300] [1050/1251] eta: 0:03:13 lr: 0.001186 loss: 4.721005 (4.767173) time: 0.967398 data: 0.000170 max mem: 18817 Epoch: [11/300] [1100/1251] eta: 0:02:25 lr: 0.001190 loss: 4.827584 (4.764773) time: 0.979779 data: 0.000172 max mem: 18817 Epoch: [11/300] [1150/1251] eta: 0:01:36 lr: 0.001194 loss: 5.059476 (4.766163) time: 0.910574 data: 0.000179 max mem: 18817 Epoch: [11/300] [1200/1251] eta: 0:00:48 lr: 0.001198 loss: 4.699868 (4.765849) time: 0.927866 data: 0.000184 max mem: 18817 Epoch: [11/300] [1250/1251] eta: 0:00:00 lr: 0.001202 loss: 5.033626 (4.762747) time: 0.964383 data: 0.000738 max mem: 18817 Epoch: [11/300] Total time: 0:20:02 (0.960916 s / it) Averaged stats: lr: 0.001202 loss: 5.033626 (4.765946) Test: [ 0/49] eta: 0:01:16 loss: 1.885322 (1.885322) acc1: 60.937500 (60.937500) acc5: 84.375000 (84.375000) time: 1.571033 data: 1.112617 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 1.885322 (1.888114) acc1: 57.812500 (58.948864) acc5: 84.375000 (82.528409) time: 0.517932 data: 0.101304 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 1.916177 (1.956175) acc1: 56.250000 (57.440476) acc5: 81.250000 (80.654762) time: 0.406014 data: 0.000148 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 2.015473 (1.954317) acc1: 54.687500 (57.106855) acc5: 79.687500 (80.897177) time: 0.382240 data: 0.000142 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.985191 (1.946869) acc1: 57.812500 (57.393293) acc5: 81.250000 (81.250000) time: 0.366988 data: 0.000159 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 2.017222 (1.952729) acc1: 57.812500 (57.536000) acc5: 79.687500 (81.152000) time: 0.371484 data: 0.000125 max mem: 18817 Test: Total time: 0:00:20 (0.412718 s / it) * Acc@1 56.912 Acc@5 81.124 loss 1.955 Max accuracy: 56.91% Epoch: [12/300] [ 0/1251] eta: 0:41:53 lr: 0.001202 loss: 5.629884 (5.629884) time: 2.009432 data: 1.086604 max mem: 18817 Epoch: [12/300] [ 50/1251] eta: 0:19:29 lr: 0.001206 loss: 4.596365 (4.584961) time: 0.974076 data: 0.000174 max mem: 18817 Epoch: [12/300] [ 100/1251] eta: 0:18:26 lr: 0.001210 loss: 4.866535 (4.635526) time: 0.911816 data: 0.000166 max mem: 18817 Epoch: [12/300] [ 150/1251] eta: 0:17:36 lr: 0.001214 loss: 4.877631 (4.677682) time: 0.920221 data: 0.000182 max mem: 18817 Epoch: [12/300] [ 200/1251] eta: 0:16:48 lr: 0.001218 loss: 4.878820 (4.693430) time: 0.975369 data: 0.000178 max mem: 18817 Epoch: [12/300] [ 250/1251] eta: 0:16:04 lr: 0.001222 loss: 4.679358 (4.694092) time: 1.014801 data: 0.000182 max mem: 18817 Epoch: [12/300] [ 300/1251] eta: 0:15:14 lr: 0.001226 loss: 4.742786 (4.710042) time: 0.976518 data: 0.000197 max mem: 18817 Epoch: [12/300] [ 350/1251] eta: 0:14:23 lr: 0.001230 loss: 4.834941 (4.690241) time: 0.916373 data: 0.000179 max mem: 18817 Epoch: [12/300] [ 400/1251] eta: 0:13:36 lr: 0.001234 loss: 4.869154 (4.687926) time: 0.930204 data: 0.000171 max mem: 18817 Epoch: [12/300] [ 450/1251] eta: 0:12:49 lr: 0.001238 loss: 4.974645 (4.683453) time: 0.981355 data: 0.000175 max mem: 18817 Epoch: [12/300] [ 500/1251] eta: 0:12:01 lr: 0.001242 loss: 4.835422 (4.679762) time: 0.982011 data: 0.000175 max mem: 18817 Epoch: [12/300] [ 550/1251] eta: 0:11:12 lr: 0.001246 loss: 4.843342 (4.684725) time: 0.972285 data: 0.000176 max mem: 18817 Epoch: [12/300] [ 600/1251] eta: 0:10:24 lr: 0.001250 loss: 4.758451 (4.683036) time: 0.924068 data: 0.000170 max mem: 18817 Epoch: [12/300] [ 650/1251] eta: 0:09:37 lr: 0.001254 loss: 4.853833 (4.686302) time: 0.925888 data: 0.000171 max mem: 18817 Epoch: [12/300] [ 700/1251] eta: 0:08:49 lr: 0.001258 loss: 4.857919 (4.688792) time: 0.980453 data: 0.000169 max mem: 18817 Epoch: [12/300] [ 750/1251] eta: 0:08:02 lr: 0.001262 loss: 4.936497 (4.688722) time: 0.983914 data: 0.000181 max mem: 18817 Epoch: [12/300] [ 800/1251] eta: 0:07:13 lr: 0.001266 loss: 4.729685 (4.685295) time: 0.966113 data: 0.000170 max mem: 18817 Epoch: [12/300] [ 850/1251] eta: 0:06:25 lr: 0.001270 loss: 4.840721 (4.692593) time: 0.915749 data: 0.000164 max mem: 18817 Epoch: [12/300] [ 900/1251] eta: 0:05:37 lr: 0.001274 loss: 4.600990 (4.696249) time: 0.919785 data: 0.000167 max mem: 18817 Epoch: [12/300] [ 950/1251] eta: 0:04:49 lr: 0.001278 loss: 4.683802 (4.684941) time: 0.990671 data: 0.000177 max mem: 18817 Epoch: [12/300] [1000/1251] eta: 0:04:01 lr: 0.001282 loss: 5.015681 (4.682422) time: 1.033526 data: 0.000168 max mem: 18817 Epoch: [12/300] [1050/1251] eta: 0:03:13 lr: 0.001286 loss: 4.647225 (4.679628) time: 0.982693 data: 0.000193 max mem: 18817 Epoch: [12/300] [1100/1251] eta: 0:02:24 lr: 0.001290 loss: 4.632113 (4.679097) time: 0.913996 data: 0.000179 max mem: 18817 Epoch: [12/300] [1150/1251] eta: 0:01:36 lr: 0.001294 loss: 4.706456 (4.682230) time: 0.927783 data: 0.000173 max mem: 18817 Epoch: [12/300] [1200/1251] eta: 0:00:49 lr: 0.001298 loss: 5.044185 (4.682729) time: 0.995117 data: 0.000176 max mem: 18817 Epoch: [12/300] [1250/1251] eta: 0:00:00 lr: 0.001301 loss: 4.363373 (4.679986) time: 0.984902 data: 0.000755 max mem: 18817 Epoch: [12/300] Total time: 0:20:03 (0.961978 s / it) Averaged stats: lr: 0.001301 loss: 4.363373 (4.678754) Test: [ 0/49] eta: 0:01:26 loss: 1.775773 (1.775773) acc1: 65.625000 (65.625000) acc5: 85.937500 (85.937500) time: 1.767000 data: 1.353913 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.772344 (1.830883) acc1: 57.812500 (59.801136) acc5: 85.937500 (82.670455) time: 0.492373 data: 0.123217 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.803781 (1.893606) acc1: 57.812500 (57.961310) acc5: 79.687500 (81.250000) time: 0.363609 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.919159 (1.883962) acc1: 56.250000 (57.862903) acc5: 79.687500 (81.451613) time: 0.380314 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.919159 (1.877896) acc1: 57.812500 (58.346037) acc5: 82.812500 (81.669207) time: 0.401405 data: 0.000125 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.932426 (1.886810) acc1: 57.812500 (58.144000) acc5: 79.687500 (81.696000) time: 0.395068 data: 0.000101 max mem: 18817 Test: Total time: 0:00:20 (0.413523 s / it) * Acc@1 58.498 Acc@5 82.420 loss 1.869 Max accuracy: 58.50% Epoch: [13/300] [ 0/1251] eta: 0:42:06 lr: 0.001301 loss: 3.343889 (3.343889) time: 2.019929 data: 1.126507 max mem: 18817 Epoch: [13/300] [ 50/1251] eta: 0:19:19 lr: 0.001306 loss: 4.481347 (4.542559) time: 0.926271 data: 0.000179 max mem: 18817 Epoch: [13/300] [ 100/1251] eta: 0:18:35 lr: 0.001310 loss: 4.217663 (4.553570) time: 0.936463 data: 0.000180 max mem: 18817 Epoch: [13/300] [ 150/1251] eta: 0:17:44 lr: 0.001314 loss: 4.604748 (4.567977) time: 0.959271 data: 0.000193 max mem: 18817 Epoch: [13/300] [ 200/1251] eta: 0:16:59 lr: 0.001318 loss: 4.492499 (4.585265) time: 1.037674 data: 0.000170 max mem: 18817 Epoch: [13/300] [ 250/1251] eta: 0:16:08 lr: 0.001322 loss: 4.967039 (4.612784) time: 0.983908 data: 0.000160 max mem: 18817 Epoch: [13/300] [ 300/1251] eta: 0:15:16 lr: 0.001326 loss: 4.413824 (4.584174) time: 0.924658 data: 0.000160 max mem: 18817 Epoch: [13/300] [ 350/1251] eta: 0:14:29 lr: 0.001330 loss: 4.411228 (4.586040) time: 0.931854 data: 0.000168 max mem: 18817 Epoch: [13/300] [ 400/1251] eta: 0:13:41 lr: 0.001334 loss: 4.930503 (4.594108) time: 0.984259 data: 0.000185 max mem: 18817 Epoch: [13/300] [ 450/1251] eta: 0:12:54 lr: 0.001338 loss: 4.893378 (4.603708) time: 1.049314 data: 0.000177 max mem: 18817 Epoch: [13/300] [ 500/1251] eta: 0:12:04 lr: 0.001342 loss: 4.762980 (4.607697) time: 0.961365 data: 0.000180 max mem: 18817 Epoch: [13/300] [ 550/1251] eta: 0:11:15 lr: 0.001346 loss: 4.553517 (4.600743) time: 0.936384 data: 0.000188 max mem: 18817 Epoch: [13/300] [ 600/1251] eta: 0:10:27 lr: 0.001350 loss: 4.536491 (4.602621) time: 0.928737 data: 0.000178 max mem: 18817 Epoch: [13/300] [ 650/1251] eta: 0:09:39 lr: 0.001354 loss: 4.392879 (4.599028) time: 0.988723 data: 0.000176 max mem: 18817 Epoch: [13/300] [ 700/1251] eta: 0:08:51 lr: 0.001358 loss: 4.892548 (4.604938) time: 1.022919 data: 0.000176 max mem: 18817 Epoch: [13/300] [ 750/1251] eta: 0:08:02 lr: 0.001362 loss: 4.716233 (4.603118) time: 0.978597 data: 0.000180 max mem: 18817 Epoch: [13/300] [ 800/1251] eta: 0:07:14 lr: 0.001366 loss: 4.632205 (4.595037) time: 0.922586 data: 0.000177 max mem: 18817 Epoch: [13/300] [ 850/1251] eta: 0:06:26 lr: 0.001370 loss: 4.607289 (4.596333) time: 0.929946 data: 0.000164 max mem: 18817 Epoch: [13/300] [ 900/1251] eta: 0:05:38 lr: 0.001374 loss: 4.932175 (4.600140) time: 0.989468 data: 0.000178 max mem: 18817 Epoch: [13/300] [ 950/1251] eta: 0:04:50 lr: 0.001378 loss: 4.694573 (4.596531) time: 1.029727 data: 0.000165 max mem: 18817 Epoch: [13/300] [1000/1251] eta: 0:04:01 lr: 0.001382 loss: 4.758835 (4.597894) time: 0.967307 data: 0.000164 max mem: 18817 Epoch: [13/300] [1050/1251] eta: 0:03:13 lr: 0.001386 loss: 4.799298 (4.601782) time: 0.919018 data: 0.000162 max mem: 18817 Epoch: [13/300] [1100/1251] eta: 0:02:25 lr: 0.001390 loss: 4.507124 (4.600281) time: 0.916347 data: 0.000248 max mem: 18817 Epoch: [13/300] [1150/1251] eta: 0:01:37 lr: 0.001394 loss: 4.686891 (4.598377) time: 0.972987 data: 0.000182 max mem: 18817 Epoch: [13/300] [1200/1251] eta: 0:00:49 lr: 0.001398 loss: 4.395834 (4.595236) time: 1.043273 data: 0.000174 max mem: 18817 Epoch: [13/300] [1250/1251] eta: 0:00:00 lr: 0.001402 loss: 4.583477 (4.591806) time: 0.969943 data: 0.000756 max mem: 18817 Epoch: [13/300] Total time: 0:20:03 (0.961833 s / it) Averaged stats: lr: 0.001402 loss: 4.583477 (4.597315) Test: [ 0/49] eta: 0:01:27 loss: 1.739451 (1.739451) acc1: 59.375000 (59.375000) acc5: 85.937500 (85.937500) time: 1.788290 data: 1.386078 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.739451 (1.787673) acc1: 62.500000 (61.363636) acc5: 85.937500 (83.806818) time: 0.498398 data: 0.126146 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.743672 (1.854444) acc1: 60.937500 (59.672619) acc5: 82.812500 (82.440476) time: 0.373325 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 1.895612 (1.851460) acc1: 54.687500 (59.526210) acc5: 82.812500 (83.014113) time: 0.467784 data: 0.000125 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 1.885146 (1.852323) acc1: 59.375000 (59.908537) acc5: 84.375000 (83.231707) time: 0.458044 data: 0.000118 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.908917 (1.863383) acc1: 60.937500 (59.712000) acc5: 82.812500 (83.072000) time: 0.355270 data: 0.000101 max mem: 18817 Test: Total time: 0:00:21 (0.435980 s / it) * Acc@1 60.446 Acc@5 83.608 loss 1.848 Max accuracy: 60.45% Epoch: [14/300] [ 0/1251] eta: 0:46:12 lr: 0.001402 loss: 5.007876 (5.007876) time: 2.216078 data: 1.319212 max mem: 18817 Epoch: [14/300] [ 50/1251] eta: 0:19:44 lr: 0.001406 loss: 4.354576 (4.563111) time: 0.932920 data: 0.000246 max mem: 18817 Epoch: [14/300] [ 100/1251] eta: 0:18:45 lr: 0.001410 loss: 4.699367 (4.613493) time: 0.999511 data: 0.000211 max mem: 18817 Epoch: [14/300] [ 150/1251] eta: 0:17:38 lr: 0.001414 loss: 4.582821 (4.575434) time: 0.954255 data: 0.000207 max mem: 18817 Epoch: [14/300] [ 200/1251] eta: 0:16:46 lr: 0.001418 loss: 4.818513 (4.546661) time: 0.920135 data: 0.000203 max mem: 18817 Epoch: [14/300] [ 250/1251] eta: 0:16:03 lr: 0.001422 loss: 4.804913 (4.553289) time: 0.942876 data: 0.000221 max mem: 18817 Epoch: [14/300] [ 300/1251] eta: 0:15:16 lr: 0.001426 loss: 4.520995 (4.556771) time: 0.949278 data: 0.000202 max mem: 18817 Epoch: [14/300] [ 350/1251] eta: 0:14:29 lr: 0.001430 loss: 4.465185 (4.545816) time: 0.970257 data: 0.000221 max mem: 18817 Epoch: [14/300] [ 400/1251] eta: 0:13:38 lr: 0.001434 loss: 4.757494 (4.551773) time: 0.970648 data: 0.000218 max mem: 18817 Epoch: [14/300] [ 450/1251] eta: 0:12:49 lr: 0.001438 loss: 4.470181 (4.541855) time: 0.916249 data: 0.000217 max mem: 18817 Epoch: [14/300] [ 500/1251] eta: 0:12:02 lr: 0.001442 loss: 4.793230 (4.540367) time: 0.928263 data: 0.000205 max mem: 18817 Epoch: [14/300] [ 550/1251] eta: 0:11:14 lr: 0.001446 loss: 4.650566 (4.533448) time: 0.940144 data: 0.000241 max mem: 18817 Epoch: [14/300] [ 600/1251] eta: 0:10:26 lr: 0.001450 loss: 4.856229 (4.548960) time: 0.970733 data: 0.000211 max mem: 18817 Epoch: [14/300] [ 650/1251] eta: 0:09:38 lr: 0.001454 loss: 4.455912 (4.543719) time: 0.983490 data: 0.000203 max mem: 18817 Epoch: [14/300] [ 700/1251] eta: 0:08:50 lr: 0.001458 loss: 4.534584 (4.548371) time: 0.944133 data: 0.000201 max mem: 18817 Epoch: [14/300] [ 750/1251] eta: 0:08:01 lr: 0.001461 loss: 4.793640 (4.564673) time: 0.914038 data: 0.000209 max mem: 18817 Epoch: [14/300] [ 800/1251] eta: 0:07:14 lr: 0.001465 loss: 4.523877 (4.566419) time: 0.947815 data: 0.000230 max mem: 18817 Epoch: [14/300] [ 850/1251] eta: 0:06:26 lr: 0.001469 loss: 4.433493 (4.562492) time: 0.982249 data: 0.000220 max mem: 18817 Epoch: [14/300] [ 900/1251] eta: 0:05:37 lr: 0.001473 loss: 4.620319 (4.566683) time: 0.968112 data: 0.000234 max mem: 18817 Epoch: [14/300] [ 950/1251] eta: 0:04:49 lr: 0.001477 loss: 4.162834 (4.554895) time: 0.941507 data: 0.000220 max mem: 18817 Epoch: [14/300] [1000/1251] eta: 0:04:01 lr: 0.001481 loss: 4.659983 (4.559006) time: 0.922907 data: 0.000222 max mem: 18817 Epoch: [14/300] [1050/1251] eta: 0:03:13 lr: 0.001485 loss: 4.707692 (4.555964) time: 0.932329 data: 0.000217 max mem: 18817 Epoch: [14/300] [1100/1251] eta: 0:02:25 lr: 0.001489 loss: 4.672017 (4.554990) time: 0.963819 data: 0.000223 max mem: 18817 Epoch: [14/300] [1150/1251] eta: 0:01:37 lr: 0.001493 loss: 4.529872 (4.549641) time: 0.961277 data: 0.000209 max mem: 18817 Epoch: [14/300] [1200/1251] eta: 0:00:48 lr: 0.001497 loss: 4.306796 (4.547581) time: 0.917058 data: 0.000226 max mem: 18817 Epoch: [14/300] [1250/1251] eta: 0:00:00 lr: 0.001501 loss: 4.461432 (4.542935) time: 0.931513 data: 0.000827 max mem: 18817 Epoch: [14/300] Total time: 0:20:02 (0.960922 s / it) Averaged stats: lr: 0.001501 loss: 4.461432 (4.547594) Test: [ 0/49] eta: 0:01:19 loss: 1.692676 (1.692676) acc1: 67.187500 (67.187500) acc5: 81.250000 (81.250000) time: 1.623788 data: 1.179146 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.741313 (1.772742) acc1: 62.500000 (61.931818) acc5: 84.375000 (84.943182) time: 0.493882 data: 0.107354 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.766822 (1.818838) acc1: 59.375000 (59.821429) acc5: 82.812500 (84.151786) time: 0.371354 data: 0.000153 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.826580 (1.790639) acc1: 59.375000 (61.340726) acc5: 82.812500 (84.223790) time: 0.362118 data: 0.000151 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.821773 (1.784026) acc1: 62.500000 (61.737805) acc5: 84.375000 (84.336890) time: 0.360480 data: 0.000143 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.831209 (1.792001) acc1: 62.500000 (61.760000) acc5: 84.375000 (84.256000) time: 0.356111 data: 0.000113 max mem: 18817 Test: Total time: 0:00:19 (0.392539 s / it) * Acc@1 61.396 Acc@5 84.332 loss 1.773 Max accuracy: 61.40% Epoch: [15/300] [ 0/1251] eta: 0:42:01 lr: 0.001501 loss: 4.691036 (4.691036) time: 2.015685 data: 1.104051 max mem: 18817 Epoch: [15/300] [ 50/1251] eta: 0:19:53 lr: 0.001506 loss: 4.688931 (4.643542) time: 1.050313 data: 0.000168 max mem: 18817 Epoch: [15/300] [ 100/1251] eta: 0:18:34 lr: 0.001510 loss: 4.520234 (4.511393) time: 0.971056 data: 0.000177 max mem: 18817 Epoch: [15/300] [ 150/1251] eta: 0:17:40 lr: 0.001514 loss: 4.565665 (4.538933) time: 0.932178 data: 0.000170 max mem: 18817 Epoch: [15/300] [ 200/1251] eta: 0:16:54 lr: 0.001518 loss: 4.602307 (4.505914) time: 0.942549 data: 0.000181 max mem: 18817 Epoch: [15/300] [ 250/1251] eta: 0:16:05 lr: 0.001522 loss: 4.591780 (4.512150) time: 0.965322 data: 0.000158 max mem: 18817 Epoch: [15/300] [ 300/1251] eta: 0:15:17 lr: 0.001526 loss: 4.403457 (4.503998) time: 1.025951 data: 0.000163 max mem: 18817 Epoch: [15/300] [ 350/1251] eta: 0:14:25 lr: 0.001530 loss: 4.753699 (4.523811) time: 0.958238 data: 0.000179 max mem: 18817 Epoch: [15/300] [ 400/1251] eta: 0:13:35 lr: 0.001534 loss: 4.540626 (4.509216) time: 0.929102 data: 0.000177 max mem: 18817 Epoch: [15/300] [ 450/1251] eta: 0:12:49 lr: 0.001538 loss: 4.814554 (4.532153) time: 0.946371 data: 0.000223 max mem: 18817 Epoch: [15/300] [ 500/1251] eta: 0:12:01 lr: 0.001542 loss: 4.431262 (4.528603) time: 0.967533 data: 0.000184 max mem: 18817 Epoch: [15/300] [ 550/1251] eta: 0:11:14 lr: 0.001546 loss: 4.416277 (4.515383) time: 1.036736 data: 0.000187 max mem: 18817 Epoch: [15/300] [ 600/1251] eta: 0:10:26 lr: 0.001550 loss: 4.642188 (4.517626) time: 0.990566 data: 0.000179 max mem: 18817 Epoch: [15/300] [ 650/1251] eta: 0:09:37 lr: 0.001554 loss: 4.734952 (4.509363) time: 0.914364 data: 0.000159 max mem: 18817 Epoch: [15/300] [ 700/1251] eta: 0:08:49 lr: 0.001558 loss: 4.796807 (4.521324) time: 0.928714 data: 0.000180 max mem: 18817 Epoch: [15/300] [ 750/1251] eta: 0:08:01 lr: 0.001562 loss: 4.461870 (4.519536) time: 0.969607 data: 0.000173 max mem: 18817 Epoch: [15/300] [ 800/1251] eta: 0:07:13 lr: 0.001566 loss: 4.593091 (4.521056) time: 1.032692 data: 0.000178 max mem: 18817 Epoch: [15/300] [ 850/1251] eta: 0:06:25 lr: 0.001570 loss: 4.713139 (4.525166) time: 0.985088 data: 0.000164 max mem: 18817 Epoch: [15/300] [ 900/1251] eta: 0:05:37 lr: 0.001574 loss: 4.405989 (4.520539) time: 0.929199 data: 0.000183 max mem: 18817 Epoch: [15/300] [ 950/1251] eta: 0:04:49 lr: 0.001578 loss: 4.586104 (4.517215) time: 0.942637 data: 0.000162 max mem: 18817 Epoch: [15/300] [1000/1251] eta: 0:04:01 lr: 0.001582 loss: 4.858743 (4.522120) time: 0.977564 data: 0.000164 max mem: 18817 Epoch: [15/300] [1050/1251] eta: 0:03:13 lr: 0.001586 loss: 4.355724 (4.516001) time: 1.026466 data: 0.000192 max mem: 18817 Epoch: [15/300] [1100/1251] eta: 0:02:25 lr: 0.001590 loss: 4.564292 (4.512560) time: 0.983609 data: 0.000167 max mem: 18817 Epoch: [15/300] [1150/1251] eta: 0:01:37 lr: 0.001594 loss: 4.645802 (4.513802) time: 0.930044 data: 0.000192 max mem: 18817 Epoch: [15/300] [1200/1251] eta: 0:00:49 lr: 0.001598 loss: 4.701621 (4.512530) time: 0.925309 data: 0.000172 max mem: 18817 Epoch: [15/300] [1250/1251] eta: 0:00:00 lr: 0.001602 loss: 4.352133 (4.508557) time: 0.982891 data: 0.000816 max mem: 18817 Epoch: [15/300] Total time: 0:20:02 (0.961516 s / it) Averaged stats: lr: 0.001602 loss: 4.352133 (4.501089) Test: [ 0/49] eta: 0:01:32 loss: 1.435622 (1.435622) acc1: 75.000000 (75.000000) acc5: 84.375000 (84.375000) time: 1.897328 data: 1.504462 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 1.666756 (1.622560) acc1: 64.062500 (65.909091) acc5: 84.375000 (84.232955) time: 0.520070 data: 0.136915 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 1.676627 (1.676564) acc1: 64.062500 (63.988095) acc5: 82.812500 (83.556548) time: 0.385429 data: 0.000139 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.719204 (1.681309) acc1: 62.500000 (63.860887) acc5: 82.812500 (83.870968) time: 0.375149 data: 0.000136 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.697365 (1.666499) acc1: 62.500000 (63.871951) acc5: 84.375000 (84.489329) time: 0.370065 data: 0.000133 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.689294 (1.666322) acc1: 64.062500 (63.776000) acc5: 84.375000 (84.544000) time: 0.365907 data: 0.000104 max mem: 18817 Test: Total time: 0:00:19 (0.407345 s / it) * Acc@1 62.674 Acc@5 85.106 loss 1.666 Max accuracy: 62.67% Epoch: [16/300] [ 0/1251] eta: 0:46:30 lr: 0.001602 loss: 4.629332 (4.629332) time: 2.230872 data: 1.325704 max mem: 18817 Epoch: [16/300] [ 50/1251] eta: 0:19:59 lr: 0.001606 loss: 4.645107 (4.538282) time: 0.979240 data: 0.000173 max mem: 18817 Epoch: [16/300] [ 100/1251] eta: 0:18:42 lr: 0.001610 loss: 4.359744 (4.511169) time: 0.930415 data: 0.000190 max mem: 18817 Epoch: [16/300] [ 150/1251] eta: 0:17:48 lr: 0.001614 loss: 4.579390 (4.473589) time: 0.936938 data: 0.000159 max mem: 18817 Epoch: [16/300] [ 200/1251] eta: 0:16:57 lr: 0.001618 loss: 4.386834 (4.443395) time: 0.968395 data: 0.000196 max mem: 18817 Epoch: [16/300] [ 250/1251] eta: 0:16:03 lr: 0.001621 loss: 4.267190 (4.440654) time: 0.965187 data: 0.000163 max mem: 18817 Epoch: [16/300] [ 300/1251] eta: 0:15:14 lr: 0.001625 loss: 4.347179 (4.456731) time: 0.937477 data: 0.000157 max mem: 18817 Epoch: [16/300] [ 350/1251] eta: 0:14:25 lr: 0.001629 loss: 4.175235 (4.448993) time: 0.928922 data: 0.000158 max mem: 18817 Epoch: [16/300] [ 400/1251] eta: 0:13:37 lr: 0.001633 loss: 4.336621 (4.452598) time: 0.922410 data: 0.000173 max mem: 18817 Epoch: [16/300] [ 450/1251] eta: 0:12:50 lr: 0.001637 loss: 4.482149 (4.463291) time: 0.992857 data: 0.000176 max mem: 18817 Epoch: [16/300] [ 500/1251] eta: 0:12:02 lr: 0.001641 loss: 4.551424 (4.466763) time: 0.986322 data: 0.000162 max mem: 18817 Epoch: [16/300] [ 550/1251] eta: 0:11:13 lr: 0.001645 loss: 4.517185 (4.471069) time: 0.958974 data: 0.000172 max mem: 18817 Epoch: [16/300] [ 600/1251] eta: 0:10:24 lr: 0.001649 loss: 4.631408 (4.481478) time: 0.931623 data: 0.000181 max mem: 18817 Epoch: [16/300] [ 650/1251] eta: 0:09:36 lr: 0.001653 loss: 4.492917 (4.479193) time: 0.934874 data: 0.000164 max mem: 18817 Epoch: [16/300] [ 700/1251] eta: 0:08:48 lr: 0.001657 loss: 4.540360 (4.479908) time: 0.975707 data: 0.000167 max mem: 18817 Epoch: [16/300] [ 750/1251] eta: 0:08:00 lr: 0.001661 loss: 4.655313 (4.479722) time: 0.972050 data: 0.000171 max mem: 18817 Epoch: [16/300] [ 800/1251] eta: 0:07:11 lr: 0.001665 loss: 4.528744 (4.481602) time: 0.914086 data: 0.000175 max mem: 18817 Epoch: [16/300] [ 850/1251] eta: 0:06:24 lr: 0.001669 loss: 4.367990 (4.481000) time: 0.931598 data: 0.000165 max mem: 18817 Epoch: [16/300] [ 900/1251] eta: 0:05:36 lr: 0.001673 loss: 4.510084 (4.477571) time: 0.952108 data: 0.000169 max mem: 18817 Epoch: [16/300] [ 950/1251] eta: 0:04:48 lr: 0.001677 loss: 4.665898 (4.472053) time: 0.975620 data: 0.000187 max mem: 18817 Epoch: [16/300] [1000/1251] eta: 0:04:00 lr: 0.001681 loss: 4.425539 (4.467007) time: 0.966988 data: 0.000164 max mem: 18817 Epoch: [16/300] [1050/1251] eta: 0:03:12 lr: 0.001685 loss: 4.531284 (4.462744) time: 0.908532 data: 0.000172 max mem: 18817 Epoch: [16/300] [1100/1251] eta: 0:02:24 lr: 0.001689 loss: 4.219500 (4.456990) time: 0.925364 data: 0.000186 max mem: 18817 Epoch: [16/300] [1150/1251] eta: 0:01:36 lr: 0.001693 loss: 4.374728 (4.452404) time: 0.955923 data: 0.000178 max mem: 18817 Epoch: [16/300] [1200/1251] eta: 0:00:48 lr: 0.001697 loss: 4.813978 (4.456233) time: 0.962046 data: 0.000182 max mem: 18817 Epoch: [16/300] [1250/1251] eta: 0:00:00 lr: 0.001701 loss: 4.526840 (4.456827) time: 0.976496 data: 0.000738 max mem: 18817 Epoch: [16/300] Total time: 0:19:59 (0.958492 s / it) Averaged stats: lr: 0.001701 loss: 4.526840 (4.449998) Test: [ 0/49] eta: 0:01:31 loss: 1.601901 (1.601901) acc1: 70.312500 (70.312500) acc5: 85.937500 (85.937500) time: 1.874490 data: 1.474218 max mem: 18817 Test: [10/49] eta: 0:00:27 loss: 1.601901 (1.623996) acc1: 64.062500 (63.778409) acc5: 84.375000 (85.795455) time: 0.699911 data: 0.134168 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 1.736882 (1.684986) acc1: 62.500000 (63.318452) acc5: 84.375000 (84.523810) time: 0.473341 data: 0.000154 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 1.723623 (1.681038) acc1: 64.062500 (63.760081) acc5: 84.375000 (84.677419) time: 0.363632 data: 0.000146 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 1.688287 (1.672902) acc1: 64.062500 (63.910061) acc5: 84.375000 (84.984756) time: 0.360725 data: 0.000146 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.686890 (1.678399) acc1: 62.500000 (64.032000) acc5: 84.375000 (85.184000) time: 0.355676 data: 0.000120 max mem: 18817 Test: Total time: 0:00:21 (0.439168 s / it) * Acc@1 63.168 Acc@5 85.754 loss 1.678 Max accuracy: 63.17% Epoch: [17/300] [ 0/1251] eta: 0:43:01 lr: 0.001701 loss: 4.232922 (4.232922) time: 2.063851 data: 1.173897 max mem: 18817 Epoch: [17/300] [ 50/1251] eta: 0:19:42 lr: 0.001706 loss: 3.842193 (4.251409) time: 0.938393 data: 0.000183 max mem: 18817 Epoch: [17/300] [ 100/1251] eta: 0:18:38 lr: 0.001710 loss: 4.640407 (4.407454) time: 0.962357 data: 0.000191 max mem: 18817 Epoch: [17/300] [ 150/1251] eta: 0:17:49 lr: 0.001714 loss: 3.947158 (4.358720) time: 0.982651 data: 0.000170 max mem: 18817 Epoch: [17/300] [ 200/1251] eta: 0:16:55 lr: 0.001718 loss: 4.438936 (4.350187) time: 0.987473 data: 0.000160 max mem: 18817 Epoch: [17/300] [ 250/1251] eta: 0:16:01 lr: 0.001722 loss: 3.967884 (4.340780) time: 0.915614 data: 0.000168 max mem: 18817 Epoch: [17/300] [ 300/1251] eta: 0:15:14 lr: 0.001726 loss: 4.422297 (4.342850) time: 0.921849 data: 0.000172 max mem: 18817 Epoch: [17/300] [ 350/1251] eta: 0:14:27 lr: 0.001730 loss: 3.983367 (4.327536) time: 0.993883 data: 0.000163 max mem: 18817 Epoch: [17/300] [ 400/1251] eta: 0:13:40 lr: 0.001734 loss: 4.270162 (4.349916) time: 0.985167 data: 0.000187 max mem: 18817 Epoch: [17/300] [ 450/1251] eta: 0:12:50 lr: 0.001738 loss: 4.402121 (4.368161) time: 0.984546 data: 0.000179 max mem: 18817 Epoch: [17/300] [ 500/1251] eta: 0:12:01 lr: 0.001742 loss: 4.688963 (4.375449) time: 0.915969 data: 0.000167 max mem: 18817 Epoch: [17/300] [ 550/1251] eta: 0:11:14 lr: 0.001746 loss: 4.643146 (4.377810) time: 0.927673 data: 0.000168 max mem: 18817 Epoch: [17/300] [ 600/1251] eta: 0:10:26 lr: 0.001750 loss: 4.438003 (4.384363) time: 0.993355 data: 0.000174 max mem: 18817 Epoch: [17/300] [ 650/1251] eta: 0:09:38 lr: 0.001754 loss: 4.235264 (4.388832) time: 0.987870 data: 0.000178 max mem: 18817 Epoch: [17/300] [ 700/1251] eta: 0:08:50 lr: 0.001758 loss: 4.142475 (4.386582) time: 0.978964 data: 0.000168 max mem: 18817 Epoch: [17/300] [ 750/1251] eta: 0:08:01 lr: 0.001762 loss: 4.565901 (4.394319) time: 0.911700 data: 0.000162 max mem: 18817 Epoch: [17/300] [ 800/1251] eta: 0:07:13 lr: 0.001766 loss: 4.600295 (4.393720) time: 0.931282 data: 0.000164 max mem: 18817 Epoch: [17/300] [ 850/1251] eta: 0:06:25 lr: 0.001770 loss: 4.489902 (4.391809) time: 0.964758 data: 0.000179 max mem: 18817 Epoch: [17/300] [ 900/1251] eta: 0:05:37 lr: 0.001774 loss: 4.423527 (4.391596) time: 0.990495 data: 0.000174 max mem: 18817 Epoch: [17/300] [ 950/1251] eta: 0:04:49 lr: 0.001778 loss: 4.554546 (4.393597) time: 0.980875 data: 0.000182 max mem: 18817 Epoch: [17/300] [1000/1251] eta: 0:04:01 lr: 0.001781 loss: 4.763080 (4.398794) time: 0.908983 data: 0.000175 max mem: 18817 Epoch: [17/300] [1050/1251] eta: 0:03:13 lr: 0.001785 loss: 4.373975 (4.401620) time: 0.922362 data: 0.000164 max mem: 18817 Epoch: [17/300] [1100/1251] eta: 0:02:25 lr: 0.001789 loss: 4.510303 (4.409999) time: 0.982798 data: 0.000172 max mem: 18817 Epoch: [17/300] [1150/1251] eta: 0:01:37 lr: 0.001793 loss: 4.545554 (4.409878) time: 0.974020 data: 0.000177 max mem: 18817 Epoch: [17/300] [1200/1251] eta: 0:00:48 lr: 0.001797 loss: 4.644239 (4.413516) time: 0.979489 data: 0.000204 max mem: 18817 Epoch: [17/300] [1250/1251] eta: 0:00:00 lr: 0.001801 loss: 4.855886 (4.415964) time: 0.913439 data: 0.000751 max mem: 18817 Epoch: [17/300] Total time: 0:20:01 (0.960413 s / it) Averaged stats: lr: 0.001801 loss: 4.855886 (4.413952) Test: [ 0/49] eta: 0:01:19 loss: 1.428672 (1.428672) acc1: 67.187500 (67.187500) acc5: 90.625000 (90.625000) time: 1.622753 data: 1.204242 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 1.589196 (1.598978) acc1: 64.062500 (65.198864) acc5: 85.937500 (85.227273) time: 0.484363 data: 0.109623 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.649146 (1.685298) acc1: 62.500000 (62.946429) acc5: 82.812500 (84.598214) time: 0.367569 data: 0.000150 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.643624 (1.667910) acc1: 60.937500 (63.306452) acc5: 85.937500 (85.383065) time: 0.364130 data: 0.000149 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.651800 (1.663653) acc1: 62.500000 (63.452744) acc5: 85.937500 (85.480183) time: 0.367181 data: 0.000152 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.668695 (1.673921) acc1: 64.062500 (63.584000) acc5: 82.812500 (85.376000) time: 0.455169 data: 0.000122 max mem: 18817 Test: Total time: 0:00:21 (0.431692 s / it) * Acc@1 63.580 Acc@5 85.990 loss 1.663 Max accuracy: 63.58% Epoch: [18/300] [ 0/1251] eta: 0:44:25 lr: 0.001801 loss: 4.349717 (4.349717) time: 2.130572 data: 1.125828 max mem: 18817 Epoch: [18/300] [ 50/1251] eta: 0:19:18 lr: 0.001805 loss: 4.253119 (4.240935) time: 0.932710 data: 0.000161 max mem: 18817 Epoch: [18/300] [ 100/1251] eta: 0:18:39 lr: 0.001809 loss: 4.445236 (4.320220) time: 0.945256 data: 0.000166 max mem: 18817 Epoch: [18/300] [ 150/1251] eta: 0:17:48 lr: 0.001813 loss: 4.687062 (4.386262) time: 0.989396 data: 0.000176 max mem: 18817 Epoch: [18/300] [ 200/1251] eta: 0:17:00 lr: 0.001817 loss: 4.397109 (4.328699) time: 1.037293 data: 0.000153 max mem: 18817 Epoch: [18/300] [ 250/1251] eta: 0:16:05 lr: 0.001821 loss: 4.686829 (4.334699) time: 0.956882 data: 0.000162 max mem: 18817 Epoch: [18/300] [ 300/1251] eta: 0:15:13 lr: 0.001825 loss: 4.640443 (4.349690) time: 0.928280 data: 0.000184 max mem: 18817 Epoch: [18/300] [ 350/1251] eta: 0:14:28 lr: 0.001829 loss: 4.418715 (4.340023) time: 0.939065 data: 0.000178 max mem: 18817 Epoch: [18/300] [ 400/1251] eta: 0:13:41 lr: 0.001833 loss: 4.599555 (4.335219) time: 1.004833 data: 0.000199 max mem: 18817 Epoch: [18/300] [ 450/1251] eta: 0:12:53 lr: 0.001837 loss: 4.694471 (4.339375) time: 1.030209 data: 0.000190 max mem: 18817 Epoch: [18/300] [ 500/1251] eta: 0:12:04 lr: 0.001841 loss: 4.679447 (4.345937) time: 0.984512 data: 0.000171 max mem: 18817 Epoch: [18/300] [ 550/1251] eta: 0:11:14 lr: 0.001845 loss: 4.565101 (4.333534) time: 0.933552 data: 0.000174 max mem: 18817 Epoch: [18/300] [ 600/1251] eta: 0:10:27 lr: 0.001849 loss: 4.509675 (4.332824) time: 0.934826 data: 0.000190 max mem: 18817 Epoch: [18/300] [ 650/1251] eta: 0:09:40 lr: 0.001853 loss: 4.403844 (4.334727) time: 1.006404 data: 0.000151 max mem: 18817 Epoch: [18/300] [ 700/1251] eta: 0:08:51 lr: 0.001857 loss: 4.343949 (4.336244) time: 1.016419 data: 0.000160 max mem: 18817 Epoch: [18/300] [ 750/1251] eta: 0:08:03 lr: 0.001861 loss: 4.578107 (4.344566) time: 0.977095 data: 0.000179 max mem: 18817 Epoch: [18/300] [ 800/1251] eta: 0:07:14 lr: 0.001865 loss: 4.718336 (4.345489) time: 0.909040 data: 0.000162 max mem: 18817 Epoch: [18/300] [ 850/1251] eta: 0:06:26 lr: 0.001869 loss: 4.520523 (4.346307) time: 0.933518 data: 0.000181 max mem: 18817 Epoch: [18/300] [ 900/1251] eta: 0:05:38 lr: 0.001873 loss: 4.118190 (4.337401) time: 1.014563 data: 0.000162 max mem: 18817 Epoch: [18/300] [ 950/1251] eta: 0:04:50 lr: 0.001877 loss: 4.636527 (4.341145) time: 1.022468 data: 0.000177 max mem: 18817 Epoch: [18/300] [1000/1251] eta: 0:04:01 lr: 0.001881 loss: 4.494984 (4.339974) time: 0.967912 data: 0.000177 max mem: 18817 Epoch: [18/300] [1050/1251] eta: 0:03:13 lr: 0.001885 loss: 4.511902 (4.341268) time: 0.914527 data: 0.000169 max mem: 18817 Epoch: [18/300] [1100/1251] eta: 0:02:25 lr: 0.001889 loss: 4.668001 (4.348979) time: 0.942583 data: 0.000187 max mem: 18817 Epoch: [18/300] [1150/1251] eta: 0:01:37 lr: 0.001893 loss: 4.387407 (4.349047) time: 0.985465 data: 0.000164 max mem: 18817 Epoch: [18/300] [1200/1251] eta: 0:00:49 lr: 0.001897 loss: 4.415985 (4.351819) time: 1.036039 data: 0.000177 max mem: 18817 Epoch: [18/300] [1250/1251] eta: 0:00:00 lr: 0.001901 loss: 4.331642 (4.349221) time: 0.970826 data: 0.000767 max mem: 18817 Epoch: [18/300] Total time: 0:20:05 (0.963649 s / it) Averaged stats: lr: 0.001901 loss: 4.331642 (4.360386) Test: [ 0/49] eta: 0:01:20 loss: 1.577868 (1.577868) acc1: 68.750000 (68.750000) acc5: 85.937500 (85.937500) time: 1.636680 data: 1.211423 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 1.577868 (1.591088) acc1: 65.625000 (66.193182) acc5: 87.500000 (87.073864) time: 0.484442 data: 0.110298 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.670073 (1.641494) acc1: 65.625000 (64.732143) acc5: 87.500000 (86.904762) time: 0.365816 data: 0.000151 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.690961 (1.630969) acc1: 64.062500 (65.221774) acc5: 87.500000 (86.945565) time: 0.381433 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.644793 (1.626274) acc1: 65.625000 (65.548780) acc5: 87.500000 (86.966463) time: 0.388083 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.668266 (1.639885) acc1: 65.625000 (65.024000) acc5: 85.937500 (86.752000) time: 0.387520 data: 0.000107 max mem: 18817 Test: Total time: 0:00:20 (0.408199 s / it) * Acc@1 64.470 Acc@5 86.654 loss 1.647 Max accuracy: 64.47% Epoch: [19/300] [ 0/1251] eta: 0:42:54 lr: 0.001901 loss: 4.268517 (4.268517) time: 2.057794 data: 1.161597 max mem: 18817 Epoch: [19/300] [ 50/1251] eta: 0:19:54 lr: 0.001906 loss: 4.107862 (4.163947) time: 0.954185 data: 0.000176 max mem: 18817 Epoch: [19/300] [ 100/1251] eta: 0:18:51 lr: 0.001910 loss: 4.643059 (4.270838) time: 0.998285 data: 0.000193 max mem: 18817 Epoch: [19/300] [ 150/1251] eta: 0:17:49 lr: 0.001914 loss: 4.608280 (4.315012) time: 0.967842 data: 0.000182 max mem: 18817 Epoch: [19/300] [ 200/1251] eta: 0:17:03 lr: 0.001918 loss: 4.437435 (4.338185) time: 0.975217 data: 0.000164 max mem: 18817 Epoch: [19/300] [ 250/1251] eta: 0:16:06 lr: 0.001922 loss: 4.608202 (4.355376) time: 0.915671 data: 0.000166 max mem: 18817 Epoch: [19/300] [ 300/1251] eta: 0:15:18 lr: 0.001926 loss: 4.332340 (4.334266) time: 0.916221 data: 0.000166 max mem: 18817 Epoch: [19/300] [ 350/1251] eta: 0:14:29 lr: 0.001930 loss: 4.502846 (4.345213) time: 0.988398 data: 0.000163 max mem: 18817 Epoch: [19/300] [ 400/1251] eta: 0:13:37 lr: 0.001934 loss: 4.618928 (4.358157) time: 0.963120 data: 0.000181 max mem: 18817 Epoch: [19/300] [ 450/1251] eta: 0:12:49 lr: 0.001938 loss: 4.412062 (4.355727) time: 0.921955 data: 0.000178 max mem: 18817 Epoch: [19/300] [ 500/1251] eta: 0:12:01 lr: 0.001941 loss: 4.520787 (4.358220) time: 0.928590 data: 0.000171 max mem: 18817 Epoch: [19/300] [ 550/1251] eta: 0:11:13 lr: 0.001945 loss: 4.550580 (4.358893) time: 0.931276 data: 0.000177 max mem: 18817 Epoch: [19/300] [ 600/1251] eta: 0:10:26 lr: 0.001949 loss: 4.086541 (4.356915) time: 1.008849 data: 0.000181 max mem: 18817 Epoch: [19/300] [ 650/1251] eta: 0:09:37 lr: 0.001953 loss: 4.442756 (4.353083) time: 0.966803 data: 0.000162 max mem: 18817 Epoch: [19/300] [ 700/1251] eta: 0:08:49 lr: 0.001957 loss: 4.472715 (4.336040) time: 0.916535 data: 0.000163 max mem: 18817 Epoch: [19/300] [ 750/1251] eta: 0:08:01 lr: 0.001961 loss: 4.421626 (4.336914) time: 0.923415 data: 0.000169 max mem: 18817 Epoch: [19/300] [ 800/1251] eta: 0:07:13 lr: 0.001965 loss: 4.183831 (4.340190) time: 0.925131 data: 0.000172 max mem: 18817 Epoch: [19/300] [ 850/1251] eta: 0:06:25 lr: 0.001969 loss: 4.256966 (4.339259) time: 0.972229 data: 0.000162 max mem: 18817 Epoch: [19/300] [ 900/1251] eta: 0:05:36 lr: 0.001973 loss: 4.519255 (4.345735) time: 0.949644 data: 0.000171 max mem: 18817 Epoch: [19/300] [ 950/1251] eta: 0:04:48 lr: 0.001977 loss: 4.422403 (4.348598) time: 0.916653 data: 0.000173 max mem: 18817 Epoch: [19/300] [1000/1251] eta: 0:04:00 lr: 0.001981 loss: 4.594659 (4.359927) time: 0.930503 data: 0.000171 max mem: 18817 Epoch: [19/300] [1050/1251] eta: 0:03:13 lr: 0.001985 loss: 4.461404 (4.353409) time: 0.941015 data: 0.000176 max mem: 18817 Epoch: [19/300] [1100/1251] eta: 0:02:25 lr: 0.001989 loss: 4.587803 (4.353488) time: 0.988519 data: 0.000174 max mem: 18817 Epoch: [19/300] [1150/1251] eta: 0:01:37 lr: 0.001993 loss: 4.175706 (4.350069) time: 0.976117 data: 0.000162 max mem: 18817 Epoch: [19/300] [1200/1251] eta: 0:00:48 lr: 0.001997 loss: 4.580001 (4.352952) time: 0.919013 data: 0.000183 max mem: 18817 Epoch: [19/300] [1250/1251] eta: 0:00:00 lr: 0.001978 loss: 4.493227 (4.353701) time: 0.921588 data: 0.000749 max mem: 18817 Epoch: [19/300] Total time: 0:20:01 (0.960741 s / it) Averaged stats: lr: 0.001978 loss: 4.493227 (4.348969) Test: [ 0/49] eta: 0:01:18 loss: 1.466273 (1.466273) acc1: 62.500000 (62.500000) acc5: 90.625000 (90.625000) time: 1.594362 data: 1.142557 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 1.418635 (1.419353) acc1: 68.750000 (69.318182) acc5: 85.937500 (86.789773) time: 0.480765 data: 0.104031 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.508785 (1.487989) acc1: 68.750000 (66.592262) acc5: 85.937500 (86.383929) time: 0.365679 data: 0.000169 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 1.558170 (1.496448) acc1: 64.062500 (66.129032) acc5: 87.500000 (86.643145) time: 0.472122 data: 0.000147 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 1.480295 (1.498081) acc1: 65.625000 (66.044207) acc5: 87.500000 (86.814024) time: 0.470217 data: 0.000126 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.559409 (1.506925) acc1: 62.500000 (65.728000) acc5: 85.937500 (86.656000) time: 0.421431 data: 0.000099 max mem: 18817 Test: Total time: 0:00:21 (0.434449 s / it) * Acc@1 65.302 Acc@5 87.158 loss 1.514 Max accuracy: 65.30% Epoch: [20/300] [ 0/1251] eta: 0:39:50 lr: 0.001978 loss: 4.066475 (4.066475) time: 1.910510 data: 1.023899 max mem: 18817 Epoch: [20/300] [ 50/1251] eta: 0:19:48 lr: 0.001978 loss: 4.055418 (4.197467) time: 1.030712 data: 0.000169 max mem: 18817 Epoch: [20/300] [ 100/1251] eta: 0:18:36 lr: 0.001978 loss: 3.816449 (4.209768) time: 0.970140 data: 0.000166 max mem: 18817 Epoch: [20/300] [ 150/1251] eta: 0:17:42 lr: 0.001978 loss: 4.481092 (4.235212) time: 0.923472 data: 0.000163 max mem: 18817 Epoch: [20/300] [ 200/1251] eta: 0:16:53 lr: 0.001978 loss: 4.667202 (4.280527) time: 0.927206 data: 0.000178 max mem: 18817 Epoch: [20/300] [ 250/1251] eta: 0:16:05 lr: 0.001978 loss: 4.254550 (4.288208) time: 0.987143 data: 0.000175 max mem: 18817 Epoch: [20/300] [ 300/1251] eta: 0:15:16 lr: 0.001978 loss: 4.287267 (4.293050) time: 1.018636 data: 0.000161 max mem: 18817 Epoch: [20/300] [ 350/1251] eta: 0:14:25 lr: 0.001978 loss: 4.473417 (4.297591) time: 0.961533 data: 0.000154 max mem: 18817 Epoch: [20/300] [ 400/1251] eta: 0:13:36 lr: 0.001978 loss: 4.106559 (4.281734) time: 0.923337 data: 0.000170 max mem: 18817 Epoch: [20/300] [ 450/1251] eta: 0:12:48 lr: 0.001978 loss: 3.741347 (4.279567) time: 0.932211 data: 0.000180 max mem: 18817 Epoch: [20/300] [ 500/1251] eta: 0:12:01 lr: 0.001977 loss: 4.319625 (4.266751) time: 0.971883 data: 0.000171 max mem: 18817 Epoch: [20/300] [ 550/1251] eta: 0:11:13 lr: 0.001977 loss: 4.559837 (4.261662) time: 1.025184 data: 0.000171 max mem: 18817 Epoch: [20/300] [ 600/1251] eta: 0:10:23 lr: 0.001977 loss: 4.651808 (4.269160) time: 0.967594 data: 0.000171 max mem: 18817 Epoch: [20/300] [ 650/1251] eta: 0:09:35 lr: 0.001977 loss: 4.384905 (4.265114) time: 0.926841 data: 0.000158 max mem: 18817 Epoch: [20/300] [ 700/1251] eta: 0:08:47 lr: 0.001977 loss: 4.424110 (4.263407) time: 0.944420 data: 0.000168 max mem: 18817 Epoch: [20/300] [ 750/1251] eta: 0:08:00 lr: 0.001977 loss: 4.520906 (4.264565) time: 1.001589 data: 0.000185 max mem: 18817 Epoch: [20/300] [ 800/1251] eta: 0:07:12 lr: 0.001977 loss: 4.342828 (4.267984) time: 1.039294 data: 0.000174 max mem: 18817 Epoch: [20/300] [ 850/1251] eta: 0:06:24 lr: 0.001977 loss: 4.637253 (4.276319) time: 0.955684 data: 0.000174 max mem: 18817 Epoch: [20/300] [ 900/1251] eta: 0:05:36 lr: 0.001977 loss: 4.210192 (4.263355) time: 0.909555 data: 0.000165 max mem: 18817 Epoch: [20/300] [ 950/1251] eta: 0:04:48 lr: 0.001977 loss: 4.378781 (4.262327) time: 0.931028 data: 0.000192 max mem: 18817 Epoch: [20/300] [1000/1251] eta: 0:04:00 lr: 0.001977 loss: 4.352766 (4.253798) time: 0.996426 data: 0.000161 max mem: 18817 Epoch: [20/300] [1050/1251] eta: 0:03:12 lr: 0.001976 loss: 4.188661 (4.250304) time: 0.997732 data: 0.000175 max mem: 18817 Epoch: [20/300] [1100/1251] eta: 0:02:24 lr: 0.001976 loss: 4.223498 (4.252582) time: 0.981636 data: 0.000163 max mem: 18817 Epoch: [20/300] [1150/1251] eta: 0:01:36 lr: 0.001976 loss: 4.556788 (4.252643) time: 0.936725 data: 0.000173 max mem: 18817 Epoch: [20/300] [1200/1251] eta: 0:00:48 lr: 0.001976 loss: 4.253445 (4.254504) time: 0.943140 data: 0.000168 max mem: 18817 Epoch: [20/300] [1250/1251] eta: 0:00:00 lr: 0.001976 loss: 4.639829 (4.259645) time: 0.970579 data: 0.000762 max mem: 18817 Epoch: [20/300] Total time: 0:20:01 (0.960370 s / it) Averaged stats: lr: 0.001976 loss: 4.639829 (4.252653) Test: [ 0/49] eta: 0:01:25 loss: 1.316848 (1.316848) acc1: 71.875000 (71.875000) acc5: 90.625000 (90.625000) time: 1.743042 data: 1.348467 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.310282 (1.380357) acc1: 70.312500 (70.312500) acc5: 89.062500 (88.210227) time: 0.499224 data: 0.122737 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.379972 (1.464187) acc1: 67.187500 (67.410714) acc5: 87.500000 (86.755952) time: 0.368769 data: 0.000159 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.487527 (1.452459) acc1: 65.625000 (67.187500) acc5: 85.937500 (87.147177) time: 0.367296 data: 0.000163 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.451027 (1.457698) acc1: 67.187500 (67.111280) acc5: 87.500000 (87.271341) time: 0.365153 data: 0.000153 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.472979 (1.465179) acc1: 67.187500 (67.040000) acc5: 85.937500 (87.424000) time: 0.355961 data: 0.000117 max mem: 18817 Test: Total time: 0:00:19 (0.394918 s / it) * Acc@1 66.238 Acc@5 87.726 loss 1.465 Max accuracy: 66.24% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0020.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0020.pth Epoch: [21/300] [ 0/1251] eta: 0:39:51 lr: 0.001976 loss: 3.775282 (3.775282) time: 1.911757 data: 1.025867 max mem: 18817 Epoch: [21/300] [ 50/1251] eta: 0:19:28 lr: 0.001976 loss: 4.329800 (4.321133) time: 0.973798 data: 0.000169 max mem: 18817 Epoch: [21/300] [ 100/1251] eta: 0:18:34 lr: 0.001976 loss: 4.595988 (4.333930) time: 1.023374 data: 0.000170 max mem: 18817 Epoch: [21/300] [ 150/1251] eta: 0:17:40 lr: 0.001976 loss: 4.363496 (4.329126) time: 0.984023 data: 0.000161 max mem: 18817 Epoch: [21/300] [ 200/1251] eta: 0:16:48 lr: 0.001976 loss: 4.412799 (4.314469) time: 0.929182 data: 0.000162 max mem: 18817 Epoch: [21/300] [ 250/1251] eta: 0:16:04 lr: 0.001976 loss: 4.321609 (4.281321) time: 0.938478 data: 0.000181 max mem: 18817 Epoch: [21/300] [ 300/1251] eta: 0:15:15 lr: 0.001976 loss: 4.512537 (4.295108) time: 0.978080 data: 0.000161 max mem: 18817 Epoch: [21/300] [ 350/1251] eta: 0:14:27 lr: 0.001975 loss: 4.185944 (4.273498) time: 1.046453 data: 0.000166 max mem: 18817 Epoch: [21/300] [ 400/1251] eta: 0:13:37 lr: 0.001975 loss: 4.527695 (4.266651) time: 0.975186 data: 0.000179 max mem: 18817 Epoch: [21/300] [ 450/1251] eta: 0:12:48 lr: 0.001975 loss: 4.294322 (4.273229) time: 0.922874 data: 0.000173 max mem: 18817 Epoch: [21/300] [ 500/1251] eta: 0:12:00 lr: 0.001975 loss: 4.205183 (4.259462) time: 0.937309 data: 0.000181 max mem: 18817 Epoch: [21/300] [ 550/1251] eta: 0:11:12 lr: 0.001975 loss: 4.362905 (4.260992) time: 0.975647 data: 0.000174 max mem: 18817 Epoch: [21/300] [ 600/1251] eta: 0:10:25 lr: 0.001975 loss: 4.410956 (4.261350) time: 1.025144 data: 0.000189 max mem: 18817 Epoch: [21/300] [ 650/1251] eta: 0:09:36 lr: 0.001975 loss: 4.359779 (4.261811) time: 0.968279 data: 0.000173 max mem: 18817 Epoch: [21/300] [ 700/1251] eta: 0:08:48 lr: 0.001975 loss: 4.364347 (4.262908) time: 0.922410 data: 0.000165 max mem: 18817 Epoch: [21/300] [ 750/1251] eta: 0:08:01 lr: 0.001975 loss: 4.168863 (4.255995) time: 0.925605 data: 0.000189 max mem: 18817 Epoch: [21/300] [ 800/1251] eta: 0:07:13 lr: 0.001975 loss: 4.307220 (4.252426) time: 0.980571 data: 0.000174 max mem: 18817 Epoch: [21/300] [ 850/1251] eta: 0:06:25 lr: 0.001975 loss: 4.347001 (4.259864) time: 1.042609 data: 0.000173 max mem: 18817 Epoch: [21/300] [ 900/1251] eta: 0:05:36 lr: 0.001974 loss: 4.306456 (4.260574) time: 0.949827 data: 0.000178 max mem: 18817 Epoch: [21/300] [ 950/1251] eta: 0:04:48 lr: 0.001974 loss: 3.990655 (4.256314) time: 0.929493 data: 0.000181 max mem: 18817 Epoch: [21/300] [1000/1251] eta: 0:04:01 lr: 0.001974 loss: 4.471421 (4.255300) time: 0.942063 data: 0.000173 max mem: 18817 Epoch: [21/300] [1050/1251] eta: 0:03:13 lr: 0.001974 loss: 4.451185 (4.253355) time: 0.969816 data: 0.000189 max mem: 18817 Epoch: [21/300] [1100/1251] eta: 0:02:25 lr: 0.001974 loss: 4.108306 (4.250122) time: 1.022115 data: 0.000180 max mem: 18817 Epoch: [21/300] [1150/1251] eta: 0:01:36 lr: 0.001974 loss: 4.417651 (4.252378) time: 0.970001 data: 0.000179 max mem: 18817 Epoch: [21/300] [1200/1251] eta: 0:00:48 lr: 0.001974 loss: 3.977734 (4.248104) time: 0.922739 data: 0.000213 max mem: 18817 Epoch: [21/300] [1250/1251] eta: 0:00:00 lr: 0.001974 loss: 3.939117 (4.246414) time: 0.938785 data: 0.000753 max mem: 18817 Epoch: [21/300] Total time: 0:20:01 (0.960270 s / it) Averaged stats: lr: 0.001974 loss: 3.939117 (4.254057) Test: [ 0/49] eta: 0:01:31 loss: 1.342907 (1.342907) acc1: 70.312500 (70.312500) acc5: 89.062500 (89.062500) time: 1.871612 data: 1.481631 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.342907 (1.350508) acc1: 67.187500 (69.034091) acc5: 89.062500 (88.920455) time: 0.505774 data: 0.134818 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.387973 (1.428909) acc1: 67.187500 (68.005952) acc5: 87.500000 (87.872024) time: 0.365454 data: 0.000124 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.487666 (1.426831) acc1: 64.062500 (67.338710) acc5: 89.062500 (88.256048) time: 0.362694 data: 0.000122 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.476038 (1.426465) acc1: 65.625000 (67.492378) acc5: 89.062500 (88.109756) time: 0.360182 data: 0.000123 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.476972 (1.434111) acc1: 67.187500 (67.552000) acc5: 89.062500 (88.288000) time: 0.457228 data: 0.000101 max mem: 18817 Test: Total time: 0:00:21 (0.435969 s / it) * Acc@1 67.318 Acc@5 88.400 loss 1.427 Max accuracy: 67.32% Epoch: [22/300] [ 0/1251] eta: 0:41:54 lr: 0.001974 loss: 5.119713 (5.119713) time: 2.010145 data: 1.124901 max mem: 18817 Epoch: [22/300] [ 50/1251] eta: 0:19:54 lr: 0.001974 loss: 4.059851 (4.257096) time: 1.004534 data: 0.000149 max mem: 18817 Epoch: [22/300] [ 100/1251] eta: 0:18:30 lr: 0.001974 loss: 3.796785 (4.195003) time: 0.955272 data: 0.000168 max mem: 18817 Epoch: [22/300] [ 150/1251] eta: 0:17:36 lr: 0.001974 loss: 4.381697 (4.206689) time: 0.920308 data: 0.000172 max mem: 18817 Epoch: [22/300] [ 200/1251] eta: 0:16:48 lr: 0.001973 loss: 4.286884 (4.202134) time: 0.927171 data: 0.000163 max mem: 18817 Epoch: [22/300] [ 250/1251] eta: 0:16:00 lr: 0.001973 loss: 4.260142 (4.192423) time: 0.990289 data: 0.000208 max mem: 18817 Epoch: [22/300] [ 300/1251] eta: 0:15:13 lr: 0.001973 loss: 4.440772 (4.213198) time: 1.025467 data: 0.000163 max mem: 18817 Epoch: [22/300] [ 350/1251] eta: 0:14:22 lr: 0.001973 loss: 4.370767 (4.231220) time: 0.968664 data: 0.000177 max mem: 18817 Epoch: [22/300] [ 400/1251] eta: 0:13:33 lr: 0.001973 loss: 4.171752 (4.225436) time: 0.919581 data: 0.000218 max mem: 18817 Epoch: [22/300] [ 450/1251] eta: 0:12:47 lr: 0.001973 loss: 4.448080 (4.226737) time: 0.925373 data: 0.000191 max mem: 18817 Epoch: [22/300] [ 500/1251] eta: 0:12:00 lr: 0.001973 loss: 4.195262 (4.221848) time: 0.981219 data: 0.000173 max mem: 18817 Epoch: [22/300] [ 550/1251] eta: 0:11:13 lr: 0.001973 loss: 4.014089 (4.212000) time: 1.030205 data: 0.000179 max mem: 18817 Epoch: [22/300] [ 600/1251] eta: 0:10:24 lr: 0.001973 loss: 4.471974 (4.218082) time: 0.973001 data: 0.000180 max mem: 18817 Epoch: [22/300] [ 650/1251] eta: 0:09:35 lr: 0.001973 loss: 4.465582 (4.214618) time: 0.925422 data: 0.000170 max mem: 18817 Epoch: [22/300] [ 700/1251] eta: 0:08:48 lr: 0.001972 loss: 3.931600 (4.205530) time: 0.926192 data: 0.000179 max mem: 18817 Epoch: [22/300] [ 750/1251] eta: 0:08:00 lr: 0.001972 loss: 4.307492 (4.207248) time: 0.990050 data: 0.000187 max mem: 18817 Epoch: [22/300] [ 800/1251] eta: 0:07:12 lr: 0.001972 loss: 4.326677 (4.206367) time: 1.016888 data: 0.000192 max mem: 18817 Epoch: [22/300] [ 850/1251] eta: 0:06:24 lr: 0.001972 loss: 4.381680 (4.200857) time: 0.961817 data: 0.000171 max mem: 18817 Epoch: [22/300] [ 900/1251] eta: 0:05:36 lr: 0.001972 loss: 4.010015 (4.194591) time: 0.911153 data: 0.000184 max mem: 18817 Epoch: [22/300] [ 950/1251] eta: 0:04:48 lr: 0.001972 loss: 4.412856 (4.191776) time: 0.929440 data: 0.000165 max mem: 18817 Epoch: [22/300] [1000/1251] eta: 0:04:00 lr: 0.001972 loss: 4.246591 (4.190526) time: 0.985836 data: 0.000181 max mem: 18817 Epoch: [22/300] [1050/1251] eta: 0:03:12 lr: 0.001972 loss: 4.430448 (4.192875) time: 1.041452 data: 0.000179 max mem: 18817 Epoch: [22/300] [1100/1251] eta: 0:02:24 lr: 0.001972 loss: 4.514985 (4.199611) time: 0.970915 data: 0.000178 max mem: 18817 Epoch: [22/300] [1150/1251] eta: 0:01:36 lr: 0.001972 loss: 4.221507 (4.197509) time: 0.920507 data: 0.000171 max mem: 18817 Epoch: [22/300] [1200/1251] eta: 0:00:48 lr: 0.001971 loss: 4.415988 (4.202561) time: 0.922398 data: 0.000167 max mem: 18817 Epoch: [22/300] [1250/1251] eta: 0:00:00 lr: 0.001971 loss: 4.134251 (4.199193) time: 0.995868 data: 0.000747 max mem: 18817 Epoch: [22/300] Total time: 0:19:59 (0.958917 s / it) Averaged stats: lr: 0.001971 loss: 4.134251 (4.194239) Test: [ 0/49] eta: 0:01:14 loss: 1.155078 (1.155078) acc1: 76.562500 (76.562500) acc5: 93.750000 (93.750000) time: 1.524138 data: 1.077332 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 1.192973 (1.275040) acc1: 70.312500 (71.306818) acc5: 92.187500 (89.062500) time: 0.480817 data: 0.098082 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.349329 (1.357429) acc1: 68.750000 (68.675595) acc5: 89.062500 (88.690476) time: 0.368915 data: 0.000146 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.432721 (1.364749) acc1: 65.625000 (68.397177) acc5: 89.062500 (88.760081) time: 0.368110 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.408782 (1.363044) acc1: 67.187500 (68.635671) acc5: 87.500000 (88.605183) time: 0.366873 data: 0.000126 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.432721 (1.373015) acc1: 67.187500 (68.832000) acc5: 87.500000 (88.608000) time: 0.355791 data: 0.000103 max mem: 18817 Test: Total time: 0:00:19 (0.391096 s / it) * Acc@1 68.116 Acc@5 88.832 loss 1.374 Max accuracy: 68.12% Epoch: [23/300] [ 0/1251] eta: 0:42:07 lr: 0.001971 loss: 3.463358 (3.463358) time: 2.020205 data: 1.104951 max mem: 18817 Epoch: [23/300] [ 50/1251] eta: 0:19:22 lr: 0.001971 loss: 4.215511 (4.092038) time: 0.913621 data: 0.000159 max mem: 18817 Epoch: [23/300] [ 100/1251] eta: 0:18:34 lr: 0.001971 loss: 4.470028 (4.173858) time: 0.930055 data: 0.000163 max mem: 18817 Epoch: [23/300] [ 150/1251] eta: 0:17:42 lr: 0.001971 loss: 3.858850 (4.113321) time: 0.933842 data: 0.000182 max mem: 18817 Epoch: [23/300] [ 200/1251] eta: 0:16:52 lr: 0.001971 loss: 4.457312 (4.129326) time: 0.978875 data: 0.000167 max mem: 18817 Epoch: [23/300] [ 250/1251] eta: 0:16:00 lr: 0.001971 loss: 4.209168 (4.129580) time: 0.965418 data: 0.000156 max mem: 18817 Epoch: [23/300] [ 300/1251] eta: 0:15:09 lr: 0.001971 loss: 4.355472 (4.130834) time: 0.909733 data: 0.000166 max mem: 18817 Epoch: [23/300] [ 350/1251] eta: 0:14:23 lr: 0.001971 loss: 4.152915 (4.142284) time: 0.928649 data: 0.000166 max mem: 18817 Epoch: [23/300] [ 400/1251] eta: 0:13:39 lr: 0.001971 loss: 4.283764 (4.153484) time: 0.972263 data: 0.000181 max mem: 18817 Epoch: [23/300] [ 450/1251] eta: 0:12:51 lr: 0.001970 loss: 4.349258 (4.146061) time: 0.971471 data: 0.000187 max mem: 18817 Epoch: [23/300] [ 500/1251] eta: 0:12:01 lr: 0.001970 loss: 4.185530 (4.146973) time: 0.949021 data: 0.000185 max mem: 18817 Epoch: [23/300] [ 550/1251] eta: 0:11:12 lr: 0.001970 loss: 4.217535 (4.160378) time: 0.909097 data: 0.000179 max mem: 18817 Epoch: [23/300] [ 600/1251] eta: 0:10:25 lr: 0.001970 loss: 4.213943 (4.173077) time: 0.948466 data: 0.000175 max mem: 18817 Epoch: [23/300] [ 650/1251] eta: 0:09:38 lr: 0.001970 loss: 4.306056 (4.170635) time: 0.936104 data: 0.000171 max mem: 18817 Epoch: [23/300] [ 700/1251] eta: 0:08:50 lr: 0.001970 loss: 4.522101 (4.167528) time: 0.999915 data: 0.000160 max mem: 18817 Epoch: [23/300] [ 750/1251] eta: 0:08:02 lr: 0.001970 loss: 4.124975 (4.172287) time: 0.964606 data: 0.000157 max mem: 18817 Epoch: [23/300] [ 800/1251] eta: 0:07:14 lr: 0.001970 loss: 4.382053 (4.171954) time: 0.954623 data: 0.000177 max mem: 18817 Epoch: [23/300] [ 850/1251] eta: 0:06:25 lr: 0.001970 loss: 3.935601 (4.163199) time: 0.926288 data: 0.000165 max mem: 18817 Epoch: [23/300] [ 900/1251] eta: 0:05:37 lr: 0.001970 loss: 4.401510 (4.167554) time: 0.931601 data: 0.000187 max mem: 18817 Epoch: [23/300] [ 950/1251] eta: 0:04:49 lr: 0.001969 loss: 4.270208 (4.171060) time: 1.001186 data: 0.000174 max mem: 18817 Epoch: [23/300] [1000/1251] eta: 0:04:01 lr: 0.001969 loss: 4.003245 (4.162266) time: 0.968202 data: 0.000163 max mem: 18817 Epoch: [23/300] [1050/1251] eta: 0:03:13 lr: 0.001969 loss: 4.318038 (4.162689) time: 0.956057 data: 0.000191 max mem: 18817 Epoch: [23/300] [1100/1251] eta: 0:02:25 lr: 0.001969 loss: 4.231430 (4.162593) time: 0.930123 data: 0.000176 max mem: 18817 Epoch: [23/300] [1150/1251] eta: 0:01:37 lr: 0.001969 loss: 4.052800 (4.155879) time: 0.925409 data: 0.000167 max mem: 18817 Epoch: [23/300] [1200/1251] eta: 0:00:49 lr: 0.001969 loss: 4.490297 (4.160713) time: 0.990924 data: 0.000180 max mem: 18817 Epoch: [23/300] [1250/1251] eta: 0:00:00 lr: 0.001969 loss: 4.112908 (4.160493) time: 0.995467 data: 0.000744 max mem: 18817 Epoch: [23/300] Total time: 0:20:05 (0.963526 s / it) Averaged stats: lr: 0.001969 loss: 4.112908 (4.159551) Test: [ 0/49] eta: 0:01:25 loss: 1.128167 (1.128167) acc1: 76.562500 (76.562500) acc5: 95.312500 (95.312500) time: 1.748761 data: 1.340640 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 1.256914 (1.300591) acc1: 70.312500 (69.602273) acc5: 90.625000 (89.772727) time: 0.520805 data: 0.122034 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 1.344943 (1.350160) acc1: 67.187500 (68.675595) acc5: 89.062500 (89.508929) time: 0.394053 data: 0.000165 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.404474 (1.359410) acc1: 68.750000 (69.153226) acc5: 89.062500 (89.264113) time: 0.376048 data: 0.000160 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.367925 (1.357753) acc1: 71.875000 (69.359756) acc5: 87.500000 (88.948171) time: 0.364081 data: 0.000146 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.426015 (1.364017) acc1: 68.750000 (69.344000) acc5: 87.500000 (88.992000) time: 0.369451 data: 0.000111 max mem: 18817 Test: Total time: 0:00:20 (0.408871 s / it) * Acc@1 68.278 Acc@5 89.102 loss 1.374 Max accuracy: 68.28% Epoch: [24/300] [ 0/1251] eta: 0:43:15 lr: 0.001969 loss: 4.545010 (4.545010) time: 2.074955 data: 1.158792 max mem: 18817 Epoch: [24/300] [ 50/1251] eta: 0:19:47 lr: 0.001969 loss: 4.254833 (4.200863) time: 0.914568 data: 0.000174 max mem: 18817 Epoch: [24/300] [ 100/1251] eta: 0:18:44 lr: 0.001969 loss: 4.493511 (4.214669) time: 0.999271 data: 0.000178 max mem: 18817 Epoch: [24/300] [ 150/1251] eta: 0:17:51 lr: 0.001969 loss: 4.276215 (4.221826) time: 0.980948 data: 0.000170 max mem: 18817 Epoch: [24/300] [ 200/1251] eta: 0:16:53 lr: 0.001968 loss: 3.713927 (4.190458) time: 0.961466 data: 0.000169 max mem: 18817 Epoch: [24/300] [ 250/1251] eta: 0:16:01 lr: 0.001968 loss: 4.303607 (4.196232) time: 0.906307 data: 0.000175 max mem: 18817 Epoch: [24/300] [ 300/1251] eta: 0:15:13 lr: 0.001968 loss: 4.071939 (4.184564) time: 0.908591 data: 0.000167 max mem: 18817 Epoch: [24/300] [ 350/1251] eta: 0:14:26 lr: 0.001968 loss: 3.949853 (4.178095) time: 0.983784 data: 0.000168 max mem: 18817 Epoch: [24/300] [ 400/1251] eta: 0:13:38 lr: 0.001968 loss: 4.095041 (4.164362) time: 1.010814 data: 0.000166 max mem: 18817 Epoch: [24/300] [ 450/1251] eta: 0:12:48 lr: 0.001968 loss: 4.275681 (4.163480) time: 0.963007 data: 0.000177 max mem: 18817 Epoch: [24/300] [ 500/1251] eta: 0:11:59 lr: 0.001968 loss: 3.559048 (4.150891) time: 0.922884 data: 0.000176 max mem: 18817 Epoch: [24/300] [ 550/1251] eta: 0:11:12 lr: 0.001968 loss: 3.893384 (4.136626) time: 0.920509 data: 0.000179 max mem: 18817 Epoch: [24/300] [ 600/1251] eta: 0:10:25 lr: 0.001968 loss: 4.397469 (4.138876) time: 1.037434 data: 0.000172 max mem: 18817 Epoch: [24/300] [ 650/1251] eta: 0:09:37 lr: 0.001967 loss: 4.354017 (4.140770) time: 1.006645 data: 0.000154 max mem: 18817 Epoch: [24/300] [ 700/1251] eta: 0:08:49 lr: 0.001967 loss: 4.206681 (4.134709) time: 0.995407 data: 0.000171 max mem: 18817 Epoch: [24/300] [ 750/1251] eta: 0:08:01 lr: 0.001967 loss: 4.436749 (4.140592) time: 0.919973 data: 0.000172 max mem: 18817 Epoch: [24/300] [ 800/1251] eta: 0:07:13 lr: 0.001967 loss: 4.445100 (4.154186) time: 0.913699 data: 0.000183 max mem: 18817 Epoch: [24/300] [ 850/1251] eta: 0:06:24 lr: 0.001967 loss: 4.451323 (4.161644) time: 0.955907 data: 0.000170 max mem: 18817 Epoch: [24/300] [ 900/1251] eta: 0:05:37 lr: 0.001967 loss: 4.280695 (4.158965) time: 1.032078 data: 0.000176 max mem: 18817 Epoch: [24/300] [ 950/1251] eta: 0:04:49 lr: 0.001967 loss: 4.214870 (4.153659) time: 0.977324 data: 0.000182 max mem: 18817 Epoch: [24/300] [1000/1251] eta: 0:04:00 lr: 0.001967 loss: 4.189254 (4.149508) time: 0.922232 data: 0.000184 max mem: 18817 Epoch: [24/300] [1050/1251] eta: 0:03:13 lr: 0.001967 loss: 4.463884 (4.159454) time: 0.935073 data: 0.000181 max mem: 18817 Epoch: [24/300] [1100/1251] eta: 0:02:25 lr: 0.001967 loss: 4.306000 (4.164548) time: 0.978788 data: 0.000176 max mem: 18817 Epoch: [24/300] [1150/1251] eta: 0:01:37 lr: 0.001966 loss: 4.030526 (4.167684) time: 1.032897 data: 0.000184 max mem: 18817 Epoch: [24/300] [1200/1251] eta: 0:00:48 lr: 0.001966 loss: 4.057441 (4.158773) time: 0.974752 data: 0.000216 max mem: 18817 Epoch: [24/300] [1250/1251] eta: 0:00:00 lr: 0.001966 loss: 4.298994 (4.158621) time: 0.910515 data: 0.000738 max mem: 18817 Epoch: [24/300] Total time: 0:20:01 (0.960363 s / it) Averaged stats: lr: 0.001966 loss: 4.298994 (4.153088) Test: [ 0/49] eta: 0:01:15 loss: 1.170548 (1.170548) acc1: 71.875000 (71.875000) acc5: 90.625000 (90.625000) time: 1.544966 data: 1.118943 max mem: 18817 Test: [10/49] eta: 0:00:24 loss: 1.170548 (1.235951) acc1: 71.875000 (72.443182) acc5: 90.625000 (89.772727) time: 0.632305 data: 0.101864 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 1.293177 (1.305398) acc1: 70.312500 (69.419643) acc5: 89.062500 (89.508929) time: 0.450806 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.345374 (1.300095) acc1: 70.312500 (69.858871) acc5: 89.062500 (89.364919) time: 0.362275 data: 0.000145 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.290418 (1.298159) acc1: 70.312500 (70.045732) acc5: 89.062500 (89.443598) time: 0.361577 data: 0.000156 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.401957 (1.302226) acc1: 70.312500 (69.984000) acc5: 89.062500 (89.408000) time: 0.355993 data: 0.000123 max mem: 18817 Test: Total time: 0:00:20 (0.422513 s / it) * Acc@1 69.084 Acc@5 89.500 loss 1.310 Max accuracy: 69.08% Epoch: [25/300] [ 0/1251] eta: 0:43:53 lr: 0.001966 loss: 4.953749 (4.953749) time: 2.104731 data: 1.205284 max mem: 18817 Epoch: [25/300] [ 50/1251] eta: 0:19:55 lr: 0.001966 loss: 4.012212 (3.960385) time: 0.995733 data: 0.000153 max mem: 18817 Epoch: [25/300] [ 100/1251] eta: 0:18:48 lr: 0.001966 loss: 4.194425 (4.087354) time: 1.033600 data: 0.000176 max mem: 18817 Epoch: [25/300] [ 150/1251] eta: 0:17:47 lr: 0.001966 loss: 4.289976 (4.096574) time: 0.969225 data: 0.000174 max mem: 18817 Epoch: [25/300] [ 200/1251] eta: 0:16:49 lr: 0.001966 loss: 4.378085 (4.116649) time: 0.918850 data: 0.000163 max mem: 18817 Epoch: [25/300] [ 250/1251] eta: 0:16:02 lr: 0.001966 loss: 4.269836 (4.117990) time: 0.916899 data: 0.000174 max mem: 18817 Epoch: [25/300] [ 300/1251] eta: 0:15:15 lr: 0.001966 loss: 4.323483 (4.091698) time: 0.991438 data: 0.000183 max mem: 18817 Epoch: [25/300] [ 350/1251] eta: 0:14:29 lr: 0.001965 loss: 4.008525 (4.090761) time: 1.052656 data: 0.000158 max mem: 18817 Epoch: [25/300] [ 400/1251] eta: 0:13:39 lr: 0.001965 loss: 4.308331 (4.097804) time: 0.976193 data: 0.000184 max mem: 18817 Epoch: [25/300] [ 450/1251] eta: 0:12:50 lr: 0.001965 loss: 4.402642 (4.105052) time: 0.929817 data: 0.000165 max mem: 18817 Epoch: [25/300] [ 500/1251] eta: 0:12:03 lr: 0.001965 loss: 4.413647 (4.109464) time: 0.923204 data: 0.000181 max mem: 18817 Epoch: [25/300] [ 550/1251] eta: 0:11:15 lr: 0.001965 loss: 4.029467 (4.115226) time: 0.974569 data: 0.000171 max mem: 18817 Epoch: [25/300] [ 600/1251] eta: 0:10:28 lr: 0.001965 loss: 4.204017 (4.108157) time: 1.026602 data: 0.000167 max mem: 18817 Epoch: [25/300] [ 650/1251] eta: 0:09:38 lr: 0.001965 loss: 4.226407 (4.105193) time: 0.952879 data: 0.000166 max mem: 18817 Epoch: [25/300] [ 700/1251] eta: 0:08:49 lr: 0.001965 loss: 3.915895 (4.102196) time: 0.919143 data: 0.000162 max mem: 18817 Epoch: [25/300] [ 750/1251] eta: 0:08:02 lr: 0.001965 loss: 4.068923 (4.095804) time: 0.923371 data: 0.000176 max mem: 18817 Epoch: [25/300] [ 800/1251] eta: 0:07:14 lr: 0.001964 loss: 4.203052 (4.093980) time: 0.975032 data: 0.000181 max mem: 18817 Epoch: [25/300] [ 850/1251] eta: 0:06:26 lr: 0.001964 loss: 4.255575 (4.100689) time: 1.031331 data: 0.000181 max mem: 18817 Epoch: [25/300] [ 900/1251] eta: 0:05:37 lr: 0.001964 loss: 4.416997 (4.104460) time: 0.982202 data: 0.000178 max mem: 18817 Epoch: [25/300] [ 950/1251] eta: 0:04:49 lr: 0.001964 loss: 4.262954 (4.107585) time: 0.916251 data: 0.000179 max mem: 18817 Epoch: [25/300] [1000/1251] eta: 0:04:01 lr: 0.001964 loss: 4.321949 (4.108261) time: 0.918910 data: 0.000171 max mem: 18817 Epoch: [25/300] [1050/1251] eta: 0:03:13 lr: 0.001964 loss: 4.408642 (4.112706) time: 0.983330 data: 0.000183 max mem: 18817 Epoch: [25/300] [1100/1251] eta: 0:02:25 lr: 0.001964 loss: 3.814085 (4.111471) time: 1.049840 data: 0.000169 max mem: 18817 Epoch: [25/300] [1150/1251] eta: 0:01:37 lr: 0.001964 loss: 4.095294 (4.112205) time: 0.960907 data: 0.000190 max mem: 18817 Epoch: [25/300] [1200/1251] eta: 0:00:48 lr: 0.001964 loss: 4.261015 (4.111896) time: 0.925086 data: 0.000181 max mem: 18817 Epoch: [25/300] [1250/1251] eta: 0:00:00 lr: 0.001963 loss: 3.882133 (4.107804) time: 0.926042 data: 0.000749 max mem: 18817 Epoch: [25/300] Total time: 0:20:02 (0.961299 s / it) Averaged stats: lr: 0.001963 loss: 3.882133 (4.112853) Test: [ 0/49] eta: 0:01:26 loss: 1.203001 (1.203001) acc1: 79.687500 (79.687500) acc5: 92.187500 (92.187500) time: 1.757905 data: 1.331692 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.169812 (1.221278) acc1: 75.000000 (72.727273) acc5: 90.625000 (91.193182) time: 0.495289 data: 0.121223 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.259129 (1.283988) acc1: 68.750000 (70.461310) acc5: 90.625000 (90.178571) time: 0.364592 data: 0.000153 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.345362 (1.283072) acc1: 68.750000 (70.514113) acc5: 89.062500 (89.919355) time: 0.362065 data: 0.000139 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.261034 (1.291039) acc1: 70.312500 (70.579268) acc5: 89.062500 (89.634146) time: 0.370625 data: 0.000135 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.276743 (1.295265) acc1: 70.312500 (70.432000) acc5: 87.500000 (89.664000) time: 0.460508 data: 0.000111 max mem: 18817 Test: Total time: 0:00:21 (0.434810 s / it) * Acc@1 69.338 Acc@5 89.840 loss 1.315 Max accuracy: 69.34% Epoch: [26/300] [ 0/1251] eta: 0:43:06 lr: 0.001963 loss: 4.258037 (4.258037) time: 2.067883 data: 1.169885 max mem: 18817 Epoch: [26/300] [ 50/1251] eta: 0:19:13 lr: 0.001963 loss: 4.443892 (4.032063) time: 0.958302 data: 0.000193 max mem: 18817 Epoch: [26/300] [ 100/1251] eta: 0:18:18 lr: 0.001963 loss: 3.906702 (4.024454) time: 0.912980 data: 0.000178 max mem: 18817 Epoch: [26/300] [ 150/1251] eta: 0:17:40 lr: 0.001963 loss: 4.144665 (4.038547) time: 0.921993 data: 0.000160 max mem: 18817 Epoch: [26/300] [ 200/1251] eta: 0:16:52 lr: 0.001963 loss: 4.337150 (4.039773) time: 0.931089 data: 0.000181 max mem: 18817 Epoch: [26/300] [ 250/1251] eta: 0:16:05 lr: 0.001963 loss: 4.205911 (4.052217) time: 0.991115 data: 0.000164 max mem: 18817 Epoch: [26/300] [ 300/1251] eta: 0:15:14 lr: 0.001963 loss: 4.483372 (4.076841) time: 0.989860 data: 0.000180 max mem: 18817 Epoch: [26/300] [ 350/1251] eta: 0:14:24 lr: 0.001963 loss: 4.122808 (4.086042) time: 0.909184 data: 0.000178 max mem: 18817 Epoch: [26/300] [ 400/1251] eta: 0:13:37 lr: 0.001963 loss: 4.247746 (4.078806) time: 0.917154 data: 0.000164 max mem: 18817 Epoch: [26/300] [ 450/1251] eta: 0:12:50 lr: 0.001962 loss: 4.238574 (4.079787) time: 0.935321 data: 0.000175 max mem: 18817 Epoch: [26/300] [ 500/1251] eta: 0:12:03 lr: 0.001962 loss: 4.139685 (4.080159) time: 0.989906 data: 0.000179 max mem: 18817 Epoch: [26/300] [ 550/1251] eta: 0:11:13 lr: 0.001962 loss: 4.275931 (4.084603) time: 0.966516 data: 0.000182 max mem: 18817 Epoch: [26/300] [ 600/1251] eta: 0:10:25 lr: 0.001962 loss: 4.266687 (4.087759) time: 0.915009 data: 0.000172 max mem: 18817 Epoch: [26/300] [ 650/1251] eta: 0:09:37 lr: 0.001962 loss: 4.087367 (4.077282) time: 0.921257 data: 0.000173 max mem: 18817 Epoch: [26/300] [ 700/1251] eta: 0:08:49 lr: 0.001962 loss: 4.109108 (4.078401) time: 0.925601 data: 0.000164 max mem: 18817 Epoch: [26/300] [ 750/1251] eta: 0:08:01 lr: 0.001962 loss: 4.068635 (4.076369) time: 0.971477 data: 0.000191 max mem: 18817 Epoch: [26/300] [ 800/1251] eta: 0:07:12 lr: 0.001962 loss: 4.335042 (4.083801) time: 0.964706 data: 0.000191 max mem: 18817 Epoch: [26/300] [ 850/1251] eta: 0:06:24 lr: 0.001962 loss: 3.994381 (4.084979) time: 0.908997 data: 0.000161 max mem: 18817 Epoch: [26/300] [ 900/1251] eta: 0:05:37 lr: 0.001961 loss: 4.213182 (4.079398) time: 0.931145 data: 0.000182 max mem: 18817 Epoch: [26/300] [ 950/1251] eta: 0:04:49 lr: 0.001961 loss: 4.401341 (4.082742) time: 0.935989 data: 0.000176 max mem: 18817 Epoch: [26/300] [1000/1251] eta: 0:04:01 lr: 0.001961 loss: 4.357724 (4.089098) time: 0.967962 data: 0.000170 max mem: 18817 Epoch: [26/300] [1050/1251] eta: 0:03:12 lr: 0.001961 loss: 4.082687 (4.093221) time: 0.977263 data: 0.000176 max mem: 18817 Epoch: [26/300] [1100/1251] eta: 0:02:24 lr: 0.001961 loss: 4.410644 (4.097008) time: 0.910331 data: 0.000175 max mem: 18817 Epoch: [26/300] [1150/1251] eta: 0:01:36 lr: 0.001961 loss: 4.059975 (4.089063) time: 0.926786 data: 0.000166 max mem: 18817 Epoch: [26/300] [1200/1251] eta: 0:00:48 lr: 0.001961 loss: 3.716343 (4.081522) time: 0.937452 data: 0.000180 max mem: 18817 Epoch: [26/300] [1250/1251] eta: 0:00:00 lr: 0.001961 loss: 3.978102 (4.079188) time: 1.003259 data: 0.000803 max mem: 18817 Epoch: [26/300] Total time: 0:20:02 (0.961513 s / it) Averaged stats: lr: 0.001961 loss: 3.978102 (4.080083) Test: [ 0/49] eta: 0:01:27 loss: 1.086617 (1.086617) acc1: 73.437500 (73.437500) acc5: 92.187500 (92.187500) time: 1.793434 data: 1.382916 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.164904 (1.180784) acc1: 73.437500 (72.017045) acc5: 92.187500 (91.335227) time: 0.501161 data: 0.125883 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.247883 (1.258408) acc1: 70.312500 (70.461310) acc5: 92.187500 (90.327381) time: 0.374054 data: 0.000156 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.295871 (1.263607) acc1: 70.312500 (70.564516) acc5: 90.625000 (90.322581) time: 0.377351 data: 0.000127 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.294153 (1.272697) acc1: 70.312500 (70.617378) acc5: 89.062500 (90.205793) time: 0.367814 data: 0.000116 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.294153 (1.282960) acc1: 68.750000 (70.336000) acc5: 89.062500 (90.272000) time: 0.355009 data: 0.000098 max mem: 18817 Test: Total time: 0:00:19 (0.399000 s / it) * Acc@1 69.616 Acc@5 90.102 loss 1.294 Max accuracy: 69.62% Epoch: [27/300] [ 0/1251] eta: 0:41:51 lr: 0.001961 loss: 4.446847 (4.446847) time: 2.007794 data: 1.106354 max mem: 18817 Epoch: [27/300] [ 50/1251] eta: 0:19:25 lr: 0.001961 loss: 3.859874 (3.973203) time: 0.919798 data: 0.000162 max mem: 18817 Epoch: [27/300] [ 100/1251] eta: 0:18:45 lr: 0.001960 loss: 4.408089 (4.084834) time: 0.946924 data: 0.000175 max mem: 18817 Epoch: [27/300] [ 150/1251] eta: 0:17:52 lr: 0.001960 loss: 4.009620 (4.040403) time: 0.995907 data: 0.000170 max mem: 18817 Epoch: [27/300] [ 200/1251] eta: 0:17:03 lr: 0.001960 loss: 4.256551 (4.005714) time: 1.030290 data: 0.000164 max mem: 18817 Epoch: [27/300] [ 250/1251] eta: 0:16:07 lr: 0.001960 loss: 3.979925 (4.000512) time: 0.951872 data: 0.000167 max mem: 18817 Epoch: [27/300] [ 300/1251] eta: 0:15:16 lr: 0.001960 loss: 4.212864 (4.015666) time: 0.929647 data: 0.000171 max mem: 18817 Epoch: [27/300] [ 350/1251] eta: 0:14:30 lr: 0.001960 loss: 3.850406 (4.001433) time: 0.932418 data: 0.000171 max mem: 18817 Epoch: [27/300] [ 400/1251] eta: 0:13:42 lr: 0.001960 loss: 3.818303 (3.999037) time: 0.987803 data: 0.000173 max mem: 18817 Epoch: [27/300] [ 450/1251] eta: 0:12:53 lr: 0.001960 loss: 3.579249 (3.985154) time: 1.026855 data: 0.000174 max mem: 18817 Epoch: [27/300] [ 500/1251] eta: 0:12:03 lr: 0.001959 loss: 4.112889 (3.994514) time: 0.975136 data: 0.000172 max mem: 18817 Epoch: [27/300] [ 550/1251] eta: 0:11:14 lr: 0.001959 loss: 4.266153 (4.007828) time: 0.931861 data: 0.000161 max mem: 18817 Epoch: [27/300] [ 600/1251] eta: 0:10:26 lr: 0.001959 loss: 4.462183 (4.013445) time: 0.927292 data: 0.000180 max mem: 18817 Epoch: [27/300] [ 650/1251] eta: 0:09:38 lr: 0.001959 loss: 3.801059 (4.012931) time: 0.992103 data: 0.000162 max mem: 18817 Epoch: [27/300] [ 700/1251] eta: 0:08:51 lr: 0.001959 loss: 3.919387 (4.011811) time: 1.040986 data: 0.000179 max mem: 18817 Epoch: [27/300] [ 750/1251] eta: 0:08:02 lr: 0.001959 loss: 4.022046 (4.018127) time: 0.977657 data: 0.000181 max mem: 18817 Epoch: [27/300] [ 800/1251] eta: 0:07:14 lr: 0.001959 loss: 4.107199 (4.019182) time: 0.921953 data: 0.000159 max mem: 18817 Epoch: [27/300] [ 850/1251] eta: 0:06:26 lr: 0.001959 loss: 3.936688 (4.011079) time: 0.934594 data: 0.000182 max mem: 18817 Epoch: [27/300] [ 900/1251] eta: 0:05:37 lr: 0.001959 loss: 4.054533 (4.006098) time: 0.959987 data: 0.000167 max mem: 18817 Epoch: [27/300] [ 950/1251] eta: 0:04:49 lr: 0.001958 loss: 4.170550 (4.019786) time: 1.023496 data: 0.000188 max mem: 18817 Epoch: [27/300] [1000/1251] eta: 0:04:01 lr: 0.001958 loss: 4.312130 (4.020354) time: 0.938321 data: 0.000174 max mem: 18817 Epoch: [27/300] [1050/1251] eta: 0:03:12 lr: 0.001958 loss: 4.094161 (4.016866) time: 0.914339 data: 0.000186 max mem: 18817 Epoch: [27/300] [1100/1251] eta: 0:02:24 lr: 0.001958 loss: 4.310195 (4.023347) time: 0.927961 data: 0.000180 max mem: 18817 Epoch: [27/300] [1150/1251] eta: 0:01:37 lr: 0.001958 loss: 3.934937 (4.022109) time: 0.987594 data: 0.000189 max mem: 18817 Epoch: [27/300] [1200/1251] eta: 0:00:48 lr: 0.001958 loss: 4.136943 (4.027149) time: 0.983139 data: 0.000173 max mem: 18817 Epoch: [27/300] [1250/1251] eta: 0:00:00 lr: 0.001958 loss: 3.785568 (4.021162) time: 0.974148 data: 0.000767 max mem: 18817 Epoch: [27/300] Total time: 0:20:00 (0.959848 s / it) Averaged stats: lr: 0.001958 loss: 3.785568 (4.029950) Test: [ 0/49] eta: 0:01:25 loss: 1.173100 (1.173100) acc1: 75.000000 (75.000000) acc5: 90.625000 (90.625000) time: 1.746153 data: 1.338343 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.191355 (1.245993) acc1: 71.875000 (72.159091) acc5: 90.625000 (91.193182) time: 0.493093 data: 0.121816 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.228914 (1.300545) acc1: 70.312500 (70.982143) acc5: 89.062500 (90.327381) time: 0.364616 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.348341 (1.301936) acc1: 70.312500 (70.362903) acc5: 89.062500 (90.473790) time: 0.362426 data: 0.000123 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.330420 (1.307657) acc1: 70.312500 (70.426829) acc5: 90.625000 (90.510671) time: 0.377480 data: 0.000121 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.340145 (1.310846) acc1: 70.312500 (70.400000) acc5: 90.625000 (90.592000) time: 0.394291 data: 0.000103 max mem: 18817 Test: Total time: 0:00:20 (0.408430 s / it) * Acc@1 70.092 Acc@5 90.166 loss 1.328 Max accuracy: 70.09% Epoch: [28/300] [ 0/1251] eta: 0:42:07 lr: 0.001958 loss: 4.021392 (4.021392) time: 2.020488 data: 1.098433 max mem: 18817 Epoch: [28/300] [ 50/1251] eta: 0:19:41 lr: 0.001958 loss: 4.184695 (3.934420) time: 0.912462 data: 0.000200 max mem: 18817 Epoch: [28/300] [ 100/1251] eta: 0:18:37 lr: 0.001957 loss: 4.441556 (4.024624) time: 0.975668 data: 0.000163 max mem: 18817 Epoch: [28/300] [ 150/1251] eta: 0:17:48 lr: 0.001957 loss: 4.168881 (4.063158) time: 1.039750 data: 0.000185 max mem: 18817 Epoch: [28/300] [ 200/1251] eta: 0:16:54 lr: 0.001957 loss: 4.084450 (4.089536) time: 0.983470 data: 0.000169 max mem: 18817 Epoch: [28/300] [ 250/1251] eta: 0:16:01 lr: 0.001957 loss: 3.841142 (4.082287) time: 0.910418 data: 0.000176 max mem: 18817 Epoch: [28/300] [ 300/1251] eta: 0:15:13 lr: 0.001957 loss: 4.045019 (4.052672) time: 0.920324 data: 0.000170 max mem: 18817 Epoch: [28/300] [ 350/1251] eta: 0:14:24 lr: 0.001957 loss: 3.829807 (4.024768) time: 0.969492 data: 0.000172 max mem: 18817 Epoch: [28/300] [ 400/1251] eta: 0:13:38 lr: 0.001957 loss: 4.171292 (4.012748) time: 1.036514 data: 0.000191 max mem: 18817 Epoch: [28/300] [ 450/1251] eta: 0:12:49 lr: 0.001957 loss: 4.151612 (4.014858) time: 0.977192 data: 0.000192 max mem: 18817 Epoch: [28/300] [ 500/1251] eta: 0:11:59 lr: 0.001956 loss: 4.281248 (4.016116) time: 0.924307 data: 0.000167 max mem: 18817 Epoch: [28/300] [ 550/1251] eta: 0:11:11 lr: 0.001956 loss: 4.321551 (4.015406) time: 0.914781 data: 0.000168 max mem: 18817 Epoch: [28/300] [ 600/1251] eta: 0:10:23 lr: 0.001956 loss: 4.286833 (4.029440) time: 0.970015 data: 0.000200 max mem: 18817 Epoch: [28/300] [ 650/1251] eta: 0:09:36 lr: 0.001956 loss: 3.698368 (4.017346) time: 1.054215 data: 0.000164 max mem: 18817 Epoch: [28/300] [ 700/1251] eta: 0:08:48 lr: 0.001956 loss: 3.812155 (4.010114) time: 0.976560 data: 0.000161 max mem: 18817 Epoch: [28/300] [ 750/1251] eta: 0:07:59 lr: 0.001956 loss: 3.546754 (4.001129) time: 0.911463 data: 0.000179 max mem: 18817 Epoch: [28/300] [ 800/1251] eta: 0:07:11 lr: 0.001956 loss: 3.988416 (4.005830) time: 0.925845 data: 0.000173 max mem: 18817 Epoch: [28/300] [ 850/1251] eta: 0:06:24 lr: 0.001956 loss: 3.883066 (3.999734) time: 1.016819 data: 0.000168 max mem: 18817 Epoch: [28/300] [ 900/1251] eta: 0:05:36 lr: 0.001955 loss: 4.027386 (3.994464) time: 1.022057 data: 0.000182 max mem: 18817 Epoch: [28/300] [ 950/1251] eta: 0:04:48 lr: 0.001955 loss: 3.573427 (3.990449) time: 0.982338 data: 0.000176 max mem: 18817 Epoch: [28/300] [1000/1251] eta: 0:04:00 lr: 0.001955 loss: 4.029836 (3.987486) time: 0.911158 data: 0.000170 max mem: 18817 Epoch: [28/300] [1050/1251] eta: 0:03:12 lr: 0.001955 loss: 4.108458 (3.987662) time: 0.922652 data: 0.000193 max mem: 18817 Epoch: [28/300] [1100/1251] eta: 0:02:24 lr: 0.001955 loss: 4.103076 (3.984168) time: 0.985777 data: 0.000171 max mem: 18817 Epoch: [28/300] [1150/1251] eta: 0:01:36 lr: 0.001955 loss: 4.139897 (3.988298) time: 1.028502 data: 0.000192 max mem: 18817 Epoch: [28/300] [1200/1251] eta: 0:00:48 lr: 0.001955 loss: 3.970174 (3.985729) time: 0.975975 data: 0.000161 max mem: 18817 Epoch: [28/300] [1250/1251] eta: 0:00:00 lr: 0.001955 loss: 4.272001 (3.992591) time: 0.915000 data: 0.000748 max mem: 18817 Epoch: [28/300] Total time: 0:19:59 (0.958512 s / it) Averaged stats: lr: 0.001955 loss: 4.272001 (3.992507) Test: [ 0/49] eta: 0:01:27 loss: 1.162028 (1.162028) acc1: 75.000000 (75.000000) acc5: 92.187500 (92.187500) time: 1.788893 data: 1.400040 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.259370 (1.265364) acc1: 73.437500 (71.732955) acc5: 90.625000 (90.625000) time: 0.502311 data: 0.127421 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 1.338989 (1.309918) acc1: 68.750000 (70.907738) acc5: 89.062500 (90.029762) time: 0.462011 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.375517 (1.305970) acc1: 70.312500 (71.068548) acc5: 89.062500 (90.322581) time: 0.456910 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 1.332952 (1.311891) acc1: 70.312500 (71.189024) acc5: 89.062500 (90.243902) time: 0.360685 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.395851 (1.316778) acc1: 68.750000 (71.072000) acc5: 89.062500 (90.400000) time: 0.355341 data: 0.000101 max mem: 18817 Test: Total time: 0:00:21 (0.433331 s / it) * Acc@1 70.594 Acc@5 90.472 loss 1.339 Max accuracy: 70.59% Epoch: [29/300] [ 0/1251] eta: 0:40:06 lr: 0.001955 loss: 4.227406 (4.227406) time: 1.923850 data: 1.045802 max mem: 18817 Epoch: [29/300] [ 50/1251] eta: 0:19:41 lr: 0.001955 loss: 4.001124 (3.899841) time: 0.976538 data: 0.000164 max mem: 18817 Epoch: [29/300] [ 100/1251] eta: 0:18:31 lr: 0.001954 loss: 4.113604 (3.915277) time: 0.972223 data: 0.000167 max mem: 18817 Epoch: [29/300] [ 150/1251] eta: 0:17:45 lr: 0.001954 loss: 3.873775 (3.918886) time: 0.973464 data: 0.000170 max mem: 18817 Epoch: [29/300] [ 200/1251] eta: 0:16:51 lr: 0.001954 loss: 3.850152 (3.923271) time: 0.925535 data: 0.000175 max mem: 18817 Epoch: [29/300] [ 250/1251] eta: 0:16:05 lr: 0.001954 loss: 4.209341 (3.961067) time: 0.935885 data: 0.000179 max mem: 18817 Epoch: [29/300] [ 300/1251] eta: 0:15:15 lr: 0.001954 loss: 3.956270 (3.953991) time: 0.967219 data: 0.000170 max mem: 18817 Epoch: [29/300] [ 350/1251] eta: 0:14:24 lr: 0.001954 loss: 4.077573 (3.961739) time: 0.967177 data: 0.000159 max mem: 18817 Epoch: [29/300] [ 400/1251] eta: 0:13:36 lr: 0.001954 loss: 3.882597 (3.969062) time: 0.944170 data: 0.000174 max mem: 18817 Epoch: [29/300] [ 450/1251] eta: 0:12:47 lr: 0.001954 loss: 4.005337 (3.963956) time: 0.916333 data: 0.000177 max mem: 18817 Epoch: [29/300] [ 500/1251] eta: 0:12:00 lr: 0.001953 loss: 3.986173 (3.971670) time: 0.930614 data: 0.000171 max mem: 18817 Epoch: [29/300] [ 550/1251] eta: 0:11:13 lr: 0.001953 loss: 4.031562 (3.963013) time: 0.972573 data: 0.000193 max mem: 18817 Epoch: [29/300] [ 600/1251] eta: 0:10:23 lr: 0.001953 loss: 4.147346 (3.962925) time: 0.970361 data: 0.000166 max mem: 18817 Epoch: [29/300] [ 650/1251] eta: 0:09:36 lr: 0.001953 loss: 4.192164 (3.968836) time: 0.949580 data: 0.000165 max mem: 18817 Epoch: [29/300] [ 700/1251] eta: 0:08:48 lr: 0.001953 loss: 4.045664 (3.973917) time: 0.936866 data: 0.000174 max mem: 18817 Epoch: [29/300] [ 750/1251] eta: 0:08:00 lr: 0.001953 loss: 3.954245 (3.973336) time: 0.941075 data: 0.000172 max mem: 18817 Epoch: [29/300] [ 800/1251] eta: 0:07:12 lr: 0.001953 loss: 4.107706 (3.971076) time: 0.971863 data: 0.000165 max mem: 18817 Epoch: [29/300] [ 850/1251] eta: 0:06:24 lr: 0.001952 loss: 3.946769 (3.968645) time: 0.953934 data: 0.000170 max mem: 18817 Epoch: [29/300] [ 900/1251] eta: 0:05:36 lr: 0.001952 loss: 4.216878 (3.975436) time: 0.943651 data: 0.000163 max mem: 18817 Epoch: [29/300] [ 950/1251] eta: 0:04:48 lr: 0.001952 loss: 4.120205 (3.968333) time: 0.924157 data: 0.000169 max mem: 18817 Epoch: [29/300] [1000/1251] eta: 0:04:00 lr: 0.001952 loss: 4.131460 (3.970779) time: 0.950343 data: 0.000158 max mem: 18817 Epoch: [29/300] [1050/1251] eta: 0:03:12 lr: 0.001952 loss: 4.177037 (3.976484) time: 0.973047 data: 0.000171 max mem: 18817 Epoch: [29/300] [1100/1251] eta: 0:02:24 lr: 0.001952 loss: 3.634819 (3.972797) time: 0.977612 data: 0.000167 max mem: 18817 Epoch: [29/300] [1150/1251] eta: 0:01:37 lr: 0.001952 loss: 4.005366 (3.976246) time: 0.992261 data: 0.000215 max mem: 18817 Epoch: [29/300] [1200/1251] eta: 0:00:48 lr: 0.001952 loss: 3.863735 (3.972901) time: 0.913947 data: 0.000174 max mem: 18817 Epoch: [29/300] [1250/1251] eta: 0:00:00 lr: 0.001951 loss: 3.802048 (3.968575) time: 0.924412 data: 0.000751 max mem: 18817 Epoch: [29/300] Total time: 0:20:02 (0.960995 s / it) Averaged stats: lr: 0.001951 loss: 3.802048 (3.971395) Test: [ 0/49] eta: 0:01:27 loss: 1.115670 (1.115670) acc1: 76.562500 (76.562500) acc5: 93.750000 (93.750000) time: 1.791431 data: 1.338408 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.140810 (1.144886) acc1: 71.875000 (73.011364) acc5: 92.187500 (91.619318) time: 0.498208 data: 0.121809 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.250467 (1.215868) acc1: 71.875000 (72.321429) acc5: 90.625000 (90.550595) time: 0.366577 data: 0.000154 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.232629 (1.212405) acc1: 68.750000 (72.076613) acc5: 90.625000 (90.776210) time: 0.364057 data: 0.000155 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.215246 (1.209924) acc1: 71.875000 (72.065549) acc5: 92.187500 (90.967988) time: 0.361327 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.215246 (1.216692) acc1: 71.875000 (72.032000) acc5: 92.187500 (90.944000) time: 0.355848 data: 0.000107 max mem: 18817 Test: Total time: 0:00:19 (0.394322 s / it) * Acc@1 71.064 Acc@5 90.680 loss 1.243 Max accuracy: 71.06% Epoch: [30/300] [ 0/1251] eta: 0:42:01 lr: 0.001951 loss: 4.658386 (4.658386) time: 2.015964 data: 1.073048 max mem: 18817 Epoch: [30/300] [ 50/1251] eta: 0:19:21 lr: 0.001951 loss: 3.998498 (3.982202) time: 0.978378 data: 0.000162 max mem: 18817 Epoch: [30/300] [ 100/1251] eta: 0:18:23 lr: 0.001951 loss: 3.921397 (3.910660) time: 0.914297 data: 0.000174 max mem: 18817 Epoch: [30/300] [ 150/1251] eta: 0:17:40 lr: 0.001951 loss: 3.597261 (3.953875) time: 0.941719 data: 0.000161 max mem: 18817 Epoch: [30/300] [ 200/1251] eta: 0:16:53 lr: 0.001951 loss: 4.161588 (3.985721) time: 0.988899 data: 0.000174 max mem: 18817 Epoch: [30/300] [ 250/1251] eta: 0:16:03 lr: 0.001951 loss: 3.847404 (3.952142) time: 1.000885 data: 0.000171 max mem: 18817 Epoch: [30/300] [ 300/1251] eta: 0:15:14 lr: 0.001951 loss: 3.980056 (3.941756) time: 0.970343 data: 0.000174 max mem: 18817 Epoch: [30/300] [ 350/1251] eta: 0:14:24 lr: 0.001951 loss: 4.126136 (3.954724) time: 0.910853 data: 0.000170 max mem: 18817 Epoch: [30/300] [ 400/1251] eta: 0:13:37 lr: 0.001950 loss: 4.125407 (3.960913) time: 0.929069 data: 0.000180 max mem: 18817 Epoch: [30/300] [ 450/1251] eta: 0:12:49 lr: 0.001950 loss: 4.041536 (3.947259) time: 0.962465 data: 0.000172 max mem: 18817 Epoch: [30/300] [ 500/1251] eta: 0:12:00 lr: 0.001950 loss: 3.670858 (3.942977) time: 0.985525 data: 0.000179 max mem: 18817 Epoch: [30/300] [ 550/1251] eta: 0:11:12 lr: 0.001950 loss: 3.812243 (3.933684) time: 0.961930 data: 0.000160 max mem: 18817 Epoch: [30/300] [ 600/1251] eta: 0:10:23 lr: 0.001950 loss: 4.297384 (3.947773) time: 0.914434 data: 0.000169 max mem: 18817 Epoch: [30/300] [ 650/1251] eta: 0:09:35 lr: 0.001950 loss: 4.047704 (3.950116) time: 0.934133 data: 0.000174 max mem: 18817 Epoch: [30/300] [ 700/1251] eta: 0:08:48 lr: 0.001950 loss: 4.056821 (3.953346) time: 0.976206 data: 0.000183 max mem: 18817 Epoch: [30/300] [ 750/1251] eta: 0:07:59 lr: 0.001950 loss: 4.128751 (3.964513) time: 0.972924 data: 0.000167 max mem: 18817 Epoch: [30/300] [ 800/1251] eta: 0:07:12 lr: 0.001949 loss: 3.819807 (3.957444) time: 0.948300 data: 0.000169 max mem: 18817 Epoch: [30/300] [ 850/1251] eta: 0:06:23 lr: 0.001949 loss: 3.940275 (3.958963) time: 0.928463 data: 0.000174 max mem: 18817 Epoch: [30/300] [ 900/1251] eta: 0:05:36 lr: 0.001949 loss: 4.127394 (3.961935) time: 0.942513 data: 0.000182 max mem: 18817 Epoch: [30/300] [ 950/1251] eta: 0:04:48 lr: 0.001949 loss: 4.155458 (3.959866) time: 0.981098 data: 0.000168 max mem: 18817 Epoch: [30/300] [1000/1251] eta: 0:04:00 lr: 0.001949 loss: 4.364303 (3.962950) time: 0.982398 data: 0.000170 max mem: 18817 Epoch: [30/300] [1050/1251] eta: 0:03:12 lr: 0.001949 loss: 4.098411 (3.958954) time: 0.968953 data: 0.000171 max mem: 18817 Epoch: [30/300] [1100/1251] eta: 0:02:24 lr: 0.001949 loss: 4.056648 (3.958022) time: 0.930007 data: 0.000180 max mem: 18817 Epoch: [30/300] [1150/1251] eta: 0:01:36 lr: 0.001948 loss: 4.222579 (3.956871) time: 0.928817 data: 0.000206 max mem: 18817 Epoch: [30/300] [1200/1251] eta: 0:00:48 lr: 0.001948 loss: 4.243062 (3.960459) time: 0.990935 data: 0.000201 max mem: 18817 Epoch: [30/300] [1250/1251] eta: 0:00:00 lr: 0.001948 loss: 3.911145 (3.959275) time: 1.002239 data: 0.000763 max mem: 18817 Epoch: [30/300] Total time: 0:20:00 (0.959855 s / it) Averaged stats: lr: 0.001948 loss: 3.911145 (3.957638) Test: [ 0/49] eta: 0:01:26 loss: 1.096471 (1.096471) acc1: 76.562500 (76.562500) acc5: 89.062500 (89.062500) time: 1.769888 data: 1.340535 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 1.153862 (1.177841) acc1: 73.437500 (73.863636) acc5: 89.062500 (90.056818) time: 0.521979 data: 0.122004 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 1.194946 (1.243923) acc1: 70.312500 (71.800595) acc5: 90.625000 (90.178571) time: 0.400216 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.271660 (1.238577) acc1: 70.312500 (72.026210) acc5: 90.625000 (90.574597) time: 0.382691 data: 0.000141 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.212947 (1.250976) acc1: 71.875000 (71.189024) acc5: 90.625000 (90.625000) time: 0.361147 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.205331 (1.249830) acc1: 68.750000 (71.168000) acc5: 90.625000 (90.720000) time: 0.371470 data: 0.000101 max mem: 18817 Test: Total time: 0:00:20 (0.413637 s / it) * Acc@1 71.154 Acc@5 90.812 loss 1.260 Max accuracy: 71.15% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0030.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0030.pth Epoch: [31/300] [ 0/1251] eta: 0:43:48 lr: 0.001948 loss: 2.950869 (2.950869) time: 2.101356 data: 1.197284 max mem: 18817 Epoch: [31/300] [ 50/1251] eta: 0:19:23 lr: 0.001948 loss: 3.464817 (3.789825) time: 0.919624 data: 0.000163 max mem: 18817 Epoch: [31/300] [ 100/1251] eta: 0:18:31 lr: 0.001948 loss: 4.191515 (3.842627) time: 0.923200 data: 0.000164 max mem: 18817 Epoch: [31/300] [ 150/1251] eta: 0:17:43 lr: 0.001948 loss: 3.903212 (3.896466) time: 0.990186 data: 0.000153 max mem: 18817 Epoch: [31/300] [ 200/1251] eta: 0:16:54 lr: 0.001948 loss: 4.066929 (3.931001) time: 0.965970 data: 0.000163 max mem: 18817 Epoch: [31/300] [ 250/1251] eta: 0:16:02 lr: 0.001948 loss: 4.091984 (3.922578) time: 0.972731 data: 0.000151 max mem: 18817 Epoch: [31/300] [ 300/1251] eta: 0:15:10 lr: 0.001947 loss: 4.118518 (3.946792) time: 0.913211 data: 0.000150 max mem: 18817 Epoch: [31/300] [ 350/1251] eta: 0:14:24 lr: 0.001947 loss: 4.037554 (3.959105) time: 0.927807 data: 0.000163 max mem: 18817 Epoch: [31/300] [ 400/1251] eta: 0:13:37 lr: 0.001947 loss: 4.159690 (3.952677) time: 0.970651 data: 0.000170 max mem: 18817 Epoch: [31/300] [ 450/1251] eta: 0:12:49 lr: 0.001947 loss: 4.207959 (3.955846) time: 0.980162 data: 0.000164 max mem: 18817 Epoch: [31/300] [ 500/1251] eta: 0:12:01 lr: 0.001947 loss: 3.761422 (3.950140) time: 0.997503 data: 0.000178 max mem: 18817 Epoch: [31/300] [ 550/1251] eta: 0:11:13 lr: 0.001947 loss: 3.791359 (3.941568) time: 0.928621 data: 0.000158 max mem: 18817 Epoch: [31/300] [ 600/1251] eta: 0:10:25 lr: 0.001947 loss: 4.002613 (3.940105) time: 0.920223 data: 0.000154 max mem: 18817 Epoch: [31/300] [ 650/1251] eta: 0:09:37 lr: 0.001946 loss: 3.967424 (3.949160) time: 0.950178 data: 0.000156 max mem: 18817 Epoch: [31/300] [ 700/1251] eta: 0:08:49 lr: 0.001946 loss: 4.146384 (3.958030) time: 0.975481 data: 0.000160 max mem: 18817 Epoch: [31/300] [ 750/1251] eta: 0:08:01 lr: 0.001946 loss: 4.264444 (3.966874) time: 0.982935 data: 0.000160 max mem: 18817 Epoch: [31/300] [ 800/1251] eta: 0:07:12 lr: 0.001946 loss: 4.091638 (3.962067) time: 0.919839 data: 0.000164 max mem: 18817 Epoch: [31/300] [ 850/1251] eta: 0:06:24 lr: 0.001946 loss: 4.022561 (3.967100) time: 0.925156 data: 0.000165 max mem: 18817 Epoch: [31/300] [ 900/1251] eta: 0:05:37 lr: 0.001946 loss: 4.179676 (3.967554) time: 0.973866 data: 0.000175 max mem: 18817 Epoch: [31/300] [ 950/1251] eta: 0:04:49 lr: 0.001946 loss: 3.947577 (3.968592) time: 0.975536 data: 0.000172 max mem: 18817 Epoch: [31/300] [1000/1251] eta: 0:04:01 lr: 0.001946 loss: 3.790438 (3.963776) time: 0.972713 data: 0.000158 max mem: 18817 Epoch: [31/300] [1050/1251] eta: 0:03:12 lr: 0.001945 loss: 4.107858 (3.960625) time: 0.927617 data: 0.000171 max mem: 18817 Epoch: [31/300] [1100/1251] eta: 0:02:24 lr: 0.001945 loss: 3.881515 (3.956709) time: 0.914631 data: 0.000172 max mem: 18817 Epoch: [31/300] [1150/1251] eta: 0:01:37 lr: 0.001945 loss: 3.966181 (3.953644) time: 0.964097 data: 0.000175 max mem: 18817 Epoch: [31/300] [1200/1251] eta: 0:00:48 lr: 0.001945 loss: 4.056510 (3.955252) time: 0.980334 data: 0.000189 max mem: 18817 Epoch: [31/300] [1250/1251] eta: 0:00:00 lr: 0.001945 loss: 3.656078 (3.956232) time: 0.963651 data: 0.000745 max mem: 18817 Epoch: [31/300] Total time: 0:20:01 (0.960570 s / it) Averaged stats: lr: 0.001945 loss: 3.656078 (3.948359) Test: [ 0/49] eta: 0:01:16 loss: 0.941325 (0.941325) acc1: 81.250000 (81.250000) acc5: 90.625000 (90.625000) time: 1.556715 data: 1.121646 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 1.082010 (1.112343) acc1: 75.000000 (75.142045) acc5: 92.187500 (91.761364) time: 0.477883 data: 0.102132 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.175388 (1.202640) acc1: 70.312500 (72.321429) acc5: 92.187500 (91.071429) time: 0.376992 data: 0.000158 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.239017 (1.206895) acc1: 71.875000 (72.429435) acc5: 90.625000 (91.129032) time: 0.382712 data: 0.000139 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.234993 (1.207839) acc1: 71.875000 (72.560976) acc5: 90.625000 (91.006098) time: 0.378013 data: 0.000143 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.248548 (1.211513) acc1: 73.437500 (72.896000) acc5: 90.625000 (90.944000) time: 0.363741 data: 0.000117 max mem: 18817 Test: Total time: 0:00:19 (0.399226 s / it) * Acc@1 71.834 Acc@5 90.982 loss 1.224 Max accuracy: 71.83% Epoch: [32/300] [ 0/1251] eta: 0:42:52 lr: 0.001945 loss: 3.536115 (3.536115) time: 2.056492 data: 1.149349 max mem: 18817 Epoch: [32/300] [ 50/1251] eta: 0:19:51 lr: 0.001945 loss: 3.972738 (3.893415) time: 0.950557 data: 0.000169 max mem: 18817 Epoch: [32/300] [ 100/1251] eta: 0:18:44 lr: 0.001945 loss: 3.939321 (3.873341) time: 0.982467 data: 0.000169 max mem: 18817 Epoch: [32/300] [ 150/1251] eta: 0:17:44 lr: 0.001944 loss: 3.973062 (3.903120) time: 0.968373 data: 0.000159 max mem: 18817 Epoch: [32/300] [ 200/1251] eta: 0:16:59 lr: 0.001944 loss: 4.089046 (3.908137) time: 0.966672 data: 0.000166 max mem: 18817 Epoch: [32/300] [ 250/1251] eta: 0:16:05 lr: 0.001944 loss: 4.140251 (3.919171) time: 0.915738 data: 0.000153 max mem: 18817 Epoch: [32/300] [ 300/1251] eta: 0:15:17 lr: 0.001944 loss: 3.868344 (3.934738) time: 0.939024 data: 0.000171 max mem: 18817 Epoch: [32/300] [ 350/1251] eta: 0:14:29 lr: 0.001944 loss: 4.165802 (3.932142) time: 0.997408 data: 0.000171 max mem: 18817 Epoch: [32/300] [ 400/1251] eta: 0:13:40 lr: 0.001944 loss: 3.799359 (3.921960) time: 0.991990 data: 0.000169 max mem: 18817 Epoch: [32/300] [ 450/1251] eta: 0:12:52 lr: 0.001944 loss: 4.038445 (3.935136) time: 0.971640 data: 0.000217 max mem: 18817 Epoch: [32/300] [ 500/1251] eta: 0:12:02 lr: 0.001943 loss: 4.181055 (3.949990) time: 0.935970 data: 0.000156 max mem: 18817 Epoch: [32/300] [ 550/1251] eta: 0:11:14 lr: 0.001943 loss: 4.164164 (3.937516) time: 0.927948 data: 0.000216 max mem: 18817 Epoch: [32/300] [ 600/1251] eta: 0:10:26 lr: 0.001943 loss: 3.950419 (3.927515) time: 0.998442 data: 0.000171 max mem: 18817 Epoch: [32/300] [ 650/1251] eta: 0:09:39 lr: 0.001943 loss: 3.786463 (3.930874) time: 1.034911 data: 0.000173 max mem: 18817 Epoch: [32/300] [ 700/1251] eta: 0:08:51 lr: 0.001943 loss: 4.226653 (3.930086) time: 0.984881 data: 0.000164 max mem: 18817 Epoch: [32/300] [ 750/1251] eta: 0:08:02 lr: 0.001943 loss: 3.666611 (3.921260) time: 0.935465 data: 0.000175 max mem: 18817 Epoch: [32/300] [ 800/1251] eta: 0:07:14 lr: 0.001943 loss: 3.581807 (3.912293) time: 0.939026 data: 0.000180 max mem: 18817 Epoch: [32/300] [ 850/1251] eta: 0:06:26 lr: 0.001943 loss: 4.148972 (3.912678) time: 0.989393 data: 0.000175 max mem: 18817 Epoch: [32/300] [ 900/1251] eta: 0:05:38 lr: 0.001942 loss: 3.713969 (3.909706) time: 1.020790 data: 0.000164 max mem: 18817 Epoch: [32/300] [ 950/1251] eta: 0:04:49 lr: 0.001942 loss: 4.053590 (3.909869) time: 0.966718 data: 0.000182 max mem: 18817 Epoch: [32/300] [1000/1251] eta: 0:04:01 lr: 0.001942 loss: 4.094183 (3.914018) time: 0.917856 data: 0.000154 max mem: 18817 Epoch: [32/300] [1050/1251] eta: 0:03:13 lr: 0.001942 loss: 4.080535 (3.915928) time: 0.932841 data: 0.000161 max mem: 18817 Epoch: [32/300] [1100/1251] eta: 0:02:25 lr: 0.001942 loss: 4.185533 (3.922637) time: 0.969701 data: 0.000157 max mem: 18817 Epoch: [32/300] [1150/1251] eta: 0:01:37 lr: 0.001942 loss: 3.940174 (3.928560) time: 0.996744 data: 0.000160 max mem: 18817 Epoch: [32/300] [1200/1251] eta: 0:00:49 lr: 0.001942 loss: 3.762944 (3.926130) time: 0.975663 data: 0.000154 max mem: 18817 Epoch: [32/300] [1250/1251] eta: 0:00:00 lr: 0.001941 loss: 4.106666 (3.924783) time: 0.919401 data: 0.000730 max mem: 18817 Epoch: [32/300] Total time: 0:20:03 (0.961755 s / it) Averaged stats: lr: 0.001941 loss: 4.106666 (3.928122) Test: [ 0/49] eta: 0:01:15 loss: 0.958538 (0.958538) acc1: 79.687500 (79.687500) acc5: 92.187500 (92.187500) time: 1.531729 data: 1.100598 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 1.074371 (1.110136) acc1: 70.312500 (72.869318) acc5: 92.187500 (92.471591) time: 0.475761 data: 0.100203 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.175457 (1.176881) acc1: 70.312500 (71.949405) acc5: 90.625000 (91.666667) time: 0.388167 data: 0.000147 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 1.200694 (1.170721) acc1: 71.875000 (72.379032) acc5: 90.625000 (91.683468) time: 0.477656 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 1.199392 (1.187220) acc1: 71.875000 (71.798780) acc5: 90.625000 (91.310976) time: 0.453522 data: 0.000137 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.219416 (1.192149) acc1: 68.750000 (71.648000) acc5: 89.062500 (91.296000) time: 0.355378 data: 0.000115 max mem: 18817 Test: Total time: 0:00:21 (0.435479 s / it) * Acc@1 72.154 Acc@5 91.302 loss 1.198 Max accuracy: 72.15% Epoch: [33/300] [ 0/1251] eta: 0:43:00 lr: 0.001941 loss: 4.153521 (4.153521) time: 2.062684 data: 1.087264 max mem: 18817 Epoch: [33/300] [ 50/1251] eta: 0:19:44 lr: 0.001941 loss: 4.038969 (3.913923) time: 0.969602 data: 0.000191 max mem: 18817 Epoch: [33/300] [ 100/1251] eta: 0:18:25 lr: 0.001941 loss: 4.037984 (3.932245) time: 0.959286 data: 0.000172 max mem: 18817 Epoch: [33/300] [ 150/1251] eta: 0:17:33 lr: 0.001941 loss: 3.687471 (3.904558) time: 0.929274 data: 0.000165 max mem: 18817 Epoch: [33/300] [ 200/1251] eta: 0:16:47 lr: 0.001941 loss: 3.879283 (3.886585) time: 0.932153 data: 0.000179 max mem: 18817 Epoch: [33/300] [ 250/1251] eta: 0:15:59 lr: 0.001941 loss: 4.089593 (3.881625) time: 0.962836 data: 0.000171 max mem: 18817 Epoch: [33/300] [ 300/1251] eta: 0:15:12 lr: 0.001941 loss: 4.019976 (3.905639) time: 0.985311 data: 0.000162 max mem: 18817 Epoch: [33/300] [ 350/1251] eta: 0:14:21 lr: 0.001940 loss: 3.900207 (3.887902) time: 0.973449 data: 0.000174 max mem: 18817 Epoch: [33/300] [ 400/1251] eta: 0:13:34 lr: 0.001940 loss: 4.024417 (3.878180) time: 0.913954 data: 0.000172 max mem: 18817 Epoch: [33/300] [ 450/1251] eta: 0:12:47 lr: 0.001940 loss: 3.638963 (3.872700) time: 0.925410 data: 0.000167 max mem: 18817 Epoch: [33/300] [ 500/1251] eta: 0:12:00 lr: 0.001940 loss: 3.943303 (3.862946) time: 1.000956 data: 0.000181 max mem: 18817 Epoch: [33/300] [ 550/1251] eta: 0:11:13 lr: 0.001940 loss: 3.520923 (3.860695) time: 1.006864 data: 0.000180 max mem: 18817 Epoch: [33/300] [ 600/1251] eta: 0:10:24 lr: 0.001940 loss: 3.768480 (3.856436) time: 0.969086 data: 0.000172 max mem: 18817 Epoch: [33/300] [ 650/1251] eta: 0:09:36 lr: 0.001940 loss: 3.614493 (3.860313) time: 0.929916 data: 0.000188 max mem: 18817 Epoch: [33/300] [ 700/1251] eta: 0:08:48 lr: 0.001939 loss: 4.047756 (3.872266) time: 0.933181 data: 0.000161 max mem: 18817 Epoch: [33/300] [ 750/1251] eta: 0:08:00 lr: 0.001939 loss: 4.026495 (3.876805) time: 1.002896 data: 0.000172 max mem: 18817 Epoch: [33/300] [ 800/1251] eta: 0:07:12 lr: 0.001939 loss: 3.882609 (3.881990) time: 0.991307 data: 0.000176 max mem: 18817 Epoch: [33/300] [ 850/1251] eta: 0:06:24 lr: 0.001939 loss: 3.994349 (3.883626) time: 0.986572 data: 0.000180 max mem: 18817 Epoch: [33/300] [ 900/1251] eta: 0:05:36 lr: 0.001939 loss: 4.162898 (3.882476) time: 0.913532 data: 0.000177 max mem: 18817 Epoch: [33/300] [ 950/1251] eta: 0:04:48 lr: 0.001939 loss: 4.150637 (3.888797) time: 0.921676 data: 0.000175 max mem: 18817 Epoch: [33/300] [1000/1251] eta: 0:04:00 lr: 0.001939 loss: 3.605841 (3.885643) time: 0.979623 data: 0.000162 max mem: 18817 Epoch: [33/300] [1050/1251] eta: 0:03:12 lr: 0.001938 loss: 4.121672 (3.887685) time: 1.029253 data: 0.000187 max mem: 18817 Epoch: [33/300] [1100/1251] eta: 0:02:24 lr: 0.001938 loss: 3.632148 (3.883466) time: 0.974162 data: 0.000206 max mem: 18817 Epoch: [33/300] [1150/1251] eta: 0:01:36 lr: 0.001938 loss: 3.969274 (3.887144) time: 0.914934 data: 0.000175 max mem: 18817 Epoch: [33/300] [1200/1251] eta: 0:00:48 lr: 0.001938 loss: 3.840873 (3.888433) time: 0.932721 data: 0.000171 max mem: 18817 Epoch: [33/300] [1250/1251] eta: 0:00:00 lr: 0.001938 loss: 3.888363 (3.891707) time: 0.984444 data: 0.000791 max mem: 18817 Epoch: [33/300] Total time: 0:19:59 (0.958812 s / it) Averaged stats: lr: 0.001938 loss: 3.888363 (3.895314) Test: [ 0/49] eta: 0:01:21 loss: 1.016052 (1.016052) acc1: 78.125000 (78.125000) acc5: 95.312500 (95.312500) time: 1.663597 data: 1.186292 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.110030 (1.100567) acc1: 73.437500 (74.289773) acc5: 93.750000 (91.903409) time: 0.487541 data: 0.107982 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.176519 (1.162589) acc1: 71.875000 (72.395833) acc5: 92.187500 (91.592262) time: 0.365996 data: 0.000138 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.211740 (1.159489) acc1: 71.875000 (72.429435) acc5: 90.625000 (91.633065) time: 0.362399 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.157395 (1.166684) acc1: 73.437500 (72.408537) acc5: 90.625000 (91.615854) time: 0.360046 data: 0.000128 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.228302 (1.175173) acc1: 70.312500 (72.416000) acc5: 90.625000 (91.584000) time: 0.355000 data: 0.000100 max mem: 18817 Test: Total time: 0:00:19 (0.390657 s / it) * Acc@1 72.158 Acc@5 91.366 loss 1.187 Max accuracy: 72.16% Epoch: [34/300] [ 0/1251] eta: 0:42:02 lr: 0.001938 loss: 3.878267 (3.878267) time: 2.016509 data: 1.114756 max mem: 18817 Epoch: [34/300] [ 50/1251] eta: 0:19:15 lr: 0.001938 loss: 3.966899 (3.762966) time: 0.957976 data: 0.000157 max mem: 18817 Epoch: [34/300] [ 100/1251] eta: 0:18:20 lr: 0.001938 loss: 4.210967 (3.787275) time: 0.920802 data: 0.000179 max mem: 18817 Epoch: [34/300] [ 150/1251] eta: 0:17:41 lr: 0.001937 loss: 3.892146 (3.811726) time: 0.929361 data: 0.000177 max mem: 18817 Epoch: [34/300] [ 200/1251] eta: 0:16:56 lr: 0.001937 loss: 4.002230 (3.817985) time: 1.000778 data: 0.000203 max mem: 18817 Epoch: [34/300] [ 250/1251] eta: 0:16:09 lr: 0.001937 loss: 3.982300 (3.823346) time: 0.999612 data: 0.000181 max mem: 18817 Epoch: [34/300] [ 300/1251] eta: 0:15:17 lr: 0.001937 loss: 4.146628 (3.845184) time: 0.964026 data: 0.000178 max mem: 18817 Epoch: [34/300] [ 350/1251] eta: 0:14:24 lr: 0.001937 loss: 4.056727 (3.855255) time: 0.908165 data: 0.000171 max mem: 18817 Epoch: [34/300] [ 400/1251] eta: 0:13:37 lr: 0.001937 loss: 3.686169 (3.847555) time: 0.917559 data: 0.000141 max mem: 18817 Epoch: [34/300] [ 450/1251] eta: 0:12:49 lr: 0.001937 loss: 3.882650 (3.837172) time: 0.989750 data: 0.000161 max mem: 18817 Epoch: [34/300] [ 500/1251] eta: 0:12:02 lr: 0.001936 loss: 4.048523 (3.841002) time: 0.992124 data: 0.000192 max mem: 18817 Epoch: [34/300] [ 550/1251] eta: 0:11:12 lr: 0.001936 loss: 4.154008 (3.855671) time: 0.947586 data: 0.000179 max mem: 18817 Epoch: [34/300] [ 600/1251] eta: 0:10:24 lr: 0.001936 loss: 3.486238 (3.855620) time: 0.932417 data: 0.000171 max mem: 18817 Epoch: [34/300] [ 650/1251] eta: 0:09:37 lr: 0.001936 loss: 4.294616 (3.861663) time: 0.925216 data: 0.000157 max mem: 18817 Epoch: [34/300] [ 700/1251] eta: 0:08:49 lr: 0.001936 loss: 3.923743 (3.867218) time: 0.997976 data: 0.000176 max mem: 18817 Epoch: [34/300] [ 750/1251] eta: 0:08:01 lr: 0.001936 loss: 4.003209 (3.873689) time: 0.971425 data: 0.000182 max mem: 18817 Epoch: [34/300] [ 800/1251] eta: 0:07:13 lr: 0.001935 loss: 3.925542 (3.875354) time: 0.984882 data: 0.000184 max mem: 18817 Epoch: [34/300] [ 850/1251] eta: 0:06:25 lr: 0.001935 loss: 3.931453 (3.875477) time: 0.924170 data: 0.000179 max mem: 18817 Epoch: [34/300] [ 900/1251] eta: 0:05:37 lr: 0.001935 loss: 4.046945 (3.877750) time: 0.936306 data: 0.000174 max mem: 18817 Epoch: [34/300] [ 950/1251] eta: 0:04:49 lr: 0.001935 loss: 3.813696 (3.877625) time: 1.002084 data: 0.000177 max mem: 18817 Epoch: [34/300] [1000/1251] eta: 0:04:01 lr: 0.001935 loss: 3.968179 (3.870303) time: 0.963703 data: 0.000163 max mem: 18817 Epoch: [34/300] [1050/1251] eta: 0:03:13 lr: 0.001935 loss: 3.496857 (3.867534) time: 0.966742 data: 0.000187 max mem: 18817 Epoch: [34/300] [1100/1251] eta: 0:02:25 lr: 0.001935 loss: 3.631866 (3.867276) time: 0.919196 data: 0.000174 max mem: 18817 Epoch: [34/300] [1150/1251] eta: 0:01:37 lr: 0.001934 loss: 3.987093 (3.868538) time: 0.924025 data: 0.000166 max mem: 18817 Epoch: [34/300] [1200/1251] eta: 0:00:49 lr: 0.001934 loss: 3.998516 (3.867579) time: 0.980697 data: 0.000166 max mem: 18817 Epoch: [34/300] [1250/1251] eta: 0:00:00 lr: 0.001934 loss: 3.941224 (3.866278) time: 0.974601 data: 0.000743 max mem: 18817 Epoch: [34/300] Total time: 0:20:02 (0.961186 s / it) Averaged stats: lr: 0.001934 loss: 3.941224 (3.872961) Test: [ 0/49] eta: 0:01:20 loss: 0.976594 (0.976594) acc1: 76.562500 (76.562500) acc5: 95.312500 (95.312500) time: 1.640024 data: 1.112493 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.103758 (1.092184) acc1: 75.000000 (75.994318) acc5: 93.750000 (91.903409) time: 0.491129 data: 0.101281 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.186595 (1.140590) acc1: 73.437500 (74.032738) acc5: 92.187500 (91.517857) time: 0.372209 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.187576 (1.151151) acc1: 71.875000 (73.336694) acc5: 92.187500 (91.935484) time: 0.382393 data: 0.000132 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.187503 (1.153561) acc1: 71.875000 (73.399390) acc5: 92.187500 (91.996951) time: 0.379671 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.191108 (1.166697) acc1: 71.875000 (72.896000) acc5: 92.187500 (91.872000) time: 0.365402 data: 0.000111 max mem: 18817 Test: Total time: 0:00:19 (0.400977 s / it) * Acc@1 72.526 Acc@5 91.582 loss 1.186 Max accuracy: 72.53% Epoch: [35/300] [ 0/1251] eta: 0:41:30 lr: 0.001934 loss: 4.239990 (4.239990) time: 1.991087 data: 1.088932 max mem: 18817 Epoch: [35/300] [ 50/1251] eta: 0:19:22 lr: 0.001934 loss: 4.063448 (3.848074) time: 0.930995 data: 0.000178 max mem: 18817 Epoch: [35/300] [ 100/1251] eta: 0:18:34 lr: 0.001934 loss: 3.917799 (3.857200) time: 0.934284 data: 0.000176 max mem: 18817 Epoch: [35/300] [ 150/1251] eta: 0:17:43 lr: 0.001934 loss: 4.097044 (3.877846) time: 0.986443 data: 0.000162 max mem: 18817 Epoch: [35/300] [ 200/1251] eta: 0:16:57 lr: 0.001934 loss: 4.086575 (3.867646) time: 1.032437 data: 0.000176 max mem: 18817 Epoch: [35/300] [ 250/1251] eta: 0:16:06 lr: 0.001933 loss: 3.887904 (3.875528) time: 0.982822 data: 0.000180 max mem: 18817 Epoch: [35/300] [ 300/1251] eta: 0:15:15 lr: 0.001933 loss: 4.119198 (3.883310) time: 0.926248 data: 0.000157 max mem: 18817 Epoch: [35/300] [ 350/1251] eta: 0:14:28 lr: 0.001933 loss: 4.116596 (3.892078) time: 0.924171 data: 0.000159 max mem: 18817 Epoch: [35/300] [ 400/1251] eta: 0:13:40 lr: 0.001933 loss: 4.263127 (3.910862) time: 0.987806 data: 0.000173 max mem: 18817 Epoch: [35/300] [ 450/1251] eta: 0:12:52 lr: 0.001933 loss: 4.107887 (3.913312) time: 1.034053 data: 0.000163 max mem: 18817 Epoch: [35/300] [ 500/1251] eta: 0:12:03 lr: 0.001933 loss: 4.039252 (3.911348) time: 0.981110 data: 0.000180 max mem: 18817 Epoch: [35/300] [ 550/1251] eta: 0:11:14 lr: 0.001932 loss: 3.891313 (3.906628) time: 0.920335 data: 0.000180 max mem: 18817 Epoch: [35/300] [ 600/1251] eta: 0:10:26 lr: 0.001932 loss: 3.972653 (3.912929) time: 0.937129 data: 0.000166 max mem: 18817 Epoch: [35/300] [ 650/1251] eta: 0:09:39 lr: 0.001932 loss: 3.989106 (3.901689) time: 0.989397 data: 0.000164 max mem: 18817 Epoch: [35/300] [ 700/1251] eta: 0:08:51 lr: 0.001932 loss: 3.802481 (3.893047) time: 1.026642 data: 0.000181 max mem: 18817 Epoch: [35/300] [ 750/1251] eta: 0:08:02 lr: 0.001932 loss: 3.332835 (3.880786) time: 0.979701 data: 0.000187 max mem: 18817 Epoch: [35/300] [ 800/1251] eta: 0:07:13 lr: 0.001932 loss: 4.129034 (3.885987) time: 0.914188 data: 0.000166 max mem: 18817 Epoch: [35/300] [ 850/1251] eta: 0:06:25 lr: 0.001932 loss: 4.090084 (3.887126) time: 0.930158 data: 0.000178 max mem: 18817 Epoch: [35/300] [ 900/1251] eta: 0:05:37 lr: 0.001931 loss: 3.649332 (3.888219) time: 0.989764 data: 0.000176 max mem: 18817 Epoch: [35/300] [ 950/1251] eta: 0:04:49 lr: 0.001931 loss: 4.161153 (3.893922) time: 1.043933 data: 0.000161 max mem: 18817 Epoch: [35/300] [1000/1251] eta: 0:04:01 lr: 0.001931 loss: 4.042955 (3.894029) time: 0.958823 data: 0.000166 max mem: 18817 Epoch: [35/300] [1050/1251] eta: 0:03:13 lr: 0.001931 loss: 3.845173 (3.890813) time: 0.923701 data: 0.000166 max mem: 18817 Epoch: [35/300] [1100/1251] eta: 0:02:25 lr: 0.001931 loss: 3.915327 (3.885349) time: 0.924297 data: 0.000159 max mem: 18817 Epoch: [35/300] [1150/1251] eta: 0:01:37 lr: 0.001931 loss: 3.870389 (3.881928) time: 0.987678 data: 0.000169 max mem: 18817 Epoch: [35/300] [1200/1251] eta: 0:00:49 lr: 0.001931 loss: 3.854811 (3.880082) time: 1.037514 data: 0.000164 max mem: 18817 Epoch: [35/300] [1250/1251] eta: 0:00:00 lr: 0.001930 loss: 3.829784 (3.882807) time: 0.957352 data: 0.000751 max mem: 18817 Epoch: [35/300] Total time: 0:20:02 (0.961626 s / it) Averaged stats: lr: 0.001930 loss: 3.829784 (3.883895) Test: [ 0/49] eta: 0:01:17 loss: 0.880644 (0.880644) acc1: 79.687500 (79.687500) acc5: 93.750000 (93.750000) time: 1.590828 data: 1.157866 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.998662 (1.036835) acc1: 73.437500 (75.852273) acc5: 93.750000 (92.187500) time: 0.480885 data: 0.105413 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.157917 (1.118761) acc1: 73.437500 (73.363095) acc5: 92.187500 (91.592262) time: 0.365815 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.140530 (1.109361) acc1: 73.437500 (73.387097) acc5: 90.625000 (91.784274) time: 0.371208 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.097531 (1.116240) acc1: 73.437500 (73.246951) acc5: 92.187500 (91.577744) time: 0.395275 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.120243 (1.122432) acc1: 73.437500 (73.568000) acc5: 90.625000 (91.648000) time: 0.394145 data: 0.000107 max mem: 18817 Test: Total time: 0:00:20 (0.408170 s / it) * Acc@1 73.138 Acc@5 91.844 loss 1.137 Max accuracy: 73.14% Epoch: [36/300] [ 0/1251] eta: 0:43:06 lr: 0.001930 loss: 4.045049 (4.045049) time: 2.067856 data: 1.150424 max mem: 18817 Epoch: [36/300] [ 50/1251] eta: 0:20:09 lr: 0.001930 loss: 4.211463 (3.974243) time: 0.942570 data: 0.000189 max mem: 18817 Epoch: [36/300] [ 100/1251] eta: 0:18:57 lr: 0.001930 loss: 4.028357 (3.960411) time: 0.979321 data: 0.000154 max mem: 18817 Epoch: [36/300] [ 150/1251] eta: 0:17:52 lr: 0.001930 loss: 3.886197 (3.932749) time: 0.972194 data: 0.000177 max mem: 18817 Epoch: [36/300] [ 200/1251] eta: 0:16:54 lr: 0.001930 loss: 4.002992 (3.891765) time: 0.915450 data: 0.000164 max mem: 18817 Epoch: [36/300] [ 250/1251] eta: 0:16:06 lr: 0.001930 loss: 4.052570 (3.888125) time: 0.924356 data: 0.000173 max mem: 18817 Epoch: [36/300] [ 300/1251] eta: 0:15:17 lr: 0.001929 loss: 3.682170 (3.887628) time: 0.943089 data: 0.000180 max mem: 18817 Epoch: [36/300] [ 350/1251] eta: 0:14:30 lr: 0.001929 loss: 3.737545 (3.869719) time: 0.996042 data: 0.000164 max mem: 18817 Epoch: [36/300] [ 400/1251] eta: 0:13:39 lr: 0.001929 loss: 4.040020 (3.875934) time: 0.951036 data: 0.000192 max mem: 18817 Epoch: [36/300] [ 450/1251] eta: 0:12:49 lr: 0.001929 loss: 3.855341 (3.875266) time: 0.920151 data: 0.000171 max mem: 18817 Epoch: [36/300] [ 500/1251] eta: 0:12:03 lr: 0.001929 loss: 3.466582 (3.854722) time: 0.936442 data: 0.000167 max mem: 18817 Epoch: [36/300] [ 550/1251] eta: 0:11:14 lr: 0.001929 loss: 3.675376 (3.848555) time: 0.942558 data: 0.000168 max mem: 18817 Epoch: [36/300] [ 600/1251] eta: 0:10:27 lr: 0.001929 loss: 3.732784 (3.848448) time: 1.002752 data: 0.000169 max mem: 18817 Epoch: [36/300] [ 650/1251] eta: 0:09:38 lr: 0.001928 loss: 4.055305 (3.851150) time: 0.959034 data: 0.000176 max mem: 18817 Epoch: [36/300] [ 700/1251] eta: 0:08:49 lr: 0.001928 loss: 4.000719 (3.852145) time: 0.920363 data: 0.000182 max mem: 18817 Epoch: [36/300] [ 750/1251] eta: 0:08:02 lr: 0.001928 loss: 4.040640 (3.852549) time: 0.927596 data: 0.000163 max mem: 18817 Epoch: [36/300] [ 800/1251] eta: 0:07:14 lr: 0.001928 loss: 4.121474 (3.851217) time: 0.957651 data: 0.000192 max mem: 18817 Epoch: [36/300] [ 850/1251] eta: 0:06:26 lr: 0.001928 loss: 3.918058 (3.840254) time: 0.991637 data: 0.000182 max mem: 18817 Epoch: [36/300] [ 900/1251] eta: 0:05:37 lr: 0.001928 loss: 4.031002 (3.841372) time: 0.987019 data: 0.000179 max mem: 18817 Epoch: [36/300] [ 950/1251] eta: 0:04:49 lr: 0.001927 loss: 4.113808 (3.848797) time: 0.918018 data: 0.000166 max mem: 18817 Epoch: [36/300] [1000/1251] eta: 0:04:01 lr: 0.001927 loss: 3.907230 (3.848050) time: 0.941775 data: 0.000167 max mem: 18817 Epoch: [36/300] [1050/1251] eta: 0:03:13 lr: 0.001927 loss: 3.769744 (3.847280) time: 0.917848 data: 0.000197 max mem: 18817 Epoch: [36/300] [1100/1251] eta: 0:02:25 lr: 0.001927 loss: 4.078788 (3.848655) time: 0.985236 data: 0.000181 max mem: 18817 Epoch: [36/300] [1150/1251] eta: 0:01:37 lr: 0.001927 loss: 3.938754 (3.852377) time: 0.967733 data: 0.000195 max mem: 18817 Epoch: [36/300] [1200/1251] eta: 0:00:49 lr: 0.001927 loss: 3.915970 (3.853962) time: 0.966990 data: 0.000181 max mem: 18817 Epoch: [36/300] [1250/1251] eta: 0:00:00 lr: 0.001927 loss: 4.029055 (3.852625) time: 0.931516 data: 0.000751 max mem: 18817 Epoch: [36/300] Total time: 0:20:04 (0.962896 s / it) Averaged stats: lr: 0.001927 loss: 4.029055 (3.845808) Test: [ 0/49] eta: 0:01:20 loss: 1.024969 (1.024969) acc1: 75.000000 (75.000000) acc5: 90.625000 (90.625000) time: 1.652300 data: 1.223991 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 1.094473 (1.104806) acc1: 75.000000 (73.437500) acc5: 92.187500 (91.761364) time: 0.484014 data: 0.111411 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.111343 (1.129824) acc1: 73.437500 (73.065476) acc5: 90.625000 (91.889881) time: 0.364376 data: 0.000150 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.133370 (1.130117) acc1: 73.437500 (73.487903) acc5: 92.187500 (91.935484) time: 0.462679 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.157376 (1.143593) acc1: 73.437500 (73.170732) acc5: 90.625000 (91.730183) time: 0.460699 data: 0.000120 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.226179 (1.146975) acc1: 71.875000 (73.408000) acc5: 90.625000 (91.712000) time: 0.354865 data: 0.000098 max mem: 18817 Test: Total time: 0:00:21 (0.429634 s / it) * Acc@1 73.308 Acc@5 91.850 loss 1.167 Max accuracy: 73.31% Epoch: [37/300] [ 0/1251] eta: 0:40:32 lr: 0.001927 loss: 3.675940 (3.675940) time: 1.944151 data: 1.045336 max mem: 18817 Epoch: [37/300] [ 50/1251] eta: 0:19:41 lr: 0.001926 loss: 4.073298 (3.908293) time: 1.013968 data: 0.000173 max mem: 18817 Epoch: [37/300] [ 100/1251] eta: 0:18:27 lr: 0.001926 loss: 4.059568 (3.902595) time: 0.970919 data: 0.000166 max mem: 18817 Epoch: [37/300] [ 150/1251] eta: 0:17:37 lr: 0.001926 loss: 3.616238 (3.882836) time: 0.912206 data: 0.000164 max mem: 18817 Epoch: [37/300] [ 200/1251] eta: 0:16:52 lr: 0.001926 loss: 3.784813 (3.864701) time: 0.930593 data: 0.000166 max mem: 18817 Epoch: [37/300] [ 250/1251] eta: 0:16:05 lr: 0.001926 loss: 4.074975 (3.845610) time: 0.992893 data: 0.000165 max mem: 18817 Epoch: [37/300] [ 300/1251] eta: 0:15:19 lr: 0.001926 loss: 4.002438 (3.850359) time: 0.992082 data: 0.000160 max mem: 18817 Epoch: [37/300] [ 350/1251] eta: 0:14:27 lr: 0.001925 loss: 3.916524 (3.857298) time: 0.970510 data: 0.000152 max mem: 18817 Epoch: [37/300] [ 400/1251] eta: 0:13:36 lr: 0.001925 loss: 3.923024 (3.846968) time: 0.910545 data: 0.000176 max mem: 18817 Epoch: [37/300] [ 450/1251] eta: 0:12:49 lr: 0.001925 loss: 3.695488 (3.835044) time: 0.928685 data: 0.000171 max mem: 18817 Epoch: [37/300] [ 500/1251] eta: 0:12:01 lr: 0.001925 loss: 4.176273 (3.835446) time: 0.964704 data: 0.000187 max mem: 18817 Epoch: [37/300] [ 550/1251] eta: 0:11:14 lr: 0.001925 loss: 3.970578 (3.843161) time: 1.004671 data: 0.000189 max mem: 18817 Epoch: [37/300] [ 600/1251] eta: 0:10:25 lr: 0.001925 loss: 3.886484 (3.838355) time: 0.995306 data: 0.000176 max mem: 18817 Epoch: [37/300] [ 650/1251] eta: 0:09:37 lr: 0.001924 loss: 4.150120 (3.838325) time: 0.926826 data: 0.000161 max mem: 18817 Epoch: [37/300] [ 700/1251] eta: 0:08:49 lr: 0.001924 loss: 3.461134 (3.841354) time: 0.925045 data: 0.000149 max mem: 18817 Epoch: [37/300] [ 750/1251] eta: 0:08:01 lr: 0.001924 loss: 4.174268 (3.851555) time: 0.978767 data: 0.000171 max mem: 18817 Epoch: [37/300] [ 800/1251] eta: 0:07:13 lr: 0.001924 loss: 3.909994 (3.848464) time: 1.015205 data: 0.000173 max mem: 18817 Epoch: [37/300] [ 850/1251] eta: 0:06:25 lr: 0.001924 loss: 3.881676 (3.841748) time: 0.981588 data: 0.000183 max mem: 18817 Epoch: [37/300] [ 900/1251] eta: 0:05:36 lr: 0.001924 loss: 3.913779 (3.843331) time: 0.909101 data: 0.000169 max mem: 18817 Epoch: [37/300] [ 950/1251] eta: 0:04:48 lr: 0.001923 loss: 3.962573 (3.840946) time: 0.928151 data: 0.000266 max mem: 18817 Epoch: [37/300] [1000/1251] eta: 0:04:01 lr: 0.001923 loss: 3.893393 (3.842281) time: 0.997762 data: 0.000178 max mem: 18817 Epoch: [37/300] [1050/1251] eta: 0:03:13 lr: 0.001923 loss: 3.898574 (3.840024) time: 1.001175 data: 0.000178 max mem: 18817 Epoch: [37/300] [1100/1251] eta: 0:02:24 lr: 0.001923 loss: 3.800946 (3.842870) time: 0.974570 data: 0.000171 max mem: 18817 Epoch: [37/300] [1150/1251] eta: 0:01:36 lr: 0.001923 loss: 3.966080 (3.848063) time: 0.919437 data: 0.000182 max mem: 18817 Epoch: [37/300] [1200/1251] eta: 0:00:48 lr: 0.001923 loss: 4.354999 (3.850942) time: 0.926357 data: 0.000180 max mem: 18817 Epoch: [37/300] [1250/1251] eta: 0:00:00 lr: 0.001923 loss: 4.054318 (3.850967) time: 0.989595 data: 0.000749 max mem: 18817 Epoch: [37/300] Total time: 0:20:01 (0.960723 s / it) Averaged stats: lr: 0.001923 loss: 4.054318 (3.848742) Test: [ 0/49] eta: 0:01:14 loss: 1.020486 (1.020486) acc1: 75.000000 (75.000000) acc5: 93.750000 (93.750000) time: 1.529386 data: 1.112894 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 1.020486 (1.043715) acc1: 76.562500 (75.710227) acc5: 93.750000 (92.329545) time: 0.476208 data: 0.101336 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.042799 (1.095083) acc1: 75.000000 (73.883929) acc5: 92.187500 (92.485119) time: 0.370385 data: 0.000151 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.155633 (1.094993) acc1: 71.875000 (73.588710) acc5: 92.187500 (92.691532) time: 0.366938 data: 0.000122 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.155633 (1.109784) acc1: 71.875000 (73.361280) acc5: 92.187500 (92.339939) time: 0.361009 data: 0.000119 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.173627 (1.112598) acc1: 71.875000 (73.600000) acc5: 92.187500 (92.288000) time: 0.355605 data: 0.000100 max mem: 18817 Test: Total time: 0:00:19 (0.389559 s / it) * Acc@1 73.106 Acc@5 91.972 loss 1.125 Max accuracy: 73.31% Epoch: [38/300] [ 0/1251] eta: 0:39:30 lr: 0.001923 loss: 4.116141 (4.116141) time: 1.895253 data: 1.011829 max mem: 18817 Epoch: [38/300] [ 50/1251] eta: 0:19:05 lr: 0.001922 loss: 3.897040 (3.755720) time: 0.947942 data: 0.000155 max mem: 18817 Epoch: [38/300] [ 100/1251] eta: 0:18:14 lr: 0.001922 loss: 3.761565 (3.783301) time: 0.935385 data: 0.000172 max mem: 18817 Epoch: [38/300] [ 150/1251] eta: 0:17:34 lr: 0.001922 loss: 3.946544 (3.810910) time: 0.928316 data: 0.000158 max mem: 18817 Epoch: [38/300] [ 200/1251] eta: 0:16:50 lr: 0.001922 loss: 3.882970 (3.834297) time: 0.999082 data: 0.000170 max mem: 18817 Epoch: [38/300] [ 250/1251] eta: 0:16:01 lr: 0.001922 loss: 4.104425 (3.870622) time: 1.003163 data: 0.000187 max mem: 18817 Epoch: [38/300] [ 300/1251] eta: 0:15:12 lr: 0.001922 loss: 3.755046 (3.864250) time: 0.991079 data: 0.000171 max mem: 18817 Epoch: [38/300] [ 350/1251] eta: 0:14:23 lr: 0.001921 loss: 3.861368 (3.870777) time: 0.931491 data: 0.000159 max mem: 18817 Epoch: [38/300] [ 400/1251] eta: 0:13:36 lr: 0.001921 loss: 3.720293 (3.870587) time: 0.931053 data: 0.000173 max mem: 18817 Epoch: [38/300] [ 450/1251] eta: 0:12:48 lr: 0.001921 loss: 3.759585 (3.862608) time: 0.980293 data: 0.000181 max mem: 18817 Epoch: [38/300] [ 500/1251] eta: 0:11:58 lr: 0.001921 loss: 3.802889 (3.873316) time: 0.952812 data: 0.000171 max mem: 18817 Epoch: [38/300] [ 550/1251] eta: 0:11:10 lr: 0.001921 loss: 3.613471 (3.865329) time: 0.956074 data: 0.000164 max mem: 18817 Epoch: [38/300] [ 600/1251] eta: 0:10:22 lr: 0.001921 loss: 4.044708 (3.859399) time: 0.936561 data: 0.000217 max mem: 18817 Epoch: [38/300] [ 650/1251] eta: 0:09:35 lr: 0.001920 loss: 3.904568 (3.859895) time: 0.936611 data: 0.000185 max mem: 18817 Epoch: [38/300] [ 700/1251] eta: 0:08:48 lr: 0.001920 loss: 4.183524 (3.866887) time: 0.987422 data: 0.000160 max mem: 18817 Epoch: [38/300] [ 750/1251] eta: 0:08:00 lr: 0.001920 loss: 4.028264 (3.865615) time: 0.987706 data: 0.000189 max mem: 18817 Epoch: [38/300] [ 800/1251] eta: 0:07:12 lr: 0.001920 loss: 4.036681 (3.860199) time: 0.980541 data: 0.000182 max mem: 18817 Epoch: [38/300] [ 850/1251] eta: 0:06:24 lr: 0.001920 loss: 3.964271 (3.860401) time: 0.934610 data: 0.000190 max mem: 18817 Epoch: [38/300] [ 900/1251] eta: 0:05:36 lr: 0.001920 loss: 4.174554 (3.867723) time: 0.922338 data: 0.000185 max mem: 18817 Epoch: [38/300] [ 950/1251] eta: 0:04:49 lr: 0.001919 loss: 4.030010 (3.868401) time: 1.000869 data: 0.000168 max mem: 18817 Epoch: [38/300] [1000/1251] eta: 0:04:01 lr: 0.001919 loss: 3.500855 (3.864508) time: 1.028808 data: 0.000175 max mem: 18817 Epoch: [38/300] [1050/1251] eta: 0:03:13 lr: 0.001919 loss: 3.925612 (3.861683) time: 0.979671 data: 0.000203 max mem: 18817 Epoch: [38/300] [1100/1251] eta: 0:02:24 lr: 0.001919 loss: 3.801207 (3.858580) time: 0.926869 data: 0.000180 max mem: 18817 Epoch: [38/300] [1150/1251] eta: 0:01:37 lr: 0.001919 loss: 3.794470 (3.857116) time: 0.932033 data: 0.000196 max mem: 18817 Epoch: [38/300] [1200/1251] eta: 0:00:49 lr: 0.001919 loss: 3.808498 (3.850922) time: 0.992425 data: 0.000182 max mem: 18817 Epoch: [38/300] [1250/1251] eta: 0:00:00 lr: 0.001918 loss: 3.872091 (3.849749) time: 1.035926 data: 0.000798 max mem: 18817 Epoch: [38/300] Total time: 0:20:03 (0.961778 s / it) Averaged stats: lr: 0.001918 loss: 3.872091 (3.846857) Test: [ 0/49] eta: 0:01:18 loss: 0.838874 (0.838874) acc1: 79.687500 (79.687500) acc5: 95.312500 (95.312500) time: 1.608624 data: 1.181250 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.943809 (0.978416) acc1: 73.437500 (74.147727) acc5: 93.750000 (93.039773) time: 0.500916 data: 0.107531 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.102253 (1.061902) acc1: 71.875000 (73.139881) acc5: 92.187500 (92.038690) time: 0.387879 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.112960 (1.066308) acc1: 73.437500 (73.336694) acc5: 92.187500 (92.187500) time: 0.373439 data: 0.000128 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.146774 (1.079141) acc1: 73.437500 (73.513720) acc5: 92.187500 (92.035061) time: 0.364723 data: 0.000121 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.185606 (1.086922) acc1: 73.437500 (73.408000) acc5: 92.187500 (92.192000) time: 0.360707 data: 0.000104 max mem: 18817 Test: Total time: 0:00:19 (0.399647 s / it) * Acc@1 73.676 Acc@5 92.108 loss 1.082 Max accuracy: 73.68% Epoch: [39/300] [ 0/1251] eta: 0:39:52 lr: 0.001918 loss: 3.065034 (3.065034) time: 1.912666 data: 1.017453 max mem: 18817 Epoch: [39/300] [ 50/1251] eta: 0:19:20 lr: 0.001918 loss: 3.908722 (3.824045) time: 0.928142 data: 0.000176 max mem: 18817 Epoch: [39/300] [ 100/1251] eta: 0:18:33 lr: 0.001918 loss: 4.164108 (3.864471) time: 0.933711 data: 0.000176 max mem: 18817 Epoch: [39/300] [ 150/1251] eta: 0:17:42 lr: 0.001918 loss: 3.838084 (3.904654) time: 0.974052 data: 0.000177 max mem: 18817 Epoch: [39/300] [ 200/1251] eta: 0:16:54 lr: 0.001918 loss: 4.198786 (3.896187) time: 1.032768 data: 0.000176 max mem: 18817 Epoch: [39/300] [ 250/1251] eta: 0:15:59 lr: 0.001918 loss: 3.650432 (3.882363) time: 0.959350 data: 0.000186 max mem: 18817 Epoch: [39/300] [ 300/1251] eta: 0:15:10 lr: 0.001917 loss: 3.716882 (3.872926) time: 0.925313 data: 0.000162 max mem: 18817 Epoch: [39/300] [ 350/1251] eta: 0:14:25 lr: 0.001917 loss: 4.002732 (3.867075) time: 0.939853 data: 0.000161 max mem: 18817 Epoch: [39/300] [ 400/1251] eta: 0:13:38 lr: 0.001917 loss: 4.018225 (3.860960) time: 1.005938 data: 0.000181 max mem: 18817 Epoch: [39/300] [ 450/1251] eta: 0:12:50 lr: 0.001917 loss: 3.722732 (3.860870) time: 1.023068 data: 0.000189 max mem: 18817 Epoch: [39/300] [ 500/1251] eta: 0:12:00 lr: 0.001917 loss: 3.866088 (3.854231) time: 0.963963 data: 0.000174 max mem: 18817 Epoch: [39/300] [ 550/1251] eta: 0:11:11 lr: 0.001917 loss: 3.824655 (3.847812) time: 0.922506 data: 0.000177 max mem: 18817 Epoch: [39/300] [ 600/1251] eta: 0:10:24 lr: 0.001916 loss: 4.241840 (3.861671) time: 0.944169 data: 0.000183 max mem: 18817 Epoch: [39/300] [ 650/1251] eta: 0:09:37 lr: 0.001916 loss: 3.606693 (3.862592) time: 0.987546 data: 0.000166 max mem: 18817 Epoch: [39/300] [ 700/1251] eta: 0:08:49 lr: 0.001916 loss: 3.780028 (3.848212) time: 1.032379 data: 0.000166 max mem: 18817 Epoch: [39/300] [ 750/1251] eta: 0:08:01 lr: 0.001916 loss: 3.651293 (3.843447) time: 0.979574 data: 0.000180 max mem: 18817 Epoch: [39/300] [ 800/1251] eta: 0:07:12 lr: 0.001916 loss: 3.880976 (3.839325) time: 0.923960 data: 0.000183 max mem: 18817 Epoch: [39/300] [ 850/1251] eta: 0:06:24 lr: 0.001916 loss: 3.537935 (3.834021) time: 0.920739 data: 0.000179 max mem: 18817 Epoch: [39/300] [ 900/1251] eta: 0:05:37 lr: 0.001915 loss: 3.835236 (3.834830) time: 1.010032 data: 0.000188 max mem: 18817 Epoch: [39/300] [ 950/1251] eta: 0:04:49 lr: 0.001915 loss: 4.049170 (3.832844) time: 1.047061 data: 0.000185 max mem: 18817 Epoch: [39/300] [1000/1251] eta: 0:04:01 lr: 0.001915 loss: 3.884620 (3.829912) time: 0.980874 data: 0.000176 max mem: 18817 Epoch: [39/300] [1050/1251] eta: 0:03:13 lr: 0.001915 loss: 3.752572 (3.833190) time: 0.930138 data: 0.000172 max mem: 18817 Epoch: [39/300] [1100/1251] eta: 0:02:25 lr: 0.001915 loss: 3.877872 (3.835142) time: 0.925074 data: 0.000190 max mem: 18817 Epoch: [39/300] [1150/1251] eta: 0:01:37 lr: 0.001915 loss: 3.786807 (3.838165) time: 0.977221 data: 0.000176 max mem: 18817 Epoch: [39/300] [1200/1251] eta: 0:00:49 lr: 0.001914 loss: 4.087454 (3.839918) time: 1.032983 data: 0.000169 max mem: 18817 Epoch: [39/300] [1250/1251] eta: 0:00:00 lr: 0.001914 loss: 3.861195 (3.830342) time: 0.960112 data: 0.000755 max mem: 18817 Epoch: [39/300] Total time: 0:20:01 (0.960657 s / it) Averaged stats: lr: 0.001914 loss: 3.861195 (3.827291) Test: [ 0/49] eta: 0:01:26 loss: 0.895037 (0.895037) acc1: 81.250000 (81.250000) acc5: 93.750000 (93.750000) time: 1.765344 data: 1.368608 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.000863 (1.021613) acc1: 76.562500 (76.420455) acc5: 93.750000 (92.755682) time: 0.497454 data: 0.124577 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.011724 (1.074662) acc1: 75.000000 (74.553571) acc5: 92.187500 (92.410714) time: 0.367926 data: 0.000166 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.072906 (1.068031) acc1: 71.875000 (74.294355) acc5: 92.187500 (92.590726) time: 0.363930 data: 0.000159 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.075131 (1.078717) acc1: 71.875000 (73.971037) acc5: 92.187500 (92.492378) time: 0.376948 data: 0.000146 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.096583 (1.083547) acc1: 71.875000 (73.856000) acc5: 92.187500 (92.320000) time: 0.393985 data: 0.000116 max mem: 18817 Test: Total time: 0:00:20 (0.409818 s / it) * Acc@1 73.604 Acc@5 92.154 loss 1.092 Max accuracy: 73.68% Epoch: [40/300] [ 0/1251] eta: 0:40:44 lr: 0.001914 loss: 3.001587 (3.001587) time: 1.954192 data: 1.042487 max mem: 18817 Epoch: [40/300] [ 50/1251] eta: 0:19:52 lr: 0.001914 loss: 3.832533 (3.848500) time: 0.943799 data: 0.000196 max mem: 18817 Epoch: [40/300] [ 100/1251] eta: 0:18:52 lr: 0.001914 loss: 3.554176 (3.753895) time: 0.992710 data: 0.000186 max mem: 18817 Epoch: [40/300] [ 150/1251] eta: 0:17:59 lr: 0.001914 loss: 3.784561 (3.762168) time: 1.029808 data: 0.000159 max mem: 18817 Epoch: [40/300] [ 200/1251] eta: 0:17:00 lr: 0.001914 loss: 3.600667 (3.754579) time: 0.966389 data: 0.000163 max mem: 18817 Epoch: [40/300] [ 250/1251] eta: 0:16:08 lr: 0.001913 loss: 4.097034 (3.775577) time: 0.914941 data: 0.000181 max mem: 18817 Epoch: [40/300] [ 300/1251] eta: 0:15:18 lr: 0.001913 loss: 4.070961 (3.768836) time: 0.923863 data: 0.000171 max mem: 18817 Epoch: [40/300] [ 350/1251] eta: 0:14:30 lr: 0.001913 loss: 3.935980 (3.772240) time: 0.982036 data: 0.000173 max mem: 18817 Epoch: [40/300] [ 400/1251] eta: 0:13:42 lr: 0.001913 loss: 3.925732 (3.806642) time: 1.042424 data: 0.000170 max mem: 18817 Epoch: [40/300] [ 450/1251] eta: 0:12:52 lr: 0.001913 loss: 3.634933 (3.805524) time: 0.980857 data: 0.000165 max mem: 18817 Epoch: [40/300] [ 500/1251] eta: 0:12:03 lr: 0.001913 loss: 3.996516 (3.813046) time: 0.924399 data: 0.000182 max mem: 18817 Epoch: [40/300] [ 550/1251] eta: 0:11:15 lr: 0.001912 loss: 3.893729 (3.815456) time: 0.936263 data: 0.000172 max mem: 18817 Epoch: [40/300] [ 600/1251] eta: 0:10:27 lr: 0.001912 loss: 3.419333 (3.795136) time: 0.992409 data: 0.000172 max mem: 18817 Epoch: [40/300] [ 650/1251] eta: 0:09:39 lr: 0.001912 loss: 4.004626 (3.797486) time: 1.035835 data: 0.000167 max mem: 18817 Epoch: [40/300] [ 700/1251] eta: 0:08:50 lr: 0.001912 loss: 4.030230 (3.810495) time: 0.965341 data: 0.000169 max mem: 18817 Epoch: [40/300] [ 750/1251] eta: 0:08:02 lr: 0.001912 loss: 3.329937 (3.799545) time: 0.922460 data: 0.000167 max mem: 18817 Epoch: [40/300] [ 800/1251] eta: 0:07:14 lr: 0.001912 loss: 4.023977 (3.799943) time: 0.920721 data: 0.000168 max mem: 18817 Epoch: [40/300] [ 850/1251] eta: 0:06:26 lr: 0.001911 loss: 3.950786 (3.808120) time: 0.982026 data: 0.000171 max mem: 18817 Epoch: [40/300] [ 900/1251] eta: 0:05:38 lr: 0.001911 loss: 3.762433 (3.811854) time: 1.043371 data: 0.000174 max mem: 18817 Epoch: [40/300] [ 950/1251] eta: 0:04:49 lr: 0.001911 loss: 3.760584 (3.809582) time: 0.964066 data: 0.000185 max mem: 18817 Epoch: [40/300] [1000/1251] eta: 0:04:01 lr: 0.001911 loss: 4.001929 (3.812592) time: 0.916934 data: 0.000171 max mem: 18817 Epoch: [40/300] [1050/1251] eta: 0:03:13 lr: 0.001911 loss: 4.068254 (3.814823) time: 0.936692 data: 0.000180 max mem: 18817 Epoch: [40/300] [1100/1251] eta: 0:02:25 lr: 0.001911 loss: 3.684260 (3.807828) time: 0.968895 data: 0.000160 max mem: 18817 Epoch: [40/300] [1150/1251] eta: 0:01:37 lr: 0.001910 loss: 3.741400 (3.808499) time: 1.022450 data: 0.000156 max mem: 18817 Epoch: [40/300] [1200/1251] eta: 0:00:48 lr: 0.001910 loss: 3.569870 (3.800921) time: 0.958986 data: 0.000194 max mem: 18817 Epoch: [40/300] [1250/1251] eta: 0:00:00 lr: 0.001910 loss: 3.945566 (3.798551) time: 0.919897 data: 0.000761 max mem: 18817 Epoch: [40/300] Total time: 0:20:01 (0.960706 s / it) Averaged stats: lr: 0.001910 loss: 3.945566 (3.805244) Test: [ 0/49] eta: 0:01:17 loss: 0.947408 (0.947408) acc1: 81.250000 (81.250000) acc5: 96.875000 (96.875000) time: 1.577668 data: 1.139090 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 1.037040 (1.024446) acc1: 76.562500 (75.710227) acc5: 93.750000 (92.755682) time: 0.481334 data: 0.103713 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 1.128783 (1.093183) acc1: 71.875000 (73.958333) acc5: 92.187500 (91.964286) time: 0.458929 data: 0.000148 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.133786 (1.091085) acc1: 73.437500 (74.193548) acc5: 92.187500 (92.489919) time: 0.454437 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.093413 (1.102862) acc1: 75.000000 (74.123476) acc5: 92.187500 (92.187500) time: 0.360198 data: 0.000136 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.133786 (1.111609) acc1: 75.000000 (74.016000) acc5: 92.187500 (92.256000) time: 0.355296 data: 0.000101 max mem: 18817 Test: Total time: 0:00:20 (0.426454 s / it) * Acc@1 73.818 Acc@5 92.230 loss 1.125 Max accuracy: 73.82% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0040.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0040.pth Epoch: [41/300] [ 0/1251] eta: 0:42:32 lr: 0.001910 loss: 4.187539 (4.187539) time: 2.040221 data: 1.118497 max mem: 18817 Epoch: [41/300] [ 50/1251] eta: 0:19:12 lr: 0.001910 loss: 4.087333 (3.765407) time: 0.911737 data: 0.000148 max mem: 18817 Epoch: [41/300] [ 100/1251] eta: 0:18:33 lr: 0.001910 loss: 3.714128 (3.790112) time: 0.944424 data: 0.000154 max mem: 18817 Epoch: [41/300] [ 150/1251] eta: 0:17:45 lr: 0.001909 loss: 3.982274 (3.820896) time: 0.998175 data: 0.000173 max mem: 18817 Epoch: [41/300] [ 200/1251] eta: 0:16:57 lr: 0.001909 loss: 3.506539 (3.773319) time: 1.024633 data: 0.000152 max mem: 18817 Epoch: [41/300] [ 250/1251] eta: 0:16:06 lr: 0.001909 loss: 3.965700 (3.797646) time: 0.984313 data: 0.000176 max mem: 18817 Epoch: [41/300] [ 300/1251] eta: 0:15:14 lr: 0.001909 loss: 3.916162 (3.786756) time: 0.926840 data: 0.000161 max mem: 18817 Epoch: [41/300] [ 350/1251] eta: 0:14:26 lr: 0.001909 loss: 4.098003 (3.805362) time: 0.921315 data: 0.000177 max mem: 18817 Epoch: [41/300] [ 400/1251] eta: 0:13:38 lr: 0.001909 loss: 3.825013 (3.798946) time: 0.983304 data: 0.000223 max mem: 18817 Epoch: [41/300] [ 450/1251] eta: 0:12:49 lr: 0.001908 loss: 3.741677 (3.784671) time: 0.992116 data: 0.000167 max mem: 18817 Epoch: [41/300] [ 500/1251] eta: 0:12:00 lr: 0.001908 loss: 3.694021 (3.770454) time: 0.979084 data: 0.000183 max mem: 18817 Epoch: [41/300] [ 550/1251] eta: 0:11:11 lr: 0.001908 loss: 3.786943 (3.766912) time: 0.928261 data: 0.000198 max mem: 18817 Epoch: [41/300] [ 600/1251] eta: 0:10:24 lr: 0.001908 loss: 4.016775 (3.770560) time: 0.939338 data: 0.000169 max mem: 18817 Epoch: [41/300] [ 650/1251] eta: 0:09:36 lr: 0.001908 loss: 3.931707 (3.779974) time: 0.971037 data: 0.000174 max mem: 18817 Epoch: [41/300] [ 700/1251] eta: 0:08:48 lr: 0.001908 loss: 3.675721 (3.779358) time: 0.987397 data: 0.000170 max mem: 18817 Epoch: [41/300] [ 750/1251] eta: 0:08:00 lr: 0.001907 loss: 3.954583 (3.783065) time: 0.989294 data: 0.000170 max mem: 18817 Epoch: [41/300] [ 800/1251] eta: 0:07:12 lr: 0.001907 loss: 3.919591 (3.775580) time: 0.922378 data: 0.000167 max mem: 18817 Epoch: [41/300] [ 850/1251] eta: 0:06:24 lr: 0.001907 loss: 4.044311 (3.787302) time: 0.947548 data: 0.000164 max mem: 18817 Epoch: [41/300] [ 900/1251] eta: 0:05:36 lr: 0.001907 loss: 3.900783 (3.789422) time: 0.992368 data: 0.000174 max mem: 18817 Epoch: [41/300] [ 950/1251] eta: 0:04:49 lr: 0.001907 loss: 3.864661 (3.791704) time: 1.017861 data: 0.000169 max mem: 18817 Epoch: [41/300] [1000/1251] eta: 0:04:00 lr: 0.001907 loss: 3.676253 (3.788417) time: 0.977279 data: 0.000163 max mem: 18817 Epoch: [41/300] [1050/1251] eta: 0:03:12 lr: 0.001906 loss: 3.763812 (3.788256) time: 0.934575 data: 0.000167 max mem: 18817 Epoch: [41/300] [1100/1251] eta: 0:02:24 lr: 0.001906 loss: 4.076179 (3.791797) time: 0.936485 data: 0.000161 max mem: 18817 Epoch: [41/300] [1150/1251] eta: 0:01:37 lr: 0.001906 loss: 3.847788 (3.793972) time: 0.984742 data: 0.000168 max mem: 18817 Epoch: [41/300] [1200/1251] eta: 0:00:48 lr: 0.001906 loss: 3.785348 (3.793500) time: 0.981205 data: 0.000214 max mem: 18817 Epoch: [41/300] [1250/1251] eta: 0:00:00 lr: 0.001906 loss: 3.817340 (3.794025) time: 1.003191 data: 0.000752 max mem: 18817 Epoch: [41/300] Total time: 0:20:01 (0.960532 s / it) Averaged stats: lr: 0.001906 loss: 3.817340 (3.798463) Test: [ 0/49] eta: 0:01:15 loss: 0.885313 (0.885313) acc1: 79.687500 (79.687500) acc5: 92.187500 (92.187500) time: 1.537412 data: 1.119191 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 1.000869 (1.030051) acc1: 76.562500 (76.420455) acc5: 92.187500 (93.039773) time: 0.475667 data: 0.101878 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.134928 (1.092899) acc1: 75.000000 (74.851190) acc5: 92.187500 (92.633929) time: 0.366761 data: 0.000130 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.120673 (1.083821) acc1: 73.437500 (75.151210) acc5: 92.187500 (92.842742) time: 0.363544 data: 0.000124 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.102972 (1.098848) acc1: 73.437500 (74.809451) acc5: 92.187500 (92.644817) time: 0.368162 data: 0.000146 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.148657 (1.094424) acc1: 73.437500 (74.912000) acc5: 92.187500 (92.704000) time: 0.377850 data: 0.000126 max mem: 18817 Test: Total time: 0:00:19 (0.398780 s / it) * Acc@1 74.268 Acc@5 92.560 loss 1.105 Max accuracy: 74.27% Epoch: [42/300] [ 0/1251] eta: 0:42:24 lr: 0.001906 loss: 3.687242 (3.687242) time: 2.034331 data: 1.135196 max mem: 18817 Epoch: [42/300] [ 50/1251] eta: 0:19:43 lr: 0.001905 loss: 3.965533 (3.926814) time: 0.949604 data: 0.000191 max mem: 18817 Epoch: [42/300] [ 100/1251] eta: 0:18:42 lr: 0.001905 loss: 3.593980 (3.848327) time: 0.970314 data: 0.000166 max mem: 18817 Epoch: [42/300] [ 150/1251] eta: 0:17:40 lr: 0.001905 loss: 3.901558 (3.754631) time: 0.967415 data: 0.000198 max mem: 18817 Epoch: [42/300] [ 200/1251] eta: 0:16:47 lr: 0.001905 loss: 3.922015 (3.762278) time: 0.912306 data: 0.000177 max mem: 18817 Epoch: [42/300] [ 250/1251] eta: 0:16:00 lr: 0.001905 loss: 3.574401 (3.752041) time: 0.916515 data: 0.000168 max mem: 18817 Epoch: [42/300] [ 300/1251] eta: 0:15:13 lr: 0.001905 loss: 3.609729 (3.740794) time: 0.989264 data: 0.000182 max mem: 18817 Epoch: [42/300] [ 350/1251] eta: 0:14:25 lr: 0.001904 loss: 3.951010 (3.749186) time: 1.011624 data: 0.000188 max mem: 18817 Epoch: [42/300] [ 400/1251] eta: 0:13:36 lr: 0.001904 loss: 3.912628 (3.739856) time: 0.969540 data: 0.000172 max mem: 18817 Epoch: [42/300] [ 450/1251] eta: 0:12:47 lr: 0.001904 loss: 3.934825 (3.734563) time: 0.911358 data: 0.000189 max mem: 18817 Epoch: [42/300] [ 500/1251] eta: 0:12:00 lr: 0.001904 loss: 4.004578 (3.743806) time: 0.913880 data: 0.000165 max mem: 18817 Epoch: [42/300] [ 550/1251] eta: 0:11:12 lr: 0.001904 loss: 3.685002 (3.742947) time: 0.980472 data: 0.000178 max mem: 18817 Epoch: [42/300] [ 600/1251] eta: 0:10:25 lr: 0.001904 loss: 3.666301 (3.749526) time: 1.022037 data: 0.000178 max mem: 18817 Epoch: [42/300] [ 650/1251] eta: 0:09:36 lr: 0.001903 loss: 3.945758 (3.752608) time: 0.955053 data: 0.000165 max mem: 18817 Epoch: [42/300] [ 700/1251] eta: 0:08:48 lr: 0.001903 loss: 3.670984 (3.743111) time: 0.912484 data: 0.000166 max mem: 18817 Epoch: [42/300] [ 750/1251] eta: 0:08:00 lr: 0.001903 loss: 4.029204 (3.741957) time: 0.924248 data: 0.000183 max mem: 18817 Epoch: [42/300] [ 800/1251] eta: 0:07:12 lr: 0.001903 loss: 3.819382 (3.737611) time: 0.989686 data: 0.000164 max mem: 18817 Epoch: [42/300] [ 850/1251] eta: 0:06:25 lr: 0.001903 loss: 3.314683 (3.737307) time: 1.054040 data: 0.000167 max mem: 18817 Epoch: [42/300] [ 900/1251] eta: 0:05:36 lr: 0.001902 loss: 3.895118 (3.738688) time: 0.960872 data: 0.000163 max mem: 18817 Epoch: [42/300] [ 950/1251] eta: 0:04:48 lr: 0.001902 loss: 3.951517 (3.741078) time: 0.910014 data: 0.000163 max mem: 18817 Epoch: [42/300] [1000/1251] eta: 0:04:00 lr: 0.001902 loss: 3.878560 (3.743847) time: 0.931477 data: 0.000182 max mem: 18817 Epoch: [42/300] [1050/1251] eta: 0:03:12 lr: 0.001902 loss: 3.911788 (3.748085) time: 0.971125 data: 0.000186 max mem: 18817 Epoch: [42/300] [1100/1251] eta: 0:02:25 lr: 0.001902 loss: 3.553396 (3.743236) time: 1.046409 data: 0.000157 max mem: 18817 Epoch: [42/300] [1150/1251] eta: 0:01:37 lr: 0.001902 loss: 4.023157 (3.741744) time: 0.977725 data: 0.000181 max mem: 18817 Epoch: [42/300] [1200/1251] eta: 0:00:48 lr: 0.001901 loss: 3.977386 (3.744417) time: 0.924978 data: 0.000163 max mem: 18817 Epoch: [42/300] [1250/1251] eta: 0:00:00 lr: 0.001901 loss: 3.708358 (3.742452) time: 0.923985 data: 0.000755 max mem: 18817 Epoch: [42/300] Total time: 0:20:02 (0.961504 s / it) Averaged stats: lr: 0.001901 loss: 3.708358 (3.737778) Test: [ 0/49] eta: 0:01:30 loss: 0.881518 (0.881518) acc1: 79.687500 (79.687500) acc5: 93.750000 (93.750000) time: 1.852949 data: 1.443896 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.944177 (1.026083) acc1: 76.562500 (75.852273) acc5: 93.750000 (92.755682) time: 0.504318 data: 0.131423 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.059481 (1.059845) acc1: 76.562500 (75.074405) acc5: 92.187500 (92.559524) time: 0.366866 data: 0.000163 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.040390 (1.050854) acc1: 75.000000 (75.100806) acc5: 92.187500 (92.489919) time: 0.365257 data: 0.000152 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.030788 (1.066664) acc1: 75.000000 (74.809451) acc5: 92.187500 (92.644817) time: 0.449397 data: 0.000152 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.134914 (1.069272) acc1: 73.437500 (75.008000) acc5: 93.750000 (92.672000) time: 0.443193 data: 0.000122 max mem: 18817 Test: Total time: 0:00:21 (0.431702 s / it) * Acc@1 74.248 Acc@5 92.500 loss 1.079 Max accuracy: 74.27% Epoch: [43/300] [ 0/1251] eta: 0:43:32 lr: 0.001901 loss: 4.212551 (4.212551) time: 2.088341 data: 1.196985 max mem: 18817 Epoch: [43/300] [ 50/1251] eta: 0:19:54 lr: 0.001901 loss: 3.948607 (3.843252) time: 1.040423 data: 0.000182 max mem: 18817 Epoch: [43/300] [ 100/1251] eta: 0:18:42 lr: 0.001901 loss: 3.823377 (3.829141) time: 0.989390 data: 0.000169 max mem: 18817 Epoch: [43/300] [ 150/1251] eta: 0:17:44 lr: 0.001901 loss: 4.023411 (3.828780) time: 0.922739 data: 0.000168 max mem: 18817 Epoch: [43/300] [ 200/1251] eta: 0:16:57 lr: 0.001900 loss: 3.917398 (3.791830) time: 0.936596 data: 0.000175 max mem: 18817 Epoch: [43/300] [ 250/1251] eta: 0:16:09 lr: 0.001900 loss: 3.963051 (3.788486) time: 0.992324 data: 0.000184 max mem: 18817 Epoch: [43/300] [ 300/1251] eta: 0:15:19 lr: 0.001900 loss: 4.011434 (3.808614) time: 1.018735 data: 0.000163 max mem: 18817 Epoch: [43/300] [ 350/1251] eta: 0:14:29 lr: 0.001900 loss: 3.833526 (3.777726) time: 0.984257 data: 0.000157 max mem: 18817 Epoch: [43/300] [ 400/1251] eta: 0:13:38 lr: 0.001900 loss: 3.881990 (3.775996) time: 0.922139 data: 0.000157 max mem: 18817 Epoch: [43/300] [ 450/1251] eta: 0:12:50 lr: 0.001900 loss: 3.878113 (3.781162) time: 0.917194 data: 0.000168 max mem: 18817 Epoch: [43/300] [ 500/1251] eta: 0:12:01 lr: 0.001899 loss: 3.829674 (3.786452) time: 0.957359 data: 0.000165 max mem: 18817 Epoch: [43/300] [ 550/1251] eta: 0:11:13 lr: 0.001899 loss: 3.616101 (3.784415) time: 1.011067 data: 0.000172 max mem: 18817 Epoch: [43/300] [ 600/1251] eta: 0:10:24 lr: 0.001899 loss: 4.096614 (3.776741) time: 0.978964 data: 0.000172 max mem: 18817 Epoch: [43/300] [ 650/1251] eta: 0:09:35 lr: 0.001899 loss: 3.632243 (3.767611) time: 0.910839 data: 0.000166 max mem: 18817 Epoch: [43/300] [ 700/1251] eta: 0:08:48 lr: 0.001899 loss: 3.769430 (3.768196) time: 0.919618 data: 0.000167 max mem: 18817 Epoch: [43/300] [ 750/1251] eta: 0:08:00 lr: 0.001898 loss: 3.781538 (3.764643) time: 0.965004 data: 0.000163 max mem: 18817 Epoch: [43/300] [ 800/1251] eta: 0:07:12 lr: 0.001898 loss: 3.972330 (3.772624) time: 1.003222 data: 0.000176 max mem: 18817 Epoch: [43/300] [ 850/1251] eta: 0:06:24 lr: 0.001898 loss: 3.932526 (3.771959) time: 0.982998 data: 0.000187 max mem: 18817 Epoch: [43/300] [ 900/1251] eta: 0:05:36 lr: 0.001898 loss: 3.829732 (3.766956) time: 0.924485 data: 0.000172 max mem: 18817 Epoch: [43/300] [ 950/1251] eta: 0:04:48 lr: 0.001898 loss: 3.370166 (3.755567) time: 0.930285 data: 0.000172 max mem: 18817 Epoch: [43/300] [1000/1251] eta: 0:04:00 lr: 0.001898 loss: 3.696072 (3.753623) time: 0.971495 data: 0.000167 max mem: 18817 Epoch: [43/300] [1050/1251] eta: 0:03:12 lr: 0.001897 loss: 3.836010 (3.753019) time: 0.990540 data: 0.000175 max mem: 18817 Epoch: [43/300] [1100/1251] eta: 0:02:24 lr: 0.001897 loss: 3.963340 (3.757407) time: 0.959477 data: 0.000165 max mem: 18817 Epoch: [43/300] [1150/1251] eta: 0:01:36 lr: 0.001897 loss: 3.974078 (3.762506) time: 0.950610 data: 0.000167 max mem: 18817 Epoch: [43/300] [1200/1251] eta: 0:00:48 lr: 0.001897 loss: 3.929598 (3.756772) time: 0.933527 data: 0.000180 max mem: 18817 Epoch: [43/300] [1250/1251] eta: 0:00:00 lr: 0.001897 loss: 3.887346 (3.755645) time: 0.986353 data: 0.000753 max mem: 18817 Epoch: [43/300] Total time: 0:20:00 (0.959279 s / it) Averaged stats: lr: 0.001897 loss: 3.887346 (3.745940) Test: [ 0/49] eta: 0:01:21 loss: 0.902101 (0.902101) acc1: 78.125000 (78.125000) acc5: 96.875000 (96.875000) time: 1.654461 data: 1.260832 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.971325 (0.967066) acc1: 78.125000 (77.698864) acc5: 93.750000 (93.607955) time: 0.488643 data: 0.114750 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.036320 (1.016473) acc1: 75.000000 (76.041667) acc5: 92.187500 (93.303571) time: 0.371657 data: 0.000131 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.083516 (1.029622) acc1: 75.000000 (75.705645) acc5: 92.187500 (93.497984) time: 0.367863 data: 0.000120 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.081120 (1.048239) acc1: 75.000000 (75.304878) acc5: 92.187500 (93.064024) time: 0.361361 data: 0.000117 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.075435 (1.050676) acc1: 75.000000 (75.232000) acc5: 92.187500 (93.024000) time: 0.355428 data: 0.000098 max mem: 18817 Test: Total time: 0:00:19 (0.394064 s / it) * Acc@1 74.426 Acc@5 92.578 loss 1.066 Max accuracy: 74.43% Epoch: [44/300] [ 0/1251] eta: 0:43:31 lr: 0.001897 loss: 3.421007 (3.421007) time: 2.087830 data: 1.186556 max mem: 18817 Epoch: [44/300] [ 50/1251] eta: 0:19:36 lr: 0.001896 loss: 3.573836 (3.771626) time: 0.939363 data: 0.000154 max mem: 18817 Epoch: [44/300] [ 100/1251] eta: 0:18:29 lr: 0.001896 loss: 3.708140 (3.748301) time: 0.919213 data: 0.000188 max mem: 18817 Epoch: [44/300] [ 150/1251] eta: 0:17:41 lr: 0.001896 loss: 3.887058 (3.730857) time: 0.929786 data: 0.000168 max mem: 18817 Epoch: [44/300] [ 200/1251] eta: 0:16:55 lr: 0.001896 loss: 3.647337 (3.739588) time: 0.985410 data: 0.000166 max mem: 18817 Epoch: [44/300] [ 250/1251] eta: 0:16:03 lr: 0.001896 loss: 3.394455 (3.725259) time: 0.975621 data: 0.000178 max mem: 18817 Epoch: [44/300] [ 300/1251] eta: 0:15:16 lr: 0.001895 loss: 3.970518 (3.735182) time: 0.948944 data: 0.000158 max mem: 18817 Epoch: [44/300] [ 350/1251] eta: 0:14:26 lr: 0.001895 loss: 3.563017 (3.739744) time: 0.937089 data: 0.000171 max mem: 18817 Epoch: [44/300] [ 400/1251] eta: 0:13:39 lr: 0.001895 loss: 3.904999 (3.746890) time: 0.920354 data: 0.000194 max mem: 18817 Epoch: [44/300] [ 450/1251] eta: 0:12:50 lr: 0.001895 loss: 3.856172 (3.750679) time: 0.985038 data: 0.000186 max mem: 18817 Epoch: [44/300] [ 500/1251] eta: 0:12:02 lr: 0.001895 loss: 4.061193 (3.744811) time: 0.987237 data: 0.000173 max mem: 18817 Epoch: [44/300] [ 550/1251] eta: 0:11:14 lr: 0.001895 loss: 3.789279 (3.750380) time: 0.965141 data: 0.000175 max mem: 18817 Epoch: [44/300] [ 600/1251] eta: 0:10:25 lr: 0.001894 loss: 3.640956 (3.752112) time: 0.925762 data: 0.000166 max mem: 18817 Epoch: [44/300] [ 650/1251] eta: 0:09:37 lr: 0.001894 loss: 3.742481 (3.745682) time: 0.931640 data: 0.000167 max mem: 18817 Epoch: [44/300] [ 700/1251] eta: 0:08:49 lr: 0.001894 loss: 3.697395 (3.740249) time: 0.971484 data: 0.000164 max mem: 18817 Epoch: [44/300] [ 750/1251] eta: 0:08:01 lr: 0.001894 loss: 3.754175 (3.737309) time: 0.982626 data: 0.000171 max mem: 18817 Epoch: [44/300] [ 800/1251] eta: 0:07:13 lr: 0.001894 loss: 3.730808 (3.733145) time: 0.953124 data: 0.000175 max mem: 18817 Epoch: [44/300] [ 850/1251] eta: 0:06:25 lr: 0.001893 loss: 3.697715 (3.731708) time: 0.933406 data: 0.000180 max mem: 18817 Epoch: [44/300] [ 900/1251] eta: 0:05:37 lr: 0.001893 loss: 3.782197 (3.733640) time: 0.924355 data: 0.000173 max mem: 18817 Epoch: [44/300] [ 950/1251] eta: 0:04:49 lr: 0.001893 loss: 3.783030 (3.737222) time: 1.009339 data: 0.000175 max mem: 18817 Epoch: [44/300] [1000/1251] eta: 0:04:01 lr: 0.001893 loss: 3.908180 (3.738305) time: 0.981477 data: 0.000163 max mem: 18817 Epoch: [44/300] [1050/1251] eta: 0:03:13 lr: 0.001893 loss: 3.582657 (3.736953) time: 0.959393 data: 0.000174 max mem: 18817 Epoch: [44/300] [1100/1251] eta: 0:02:25 lr: 0.001893 loss: 3.539000 (3.736423) time: 0.925454 data: 0.000185 max mem: 18817 Epoch: [44/300] [1150/1251] eta: 0:01:37 lr: 0.001892 loss: 3.953192 (3.733476) time: 0.932213 data: 0.000164 max mem: 18817 Epoch: [44/300] [1200/1251] eta: 0:00:49 lr: 0.001892 loss: 3.943928 (3.733538) time: 0.989538 data: 0.000184 max mem: 18817 Epoch: [44/300] [1250/1251] eta: 0:00:00 lr: 0.001892 loss: 3.767128 (3.735255) time: 0.977700 data: 0.000764 max mem: 18817 Epoch: [44/300] Total time: 0:20:02 (0.961306 s / it) Averaged stats: lr: 0.001892 loss: 3.767128 (3.745608) Test: [ 0/49] eta: 0:01:17 loss: 0.906015 (0.906015) acc1: 81.250000 (81.250000) acc5: 92.187500 (92.187500) time: 1.575216 data: 1.172928 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.954761 (0.987336) acc1: 75.000000 (76.562500) acc5: 93.750000 (92.897727) time: 0.515559 data: 0.106792 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.977806 (1.010763) acc1: 75.000000 (76.413690) acc5: 93.750000 (93.080357) time: 0.407245 data: 0.000148 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.044615 (1.015799) acc1: 75.000000 (75.957661) acc5: 93.750000 (93.296371) time: 0.384778 data: 0.000127 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.047562 (1.037799) acc1: 75.000000 (75.076220) acc5: 93.750000 (93.292683) time: 0.361607 data: 0.000130 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.109912 (1.048939) acc1: 71.875000 (75.200000) acc5: 93.750000 (93.280000) time: 0.358034 data: 0.000106 max mem: 18817 Test: Total time: 0:00:19 (0.407784 s / it) * Acc@1 74.748 Acc@5 92.832 loss 1.061 Max accuracy: 74.75% Epoch: [45/300] [ 0/1251] eta: 0:41:34 lr: 0.001892 loss: 4.409186 (4.409186) time: 1.994379 data: 1.099119 max mem: 18817 Epoch: [45/300] [ 50/1251] eta: 0:19:56 lr: 0.001892 loss: 3.775946 (3.868943) time: 0.926686 data: 0.000150 max mem: 18817 Epoch: [45/300] [ 100/1251] eta: 0:18:49 lr: 0.001892 loss: 4.057892 (3.860892) time: 0.982563 data: 0.000155 max mem: 18817 Epoch: [45/300] [ 150/1251] eta: 0:17:53 lr: 0.001891 loss: 3.440915 (3.835142) time: 0.981424 data: 0.000176 max mem: 18817 Epoch: [45/300] [ 200/1251] eta: 0:16:55 lr: 0.001891 loss: 3.923018 (3.787157) time: 0.960912 data: 0.000162 max mem: 18817 Epoch: [45/300] [ 250/1251] eta: 0:16:03 lr: 0.001891 loss: 3.724721 (3.770620) time: 0.913617 data: 0.000171 max mem: 18817 Epoch: [45/300] [ 300/1251] eta: 0:15:16 lr: 0.001891 loss: 3.848838 (3.756665) time: 0.926934 data: 0.000174 max mem: 18817 Epoch: [45/300] [ 350/1251] eta: 0:14:29 lr: 0.001891 loss: 3.816015 (3.757980) time: 0.992593 data: 0.000170 max mem: 18817 Epoch: [45/300] [ 400/1251] eta: 0:13:42 lr: 0.001890 loss: 3.943577 (3.764080) time: 0.992564 data: 0.000169 max mem: 18817 Epoch: [45/300] [ 450/1251] eta: 0:12:53 lr: 0.001890 loss: 3.719480 (3.760180) time: 0.969457 data: 0.000188 max mem: 18817 Epoch: [45/300] [ 500/1251] eta: 0:12:04 lr: 0.001890 loss: 3.852237 (3.760614) time: 0.915946 data: 0.000192 max mem: 18817 Epoch: [45/300] [ 550/1251] eta: 0:11:15 lr: 0.001890 loss: 4.008782 (3.766013) time: 0.921448 data: 0.000186 max mem: 18817 Epoch: [45/300] [ 600/1251] eta: 0:10:27 lr: 0.001890 loss: 3.720541 (3.758933) time: 0.987411 data: 0.000166 max mem: 18817 Epoch: [45/300] [ 650/1251] eta: 0:09:39 lr: 0.001889 loss: 3.830424 (3.759713) time: 0.964671 data: 0.000166 max mem: 18817 Epoch: [45/300] [ 700/1251] eta: 0:08:50 lr: 0.001889 loss: 3.796440 (3.757658) time: 0.964233 data: 0.000152 max mem: 18817 Epoch: [45/300] [ 750/1251] eta: 0:08:01 lr: 0.001889 loss: 3.745153 (3.748982) time: 0.914991 data: 0.000158 max mem: 18817 Epoch: [45/300] [ 800/1251] eta: 0:07:13 lr: 0.001889 loss: 3.665180 (3.750190) time: 0.910962 data: 0.000183 max mem: 18817 Epoch: [45/300] [ 850/1251] eta: 0:06:25 lr: 0.001889 loss: 3.668902 (3.750526) time: 1.004174 data: 0.000172 max mem: 18817 Epoch: [45/300] [ 900/1251] eta: 0:05:37 lr: 0.001889 loss: 3.478767 (3.746461) time: 0.981390 data: 0.000167 max mem: 18817 Epoch: [45/300] [ 950/1251] eta: 0:04:49 lr: 0.001888 loss: 3.712937 (3.739413) time: 0.969590 data: 0.000176 max mem: 18817 Epoch: [45/300] [1000/1251] eta: 0:04:01 lr: 0.001888 loss: 3.788614 (3.738556) time: 0.914805 data: 0.000153 max mem: 18817 Epoch: [45/300] [1050/1251] eta: 0:03:13 lr: 0.001888 loss: 3.657820 (3.743590) time: 0.928480 data: 0.000167 max mem: 18817 Epoch: [45/300] [1100/1251] eta: 0:02:25 lr: 0.001888 loss: 3.715534 (3.737638) time: 1.004967 data: 0.000189 max mem: 18817 Epoch: [45/300] [1150/1251] eta: 0:01:37 lr: 0.001888 loss: 3.512337 (3.736049) time: 0.969511 data: 0.000183 max mem: 18817 Epoch: [45/300] [1200/1251] eta: 0:00:48 lr: 0.001887 loss: 3.611311 (3.729454) time: 0.973881 data: 0.000166 max mem: 18817 Epoch: [45/300] [1250/1251] eta: 0:00:00 lr: 0.001887 loss: 3.801137 (3.731456) time: 0.913157 data: 0.000743 max mem: 18817 Epoch: [45/300] Total time: 0:20:01 (0.960432 s / it) Averaged stats: lr: 0.001887 loss: 3.801137 (3.729266) Test: [ 0/49] eta: 0:01:11 loss: 0.945152 (0.945152) acc1: 81.250000 (81.250000) acc5: 92.187500 (92.187500) time: 1.460936 data: 1.063388 max mem: 18817 Test: [10/49] eta: 0:00:25 loss: 0.951356 (0.971851) acc1: 79.687500 (78.693182) acc5: 93.750000 (93.181818) time: 0.655280 data: 0.096809 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 1.081242 (1.028054) acc1: 75.000000 (77.306548) acc5: 93.750000 (92.782738) time: 0.468515 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.097855 (1.040345) acc1: 73.437500 (76.008065) acc5: 93.750000 (92.842742) time: 0.362710 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.073515 (1.061254) acc1: 73.437500 (75.342988) acc5: 93.750000 (93.140244) time: 0.360830 data: 0.000133 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.102722 (1.058423) acc1: 73.437500 (75.424000) acc5: 93.750000 (93.312000) time: 0.355771 data: 0.000111 max mem: 18817 Test: Total time: 0:00:20 (0.427631 s / it) * Acc@1 74.802 Acc@5 92.766 loss 1.074 Max accuracy: 74.80% Epoch: [46/300] [ 0/1251] eta: 0:40:37 lr: 0.001887 loss: 2.586673 (2.586673) time: 1.948717 data: 1.057184 max mem: 18817 Epoch: [46/300] [ 50/1251] eta: 0:19:42 lr: 0.001887 loss: 4.016838 (3.752911) time: 0.973003 data: 0.000182 max mem: 18817 Epoch: [46/300] [ 100/1251] eta: 0:18:29 lr: 0.001887 loss: 3.842060 (3.822881) time: 0.970822 data: 0.000179 max mem: 18817 Epoch: [46/300] [ 150/1251] eta: 0:17:43 lr: 0.001887 loss: 3.902935 (3.784130) time: 0.969502 data: 0.000170 max mem: 18817 Epoch: [46/300] [ 200/1251] eta: 0:16:50 lr: 0.001886 loss: 3.867884 (3.757324) time: 0.931227 data: 0.000165 max mem: 18817 Epoch: [46/300] [ 250/1251] eta: 0:16:00 lr: 0.001886 loss: 3.661211 (3.751826) time: 0.925176 data: 0.000174 max mem: 18817 Epoch: [46/300] [ 300/1251] eta: 0:15:14 lr: 0.001886 loss: 3.835503 (3.740490) time: 0.995272 data: 0.000167 max mem: 18817 Epoch: [46/300] [ 350/1251] eta: 0:14:24 lr: 0.001886 loss: 3.825590 (3.703715) time: 0.977100 data: 0.000164 max mem: 18817 Epoch: [46/300] [ 400/1251] eta: 0:13:37 lr: 0.001886 loss: 3.915379 (3.702129) time: 0.935438 data: 0.000189 max mem: 18817 Epoch: [46/300] [ 450/1251] eta: 0:12:48 lr: 0.001885 loss: 3.533995 (3.698699) time: 0.923780 data: 0.000165 max mem: 18817 Epoch: [46/300] [ 500/1251] eta: 0:12:01 lr: 0.001885 loss: 3.996472 (3.709827) time: 0.945457 data: 0.000188 max mem: 18817 Epoch: [46/300] [ 550/1251] eta: 0:11:14 lr: 0.001885 loss: 3.531062 (3.713752) time: 1.007088 data: 0.000174 max mem: 18817 Epoch: [46/300] [ 600/1251] eta: 0:10:24 lr: 0.001885 loss: 3.346249 (3.717073) time: 0.967835 data: 0.000172 max mem: 18817 Epoch: [46/300] [ 650/1251] eta: 0:09:36 lr: 0.001885 loss: 4.008889 (3.711272) time: 0.958712 data: 0.000172 max mem: 18817 Epoch: [46/300] [ 700/1251] eta: 0:08:48 lr: 0.001884 loss: 3.699054 (3.709723) time: 0.926230 data: 0.000170 max mem: 18817 Epoch: [46/300] [ 750/1251] eta: 0:08:00 lr: 0.001884 loss: 3.590837 (3.708781) time: 0.939043 data: 0.000165 max mem: 18817 Epoch: [46/300] [ 800/1251] eta: 0:07:13 lr: 0.001884 loss: 3.828913 (3.713706) time: 0.986419 data: 0.000188 max mem: 18817 Epoch: [46/300] [ 850/1251] eta: 0:06:25 lr: 0.001884 loss: 4.011740 (3.704769) time: 0.976945 data: 0.000181 max mem: 18817 Epoch: [46/300] [ 900/1251] eta: 0:05:37 lr: 0.001884 loss: 4.093102 (3.707507) time: 0.984829 data: 0.000166 max mem: 18817 Epoch: [46/300] [ 950/1251] eta: 0:04:49 lr: 0.001883 loss: 3.969953 (3.711793) time: 0.930845 data: 0.000176 max mem: 18817 Epoch: [46/300] [1000/1251] eta: 0:04:01 lr: 0.001883 loss: 3.428227 (3.708363) time: 0.959808 data: 0.000172 max mem: 18817 Epoch: [46/300] [1050/1251] eta: 0:03:13 lr: 0.001883 loss: 3.807588 (3.704648) time: 0.976097 data: 0.000177 max mem: 18817 Epoch: [46/300] [1100/1251] eta: 0:02:25 lr: 0.001883 loss: 3.752518 (3.699710) time: 0.983717 data: 0.000168 max mem: 18817 Epoch: [46/300] [1150/1251] eta: 0:01:36 lr: 0.001883 loss: 3.959552 (3.705561) time: 0.926959 data: 0.000159 max mem: 18817 Epoch: [46/300] [1200/1251] eta: 0:00:48 lr: 0.001883 loss: 3.784580 (3.707498) time: 0.923071 data: 0.000170 max mem: 18817 Epoch: [46/300] [1250/1251] eta: 0:00:00 lr: 0.001882 loss: 3.846136 (3.713279) time: 0.924351 data: 0.000741 max mem: 18817 Epoch: [46/300] Total time: 0:20:01 (0.960071 s / it) Averaged stats: lr: 0.001882 loss: 3.846136 (3.714444) Test: [ 0/49] eta: 0:01:16 loss: 0.828924 (0.828924) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.312500) time: 1.556948 data: 1.100725 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.999016 (1.000598) acc1: 76.562500 (77.272727) acc5: 93.750000 (93.892045) time: 0.476657 data: 0.100213 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.040481 (1.035234) acc1: 76.562500 (75.967262) acc5: 93.750000 (93.601190) time: 0.364986 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.093437 (1.049777) acc1: 73.437500 (75.504032) acc5: 93.750000 (93.447581) time: 0.361237 data: 0.000136 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.093437 (1.063727) acc1: 73.437500 (75.038110) acc5: 92.187500 (93.254573) time: 0.359343 data: 0.000131 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.104642 (1.075138) acc1: 73.437500 (75.008000) acc5: 92.187500 (93.024000) time: 0.354707 data: 0.000103 max mem: 18817 Test: Total time: 0:00:18 (0.386637 s / it) * Acc@1 74.806 Acc@5 92.904 loss 1.089 Max accuracy: 74.81% Epoch: [47/300] [ 0/1251] eta: 0:43:53 lr: 0.001882 loss: 4.063732 (4.063732) time: 2.105432 data: 1.147189 max mem: 18817 Epoch: [47/300] [ 50/1251] eta: 0:19:15 lr: 0.001882 loss: 3.686098 (3.654784) time: 0.961489 data: 0.000182 max mem: 18817 Epoch: [47/300] [ 100/1251] eta: 0:18:18 lr: 0.001882 loss: 3.769976 (3.697199) time: 0.907471 data: 0.000188 max mem: 18817 Epoch: [47/300] [ 150/1251] eta: 0:17:39 lr: 0.001882 loss: 3.576323 (3.667576) time: 0.941270 data: 0.000177 max mem: 18817 Epoch: [47/300] [ 200/1251] eta: 0:16:55 lr: 0.001882 loss: 3.673843 (3.695656) time: 0.940157 data: 0.000186 max mem: 18817 Epoch: [47/300] [ 250/1251] eta: 0:16:09 lr: 0.001881 loss: 3.824274 (3.685763) time: 0.982090 data: 0.000181 max mem: 18817 Epoch: [47/300] [ 300/1251] eta: 0:15:18 lr: 0.001881 loss: 3.704884 (3.682989) time: 0.984595 data: 0.000178 max mem: 18817 Epoch: [47/300] [ 350/1251] eta: 0:14:29 lr: 0.001881 loss: 3.696462 (3.686270) time: 0.947335 data: 0.000161 max mem: 18817 Epoch: [47/300] [ 400/1251] eta: 0:13:40 lr: 0.001881 loss: 3.638823 (3.685340) time: 0.941182 data: 0.000172 max mem: 18817 Epoch: [47/300] [ 450/1251] eta: 0:12:52 lr: 0.001881 loss: 3.733822 (3.697947) time: 0.936691 data: 0.000170 max mem: 18817 Epoch: [47/300] [ 500/1251] eta: 0:12:04 lr: 0.001880 loss: 3.772433 (3.684604) time: 0.984406 data: 0.000177 max mem: 18817 Epoch: [47/300] [ 550/1251] eta: 0:11:14 lr: 0.001880 loss: 3.815350 (3.692536) time: 0.964426 data: 0.000185 max mem: 18817 Epoch: [47/300] [ 600/1251] eta: 0:10:26 lr: 0.001880 loss: 4.019083 (3.702822) time: 0.957762 data: 0.000181 max mem: 18817 Epoch: [47/300] [ 650/1251] eta: 0:09:38 lr: 0.001880 loss: 3.778600 (3.707731) time: 0.917375 data: 0.000166 max mem: 18817 Epoch: [47/300] [ 700/1251] eta: 0:08:50 lr: 0.001880 loss: 3.912955 (3.698890) time: 0.927731 data: 0.000188 max mem: 18817 Epoch: [47/300] [ 750/1251] eta: 0:08:02 lr: 0.001879 loss: 3.592540 (3.704241) time: 1.008425 data: 0.000182 max mem: 18817 Epoch: [47/300] [ 800/1251] eta: 0:07:13 lr: 0.001879 loss: 3.695630 (3.709197) time: 0.966002 data: 0.000176 max mem: 18817 Epoch: [47/300] [ 850/1251] eta: 0:06:25 lr: 0.001879 loss: 3.637042 (3.711138) time: 0.944067 data: 0.000175 max mem: 18817 Epoch: [47/300] [ 900/1251] eta: 0:05:37 lr: 0.001879 loss: 3.291818 (3.697405) time: 0.914644 data: 0.000164 max mem: 18817 Epoch: [47/300] [ 950/1251] eta: 0:04:49 lr: 0.001879 loss: 3.889087 (3.694994) time: 0.934590 data: 0.000174 max mem: 18817 Epoch: [47/300] [1000/1251] eta: 0:04:01 lr: 0.001878 loss: 3.762035 (3.692184) time: 0.981231 data: 0.000426 max mem: 18817 Epoch: [47/300] [1050/1251] eta: 0:03:13 lr: 0.001878 loss: 3.492541 (3.695218) time: 0.982969 data: 0.000185 max mem: 18817 Epoch: [47/300] [1100/1251] eta: 0:02:25 lr: 0.001878 loss: 3.778927 (3.697309) time: 0.936516 data: 0.000178 max mem: 18817 Epoch: [47/300] [1150/1251] eta: 0:01:37 lr: 0.001878 loss: 3.634707 (3.697236) time: 0.931882 data: 0.000178 max mem: 18817 Epoch: [47/300] [1200/1251] eta: 0:00:49 lr: 0.001878 loss: 3.910425 (3.701641) time: 0.930610 data: 0.000170 max mem: 18817 Epoch: [47/300] [1250/1251] eta: 0:00:00 lr: 0.001877 loss: 3.863872 (3.704551) time: 0.990425 data: 0.000760 max mem: 18817 Epoch: [47/300] Total time: 0:20:03 (0.962404 s / it) Averaged stats: lr: 0.001877 loss: 3.863872 (3.708455) Test: [ 0/49] eta: 0:01:17 loss: 0.911587 (0.911587) acc1: 82.812500 (82.812500) acc5: 93.750000 (93.750000) time: 1.578082 data: 1.109409 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 1.045947 (1.031269) acc1: 78.125000 (77.982955) acc5: 93.750000 (93.892045) time: 0.491208 data: 0.101035 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.116257 (1.084930) acc1: 75.000000 (75.967262) acc5: 92.187500 (93.303571) time: 0.381329 data: 0.000171 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.103837 (1.075007) acc1: 75.000000 (75.604839) acc5: 92.187500 (93.598790) time: 0.375000 data: 0.000150 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.079238 (1.094871) acc1: 75.000000 (75.076220) acc5: 93.750000 (93.216463) time: 0.370449 data: 0.000145 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.103837 (1.096999) acc1: 75.000000 (75.232000) acc5: 93.750000 (93.216000) time: 0.364611 data: 0.000118 max mem: 18817 Test: Total time: 0:00:19 (0.400170 s / it) * Acc@1 74.632 Acc@5 92.802 loss 1.116 Max accuracy: 74.81% Epoch: [48/300] [ 0/1251] eta: 0:41:47 lr: 0.001877 loss: 3.962004 (3.962004) time: 2.004132 data: 1.089010 max mem: 18817 Epoch: [48/300] [ 50/1251] eta: 0:19:21 lr: 0.001877 loss: 3.863905 (3.733447) time: 0.930351 data: 0.000169 max mem: 18817 Epoch: [48/300] [ 100/1251] eta: 0:18:30 lr: 0.001877 loss: 3.697659 (3.739569) time: 0.922487 data: 0.000174 max mem: 18817 Epoch: [48/300] [ 150/1251] eta: 0:17:37 lr: 0.001877 loss: 3.694484 (3.721886) time: 0.983446 data: 0.000176 max mem: 18817 Epoch: [48/300] [ 200/1251] eta: 0:16:49 lr: 0.001877 loss: 3.725252 (3.750782) time: 1.020586 data: 0.000190 max mem: 18817 Epoch: [48/300] [ 250/1251] eta: 0:15:58 lr: 0.001876 loss: 3.724906 (3.722677) time: 0.967258 data: 0.000163 max mem: 18817 Epoch: [48/300] [ 300/1251] eta: 0:15:11 lr: 0.001876 loss: 3.786551 (3.709040) time: 0.930910 data: 0.000162 max mem: 18817 Epoch: [48/300] [ 350/1251] eta: 0:14:25 lr: 0.001876 loss: 3.996993 (3.709685) time: 0.942266 data: 0.000178 max mem: 18817 Epoch: [48/300] [ 400/1251] eta: 0:13:38 lr: 0.001876 loss: 3.888051 (3.709900) time: 1.013078 data: 0.000166 max mem: 18817 Epoch: [48/300] [ 450/1251] eta: 0:12:52 lr: 0.001876 loss: 3.501931 (3.705614) time: 1.051911 data: 0.000167 max mem: 18817 Epoch: [48/300] [ 500/1251] eta: 0:12:02 lr: 0.001875 loss: 4.168846 (3.718993) time: 0.963385 data: 0.000160 max mem: 18817 Epoch: [48/300] [ 550/1251] eta: 0:11:12 lr: 0.001875 loss: 3.961214 (3.717038) time: 0.906488 data: 0.000175 max mem: 18817 Epoch: [48/300] [ 600/1251] eta: 0:10:24 lr: 0.001875 loss: 4.041558 (3.722715) time: 0.925723 data: 0.000164 max mem: 18817 Epoch: [48/300] [ 650/1251] eta: 0:09:36 lr: 0.001875 loss: 3.824287 (3.724687) time: 0.972275 data: 0.000171 max mem: 18817 Epoch: [48/300] [ 700/1251] eta: 0:08:49 lr: 0.001875 loss: 3.924098 (3.725489) time: 1.034483 data: 0.000160 max mem: 18817 Epoch: [48/300] [ 750/1251] eta: 0:08:00 lr: 0.001874 loss: 3.960268 (3.725250) time: 0.970016 data: 0.000156 max mem: 18817 Epoch: [48/300] [ 800/1251] eta: 0:07:12 lr: 0.001874 loss: 3.895293 (3.731725) time: 0.924276 data: 0.000165 max mem: 18817 Epoch: [48/300] [ 850/1251] eta: 0:06:24 lr: 0.001874 loss: 3.753448 (3.721272) time: 0.923397 data: 0.000177 max mem: 18817 Epoch: [48/300] [ 900/1251] eta: 0:05:36 lr: 0.001874 loss: 3.455312 (3.722526) time: 0.975808 data: 0.000178 max mem: 18817 Epoch: [48/300] [ 950/1251] eta: 0:04:48 lr: 0.001874 loss: 3.703756 (3.725085) time: 1.029119 data: 0.000186 max mem: 18817 Epoch: [48/300] [1000/1251] eta: 0:04:00 lr: 0.001873 loss: 3.911016 (3.722585) time: 0.967560 data: 0.000162 max mem: 18817 Epoch: [48/300] [1050/1251] eta: 0:03:12 lr: 0.001873 loss: 3.524684 (3.718713) time: 0.924341 data: 0.000161 max mem: 18817 Epoch: [48/300] [1100/1251] eta: 0:02:24 lr: 0.001873 loss: 3.899601 (3.721120) time: 0.923365 data: 0.000161 max mem: 18817 Epoch: [48/300] [1150/1251] eta: 0:01:36 lr: 0.001873 loss: 3.472377 (3.713321) time: 0.992472 data: 0.000167 max mem: 18817 Epoch: [48/300] [1200/1251] eta: 0:00:48 lr: 0.001873 loss: 3.712430 (3.716562) time: 1.017360 data: 0.000162 max mem: 18817 Epoch: [48/300] [1250/1251] eta: 0:00:00 lr: 0.001872 loss: 3.487283 (3.715436) time: 0.967345 data: 0.000727 max mem: 18817 Epoch: [48/300] Total time: 0:19:59 (0.958486 s / it) Averaged stats: lr: 0.001872 loss: 3.487283 (3.712535) Test: [ 0/49] eta: 0:01:20 loss: 0.855298 (0.855298) acc1: 82.812500 (82.812500) acc5: 93.750000 (93.750000) time: 1.643871 data: 1.189124 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.983181 (0.993438) acc1: 76.562500 (77.272727) acc5: 93.750000 (93.181818) time: 0.487172 data: 0.108289 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.042178 (1.043485) acc1: 76.562500 (75.967262) acc5: 92.187500 (92.931548) time: 0.366984 data: 0.000165 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.048061 (1.045740) acc1: 76.562500 (75.907258) acc5: 93.750000 (93.296371) time: 0.362375 data: 0.000129 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.074758 (1.070400) acc1: 75.000000 (75.762195) acc5: 93.750000 (93.102134) time: 0.389763 data: 0.000126 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.074758 (1.075858) acc1: 75.000000 (75.872000) acc5: 92.187500 (92.896000) time: 0.402653 data: 0.000102 max mem: 18817 Test: Total time: 0:00:20 (0.410915 s / it) * Acc@1 74.904 Acc@5 92.832 loss 1.090 Max accuracy: 74.90% Epoch: [49/300] [ 0/1251] eta: 0:39:33 lr: 0.001872 loss: 3.953253 (3.953253) time: 1.897496 data: 1.009441 max mem: 18817 Epoch: [49/300] [ 50/1251] eta: 0:19:39 lr: 0.001872 loss: 3.876951 (3.709118) time: 0.932943 data: 0.000193 max mem: 18817 Epoch: [49/300] [ 100/1251] eta: 0:18:38 lr: 0.001872 loss: 3.680260 (3.628957) time: 0.984085 data: 0.000177 max mem: 18817 Epoch: [49/300] [ 150/1251] eta: 0:17:37 lr: 0.001872 loss: 3.700267 (3.628561) time: 0.969107 data: 0.000164 max mem: 18817 Epoch: [49/300] [ 200/1251] eta: 0:16:43 lr: 0.001871 loss: 3.340368 (3.616958) time: 0.921120 data: 0.000167 max mem: 18817 Epoch: [49/300] [ 250/1251] eta: 0:15:59 lr: 0.001871 loss: 3.912601 (3.626707) time: 0.931716 data: 0.000166 max mem: 18817 Epoch: [49/300] [ 300/1251] eta: 0:15:11 lr: 0.001871 loss: 3.634420 (3.627126) time: 0.970502 data: 0.000174 max mem: 18817 Epoch: [49/300] [ 350/1251] eta: 0:14:24 lr: 0.001871 loss: 3.777296 (3.627929) time: 0.995397 data: 0.000162 max mem: 18817 Epoch: [49/300] [ 400/1251] eta: 0:13:34 lr: 0.001871 loss: 3.722155 (3.647840) time: 0.967908 data: 0.000169 max mem: 18817 Epoch: [49/300] [ 450/1251] eta: 0:12:46 lr: 0.001870 loss: 4.019351 (3.661620) time: 0.908813 data: 0.000164 max mem: 18817 Epoch: [49/300] [ 500/1251] eta: 0:11:59 lr: 0.001870 loss: 3.729750 (3.653698) time: 0.932403 data: 0.000187 max mem: 18817 Epoch: [49/300] [ 550/1251] eta: 0:11:13 lr: 0.001870 loss: 3.618871 (3.651783) time: 0.980212 data: 0.000171 max mem: 18817 Epoch: [49/300] [ 600/1251] eta: 0:10:25 lr: 0.001870 loss: 3.879468 (3.660556) time: 0.987514 data: 0.000178 max mem: 18817 Epoch: [49/300] [ 650/1251] eta: 0:09:36 lr: 0.001870 loss: 3.894154 (3.675429) time: 0.968308 data: 0.000163 max mem: 18817 Epoch: [49/300] [ 700/1251] eta: 0:08:47 lr: 0.001869 loss: 3.665293 (3.668640) time: 0.915883 data: 0.000171 max mem: 18817 Epoch: [49/300] [ 750/1251] eta: 0:08:00 lr: 0.001869 loss: 3.864298 (3.672886) time: 0.931535 data: 0.000170 max mem: 18817 Epoch: [49/300] [ 800/1251] eta: 0:07:12 lr: 0.001869 loss: 3.847458 (3.669238) time: 0.966655 data: 0.000176 max mem: 18817 Epoch: [49/300] [ 850/1251] eta: 0:06:24 lr: 0.001869 loss: 3.711573 (3.665144) time: 0.986496 data: 0.000163 max mem: 18817 Epoch: [49/300] [ 900/1251] eta: 0:05:36 lr: 0.001869 loss: 3.727378 (3.668626) time: 0.973660 data: 0.000158 max mem: 18817 Epoch: [49/300] [ 950/1251] eta: 0:04:48 lr: 0.001868 loss: 3.456737 (3.665750) time: 0.914424 data: 0.000176 max mem: 18817 Epoch: [49/300] [1000/1251] eta: 0:04:00 lr: 0.001868 loss: 3.874430 (3.665537) time: 0.930175 data: 0.000161 max mem: 18817 Epoch: [49/300] [1050/1251] eta: 0:03:12 lr: 0.001868 loss: 3.758709 (3.664802) time: 0.950946 data: 0.000182 max mem: 18817 Epoch: [49/300] [1100/1251] eta: 0:02:24 lr: 0.001868 loss: 3.788656 (3.665674) time: 0.976612 data: 0.000170 max mem: 18817 Epoch: [49/300] [1150/1251] eta: 0:01:36 lr: 0.001868 loss: 3.883824 (3.666773) time: 0.985860 data: 0.000167 max mem: 18817 Epoch: [49/300] [1200/1251] eta: 0:00:48 lr: 0.001867 loss: 3.584074 (3.665966) time: 0.908018 data: 0.000177 max mem: 18817 Epoch: [49/300] [1250/1251] eta: 0:00:00 lr: 0.001867 loss: 3.923756 (3.668465) time: 0.918617 data: 0.000764 max mem: 18817 Epoch: [49/300] Total time: 0:20:00 (0.959538 s / it) Averaged stats: lr: 0.001867 loss: 3.923756 (3.666478) Test: [ 0/49] eta: 0:01:16 loss: 0.833165 (0.833165) acc1: 79.687500 (79.687500) acc5: 95.312500 (95.312500) time: 1.552280 data: 1.102065 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.946859 (0.979082) acc1: 76.562500 (75.994318) acc5: 92.187500 (92.755682) time: 0.475607 data: 0.100336 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.027255 (1.022813) acc1: 75.000000 (75.818452) acc5: 92.187500 (92.633929) time: 0.365702 data: 0.000151 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.051233 (1.023363) acc1: 75.000000 (75.201613) acc5: 93.750000 (93.296371) time: 0.365702 data: 0.000135 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.051233 (1.041193) acc1: 75.000000 (74.847561) acc5: 93.750000 (92.911585) time: 0.460201 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.093191 (1.047665) acc1: 73.437500 (74.752000) acc5: 93.750000 (92.896000) time: 0.454433 data: 0.000099 max mem: 18817 Test: Total time: 0:00:21 (0.429465 s / it) * Acc@1 75.310 Acc@5 93.096 loss 1.049 Max accuracy: 75.31% Epoch: [50/300] [ 0/1251] eta: 0:41:12 lr: 0.001867 loss: 3.149966 (3.149966) time: 1.976482 data: 1.092349 max mem: 18817 Epoch: [50/300] [ 50/1251] eta: 0:19:46 lr: 0.001867 loss: 3.540777 (3.629148) time: 1.053140 data: 0.000175 max mem: 18817 Epoch: [50/300] [ 100/1251] eta: 0:18:33 lr: 0.001867 loss: 3.575337 (3.618942) time: 0.964925 data: 0.000163 max mem: 18817 Epoch: [50/300] [ 150/1251] eta: 0:17:34 lr: 0.001867 loss: 3.918371 (3.675511) time: 0.912015 data: 0.000173 max mem: 18817 Epoch: [50/300] [ 200/1251] eta: 0:16:49 lr: 0.001866 loss: 3.779009 (3.672252) time: 0.927001 data: 0.000170 max mem: 18817 Epoch: [50/300] [ 250/1251] eta: 0:16:04 lr: 0.001866 loss: 3.805220 (3.662062) time: 1.003985 data: 0.000158 max mem: 18817 Epoch: [50/300] [ 300/1251] eta: 0:15:16 lr: 0.001866 loss: 3.331708 (3.660717) time: 1.019903 data: 0.000175 max mem: 18817 Epoch: [50/300] [ 350/1251] eta: 0:14:26 lr: 0.001866 loss: 3.894927 (3.695098) time: 0.967876 data: 0.000173 max mem: 18817 Epoch: [50/300] [ 400/1251] eta: 0:13:37 lr: 0.001866 loss: 3.761419 (3.704627) time: 0.912921 data: 0.000174 max mem: 18817 Epoch: [50/300] [ 450/1251] eta: 0:12:49 lr: 0.001865 loss: 3.753267 (3.706937) time: 0.925045 data: 0.000179 max mem: 18817 Epoch: [50/300] [ 500/1251] eta: 0:12:02 lr: 0.001865 loss: 3.626913 (3.701215) time: 0.999829 data: 0.000189 max mem: 18817 Epoch: [50/300] [ 550/1251] eta: 0:11:15 lr: 0.001865 loss: 3.972521 (3.705457) time: 0.990684 data: 0.000183 max mem: 18817 Epoch: [50/300] [ 600/1251] eta: 0:10:25 lr: 0.001865 loss: 3.450314 (3.700914) time: 0.969077 data: 0.000172 max mem: 18817 Epoch: [50/300] [ 650/1251] eta: 0:09:37 lr: 0.001864 loss: 3.346873 (3.695408) time: 0.908997 data: 0.000183 max mem: 18817 Epoch: [50/300] [ 700/1251] eta: 0:08:49 lr: 0.001864 loss: 3.606519 (3.695603) time: 0.915714 data: 0.000166 max mem: 18817 Epoch: [50/300] [ 750/1251] eta: 0:08:00 lr: 0.001864 loss: 3.568958 (3.693732) time: 0.976282 data: 0.000184 max mem: 18817 Epoch: [50/300] [ 800/1251] eta: 0:07:13 lr: 0.001864 loss: 3.969279 (3.705896) time: 1.047758 data: 0.000169 max mem: 18817 Epoch: [50/300] [ 850/1251] eta: 0:06:24 lr: 0.001864 loss: 3.753238 (3.706666) time: 0.965281 data: 0.000162 max mem: 18817 Epoch: [50/300] [ 900/1251] eta: 0:05:36 lr: 0.001863 loss: 4.026524 (3.709453) time: 0.915836 data: 0.000179 max mem: 18817 Epoch: [50/300] [ 950/1251] eta: 0:04:48 lr: 0.001863 loss: 3.837919 (3.705100) time: 0.930741 data: 0.000171 max mem: 18817 Epoch: [50/300] [1000/1251] eta: 0:04:00 lr: 0.001863 loss: 3.857214 (3.703443) time: 0.989614 data: 0.000167 max mem: 18817 Epoch: [50/300] [1050/1251] eta: 0:03:12 lr: 0.001863 loss: 3.567796 (3.700394) time: 1.027716 data: 0.000181 max mem: 18817 Epoch: [50/300] [1100/1251] eta: 0:02:24 lr: 0.001863 loss: 3.770481 (3.699246) time: 0.981289 data: 0.000181 max mem: 18817 Epoch: [50/300] [1150/1251] eta: 0:01:36 lr: 0.001862 loss: 3.730559 (3.702424) time: 0.922903 data: 0.000186 max mem: 18817 Epoch: [50/300] [1200/1251] eta: 0:00:48 lr: 0.001862 loss: 3.952672 (3.702158) time: 0.926540 data: 0.000180 max mem: 18817 Epoch: [50/300] [1250/1251] eta: 0:00:00 lr: 0.001862 loss: 4.037559 (3.702952) time: 0.989248 data: 0.000759 max mem: 18817 Epoch: [50/300] Total time: 0:20:00 (0.959892 s / it) Averaged stats: lr: 0.001862 loss: 4.037559 (3.701377) Test: [ 0/49] eta: 0:01:26 loss: 0.770976 (0.770976) acc1: 78.125000 (78.125000) acc5: 95.312500 (95.312500) time: 1.767631 data: 1.365450 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.942252 (0.960214) acc1: 78.125000 (75.852273) acc5: 93.750000 (93.465909) time: 0.517483 data: 0.124273 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.022371 (1.006021) acc1: 75.000000 (75.297619) acc5: 93.750000 (93.080357) time: 0.381891 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.043541 (1.003612) acc1: 73.437500 (75.252016) acc5: 93.750000 (93.346774) time: 0.367081 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.030457 (1.019314) acc1: 73.437500 (75.000000) acc5: 93.750000 (93.216463) time: 0.360636 data: 0.000125 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.017071 (1.019965) acc1: 75.000000 (75.232000) acc5: 93.750000 (93.088000) time: 0.355772 data: 0.000100 max mem: 18817 Test: Total time: 0:00:19 (0.399077 s / it) * Acc@1 75.446 Acc@5 93.088 loss 1.019 Max accuracy: 75.45% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0050.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0050.pth Epoch: [51/300] [ 0/1251] eta: 0:44:47 lr: 0.001862 loss: 2.532893 (2.532893) time: 2.148485 data: 1.254415 max mem: 18817 Epoch: [51/300] [ 50/1251] eta: 0:19:42 lr: 0.001862 loss: 3.700749 (3.671920) time: 0.987652 data: 0.000264 max mem: 18817 Epoch: [51/300] [ 100/1251] eta: 0:18:47 lr: 0.001862 loss: 3.709430 (3.711119) time: 1.036554 data: 0.000211 max mem: 18817 Epoch: [51/300] [ 150/1251] eta: 0:17:41 lr: 0.001861 loss: 3.369730 (3.669998) time: 0.954713 data: 0.000210 max mem: 18817 Epoch: [51/300] [ 200/1251] eta: 0:16:46 lr: 0.001861 loss: 3.590448 (3.696088) time: 0.916955 data: 0.000211 max mem: 18817 Epoch: [51/300] [ 250/1251] eta: 0:16:00 lr: 0.001861 loss: 3.461050 (3.685119) time: 0.927631 data: 0.000158 max mem: 18817 Epoch: [51/300] [ 300/1251] eta: 0:15:13 lr: 0.001861 loss: 3.712436 (3.689814) time: 0.979157 data: 0.000170 max mem: 18817 Epoch: [51/300] [ 350/1251] eta: 0:14:26 lr: 0.001860 loss: 3.621047 (3.669590) time: 1.045133 data: 0.000176 max mem: 18817 Epoch: [51/300] [ 400/1251] eta: 0:13:37 lr: 0.001860 loss: 3.581413 (3.665154) time: 0.998624 data: 0.000162 max mem: 18817 Epoch: [51/300] [ 450/1251] eta: 0:12:49 lr: 0.001860 loss: 3.833779 (3.672190) time: 0.933156 data: 0.000191 max mem: 18817 Epoch: [51/300] [ 500/1251] eta: 0:12:01 lr: 0.001860 loss: 3.917085 (3.670802) time: 0.938997 data: 0.000183 max mem: 18817 Epoch: [51/300] [ 550/1251] eta: 0:11:14 lr: 0.001860 loss: 3.612451 (3.670855) time: 0.975658 data: 0.000175 max mem: 18817 Epoch: [51/300] [ 600/1251] eta: 0:10:25 lr: 0.001859 loss: 3.533281 (3.670475) time: 1.016422 data: 0.000181 max mem: 18817 Epoch: [51/300] [ 650/1251] eta: 0:09:37 lr: 0.001859 loss: 3.753718 (3.667890) time: 0.981449 data: 0.000186 max mem: 18817 Epoch: [51/300] [ 700/1251] eta: 0:08:48 lr: 0.001859 loss: 3.517573 (3.670634) time: 0.925475 data: 0.000183 max mem: 18817 Epoch: [51/300] [ 750/1251] eta: 0:08:01 lr: 0.001859 loss: 3.561155 (3.660766) time: 0.919814 data: 0.000179 max mem: 18817 Epoch: [51/300] [ 800/1251] eta: 0:07:13 lr: 0.001859 loss: 3.577181 (3.665282) time: 0.971227 data: 0.000187 max mem: 18817 Epoch: [51/300] [ 850/1251] eta: 0:06:25 lr: 0.001858 loss: 3.573209 (3.667714) time: 1.039447 data: 0.000182 max mem: 18817 Epoch: [51/300] [ 900/1251] eta: 0:05:36 lr: 0.001858 loss: 3.943840 (3.663169) time: 0.960738 data: 0.000178 max mem: 18817 Epoch: [51/300] [ 950/1251] eta: 0:04:48 lr: 0.001858 loss: 3.871499 (3.662403) time: 0.922596 data: 0.000181 max mem: 18817 Epoch: [51/300] [1000/1251] eta: 0:04:00 lr: 0.001858 loss: 3.620409 (3.656779) time: 0.923989 data: 0.000170 max mem: 18817 Epoch: [51/300] [1050/1251] eta: 0:03:12 lr: 0.001857 loss: 3.538546 (3.652573) time: 0.963170 data: 0.000166 max mem: 18817 Epoch: [51/300] [1100/1251] eta: 0:02:24 lr: 0.001857 loss: 3.842845 (3.657175) time: 0.991096 data: 0.000166 max mem: 18817 Epoch: [51/300] [1150/1251] eta: 0:01:36 lr: 0.001857 loss: 3.670817 (3.658144) time: 0.972898 data: 0.000169 max mem: 18817 Epoch: [51/300] [1200/1251] eta: 0:00:48 lr: 0.001857 loss: 3.680335 (3.657152) time: 0.933138 data: 0.000174 max mem: 18817 Epoch: [51/300] [1250/1251] eta: 0:00:00 lr: 0.001857 loss: 3.877904 (3.658774) time: 0.923658 data: 0.000742 max mem: 18817 Epoch: [51/300] Total time: 0:19:59 (0.958873 s / it) Averaged stats: lr: 0.001857 loss: 3.877904 (3.660008) Test: [ 0/49] eta: 0:01:16 loss: 0.911441 (0.911441) acc1: 78.125000 (78.125000) acc5: 95.312500 (95.312500) time: 1.568202 data: 1.115785 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.989770 (1.019148) acc1: 79.687500 (77.698864) acc5: 93.750000 (93.323864) time: 0.478538 data: 0.101608 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.017100 (1.049350) acc1: 75.000000 (76.711310) acc5: 92.187500 (93.229167) time: 0.365782 data: 0.000162 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.040378 (1.049805) acc1: 73.437500 (76.209677) acc5: 93.750000 (93.346774) time: 0.363540 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.072502 (1.069455) acc1: 75.000000 (75.685976) acc5: 93.750000 (93.178354) time: 0.361819 data: 0.000130 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.140931 (1.067988) acc1: 75.000000 (75.648000) acc5: 93.750000 (93.312000) time: 0.356297 data: 0.000102 max mem: 18817 Test: Total time: 0:00:19 (0.390616 s / it) * Acc@1 75.312 Acc@5 93.068 loss 1.072 Max accuracy: 75.45% Epoch: [52/300] [ 0/1251] eta: 0:42:51 lr: 0.001857 loss: 3.169152 (3.169152) time: 2.055537 data: 1.174931 max mem: 18817 Epoch: [52/300] [ 50/1251] eta: 0:19:31 lr: 0.001856 loss: 3.711919 (3.647610) time: 1.016293 data: 0.000171 max mem: 18817 Epoch: [52/300] [ 100/1251] eta: 0:18:27 lr: 0.001856 loss: 3.570138 (3.622565) time: 0.977364 data: 0.000164 max mem: 18817 Epoch: [52/300] [ 150/1251] eta: 0:17:38 lr: 0.001856 loss: 3.882269 (3.704444) time: 0.933573 data: 0.000169 max mem: 18817 Epoch: [52/300] [ 200/1251] eta: 0:16:52 lr: 0.001856 loss: 3.553271 (3.702288) time: 0.922611 data: 0.000174 max mem: 18817 Epoch: [52/300] [ 250/1251] eta: 0:16:04 lr: 0.001856 loss: 3.708815 (3.693449) time: 0.983765 data: 0.000182 max mem: 18817 Epoch: [52/300] [ 300/1251] eta: 0:15:14 lr: 0.001855 loss: 3.859268 (3.677957) time: 0.996875 data: 0.000178 max mem: 18817 Epoch: [52/300] [ 350/1251] eta: 0:14:25 lr: 0.001855 loss: 3.706787 (3.671304) time: 0.971783 data: 0.000174 max mem: 18817 Epoch: [52/300] [ 400/1251] eta: 0:13:36 lr: 0.001855 loss: 3.863369 (3.692866) time: 0.922679 data: 0.000186 max mem: 18817 Epoch: [52/300] [ 450/1251] eta: 0:12:48 lr: 0.001855 loss: 3.877413 (3.692328) time: 0.928244 data: 0.000178 max mem: 18817 Epoch: [52/300] [ 500/1251] eta: 0:12:00 lr: 0.001854 loss: 3.854782 (3.706119) time: 0.966980 data: 0.000166 max mem: 18817 Epoch: [52/300] [ 550/1251] eta: 0:11:11 lr: 0.001854 loss: 3.517273 (3.701443) time: 0.971401 data: 0.000172 max mem: 18817 Epoch: [52/300] [ 600/1251] eta: 0:10:24 lr: 0.001854 loss: 3.676838 (3.694007) time: 0.985590 data: 0.000204 max mem: 18817 Epoch: [52/300] [ 650/1251] eta: 0:09:36 lr: 0.001854 loss: 3.654843 (3.688545) time: 0.923005 data: 0.000169 max mem: 18817 Epoch: [52/300] [ 700/1251] eta: 0:08:49 lr: 0.001854 loss: 3.736991 (3.680067) time: 0.928817 data: 0.000173 max mem: 18817 Epoch: [52/300] [ 750/1251] eta: 0:08:01 lr: 0.001853 loss: 3.902489 (3.681434) time: 0.979966 data: 0.000186 max mem: 18817 Epoch: [52/300] [ 800/1251] eta: 0:07:13 lr: 0.001853 loss: 3.923139 (3.675300) time: 1.009720 data: 0.000164 max mem: 18817 Epoch: [52/300] [ 850/1251] eta: 0:06:24 lr: 0.001853 loss: 3.694410 (3.676640) time: 0.961307 data: 0.000170 max mem: 18817 Epoch: [52/300] [ 900/1251] eta: 0:05:36 lr: 0.001853 loss: 3.973932 (3.680067) time: 0.951441 data: 0.000175 max mem: 18817 Epoch: [52/300] [ 950/1251] eta: 0:04:49 lr: 0.001852 loss: 3.900055 (3.676738) time: 0.932327 data: 0.000179 max mem: 18817 Epoch: [52/300] [1000/1251] eta: 0:04:01 lr: 0.001852 loss: 3.766136 (3.677470) time: 0.972923 data: 0.000172 max mem: 18817 Epoch: [52/300] [1050/1251] eta: 0:03:13 lr: 0.001852 loss: 3.744065 (3.678882) time: 1.002995 data: 0.000189 max mem: 18817 Epoch: [52/300] [1100/1251] eta: 0:02:25 lr: 0.001852 loss: 3.703595 (3.680577) time: 0.991152 data: 0.000164 max mem: 18817 Epoch: [52/300] [1150/1251] eta: 0:01:36 lr: 0.001852 loss: 3.735673 (3.684032) time: 0.923302 data: 0.000175 max mem: 18817 Epoch: [52/300] [1200/1251] eta: 0:00:49 lr: 0.001851 loss: 3.770819 (3.682914) time: 0.933114 data: 0.000182 max mem: 18817 Epoch: [52/300] [1250/1251] eta: 0:00:00 lr: 0.001851 loss: 3.764601 (3.678645) time: 0.991923 data: 0.000756 max mem: 18817 Epoch: [52/300] Total time: 0:20:03 (0.961655 s / it) Averaged stats: lr: 0.001851 loss: 3.764601 (3.679279) Test: [ 0/49] eta: 0:01:26 loss: 0.923064 (0.923064) acc1: 76.562500 (76.562500) acc5: 93.750000 (93.750000) time: 1.771443 data: 1.355068 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.923064 (0.960460) acc1: 78.125000 (77.982955) acc5: 93.750000 (93.607955) time: 0.518961 data: 0.123329 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.025336 (1.027412) acc1: 75.000000 (75.892857) acc5: 93.750000 (93.452381) time: 0.378537 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.013535 (1.007341) acc1: 75.000000 (76.159274) acc5: 93.750000 (93.951613) time: 0.363190 data: 0.000132 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.003634 (1.019080) acc1: 76.562500 (75.838415) acc5: 93.750000 (93.597561) time: 0.360470 data: 0.000125 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.023118 (1.020349) acc1: 75.000000 (75.776000) acc5: 92.187500 (93.568000) time: 0.355731 data: 0.000103 max mem: 18817 Test: Total time: 0:00:19 (0.398242 s / it) * Acc@1 75.652 Acc@5 93.170 loss 1.034 Max accuracy: 75.65% Epoch: [53/300] [ 0/1251] eta: 0:40:12 lr: 0.001851 loss: 4.042874 (4.042874) time: 1.928737 data: 1.026174 max mem: 18817 Epoch: [53/300] [ 50/1251] eta: 0:19:17 lr: 0.001851 loss: 3.699009 (3.511060) time: 0.947056 data: 0.000183 max mem: 18817 Epoch: [53/300] [ 100/1251] eta: 0:18:23 lr: 0.001851 loss: 3.951627 (3.620080) time: 0.934056 data: 0.000185 max mem: 18817 Epoch: [53/300] [ 150/1251] eta: 0:17:43 lr: 0.001851 loss: 3.380233 (3.591652) time: 0.944810 data: 0.000168 max mem: 18817 Epoch: [53/300] [ 200/1251] eta: 0:16:56 lr: 0.001850 loss: 3.621611 (3.620617) time: 0.997744 data: 0.000196 max mem: 18817 Epoch: [53/300] [ 250/1251] eta: 0:16:09 lr: 0.001850 loss: 3.477878 (3.631097) time: 1.024927 data: 0.000162 max mem: 18817 Epoch: [53/300] [ 300/1251] eta: 0:15:18 lr: 0.001850 loss: 3.682028 (3.640232) time: 0.968633 data: 0.000180 max mem: 18817 Epoch: [53/300] [ 350/1251] eta: 0:14:27 lr: 0.001850 loss: 3.345688 (3.616353) time: 0.937382 data: 0.000174 max mem: 18817 Epoch: [53/300] [ 400/1251] eta: 0:13:39 lr: 0.001849 loss: 3.781813 (3.621836) time: 0.938881 data: 0.000183 max mem: 18817 Epoch: [53/300] [ 450/1251] eta: 0:12:50 lr: 0.001849 loss: 3.746639 (3.627422) time: 0.976128 data: 0.000180 max mem: 18817 Epoch: [53/300] [ 500/1251] eta: 0:12:01 lr: 0.001849 loss: 3.688974 (3.631956) time: 0.978144 data: 0.000176 max mem: 18817 Epoch: [53/300] [ 550/1251] eta: 0:11:13 lr: 0.001849 loss: 4.134858 (3.655433) time: 0.965220 data: 0.000180 max mem: 18817 Epoch: [53/300] [ 600/1251] eta: 0:10:24 lr: 0.001849 loss: 3.786127 (3.646936) time: 0.928054 data: 0.000174 max mem: 18817 Epoch: [53/300] [ 650/1251] eta: 0:09:37 lr: 0.001848 loss: 3.936728 (3.653227) time: 0.919527 data: 0.000163 max mem: 18817 Epoch: [53/300] [ 700/1251] eta: 0:08:49 lr: 0.001848 loss: 3.691959 (3.647709) time: 0.993946 data: 0.000176 max mem: 18817 Epoch: [53/300] [ 750/1251] eta: 0:08:00 lr: 0.001848 loss: 3.769188 (3.645170) time: 0.960346 data: 0.000189 max mem: 18817 Epoch: [53/300] [ 800/1251] eta: 0:07:13 lr: 0.001848 loss: 3.589802 (3.643287) time: 0.985579 data: 0.000197 max mem: 18817 Epoch: [53/300] [ 850/1251] eta: 0:06:25 lr: 0.001847 loss: 3.393747 (3.636768) time: 0.940908 data: 0.000171 max mem: 18817 Epoch: [53/300] [ 900/1251] eta: 0:05:37 lr: 0.001847 loss: 3.546718 (3.638312) time: 0.929894 data: 0.000168 max mem: 18817 Epoch: [53/300] [ 950/1251] eta: 0:04:49 lr: 0.001847 loss: 3.629687 (3.633809) time: 0.984049 data: 0.000189 max mem: 18817 Epoch: [53/300] [1000/1251] eta: 0:04:00 lr: 0.001847 loss: 3.754548 (3.634946) time: 0.953939 data: 0.000181 max mem: 18817 Epoch: [53/300] [1050/1251] eta: 0:03:12 lr: 0.001847 loss: 3.845231 (3.633778) time: 0.950637 data: 0.000163 max mem: 18817 Epoch: [53/300] [1100/1251] eta: 0:02:24 lr: 0.001846 loss: 3.663159 (3.631841) time: 0.939112 data: 0.000155 max mem: 18817 Epoch: [53/300] [1150/1251] eta: 0:01:36 lr: 0.001846 loss: 3.763628 (3.632769) time: 0.933024 data: 0.000183 max mem: 18817 Epoch: [53/300] [1200/1251] eta: 0:00:48 lr: 0.001846 loss: 3.765182 (3.634657) time: 0.992843 data: 0.000161 max mem: 18817 Epoch: [53/300] [1250/1251] eta: 0:00:00 lr: 0.001846 loss: 3.504129 (3.628888) time: 0.989505 data: 0.000726 max mem: 18817 Epoch: [53/300] Total time: 0:20:01 (0.960395 s / it) Averaged stats: lr: 0.001846 loss: 3.504129 (3.633839) Test: [ 0/49] eta: 0:01:28 loss: 0.815191 (0.815191) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.875000) time: 1.804205 data: 1.410471 max mem: 18817 Test: [10/49] eta: 0:00:21 loss: 0.840113 (0.879822) acc1: 78.125000 (78.125000) acc5: 95.312500 (94.744318) time: 0.557757 data: 0.128395 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.964118 (0.931530) acc1: 76.562500 (77.157738) acc5: 93.750000 (94.122024) time: 0.417274 data: 0.000168 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.988394 (0.943983) acc1: 75.000000 (76.411290) acc5: 93.750000 (94.304435) time: 0.388956 data: 0.000141 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.046309 (0.964232) acc1: 75.000000 (75.990854) acc5: 93.750000 (94.245427) time: 0.367263 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.046309 (0.972177) acc1: 75.000000 (76.000000) acc5: 95.312500 (94.016000) time: 0.355620 data: 0.000107 max mem: 18817 Test: Total time: 0:00:20 (0.417682 s / it) * Acc@1 75.750 Acc@5 93.332 loss 1.000 Max accuracy: 75.75% Epoch: [54/300] [ 0/1251] eta: 0:57:03 lr: 0.001846 loss: 2.662882 (2.662882) time: 2.736363 data: 1.181201 max mem: 18817 Epoch: [54/300] [ 50/1251] eta: 0:19:43 lr: 0.001845 loss: 3.935095 (3.717222) time: 0.936487 data: 0.000187 max mem: 18817 Epoch: [54/300] [ 100/1251] eta: 0:18:54 lr: 0.001845 loss: 3.913239 (3.693786) time: 0.955667 data: 0.000168 max mem: 18817 Epoch: [54/300] [ 150/1251] eta: 0:18:02 lr: 0.001845 loss: 3.832896 (3.731350) time: 1.013602 data: 0.000168 max mem: 18817 Epoch: [54/300] [ 200/1251] eta: 0:17:08 lr: 0.001845 loss: 3.520050 (3.700418) time: 1.020911 data: 0.000174 max mem: 18817 Epoch: [54/300] [ 250/1251] eta: 0:16:16 lr: 0.001845 loss: 3.966815 (3.688755) time: 0.987605 data: 0.000171 max mem: 18817 Epoch: [54/300] [ 300/1251] eta: 0:15:22 lr: 0.001844 loss: 3.351985 (3.671047) time: 0.924792 data: 0.000171 max mem: 18817 Epoch: [54/300] [ 350/1251] eta: 0:14:35 lr: 0.001844 loss: 3.845071 (3.673688) time: 0.934635 data: 0.000196 max mem: 18817 Epoch: [54/300] [ 400/1251] eta: 0:13:46 lr: 0.001844 loss: 3.854969 (3.661492) time: 0.982314 data: 0.000173 max mem: 18817 Epoch: [54/300] [ 450/1251] eta: 0:12:58 lr: 0.001844 loss: 3.614459 (3.658664) time: 1.029222 data: 0.000169 max mem: 18817 Epoch: [54/300] [ 500/1251] eta: 0:12:07 lr: 0.001843 loss: 3.456575 (3.652485) time: 0.972346 data: 0.000175 max mem: 18817 Epoch: [54/300] [ 550/1251] eta: 0:11:18 lr: 0.001843 loss: 3.794197 (3.655426) time: 0.924637 data: 0.000168 max mem: 18817 Epoch: [54/300] [ 600/1251] eta: 0:10:29 lr: 0.001843 loss: 3.563528 (3.654297) time: 0.944333 data: 0.000188 max mem: 18817 Epoch: [54/300] [ 650/1251] eta: 0:09:41 lr: 0.001843 loss: 3.614191 (3.660488) time: 0.969212 data: 0.000166 max mem: 18817 Epoch: [54/300] [ 700/1251] eta: 0:08:52 lr: 0.001843 loss: 3.889456 (3.669671) time: 1.000369 data: 0.000160 max mem: 18817 Epoch: [54/300] [ 750/1251] eta: 0:08:03 lr: 0.001842 loss: 3.962665 (3.668387) time: 0.983292 data: 0.000165 max mem: 18817 Epoch: [54/300] [ 800/1251] eta: 0:07:14 lr: 0.001842 loss: 3.784838 (3.674382) time: 0.925883 data: 0.000164 max mem: 18817 Epoch: [54/300] [ 850/1251] eta: 0:06:26 lr: 0.001842 loss: 3.690980 (3.677164) time: 0.931470 data: 0.000176 max mem: 18817 Epoch: [54/300] [ 900/1251] eta: 0:05:38 lr: 0.001842 loss: 3.850180 (3.680945) time: 0.984477 data: 0.000171 max mem: 18817 Epoch: [54/300] [ 950/1251] eta: 0:04:50 lr: 0.001841 loss: 3.794007 (3.677652) time: 1.019112 data: 0.000174 max mem: 18817 Epoch: [54/300] [1000/1251] eta: 0:04:01 lr: 0.001841 loss: 3.865931 (3.678706) time: 0.958421 data: 0.000166 max mem: 18817 Epoch: [54/300] [1050/1251] eta: 0:03:13 lr: 0.001841 loss: 3.792452 (3.678680) time: 0.939356 data: 0.000178 max mem: 18817 Epoch: [54/300] [1100/1251] eta: 0:02:25 lr: 0.001841 loss: 3.799318 (3.679410) time: 0.929579 data: 0.000160 max mem: 18817 Epoch: [54/300] [1150/1251] eta: 0:01:37 lr: 0.001841 loss: 3.531384 (3.674308) time: 0.972329 data: 0.000169 max mem: 18817 Epoch: [54/300] [1200/1251] eta: 0:00:49 lr: 0.001840 loss: 3.707444 (3.676862) time: 0.965882 data: 0.000175 max mem: 18817 Epoch: [54/300] [1250/1251] eta: 0:00:00 lr: 0.001840 loss: 3.871704 (3.671289) time: 0.967199 data: 0.000755 max mem: 18817 Epoch: [54/300] Total time: 0:20:04 (0.962461 s / it) Averaged stats: lr: 0.001840 loss: 3.871704 (3.663801) Test: [ 0/49] eta: 0:01:32 loss: 0.877419 (0.877419) acc1: 81.250000 (81.250000) acc5: 93.750000 (93.750000) time: 1.896968 data: 1.484172 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.911433 (0.966806) acc1: 78.125000 (78.267045) acc5: 93.750000 (93.750000) time: 0.508224 data: 0.135061 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.070467 (1.027683) acc1: 75.000000 (76.562500) acc5: 92.187500 (93.154762) time: 0.366080 data: 0.000133 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.071266 (1.036536) acc1: 73.437500 (75.655242) acc5: 92.187500 (93.346774) time: 0.362834 data: 0.000122 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.064246 (1.046455) acc1: 75.000000 (76.028963) acc5: 93.750000 (93.178354) time: 0.362124 data: 0.000123 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.064246 (1.044302) acc1: 76.562500 (76.448000) acc5: 93.750000 (93.088000) time: 0.369595 data: 0.000105 max mem: 18817 Test: Total time: 0:00:19 (0.401722 s / it) * Acc@1 75.706 Acc@5 93.324 loss 1.049 Max accuracy: 75.75% Epoch: [55/300] [ 0/1251] eta: 0:41:51 lr: 0.001840 loss: 3.710407 (3.710407) time: 2.007343 data: 1.093382 max mem: 18817 Epoch: [55/300] [ 50/1251] eta: 0:19:48 lr: 0.001840 loss: 3.538943 (3.647203) time: 0.949197 data: 0.000148 max mem: 18817 Epoch: [55/300] [ 100/1251] eta: 0:18:53 lr: 0.001840 loss: 3.787828 (3.612356) time: 1.000234 data: 0.000181 max mem: 18817 Epoch: [55/300] [ 150/1251] eta: 0:17:50 lr: 0.001839 loss: 3.751244 (3.629117) time: 0.990915 data: 0.000175 max mem: 18817 Epoch: [55/300] [ 200/1251] eta: 0:16:57 lr: 0.001839 loss: 3.844458 (3.647612) time: 0.924415 data: 0.000175 max mem: 18817 Epoch: [55/300] [ 250/1251] eta: 0:16:10 lr: 0.001839 loss: 3.930160 (3.660475) time: 0.917892 data: 0.000167 max mem: 18817 Epoch: [55/300] [ 300/1251] eta: 0:15:23 lr: 0.001839 loss: 3.328838 (3.627276) time: 0.937062 data: 0.000152 max mem: 18817 Epoch: [55/300] [ 350/1251] eta: 0:14:35 lr: 0.001838 loss: 3.378528 (3.612978) time: 0.996963 data: 0.000167 max mem: 18817 Epoch: [55/300] [ 400/1251] eta: 0:13:44 lr: 0.001838 loss: 3.516200 (3.607649) time: 0.986726 data: 0.000174 max mem: 18817 Epoch: [55/300] [ 450/1251] eta: 0:12:56 lr: 0.001838 loss: 3.395464 (3.613748) time: 0.961175 data: 0.000168 max mem: 18817 Epoch: [55/300] [ 500/1251] eta: 0:12:06 lr: 0.001838 loss: 3.794264 (3.614169) time: 0.939808 data: 0.000171 max mem: 18817 Epoch: [55/300] [ 550/1251] eta: 0:11:19 lr: 0.001838 loss: 3.694226 (3.620048) time: 0.939149 data: 0.000161 max mem: 18817 Epoch: [55/300] [ 600/1251] eta: 0:10:30 lr: 0.001837 loss: 3.510909 (3.618953) time: 0.972620 data: 0.000167 max mem: 18817 Epoch: [55/300] [ 650/1251] eta: 0:09:40 lr: 0.001837 loss: 3.831417 (3.630270) time: 0.965272 data: 0.000173 max mem: 18817 Epoch: [55/300] [ 700/1251] eta: 0:08:52 lr: 0.001837 loss: 3.473846 (3.627775) time: 0.980287 data: 0.000164 max mem: 18817 Epoch: [55/300] [ 750/1251] eta: 0:08:03 lr: 0.001837 loss: 3.795249 (3.627184) time: 0.911608 data: 0.000175 max mem: 18817 Epoch: [55/300] [ 800/1251] eta: 0:07:15 lr: 0.001836 loss: 3.658938 (3.617835) time: 0.924233 data: 0.000168 max mem: 18817 Epoch: [55/300] [ 850/1251] eta: 0:06:27 lr: 0.001836 loss: 3.651552 (3.613329) time: 0.990643 data: 0.000191 max mem: 18817 Epoch: [55/300] [ 900/1251] eta: 0:05:38 lr: 0.001836 loss: 3.660280 (3.616986) time: 1.001778 data: 0.000165 max mem: 18817 Epoch: [55/300] [ 950/1251] eta: 0:04:50 lr: 0.001836 loss: 3.729333 (3.623558) time: 0.965458 data: 0.000167 max mem: 18817 Epoch: [55/300] [1000/1251] eta: 0:04:01 lr: 0.001835 loss: 3.884213 (3.625758) time: 0.914619 data: 0.000167 max mem: 18817 Epoch: [55/300] [1050/1251] eta: 0:03:13 lr: 0.001835 loss: 3.525619 (3.621799) time: 0.924493 data: 0.000173 max mem: 18817 Epoch: [55/300] [1100/1251] eta: 0:02:25 lr: 0.001835 loss: 3.843502 (3.617831) time: 0.996152 data: 0.000187 max mem: 18817 Epoch: [55/300] [1150/1251] eta: 0:01:37 lr: 0.001835 loss: 3.851941 (3.622060) time: 1.008159 data: 0.000180 max mem: 18817 Epoch: [55/300] [1200/1251] eta: 0:00:49 lr: 0.001835 loss: 3.241229 (3.613783) time: 0.969192 data: 0.000175 max mem: 18817 Epoch: [55/300] [1250/1251] eta: 0:00:00 lr: 0.001834 loss: 3.620150 (3.613110) time: 0.931745 data: 0.000736 max mem: 18817 Epoch: [55/300] Total time: 0:20:06 (0.964234 s / it) Averaged stats: lr: 0.001834 loss: 3.620150 (3.617212) Test: [ 0/49] eta: 0:01:13 loss: 0.776144 (0.776144) acc1: 81.250000 (81.250000) acc5: 96.875000 (96.875000) time: 1.508939 data: 1.083434 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.955160 (0.964924) acc1: 76.562500 (77.982955) acc5: 95.312500 (94.034091) time: 0.475150 data: 0.098671 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 1.047407 (1.017652) acc1: 75.000000 (76.116071) acc5: 93.750000 (93.601190) time: 0.405462 data: 0.000164 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.030198 (1.014463) acc1: 75.000000 (76.058468) acc5: 93.750000 (93.598790) time: 0.461678 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.006735 (1.022801) acc1: 76.562500 (76.181402) acc5: 93.750000 (93.445122) time: 0.420905 data: 0.000130 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.006735 (1.021819) acc1: 78.125000 (76.416000) acc5: 92.187500 (93.440000) time: 0.355118 data: 0.000101 max mem: 18817 Test: Total time: 0:00:20 (0.428452 s / it) * Acc@1 75.828 Acc@5 93.376 loss 1.036 Max accuracy: 75.83% Epoch: [56/300] [ 0/1251] eta: 0:41:26 lr: 0.001834 loss: 3.250294 (3.250294) time: 1.987462 data: 1.070902 max mem: 18817 Epoch: [56/300] [ 50/1251] eta: 0:19:46 lr: 0.001834 loss: 3.451754 (3.565920) time: 0.984647 data: 0.000178 max mem: 18817 Epoch: [56/300] [ 100/1251] eta: 0:18:31 lr: 0.001834 loss: 3.744086 (3.604412) time: 0.966905 data: 0.000170 max mem: 18817 Epoch: [56/300] [ 150/1251] eta: 0:17:35 lr: 0.001834 loss: 3.796172 (3.645776) time: 0.905433 data: 0.000178 max mem: 18817 Epoch: [56/300] [ 200/1251] eta: 0:16:53 lr: 0.001833 loss: 3.505044 (3.603589) time: 0.928270 data: 0.000180 max mem: 18817 Epoch: [56/300] [ 250/1251] eta: 0:16:07 lr: 0.001833 loss: 3.589907 (3.613021) time: 0.955399 data: 0.000201 max mem: 18817 Epoch: [56/300] [ 300/1251] eta: 0:15:17 lr: 0.001833 loss: 3.837551 (3.624949) time: 0.973715 data: 0.000173 max mem: 18817 Epoch: [56/300] [ 350/1251] eta: 0:14:26 lr: 0.001833 loss: 3.764473 (3.635451) time: 0.977494 data: 0.000162 max mem: 18817 Epoch: [56/300] [ 400/1251] eta: 0:13:36 lr: 0.001833 loss: 3.698071 (3.639133) time: 0.920604 data: 0.000196 max mem: 18817 Epoch: [56/300] [ 450/1251] eta: 0:12:49 lr: 0.001832 loss: 3.648867 (3.631569) time: 0.928621 data: 0.000167 max mem: 18817 Epoch: [56/300] [ 500/1251] eta: 0:12:01 lr: 0.001832 loss: 3.725565 (3.639346) time: 0.972909 data: 0.000169 max mem: 18817 Epoch: [56/300] [ 550/1251] eta: 0:11:14 lr: 0.001832 loss: 3.878022 (3.645240) time: 0.982697 data: 0.000180 max mem: 18817 Epoch: [56/300] [ 600/1251] eta: 0:10:25 lr: 0.001832 loss: 3.753782 (3.644075) time: 0.964326 data: 0.000181 max mem: 18817 Epoch: [56/300] [ 650/1251] eta: 0:09:37 lr: 0.001831 loss: 3.824965 (3.655372) time: 0.911646 data: 0.000166 max mem: 18817 Epoch: [56/300] [ 700/1251] eta: 0:08:49 lr: 0.001831 loss: 3.479521 (3.649069) time: 0.918378 data: 0.000171 max mem: 18817 Epoch: [56/300] [ 750/1251] eta: 0:08:01 lr: 0.001831 loss: 3.097650 (3.640335) time: 0.965136 data: 0.000171 max mem: 18817 Epoch: [56/300] [ 800/1251] eta: 0:07:13 lr: 0.001831 loss: 3.749599 (3.640251) time: 0.964917 data: 0.000180 max mem: 18817 Epoch: [56/300] [ 850/1251] eta: 0:06:25 lr: 0.001830 loss: 3.828231 (3.638106) time: 0.973721 data: 0.000180 max mem: 18817 Epoch: [56/300] [ 900/1251] eta: 0:05:36 lr: 0.001830 loss: 4.000950 (3.643810) time: 0.917989 data: 0.000178 max mem: 18817 Epoch: [56/300] [ 950/1251] eta: 0:04:48 lr: 0.001830 loss: 3.627399 (3.645179) time: 0.932836 data: 0.000162 max mem: 18817 Epoch: [56/300] [1000/1251] eta: 0:04:01 lr: 0.001830 loss: 3.959873 (3.652860) time: 0.970539 data: 0.000173 max mem: 18817 Epoch: [56/300] [1050/1251] eta: 0:03:13 lr: 0.001829 loss: 3.429401 (3.641974) time: 0.990287 data: 0.000182 max mem: 18817 Epoch: [56/300] [1100/1251] eta: 0:02:25 lr: 0.001829 loss: 3.662055 (3.644882) time: 0.982681 data: 0.000189 max mem: 18817 Epoch: [56/300] [1150/1251] eta: 0:01:36 lr: 0.001829 loss: 3.590268 (3.643403) time: 0.908749 data: 0.000164 max mem: 18817 Epoch: [56/300] [1200/1251] eta: 0:00:48 lr: 0.001829 loss: 3.918641 (3.646971) time: 0.927169 data: 0.000165 max mem: 18817 Epoch: [56/300] [1250/1251] eta: 0:00:00 lr: 0.001829 loss: 3.739285 (3.646613) time: 0.937666 data: 0.000744 max mem: 18817 Epoch: [56/300] Total time: 0:20:02 (0.961226 s / it) Averaged stats: lr: 0.001829 loss: 3.739285 (3.633864) Test: [ 0/49] eta: 0:01:26 loss: 0.690829 (0.690829) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.771454 data: 1.390080 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.840780 (0.925522) acc1: 78.125000 (79.403409) acc5: 95.312500 (93.892045) time: 0.504062 data: 0.126523 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.013569 (0.977460) acc1: 78.125000 (77.976190) acc5: 93.750000 (93.601190) time: 0.369424 data: 0.000162 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.013569 (0.970554) acc1: 76.562500 (77.570565) acc5: 93.750000 (93.850806) time: 0.364418 data: 0.000161 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.974357 (0.983134) acc1: 76.562500 (77.210366) acc5: 93.750000 (93.711890) time: 0.365706 data: 0.000144 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.021533 (0.985027) acc1: 75.000000 (77.216000) acc5: 93.750000 (93.888000) time: 0.361271 data: 0.000112 max mem: 18817 Test: Total time: 0:00:19 (0.396661 s / it) * Acc@1 76.216 Acc@5 93.412 loss 1.010 Max accuracy: 76.22% Epoch: [57/300] [ 0/1251] eta: 0:44:33 lr: 0.001829 loss: 2.267258 (2.267258) time: 2.137186 data: 1.121166 max mem: 18817 Epoch: [57/300] [ 50/1251] eta: 0:19:19 lr: 0.001828 loss: 3.602090 (3.474408) time: 0.961854 data: 0.000197 max mem: 18817 Epoch: [57/300] [ 100/1251] eta: 0:18:23 lr: 0.001828 loss: 3.883649 (3.576021) time: 0.923179 data: 0.000168 max mem: 18817 Epoch: [57/300] [ 150/1251] eta: 0:17:41 lr: 0.001828 loss: 3.799291 (3.608586) time: 0.922577 data: 0.000171 max mem: 18817 Epoch: [57/300] [ 200/1251] eta: 0:16:51 lr: 0.001828 loss: 3.841237 (3.610524) time: 0.950347 data: 0.000183 max mem: 18817 Epoch: [57/300] [ 250/1251] eta: 0:16:01 lr: 0.001827 loss: 3.571478 (3.579873) time: 0.998926 data: 0.000175 max mem: 18817 Epoch: [57/300] [ 300/1251] eta: 0:15:14 lr: 0.001827 loss: 3.470460 (3.578889) time: 0.981178 data: 0.000174 max mem: 18817 Epoch: [57/300] [ 350/1251] eta: 0:14:24 lr: 0.001827 loss: 3.681714 (3.588254) time: 0.928160 data: 0.000166 max mem: 18817 Epoch: [57/300] [ 400/1251] eta: 0:13:38 lr: 0.001827 loss: 3.872372 (3.609510) time: 0.936920 data: 0.000210 max mem: 18817 Epoch: [57/300] [ 450/1251] eta: 0:12:51 lr: 0.001826 loss: 3.843237 (3.608858) time: 0.985184 data: 0.000187 max mem: 18817 Epoch: [57/300] [ 500/1251] eta: 0:12:04 lr: 0.001826 loss: 3.597660 (3.602389) time: 1.038857 data: 0.000194 max mem: 18817 Epoch: [57/300] [ 550/1251] eta: 0:11:15 lr: 0.001826 loss: 3.701236 (3.608897) time: 0.976440 data: 0.000164 max mem: 18817 Epoch: [57/300] [ 600/1251] eta: 0:10:26 lr: 0.001826 loss: 3.477770 (3.596643) time: 0.913888 data: 0.000178 max mem: 18817 Epoch: [57/300] [ 650/1251] eta: 0:09:38 lr: 0.001826 loss: 3.737560 (3.596268) time: 0.935263 data: 0.000164 max mem: 18817 Epoch: [57/300] [ 700/1251] eta: 0:08:50 lr: 0.001825 loss: 3.694495 (3.606291) time: 0.989099 data: 0.000171 max mem: 18817 Epoch: [57/300] [ 750/1251] eta: 0:08:02 lr: 0.001825 loss: 3.524776 (3.605406) time: 1.022565 data: 0.000169 max mem: 18817 Epoch: [57/300] [ 800/1251] eta: 0:07:14 lr: 0.001825 loss: 3.740553 (3.609000) time: 0.982069 data: 0.000185 max mem: 18817 Epoch: [57/300] [ 850/1251] eta: 0:06:25 lr: 0.001825 loss: 3.593960 (3.605790) time: 0.921488 data: 0.000174 max mem: 18817 Epoch: [57/300] [ 900/1251] eta: 0:05:37 lr: 0.001824 loss: 3.409257 (3.598903) time: 0.923548 data: 0.000177 max mem: 18817 Epoch: [57/300] [ 950/1251] eta: 0:04:49 lr: 0.001824 loss: 3.882478 (3.609685) time: 0.995989 data: 0.000179 max mem: 18817 Epoch: [57/300] [1000/1251] eta: 0:04:01 lr: 0.001824 loss: 3.680019 (3.615416) time: 1.023450 data: 0.000179 max mem: 18817 Epoch: [57/300] [1050/1251] eta: 0:03:13 lr: 0.001824 loss: 3.540326 (3.611179) time: 0.979296 data: 0.000187 max mem: 18817 Epoch: [57/300] [1100/1251] eta: 0:02:25 lr: 0.001823 loss: 3.793100 (3.613358) time: 0.921002 data: 0.000173 max mem: 18817 Epoch: [57/300] [1150/1251] eta: 0:01:37 lr: 0.001823 loss: 3.978305 (3.613145) time: 0.925976 data: 0.000185 max mem: 18817 Epoch: [57/300] [1200/1251] eta: 0:00:49 lr: 0.001823 loss: 3.600279 (3.610746) time: 0.983958 data: 0.000178 max mem: 18817 Epoch: [57/300] [1250/1251] eta: 0:00:00 lr: 0.001823 loss: 3.595656 (3.607263) time: 1.033653 data: 0.000746 max mem: 18817 Epoch: [57/300] Total time: 0:20:04 (0.962718 s / it) Averaged stats: lr: 0.001823 loss: 3.595656 (3.603157) Test: [ 0/49] eta: 0:01:23 loss: 0.781772 (0.781772) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.708080 data: 1.291912 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.855163 (0.908394) acc1: 76.562500 (78.977273) acc5: 95.312500 (95.028409) time: 0.510847 data: 0.117654 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.943758 (0.961482) acc1: 76.562500 (77.008929) acc5: 95.312500 (94.568452) time: 0.379371 data: 0.000187 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.977176 (0.963068) acc1: 75.000000 (76.713710) acc5: 95.312500 (94.657258) time: 0.365336 data: 0.000156 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.962989 (0.974468) acc1: 76.562500 (76.714939) acc5: 93.750000 (94.245427) time: 0.371926 data: 0.000154 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.003520 (0.979181) acc1: 76.562500 (76.672000) acc5: 93.750000 (94.272000) time: 0.367036 data: 0.000118 max mem: 18817 Test: Total time: 0:00:19 (0.401603 s / it) * Acc@1 76.166 Acc@5 93.468 loss 1.000 Max accuracy: 76.22% Epoch: [58/300] [ 0/1251] eta: 0:42:04 lr: 0.001823 loss: 3.560748 (3.560748) time: 2.017731 data: 1.123047 max mem: 18817 Epoch: [58/300] [ 50/1251] eta: 0:19:30 lr: 0.001822 loss: 3.656594 (3.465343) time: 0.914300 data: 0.000182 max mem: 18817 Epoch: [58/300] [ 100/1251] eta: 0:18:38 lr: 0.001822 loss: 3.580140 (3.574448) time: 0.933051 data: 0.000186 max mem: 18817 Epoch: [58/300] [ 150/1251] eta: 0:17:51 lr: 0.001822 loss: 3.692529 (3.577157) time: 0.937138 data: 0.000176 max mem: 18817 Epoch: [58/300] [ 200/1251] eta: 0:17:01 lr: 0.001822 loss: 3.499163 (3.598779) time: 0.981258 data: 0.000168 max mem: 18817 Epoch: [58/300] [ 250/1251] eta: 0:16:06 lr: 0.001821 loss: 3.510353 (3.578746) time: 0.963702 data: 0.000177 max mem: 18817 Epoch: [58/300] [ 300/1251] eta: 0:15:16 lr: 0.001821 loss: 3.686028 (3.593081) time: 0.926522 data: 0.000168 max mem: 18817 Epoch: [58/300] [ 350/1251] eta: 0:14:29 lr: 0.001821 loss: 3.959851 (3.609286) time: 0.926591 data: 0.000160 max mem: 18817 Epoch: [58/300] [ 400/1251] eta: 0:13:43 lr: 0.001821 loss: 3.888627 (3.612318) time: 0.934238 data: 0.000174 max mem: 18817 Epoch: [58/300] [ 450/1251] eta: 0:12:54 lr: 0.001821 loss: 3.648467 (3.635365) time: 0.990804 data: 0.000161 max mem: 18817 Epoch: [58/300] [ 500/1251] eta: 0:12:05 lr: 0.001820 loss: 3.378841 (3.628437) time: 0.970450 data: 0.000151 max mem: 18817 Epoch: [58/300] [ 550/1251] eta: 0:11:17 lr: 0.001820 loss: 3.703811 (3.638073) time: 0.934450 data: 0.000176 max mem: 18817 Epoch: [58/300] [ 600/1251] eta: 0:10:28 lr: 0.001820 loss: 3.679066 (3.647601) time: 0.937617 data: 0.000172 max mem: 18817 Epoch: [58/300] [ 650/1251] eta: 0:09:39 lr: 0.001820 loss: 3.793330 (3.647827) time: 0.927891 data: 0.000164 max mem: 18817 Epoch: [58/300] [ 700/1251] eta: 0:08:52 lr: 0.001819 loss: 3.635235 (3.645547) time: 0.970097 data: 0.000180 max mem: 18817 Epoch: [58/300] [ 750/1251] eta: 0:08:03 lr: 0.001819 loss: 3.639773 (3.646562) time: 0.978414 data: 0.000170 max mem: 18817 Epoch: [58/300] [ 800/1251] eta: 0:07:15 lr: 0.001819 loss: 3.872858 (3.641386) time: 0.988803 data: 0.000179 max mem: 18817 Epoch: [58/300] [ 850/1251] eta: 0:06:26 lr: 0.001819 loss: 3.601382 (3.639975) time: 0.923105 data: 0.000161 max mem: 18817 Epoch: [58/300] [ 900/1251] eta: 0:05:38 lr: 0.001818 loss: 3.723996 (3.642161) time: 0.926854 data: 0.000174 max mem: 18817 Epoch: [58/300] [ 950/1251] eta: 0:04:50 lr: 0.001818 loss: 3.743968 (3.639813) time: 0.992290 data: 0.000175 max mem: 18817 Epoch: [58/300] [1000/1251] eta: 0:04:01 lr: 0.001818 loss: 3.465425 (3.632866) time: 1.000535 data: 0.000190 max mem: 18817 Epoch: [58/300] [1050/1251] eta: 0:03:13 lr: 0.001818 loss: 3.795699 (3.634178) time: 0.973855 data: 0.000194 max mem: 18817 Epoch: [58/300] [1100/1251] eta: 0:02:25 lr: 0.001817 loss: 3.441253 (3.630623) time: 0.940315 data: 0.000194 max mem: 18817 Epoch: [58/300] [1150/1251] eta: 0:01:37 lr: 0.001817 loss: 3.366604 (3.625564) time: 0.935073 data: 0.000171 max mem: 18817 Epoch: [58/300] [1200/1251] eta: 0:00:49 lr: 0.001817 loss: 3.380562 (3.624870) time: 0.976685 data: 0.000182 max mem: 18817 Epoch: [58/300] [1250/1251] eta: 0:00:00 lr: 0.001817 loss: 3.616036 (3.624513) time: 1.020960 data: 0.000769 max mem: 18817 Epoch: [58/300] Total time: 0:20:08 (0.965784 s / it) Averaged stats: lr: 0.001817 loss: 3.616036 (3.622005) Test: [ 0/49] eta: 0:01:29 loss: 0.792462 (0.792462) acc1: 79.687500 (79.687500) acc5: 96.875000 (96.875000) time: 1.829273 data: 1.411270 max mem: 18817 Test: [10/49] eta: 0:00:21 loss: 0.823375 (0.889803) acc1: 75.000000 (77.698864) acc5: 93.750000 (94.034091) time: 0.538623 data: 0.128445 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.985989 (0.942347) acc1: 75.000000 (76.562500) acc5: 93.750000 (93.675595) time: 0.393052 data: 0.000159 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.985989 (0.949749) acc1: 75.000000 (76.058468) acc5: 93.750000 (93.901210) time: 0.370173 data: 0.000149 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.960289 (0.969918) acc1: 75.000000 (75.876524) acc5: 93.750000 (93.711890) time: 0.364171 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.055225 (0.973390) acc1: 75.000000 (76.224000) acc5: 93.750000 (93.728000) time: 0.364680 data: 0.000098 max mem: 18817 Test: Total time: 0:00:20 (0.409170 s / it) * Acc@1 76.458 Acc@5 93.554 loss 0.982 Max accuracy: 76.46% Epoch: [59/300] [ 0/1251] eta: 0:40:13 lr: 0.001817 loss: 3.961921 (3.961921) time: 1.929454 data: 1.023179 max mem: 18817 Epoch: [59/300] [ 50/1251] eta: 0:19:50 lr: 0.001816 loss: 3.673916 (3.615780) time: 0.934351 data: 0.000193 max mem: 18817 Epoch: [59/300] [ 100/1251] eta: 0:18:48 lr: 0.001816 loss: 3.785761 (3.544982) time: 0.964581 data: 0.000177 max mem: 18817 Epoch: [59/300] [ 150/1251] eta: 0:17:54 lr: 0.001816 loss: 3.555714 (3.535078) time: 0.975048 data: 0.000161 max mem: 18817 Epoch: [59/300] [ 200/1251] eta: 0:17:00 lr: 0.001816 loss: 3.798489 (3.574068) time: 0.985921 data: 0.000493 max mem: 18817 Epoch: [59/300] [ 250/1251] eta: 0:16:07 lr: 0.001815 loss: 3.626640 (3.581568) time: 0.914124 data: 0.000168 max mem: 18817 Epoch: [59/300] [ 300/1251] eta: 0:15:21 lr: 0.001815 loss: 3.764207 (3.587703) time: 0.945899 data: 0.000186 max mem: 18817 Epoch: [59/300] [ 350/1251] eta: 0:14:32 lr: 0.001815 loss: 3.673681 (3.599864) time: 0.949565 data: 0.000173 max mem: 18817 Epoch: [59/300] [ 400/1251] eta: 0:13:44 lr: 0.001815 loss: 3.529420 (3.591723) time: 1.000976 data: 0.000170 max mem: 18817 Epoch: [59/300] [ 450/1251] eta: 0:12:53 lr: 0.001815 loss: 3.435441 (3.574304) time: 0.968583 data: 0.000186 max mem: 18817 Epoch: [59/300] [ 500/1251] eta: 0:12:03 lr: 0.001814 loss: 3.766512 (3.587372) time: 0.926909 data: 0.000176 max mem: 18817 Epoch: [59/300] [ 550/1251] eta: 0:11:16 lr: 0.001814 loss: 3.794424 (3.601301) time: 0.927692 data: 0.000171 max mem: 18817 Epoch: [59/300] [ 600/1251] eta: 0:10:28 lr: 0.001814 loss: 3.685143 (3.600548) time: 0.940926 data: 0.000166 max mem: 18817 Epoch: [59/300] [ 650/1251] eta: 0:09:40 lr: 0.001814 loss: 3.670109 (3.595338) time: 0.981690 data: 0.000164 max mem: 18817 Epoch: [59/300] [ 700/1251] eta: 0:08:51 lr: 0.001813 loss: 3.563884 (3.601265) time: 0.976701 data: 0.000195 max mem: 18817 Epoch: [59/300] [ 750/1251] eta: 0:08:02 lr: 0.001813 loss: 3.345968 (3.590528) time: 0.926875 data: 0.000181 max mem: 18817 Epoch: [59/300] [ 800/1251] eta: 0:07:14 lr: 0.001813 loss: 3.758476 (3.597193) time: 0.937549 data: 0.000171 max mem: 18817 Epoch: [59/300] [ 850/1251] eta: 0:06:26 lr: 0.001813 loss: 3.905158 (3.604365) time: 0.937845 data: 0.000186 max mem: 18817 Epoch: [59/300] [ 900/1251] eta: 0:05:38 lr: 0.001812 loss: 3.578852 (3.608491) time: 0.986333 data: 0.000176 max mem: 18817 Epoch: [59/300] [ 950/1251] eta: 0:04:49 lr: 0.001812 loss: 3.789785 (3.616690) time: 0.947999 data: 0.000170 max mem: 18817 Epoch: [59/300] [1000/1251] eta: 0:04:01 lr: 0.001812 loss: 3.866654 (3.617251) time: 0.930306 data: 0.000178 max mem: 18817 Epoch: [59/300] [1050/1251] eta: 0:03:13 lr: 0.001812 loss: 3.661931 (3.618069) time: 0.936950 data: 0.000169 max mem: 18817 Epoch: [59/300] [1100/1251] eta: 0:02:25 lr: 0.001811 loss: 3.829479 (3.619173) time: 0.933790 data: 0.000168 max mem: 18817 Epoch: [59/300] [1150/1251] eta: 0:01:37 lr: 0.001811 loss: 3.655154 (3.619010) time: 0.981825 data: 0.000172 max mem: 18817 Epoch: [59/300] [1200/1251] eta: 0:00:49 lr: 0.001811 loss: 3.470176 (3.615435) time: 0.969682 data: 0.000189 max mem: 18817 Epoch: [59/300] [1250/1251] eta: 0:00:00 lr: 0.001811 loss: 3.861439 (3.616206) time: 0.902316 data: 0.000797 max mem: 18817 Epoch: [59/300] Total time: 0:20:03 (0.961771 s / it) Averaged stats: lr: 0.001811 loss: 3.861439 (3.610564) Test: [ 0/49] eta: 0:01:59 loss: 0.842481 (0.842481) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.312500) time: 2.432435 data: 1.206323 max mem: 18817 Test: [10/49] eta: 0:00:23 loss: 0.860206 (0.918227) acc1: 78.125000 (79.545455) acc5: 93.750000 (93.323864) time: 0.613144 data: 0.109814 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.986294 (0.960649) acc1: 78.125000 (78.199405) acc5: 92.187500 (93.005952) time: 0.399777 data: 0.000155 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.007071 (0.960585) acc1: 76.562500 (77.872984) acc5: 93.750000 (93.346774) time: 0.365297 data: 0.000150 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.003745 (0.975935) acc1: 76.562500 (77.477134) acc5: 93.750000 (93.292683) time: 0.360526 data: 0.000154 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.003745 (0.979849) acc1: 75.000000 (76.960000) acc5: 93.750000 (93.440000) time: 0.359717 data: 0.000123 max mem: 18817 Test: Total time: 0:00:20 (0.421089 s / it) * Acc@1 76.412 Acc@5 93.528 loss 0.978 Max accuracy: 76.46% Epoch: [60/300] [ 0/1251] eta: 0:40:40 lr: 0.001811 loss: 3.773545 (3.773545) time: 1.951074 data: 1.056932 max mem: 18817 Epoch: [60/300] [ 50/1251] eta: 0:19:58 lr: 0.001810 loss: 3.574870 (3.601866) time: 1.013605 data: 0.000177 max mem: 18817 Epoch: [60/300] [ 100/1251] eta: 0:18:49 lr: 0.001810 loss: 3.455683 (3.572391) time: 0.991924 data: 0.000184 max mem: 18817 Epoch: [60/300] [ 150/1251] eta: 0:17:49 lr: 0.001810 loss: 3.711011 (3.585864) time: 0.976203 data: 0.000167 max mem: 18817 Epoch: [60/300] [ 200/1251] eta: 0:16:53 lr: 0.001810 loss: 3.611751 (3.606207) time: 0.906137 data: 0.000163 max mem: 18817 Epoch: [60/300] [ 250/1251] eta: 0:16:07 lr: 0.001809 loss: 3.770807 (3.610754) time: 0.934433 data: 0.000154 max mem: 18817 Epoch: [60/300] [ 300/1251] eta: 0:15:20 lr: 0.001809 loss: 3.514071 (3.600337) time: 1.003050 data: 0.000172 max mem: 18817 Epoch: [60/300] [ 350/1251] eta: 0:14:32 lr: 0.001809 loss: 3.929673 (3.595355) time: 0.997213 data: 0.000168 max mem: 18817 Epoch: [60/300] [ 400/1251] eta: 0:13:42 lr: 0.001809 loss: 3.910163 (3.611287) time: 0.976970 data: 0.000165 max mem: 18817 Epoch: [60/300] [ 450/1251] eta: 0:12:52 lr: 0.001808 loss: 3.818881 (3.606299) time: 0.921556 data: 0.000172 max mem: 18817 Epoch: [60/300] [ 500/1251] eta: 0:12:05 lr: 0.001808 loss: 3.524357 (3.610494) time: 0.940093 data: 0.000163 max mem: 18817 Epoch: [60/300] [ 550/1251] eta: 0:11:17 lr: 0.001808 loss: 3.854262 (3.624429) time: 0.953164 data: 0.000172 max mem: 18817 Epoch: [60/300] [ 600/1251] eta: 0:10:29 lr: 0.001808 loss: 3.690384 (3.626711) time: 0.997305 data: 0.000173 max mem: 18817 Epoch: [60/300] [ 650/1251] eta: 0:09:40 lr: 0.001807 loss: 3.704477 (3.631175) time: 0.961033 data: 0.000179 max mem: 18817 Epoch: [60/300] [ 700/1251] eta: 0:08:51 lr: 0.001807 loss: 3.700475 (3.630397) time: 0.919837 data: 0.000162 max mem: 18817 Epoch: [60/300] [ 750/1251] eta: 0:08:03 lr: 0.001807 loss: 3.567083 (3.624988) time: 0.924733 data: 0.000167 max mem: 18817 Epoch: [60/300] [ 800/1251] eta: 0:07:15 lr: 0.001807 loss: 3.698558 (3.621577) time: 0.960502 data: 0.000169 max mem: 18817 Epoch: [60/300] [ 850/1251] eta: 0:06:27 lr: 0.001806 loss: 3.666020 (3.618658) time: 1.002267 data: 0.000182 max mem: 18817 Epoch: [60/300] [ 900/1251] eta: 0:05:38 lr: 0.001806 loss: 3.493893 (3.616147) time: 0.976010 data: 0.000185 max mem: 18817 Epoch: [60/300] [ 950/1251] eta: 0:04:50 lr: 0.001806 loss: 3.463756 (3.612765) time: 0.915867 data: 0.000170 max mem: 18817 Epoch: [60/300] [1000/1251] eta: 0:04:02 lr: 0.001806 loss: 3.580432 (3.609957) time: 0.911709 data: 0.000162 max mem: 18817 Epoch: [60/300] [1050/1251] eta: 0:03:13 lr: 0.001805 loss: 3.663085 (3.615597) time: 0.926028 data: 0.000174 max mem: 18817 Epoch: [60/300] [1100/1251] eta: 0:02:25 lr: 0.001805 loss: 3.424680 (3.616955) time: 0.973976 data: 0.000188 max mem: 18817 Epoch: [60/300] [1150/1251] eta: 0:01:37 lr: 0.001805 loss: 3.614622 (3.614126) time: 0.960290 data: 0.000171 max mem: 18817 Epoch: [60/300] [1200/1251] eta: 0:00:49 lr: 0.001805 loss: 3.778827 (3.615796) time: 0.912048 data: 0.000164 max mem: 18817 Epoch: [60/300] [1250/1251] eta: 0:00:00 lr: 0.001805 loss: 3.584709 (3.615761) time: 0.931369 data: 0.000734 max mem: 18817 Epoch: [60/300] Total time: 0:20:05 (0.963360 s / it) Averaged stats: lr: 0.001805 loss: 3.584709 (3.608095) Test: [ 0/49] eta: 0:01:14 loss: 0.817125 (0.817125) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.526973 data: 1.074431 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.891976 (0.939842) acc1: 76.562500 (78.267045) acc5: 93.750000 (93.892045) time: 0.475582 data: 0.097808 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.947856 (0.981853) acc1: 75.000000 (76.860119) acc5: 93.750000 (93.898810) time: 0.366538 data: 0.000134 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.951819 (0.974842) acc1: 75.000000 (76.562500) acc5: 95.312500 (94.304435) time: 0.382109 data: 0.000120 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.951819 (0.992675) acc1: 76.562500 (76.638720) acc5: 93.750000 (94.016768) time: 0.468546 data: 0.000115 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.992633 (0.994594) acc1: 76.562500 (76.480000) acc5: 93.750000 (93.984000) time: 0.462927 data: 0.000097 max mem: 18817 Test: Total time: 0:00:21 (0.433065 s / it) * Acc@1 76.544 Acc@5 93.612 loss 1.002 Max accuracy: 76.54% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0060.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0060.pth Epoch: [61/300] [ 0/1251] eta: 0:42:52 lr: 0.001805 loss: 4.764658 (4.764658) time: 2.056429 data: 1.131330 max mem: 18817 Epoch: [61/300] [ 50/1251] eta: 0:19:53 lr: 0.001804 loss: 3.634050 (3.520945) time: 0.948935 data: 0.000163 max mem: 18817 Epoch: [61/300] [ 100/1251] eta: 0:19:04 lr: 0.001804 loss: 4.021946 (3.566901) time: 1.011582 data: 0.000169 max mem: 18817 Epoch: [61/300] [ 150/1251] eta: 0:18:00 lr: 0.001804 loss: 3.566554 (3.568800) time: 0.998467 data: 0.000172 max mem: 18817 Epoch: [61/300] [ 200/1251] eta: 0:17:03 lr: 0.001804 loss: 3.746795 (3.577804) time: 0.966196 data: 0.000151 max mem: 18817 Epoch: [61/300] [ 250/1251] eta: 0:16:10 lr: 0.001803 loss: 3.642331 (3.587492) time: 0.927575 data: 0.000175 max mem: 18817 Epoch: [61/300] [ 300/1251] eta: 0:15:20 lr: 0.001803 loss: 3.684197 (3.592715) time: 0.927542 data: 0.000161 max mem: 18817 Epoch: [61/300] [ 350/1251] eta: 0:14:30 lr: 0.001803 loss: 3.720778 (3.594806) time: 0.964784 data: 0.000164 max mem: 18817 Epoch: [61/300] [ 400/1251] eta: 0:13:39 lr: 0.001803 loss: 3.665940 (3.597871) time: 0.967264 data: 0.000162 max mem: 18817 Epoch: [61/300] [ 450/1251] eta: 0:12:48 lr: 0.001802 loss: 3.562695 (3.597361) time: 0.927143 data: 0.000173 max mem: 18817 Epoch: [61/300] [ 500/1251] eta: 0:12:01 lr: 0.001802 loss: 3.705108 (3.599682) time: 0.926017 data: 0.000177 max mem: 18817 Epoch: [61/300] [ 550/1251] eta: 0:11:13 lr: 0.001802 loss: 3.586138 (3.606501) time: 0.928574 data: 0.000167 max mem: 18817 Epoch: [61/300] [ 600/1251] eta: 0:10:25 lr: 0.001802 loss: 3.681952 (3.610330) time: 0.978485 data: 0.000168 max mem: 18817 Epoch: [61/300] [ 650/1251] eta: 0:09:36 lr: 0.001801 loss: 3.448212 (3.607198) time: 0.979699 data: 0.000165 max mem: 18817 Epoch: [61/300] [ 700/1251] eta: 0:08:48 lr: 0.001801 loss: 3.593720 (3.599613) time: 0.908301 data: 0.000174 max mem: 18817 Epoch: [61/300] [ 750/1251] eta: 0:08:00 lr: 0.001801 loss: 3.458630 (3.594389) time: 0.930164 data: 0.000174 max mem: 18817 Epoch: [61/300] [ 800/1251] eta: 0:07:13 lr: 0.001801 loss: 3.748437 (3.602734) time: 0.946340 data: 0.000161 max mem: 18817 Epoch: [61/300] [ 850/1251] eta: 0:06:25 lr: 0.001800 loss: 3.606994 (3.602030) time: 1.004560 data: 0.000172 max mem: 18817 Epoch: [61/300] [ 900/1251] eta: 0:05:37 lr: 0.001800 loss: 3.872211 (3.610547) time: 0.975823 data: 0.000171 max mem: 18817 Epoch: [61/300] [ 950/1251] eta: 0:04:48 lr: 0.001800 loss: 3.673283 (3.611408) time: 0.918389 data: 0.000167 max mem: 18817 Epoch: [61/300] [1000/1251] eta: 0:04:00 lr: 0.001800 loss: 3.720772 (3.611347) time: 0.921760 data: 0.000163 max mem: 18817 Epoch: [61/300] [1050/1251] eta: 0:03:13 lr: 0.001799 loss: 3.789827 (3.614420) time: 0.944293 data: 0.000169 max mem: 18817 Epoch: [61/300] [1100/1251] eta: 0:02:25 lr: 0.001799 loss: 3.897056 (3.616701) time: 0.983139 data: 0.000181 max mem: 18817 Epoch: [61/300] [1150/1251] eta: 0:01:36 lr: 0.001799 loss: 3.596091 (3.612357) time: 0.961001 data: 0.000161 max mem: 18817 Epoch: [61/300] [1200/1251] eta: 0:00:48 lr: 0.001799 loss: 3.462111 (3.613188) time: 0.927886 data: 0.000177 max mem: 18817 Epoch: [61/300] [1250/1251] eta: 0:00:00 lr: 0.001798 loss: 3.563087 (3.609423) time: 0.913881 data: 0.000764 max mem: 18817 Epoch: [61/300] Total time: 0:20:01 (0.960272 s / it) Averaged stats: lr: 0.001798 loss: 3.563087 (3.605184) Test: [ 0/49] eta: 0:01:28 loss: 0.758304 (0.758304) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.800066 data: 1.380636 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.811764 (0.920624) acc1: 76.562500 (78.409091) acc5: 95.312500 (94.602273) time: 0.501773 data: 0.125661 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 1.012607 (0.973594) acc1: 75.000000 (76.488095) acc5: 92.187500 (93.973214) time: 0.367237 data: 0.000161 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.012607 (0.963664) acc1: 75.000000 (76.864919) acc5: 93.750000 (94.304435) time: 0.454077 data: 0.000152 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.930432 (0.977005) acc1: 76.562500 (76.676829) acc5: 95.312500 (94.245427) time: 0.452451 data: 0.000147 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.053563 (0.986187) acc1: 75.000000 (76.544000) acc5: 93.750000 (94.016000) time: 0.356028 data: 0.000118 max mem: 18817 Test: Total time: 0:00:21 (0.432037 s / it) * Acc@1 76.488 Acc@5 93.724 loss 0.994 Max accuracy: 76.54% Epoch: [62/300] [ 0/1251] eta: 0:43:06 lr: 0.001798 loss: 3.538911 (3.538911) time: 2.067878 data: 1.161031 max mem: 18817 Epoch: [62/300] [ 50/1251] eta: 0:19:53 lr: 0.001798 loss: 3.664713 (3.581852) time: 1.044815 data: 0.000171 max mem: 18817 Epoch: [62/300] [ 100/1251] eta: 0:18:39 lr: 0.001798 loss: 3.410976 (3.511082) time: 0.981269 data: 0.000174 max mem: 18817 Epoch: [62/300] [ 150/1251] eta: 0:17:40 lr: 0.001798 loss: 3.569603 (3.485668) time: 0.912105 data: 0.000169 max mem: 18817 Epoch: [62/300] [ 200/1251] eta: 0:16:53 lr: 0.001797 loss: 3.718790 (3.487143) time: 0.944528 data: 0.000187 max mem: 18817 Epoch: [62/300] [ 250/1251] eta: 0:16:04 lr: 0.001797 loss: 3.719311 (3.510157) time: 0.971338 data: 0.000181 max mem: 18817 Epoch: [62/300] [ 300/1251] eta: 0:15:17 lr: 0.001797 loss: 3.696875 (3.518233) time: 1.038018 data: 0.000178 max mem: 18817 Epoch: [62/300] [ 350/1251] eta: 0:14:26 lr: 0.001797 loss: 3.774322 (3.521787) time: 0.962399 data: 0.000178 max mem: 18817 Epoch: [62/300] [ 400/1251] eta: 0:13:36 lr: 0.001796 loss: 3.745997 (3.548422) time: 0.922139 data: 0.000165 max mem: 18817 Epoch: [62/300] [ 450/1251] eta: 0:12:49 lr: 0.001796 loss: 3.185238 (3.537055) time: 0.924333 data: 0.000171 max mem: 18817 Epoch: [62/300] [ 500/1251] eta: 0:12:01 lr: 0.001796 loss: 3.811464 (3.547050) time: 0.971268 data: 0.000169 max mem: 18817 Epoch: [62/300] [ 550/1251] eta: 0:11:14 lr: 0.001795 loss: 3.864733 (3.553196) time: 1.051503 data: 0.000175 max mem: 18817 Epoch: [62/300] [ 600/1251] eta: 0:10:25 lr: 0.001795 loss: 3.738661 (3.562919) time: 0.970683 data: 0.000188 max mem: 18817 Epoch: [62/300] [ 650/1251] eta: 0:09:36 lr: 0.001795 loss: 3.164667 (3.563081) time: 0.943921 data: 0.000179 max mem: 18817 Epoch: [62/300] [ 700/1251] eta: 0:08:49 lr: 0.001795 loss: 3.376522 (3.563890) time: 0.934957 data: 0.000162 max mem: 18817 Epoch: [62/300] [ 750/1251] eta: 0:08:01 lr: 0.001794 loss: 3.738976 (3.560823) time: 0.997044 data: 0.000167 max mem: 18817 Epoch: [62/300] [ 800/1251] eta: 0:07:13 lr: 0.001794 loss: 3.400635 (3.559301) time: 1.018780 data: 0.000175 max mem: 18817 Epoch: [62/300] [ 850/1251] eta: 0:06:25 lr: 0.001794 loss: 3.612713 (3.561894) time: 0.974754 data: 0.000166 max mem: 18817 Epoch: [62/300] [ 900/1251] eta: 0:05:36 lr: 0.001794 loss: 3.435109 (3.553462) time: 0.930635 data: 0.000178 max mem: 18817 Epoch: [62/300] [ 950/1251] eta: 0:04:49 lr: 0.001793 loss: 3.741298 (3.554094) time: 0.933973 data: 0.000169 max mem: 18817 Epoch: [62/300] [1000/1251] eta: 0:04:01 lr: 0.001793 loss: 3.524493 (3.553789) time: 1.002569 data: 0.000173 max mem: 18817 Epoch: [62/300] [1050/1251] eta: 0:03:13 lr: 0.001793 loss: 3.578758 (3.557301) time: 1.019877 data: 0.000172 max mem: 18817 Epoch: [62/300] [1100/1251] eta: 0:02:25 lr: 0.001793 loss: 3.785510 (3.560051) time: 0.992144 data: 0.000166 max mem: 18817 Epoch: [62/300] [1150/1251] eta: 0:01:37 lr: 0.001792 loss: 3.619201 (3.563212) time: 0.917987 data: 0.000187 max mem: 18817 Epoch: [62/300] [1200/1251] eta: 0:00:49 lr: 0.001792 loss: 3.553401 (3.566489) time: 0.928873 data: 0.000175 max mem: 18817 Epoch: [62/300] [1250/1251] eta: 0:00:00 lr: 0.001792 loss: 3.347740 (3.563802) time: 1.006313 data: 0.000775 max mem: 18817 Epoch: [62/300] Total time: 0:20:03 (0.961718 s / it) Averaged stats: lr: 0.001792 loss: 3.347740 (3.566733) Test: [ 0/49] eta: 0:01:26 loss: 0.788819 (0.788819) acc1: 79.687500 (79.687500) acc5: 98.437500 (98.437500) time: 1.769944 data: 1.352418 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.879001 (0.922160) acc1: 76.562500 (77.982955) acc5: 95.312500 (94.602273) time: 0.519726 data: 0.123084 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.945611 (0.968436) acc1: 76.562500 (77.529762) acc5: 95.312500 (94.345238) time: 0.378107 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.034121 (0.986274) acc1: 75.000000 (76.814516) acc5: 95.312500 (94.102823) time: 0.362418 data: 0.000136 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.018038 (1.001999) acc1: 76.562500 (76.943598) acc5: 93.750000 (93.826220) time: 0.360725 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.020601 (1.011676) acc1: 75.000000 (76.512000) acc5: 93.750000 (93.568000) time: 0.355229 data: 0.000108 max mem: 18817 Test: Total time: 0:00:19 (0.398080 s / it) * Acc@1 76.434 Acc@5 93.660 loss 1.018 Max accuracy: 76.54% Epoch: [63/300] [ 0/1251] eta: 0:43:15 lr: 0.001792 loss: 3.998921 (3.998921) time: 2.074996 data: 1.187671 max mem: 18817 Epoch: [63/300] [ 50/1251] eta: 0:19:28 lr: 0.001792 loss: 3.576924 (3.558804) time: 0.966606 data: 0.000151 max mem: 18817 Epoch: [63/300] [ 100/1251] eta: 0:18:26 lr: 0.001791 loss: 3.768258 (3.554511) time: 0.920408 data: 0.000163 max mem: 18817 Epoch: [63/300] [ 150/1251] eta: 0:17:40 lr: 0.001791 loss: 3.713188 (3.555344) time: 0.925876 data: 0.000169 max mem: 18817 Epoch: [63/300] [ 200/1251] eta: 0:16:52 lr: 0.001791 loss: 3.389009 (3.528249) time: 0.980750 data: 0.000167 max mem: 18817 Epoch: [63/300] [ 250/1251] eta: 0:16:05 lr: 0.001791 loss: 3.532957 (3.533545) time: 1.044245 data: 0.000185 max mem: 18817 Epoch: [63/300] [ 300/1251] eta: 0:15:15 lr: 0.001790 loss: 3.285046 (3.514826) time: 0.980713 data: 0.000161 max mem: 18817 Epoch: [63/300] [ 350/1251] eta: 0:14:26 lr: 0.001790 loss: 3.681783 (3.523048) time: 0.923752 data: 0.000157 max mem: 18817 Epoch: [63/300] [ 400/1251] eta: 0:13:39 lr: 0.001790 loss: 3.705912 (3.534131) time: 0.945064 data: 0.000195 max mem: 18817 Epoch: [63/300] [ 450/1251] eta: 0:12:52 lr: 0.001790 loss: 3.757735 (3.554778) time: 0.992582 data: 0.000190 max mem: 18817 Epoch: [63/300] [ 500/1251] eta: 0:12:06 lr: 0.001789 loss: 3.849312 (3.554786) time: 1.079934 data: 0.000172 max mem: 18817 Epoch: [63/300] [ 550/1251] eta: 0:11:16 lr: 0.001789 loss: 3.879804 (3.568060) time: 0.960321 data: 0.000190 max mem: 18817 Epoch: [63/300] [ 600/1251] eta: 0:10:26 lr: 0.001789 loss: 3.368001 (3.557727) time: 0.925985 data: 0.000177 max mem: 18817 Epoch: [63/300] [ 650/1251] eta: 0:09:38 lr: 0.001789 loss: 3.787723 (3.574466) time: 0.924833 data: 0.000176 max mem: 18817 Epoch: [63/300] [ 700/1251] eta: 0:08:50 lr: 0.001788 loss: 3.574571 (3.570798) time: 0.989699 data: 0.000161 max mem: 18817 Epoch: [63/300] [ 750/1251] eta: 0:08:02 lr: 0.001788 loss: 3.562428 (3.563684) time: 1.043128 data: 0.000169 max mem: 18817 Epoch: [63/300] [ 800/1251] eta: 0:07:14 lr: 0.001788 loss: 3.516511 (3.558982) time: 0.977969 data: 0.000176 max mem: 18817 Epoch: [63/300] [ 850/1251] eta: 0:06:25 lr: 0.001788 loss: 3.685453 (3.557746) time: 0.924721 data: 0.000170 max mem: 18817 Epoch: [63/300] [ 900/1251] eta: 0:05:37 lr: 0.001787 loss: 3.480968 (3.553678) time: 0.937751 data: 0.000205 max mem: 18817 Epoch: [63/300] [ 950/1251] eta: 0:04:49 lr: 0.001787 loss: 3.554694 (3.554183) time: 0.970395 data: 0.000180 max mem: 18817 Epoch: [63/300] [1000/1251] eta: 0:04:01 lr: 0.001787 loss: 4.097482 (3.557572) time: 1.040896 data: 0.000179 max mem: 18817 Epoch: [63/300] [1050/1251] eta: 0:03:13 lr: 0.001787 loss: 3.680248 (3.554372) time: 0.985571 data: 0.000168 max mem: 18817 Epoch: [63/300] [1100/1251] eta: 0:02:25 lr: 0.001786 loss: 3.395977 (3.552023) time: 0.925095 data: 0.000161 max mem: 18817 Epoch: [63/300] [1150/1251] eta: 0:01:37 lr: 0.001786 loss: 3.718060 (3.556326) time: 0.927680 data: 0.000171 max mem: 18817 Epoch: [63/300] [1200/1251] eta: 0:00:49 lr: 0.001786 loss: 3.671693 (3.559712) time: 0.987996 data: 0.000190 max mem: 18817 Epoch: [63/300] [1250/1251] eta: 0:00:00 lr: 0.001786 loss: 3.567870 (3.557422) time: 1.020125 data: 0.000761 max mem: 18817 Epoch: [63/300] Total time: 0:20:04 (0.963199 s / it) Averaged stats: lr: 0.001786 loss: 3.567870 (3.566515) Test: [ 0/49] eta: 0:01:17 loss: 0.699717 (0.699717) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.589768 data: 1.146762 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.827644 (0.871422) acc1: 79.687500 (79.545455) acc5: 95.312500 (93.607955) time: 0.497170 data: 0.104420 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.960316 (0.939216) acc1: 76.562500 (77.604167) acc5: 93.750000 (93.452381) time: 0.375619 data: 0.000150 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.000231 (0.944376) acc1: 75.000000 (77.066532) acc5: 95.312500 (93.750000) time: 0.363234 data: 0.000122 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.972944 (0.952364) acc1: 75.000000 (76.943598) acc5: 95.312500 (93.635671) time: 0.360678 data: 0.000120 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.972944 (0.951180) acc1: 76.562500 (77.184000) acc5: 93.750000 (93.600000) time: 0.358971 data: 0.000098 max mem: 18817 Test: Total time: 0:00:19 (0.393739 s / it) * Acc@1 76.898 Acc@5 93.728 loss 0.961 Max accuracy: 76.90% Epoch: [64/300] [ 0/1251] eta: 0:42:26 lr: 0.001786 loss: 3.698675 (3.698675) time: 2.035683 data: 1.133614 max mem: 18817 Epoch: [64/300] [ 50/1251] eta: 0:19:23 lr: 0.001785 loss: 3.531909 (3.601270) time: 0.929887 data: 0.000182 max mem: 18817 Epoch: [64/300] [ 100/1251] eta: 0:18:27 lr: 0.001785 loss: 3.725210 (3.586961) time: 0.928805 data: 0.000178 max mem: 18817 Epoch: [64/300] [ 150/1251] eta: 0:17:41 lr: 0.001785 loss: 3.698850 (3.527478) time: 0.992512 data: 0.000180 max mem: 18817 Epoch: [64/300] [ 200/1251] eta: 0:16:52 lr: 0.001785 loss: 3.646266 (3.563104) time: 1.016941 data: 0.000161 max mem: 18817 Epoch: [64/300] [ 250/1251] eta: 0:15:59 lr: 0.001784 loss: 3.691252 (3.562002) time: 0.943409 data: 0.000154 max mem: 18817 Epoch: [64/300] [ 300/1251] eta: 0:15:11 lr: 0.001784 loss: 3.411538 (3.563020) time: 0.923365 data: 0.000170 max mem: 18817 Epoch: [64/300] [ 350/1251] eta: 0:14:23 lr: 0.001784 loss: 3.643535 (3.570786) time: 0.917034 data: 0.000156 max mem: 18817 Epoch: [64/300] [ 400/1251] eta: 0:13:37 lr: 0.001783 loss: 3.558865 (3.562997) time: 0.998465 data: 0.000173 max mem: 18817 Epoch: [64/300] [ 450/1251] eta: 0:12:48 lr: 0.001783 loss: 3.626139 (3.558317) time: 0.994259 data: 0.000183 max mem: 18817 Epoch: [64/300] [ 500/1251] eta: 0:12:00 lr: 0.001783 loss: 3.732751 (3.564439) time: 0.975971 data: 0.000168 max mem: 18817 Epoch: [64/300] [ 550/1251] eta: 0:11:11 lr: 0.001783 loss: 3.639033 (3.569869) time: 0.931097 data: 0.000170 max mem: 18817 Epoch: [64/300] [ 600/1251] eta: 0:10:25 lr: 0.001782 loss: 3.868659 (3.570133) time: 0.941883 data: 0.000185 max mem: 18817 Epoch: [64/300] [ 650/1251] eta: 0:09:37 lr: 0.001782 loss: 3.292642 (3.564141) time: 0.984678 data: 0.000157 max mem: 18817 Epoch: [64/300] [ 700/1251] eta: 0:08:49 lr: 0.001782 loss: 3.503656 (3.563987) time: 1.007604 data: 0.000170 max mem: 18817 Epoch: [64/300] [ 750/1251] eta: 0:08:01 lr: 0.001782 loss: 3.775187 (3.563757) time: 0.987655 data: 0.000174 max mem: 18817 Epoch: [64/300] [ 800/1251] eta: 0:07:12 lr: 0.001781 loss: 3.817406 (3.569218) time: 0.931005 data: 0.000165 max mem: 18817 Epoch: [64/300] [ 850/1251] eta: 0:06:24 lr: 0.001781 loss: 3.527672 (3.570266) time: 0.921814 data: 0.000166 max mem: 18817 Epoch: [64/300] [ 900/1251] eta: 0:05:37 lr: 0.001781 loss: 3.848591 (3.575258) time: 0.968035 data: 0.000186 max mem: 18817 Epoch: [64/300] [ 950/1251] eta: 0:04:49 lr: 0.001781 loss: 3.770786 (3.585274) time: 1.004989 data: 0.000175 max mem: 18817 Epoch: [64/300] [1000/1251] eta: 0:04:01 lr: 0.001780 loss: 3.444948 (3.576701) time: 0.971898 data: 0.000166 max mem: 18817 Epoch: [64/300] [1050/1251] eta: 0:03:12 lr: 0.001780 loss: 3.538225 (3.572729) time: 0.934278 data: 0.000177 max mem: 18817 Epoch: [64/300] [1100/1251] eta: 0:02:24 lr: 0.001780 loss: 3.810299 (3.576189) time: 0.934402 data: 0.000172 max mem: 18817 Epoch: [64/300] [1150/1251] eta: 0:01:36 lr: 0.001780 loss: 3.590414 (3.577114) time: 0.990635 data: 0.000188 max mem: 18817 Epoch: [64/300] [1200/1251] eta: 0:00:48 lr: 0.001779 loss: 3.656422 (3.574840) time: 1.002356 data: 0.000171 max mem: 18817 Epoch: [64/300] [1250/1251] eta: 0:00:00 lr: 0.001779 loss: 3.401087 (3.568830) time: 0.967882 data: 0.000769 max mem: 18817 Epoch: [64/300] Total time: 0:20:01 (0.960412 s / it) Averaged stats: lr: 0.001779 loss: 3.401087 (3.567294) Test: [ 0/49] eta: 0:01:15 loss: 0.805841 (0.805841) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.544343 data: 1.113032 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.901700 (0.908097) acc1: 79.687500 (79.971591) acc5: 95.312500 (94.318182) time: 0.475650 data: 0.101350 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.968249 (0.975156) acc1: 76.562500 (77.976190) acc5: 93.750000 (93.898810) time: 0.365791 data: 0.000147 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 1.047419 (0.984377) acc1: 76.562500 (77.570565) acc5: 93.750000 (94.052419) time: 0.363959 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.052879 (1.003734) acc1: 76.562500 (77.400915) acc5: 93.750000 (93.940549) time: 0.378985 data: 0.000142 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.055749 (1.006582) acc1: 76.562500 (77.376000) acc5: 93.750000 (93.760000) time: 0.400156 data: 0.000109 max mem: 18817 Test: Total time: 0:00:19 (0.407220 s / it) * Acc@1 76.902 Acc@5 93.812 loss 1.006 Max accuracy: 76.90% Epoch: [65/300] [ 0/1251] eta: 0:43:26 lr: 0.001779 loss: 3.015837 (3.015837) time: 2.083585 data: 1.156903 max mem: 18817 Epoch: [65/300] [ 50/1251] eta: 0:19:50 lr: 0.001779 loss: 3.627363 (3.481135) time: 0.939999 data: 0.000167 max mem: 18817 Epoch: [65/300] [ 100/1251] eta: 0:18:42 lr: 0.001779 loss: 3.528721 (3.466481) time: 0.988086 data: 0.000175 max mem: 18817 Epoch: [65/300] [ 150/1251] eta: 0:17:43 lr: 0.001778 loss: 3.853323 (3.505668) time: 0.982567 data: 0.000159 max mem: 18817 Epoch: [65/300] [ 200/1251] eta: 0:16:50 lr: 0.001778 loss: 3.511077 (3.492292) time: 0.906666 data: 0.000173 max mem: 18817 Epoch: [65/300] [ 250/1251] eta: 0:16:04 lr: 0.001778 loss: 3.641753 (3.493229) time: 0.937146 data: 0.000171 max mem: 18817 Epoch: [65/300] [ 300/1251] eta: 0:15:18 lr: 0.001777 loss: 3.524955 (3.499933) time: 0.955836 data: 0.000174 max mem: 18817 Epoch: [65/300] [ 350/1251] eta: 0:14:29 lr: 0.001777 loss: 3.582862 (3.513633) time: 0.988671 data: 0.000174 max mem: 18817 Epoch: [65/300] [ 400/1251] eta: 0:13:39 lr: 0.001777 loss: 3.652557 (3.513724) time: 0.973317 data: 0.000193 max mem: 18817 Epoch: [65/300] [ 450/1251] eta: 0:12:51 lr: 0.001777 loss: 3.516312 (3.513043) time: 0.933502 data: 0.000184 max mem: 18817 Epoch: [65/300] [ 500/1251] eta: 0:12:03 lr: 0.001776 loss: 3.730718 (3.525060) time: 0.929592 data: 0.000176 max mem: 18817 Epoch: [65/300] [ 550/1251] eta: 0:11:15 lr: 0.001776 loss: 3.674043 (3.534592) time: 0.952046 data: 0.000177 max mem: 18817 Epoch: [65/300] [ 600/1251] eta: 0:10:28 lr: 0.001776 loss: 3.731965 (3.531847) time: 0.975963 data: 0.000174 max mem: 18817 Epoch: [65/300] [ 650/1251] eta: 0:09:38 lr: 0.001776 loss: 3.836450 (3.532561) time: 0.968103 data: 0.000178 max mem: 18817 Epoch: [65/300] [ 700/1251] eta: 0:08:50 lr: 0.001775 loss: 3.708495 (3.545286) time: 0.907859 data: 0.000162 max mem: 18817 Epoch: [65/300] [ 750/1251] eta: 0:08:02 lr: 0.001775 loss: 3.756965 (3.554028) time: 0.921634 data: 0.000176 max mem: 18817 Epoch: [65/300] [ 800/1251] eta: 0:07:14 lr: 0.001775 loss: 3.517723 (3.557323) time: 0.937807 data: 0.000189 max mem: 18817 Epoch: [65/300] [ 850/1251] eta: 0:06:26 lr: 0.001775 loss: 3.179177 (3.556471) time: 0.992541 data: 0.000163 max mem: 18817 Epoch: [65/300] [ 900/1251] eta: 0:05:37 lr: 0.001774 loss: 3.664261 (3.554023) time: 0.975870 data: 0.000173 max mem: 18817 Epoch: [65/300] [ 950/1251] eta: 0:04:49 lr: 0.001774 loss: 3.679564 (3.553702) time: 0.914630 data: 0.000180 max mem: 18817 Epoch: [65/300] [1000/1251] eta: 0:04:01 lr: 0.001774 loss: 3.675360 (3.553911) time: 0.932130 data: 0.000181 max mem: 18817 Epoch: [65/300] [1050/1251] eta: 0:03:13 lr: 0.001774 loss: 3.668694 (3.552918) time: 0.935863 data: 0.000178 max mem: 18817 Epoch: [65/300] [1100/1251] eta: 0:02:25 lr: 0.001773 loss: 3.650921 (3.552155) time: 0.973287 data: 0.000177 max mem: 18817 Epoch: [65/300] [1150/1251] eta: 0:01:37 lr: 0.001773 loss: 3.814039 (3.557164) time: 0.968300 data: 0.000186 max mem: 18817 Epoch: [65/300] [1200/1251] eta: 0:00:49 lr: 0.001773 loss: 3.844202 (3.561641) time: 0.915268 data: 0.000170 max mem: 18817 Epoch: [65/300] [1250/1251] eta: 0:00:00 lr: 0.001772 loss: 3.715391 (3.562622) time: 0.923936 data: 0.000780 max mem: 18817 Epoch: [65/300] Total time: 0:20:04 (0.962527 s / it) Averaged stats: lr: 0.001772 loss: 3.715391 (3.567866) Test: [ 0/49] eta: 0:01:26 loss: 0.869009 (0.869009) acc1: 76.562500 (76.562500) acc5: 95.312500 (95.312500) time: 1.772616 data: 1.330564 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.869009 (0.891052) acc1: 78.125000 (78.835227) acc5: 95.312500 (94.318182) time: 0.497574 data: 0.121126 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.975622 (0.946190) acc1: 76.562500 (77.455357) acc5: 93.750000 (93.675595) time: 0.367119 data: 0.000165 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 0.975622 (0.951981) acc1: 75.000000 (76.915323) acc5: 93.750000 (93.750000) time: 0.470436 data: 0.000142 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.973031 (0.971183) acc1: 73.437500 (76.486280) acc5: 93.750000 (93.407012) time: 0.467465 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.017771 (0.968077) acc1: 75.000000 (76.640000) acc5: 92.187500 (93.408000) time: 0.421030 data: 0.000104 max mem: 18817 Test: Total time: 0:00:21 (0.437603 s / it) * Acc@1 76.578 Acc@5 93.740 loss 0.976 Max accuracy: 76.90% Epoch: [66/300] [ 0/1251] eta: 0:41:27 lr: 0.001772 loss: 2.731624 (2.731624) time: 1.988016 data: 1.106001 max mem: 18817 Epoch: [66/300] [ 50/1251] eta: 0:19:47 lr: 0.001772 loss: 3.900370 (3.693709) time: 1.038412 data: 0.000152 max mem: 18817 Epoch: [66/300] [ 100/1251] eta: 0:18:34 lr: 0.001772 loss: 3.381600 (3.574383) time: 0.974561 data: 0.000172 max mem: 18817 Epoch: [66/300] [ 150/1251] eta: 0:17:37 lr: 0.001772 loss: 3.600979 (3.589095) time: 0.911331 data: 0.000166 max mem: 18817 Epoch: [66/300] [ 200/1251] eta: 0:16:53 lr: 0.001771 loss: 3.787611 (3.578897) time: 0.938167 data: 0.000156 max mem: 18817 Epoch: [66/300] [ 250/1251] eta: 0:16:05 lr: 0.001771 loss: 3.621130 (3.557011) time: 0.986557 data: 0.000172 max mem: 18817 Epoch: [66/300] [ 300/1251] eta: 0:15:17 lr: 0.001771 loss: 3.690131 (3.575581) time: 1.018787 data: 0.000161 max mem: 18817 Epoch: [66/300] [ 350/1251] eta: 0:14:26 lr: 0.001771 loss: 3.500230 (3.574818) time: 0.968068 data: 0.000162 max mem: 18817 Epoch: [66/300] [ 400/1251] eta: 0:13:35 lr: 0.001770 loss: 3.680528 (3.591137) time: 0.907536 data: 0.000177 max mem: 18817 Epoch: [66/300] [ 450/1251] eta: 0:12:48 lr: 0.001770 loss: 3.566136 (3.573928) time: 0.947164 data: 0.000155 max mem: 18817 Epoch: [66/300] [ 500/1251] eta: 0:11:59 lr: 0.001770 loss: 3.654135 (3.559355) time: 0.973623 data: 0.000174 max mem: 18817 Epoch: [66/300] [ 550/1251] eta: 0:11:13 lr: 0.001770 loss: 3.852903 (3.570460) time: 1.055658 data: 0.000170 max mem: 18817 Epoch: [66/300] [ 600/1251] eta: 0:10:24 lr: 0.001769 loss: 3.834458 (3.574444) time: 0.971613 data: 0.000169 max mem: 18817 Epoch: [66/300] [ 650/1251] eta: 0:09:36 lr: 0.001769 loss: 3.825965 (3.577430) time: 0.912773 data: 0.000174 max mem: 18817 Epoch: [66/300] [ 700/1251] eta: 0:08:48 lr: 0.001769 loss: 3.761972 (3.577410) time: 0.932752 data: 0.000171 max mem: 18817 Epoch: [66/300] [ 750/1251] eta: 0:08:01 lr: 0.001768 loss: 3.697878 (3.579352) time: 0.997018 data: 0.000176 max mem: 18817 Epoch: [66/300] [ 800/1251] eta: 0:07:13 lr: 0.001768 loss: 3.417821 (3.572430) time: 1.030142 data: 0.000183 max mem: 18817 Epoch: [66/300] [ 850/1251] eta: 0:06:25 lr: 0.001768 loss: 3.554076 (3.569526) time: 0.987640 data: 0.000160 max mem: 18817 Epoch: [66/300] [ 900/1251] eta: 0:05:36 lr: 0.001768 loss: 3.875553 (3.570985) time: 0.912581 data: 0.000163 max mem: 18817 Epoch: [66/300] [ 950/1251] eta: 0:04:49 lr: 0.001767 loss: 3.562513 (3.570735) time: 0.937744 data: 0.000173 max mem: 18817 Epoch: [66/300] [1000/1251] eta: 0:04:01 lr: 0.001767 loss: 3.751004 (3.570445) time: 0.993936 data: 0.000161 max mem: 18817 Epoch: [66/300] [1050/1251] eta: 0:03:13 lr: 0.001767 loss: 3.619006 (3.573276) time: 1.018427 data: 0.000196 max mem: 18817 Epoch: [66/300] [1100/1251] eta: 0:02:25 lr: 0.001767 loss: 3.634886 (3.570980) time: 0.981047 data: 0.000191 max mem: 18817 Epoch: [66/300] [1150/1251] eta: 0:01:36 lr: 0.001766 loss: 3.570031 (3.572616) time: 0.913479 data: 0.000191 max mem: 18817 Epoch: [66/300] [1200/1251] eta: 0:00:49 lr: 0.001766 loss: 3.546998 (3.568558) time: 0.932429 data: 0.000183 max mem: 18817 Epoch: [66/300] [1250/1251] eta: 0:00:00 lr: 0.001766 loss: 3.734216 (3.573685) time: 0.985035 data: 0.000983 max mem: 18817 Epoch: [66/300] Total time: 0:20:03 (0.961739 s / it) Averaged stats: lr: 0.001766 loss: 3.734216 (3.567799) Test: [ 0/49] eta: 0:01:28 loss: 0.721950 (0.721950) acc1: 81.250000 (81.250000) acc5: 96.875000 (96.875000) time: 1.812345 data: 1.365512 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.861440 (0.899253) acc1: 78.125000 (79.119318) acc5: 93.750000 (93.465909) time: 0.498675 data: 0.124292 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.943889 (0.925551) acc1: 78.125000 (78.199405) acc5: 93.750000 (93.750000) time: 0.368487 data: 0.000157 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.943889 (0.931845) acc1: 76.562500 (77.620968) acc5: 95.312500 (93.901210) time: 0.366243 data: 0.000146 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.986252 (0.948729) acc1: 75.000000 (77.248476) acc5: 93.750000 (93.864329) time: 0.360486 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.986252 (0.950651) acc1: 75.000000 (77.376000) acc5: 95.312500 (93.920000) time: 0.355516 data: 0.000108 max mem: 18817 Test: Total time: 0:00:19 (0.395103 s / it) * Acc@1 76.740 Acc@5 93.878 loss 0.957 Max accuracy: 76.90% Epoch: [67/300] [ 0/1251] eta: 0:41:28 lr: 0.001766 loss: 3.758117 (3.758117) time: 1.989409 data: 1.101536 max mem: 18817 Epoch: [67/300] [ 50/1251] eta: 0:19:02 lr: 0.001766 loss: 3.502097 (3.490763) time: 0.905744 data: 0.000153 max mem: 18817 Epoch: [67/300] [ 100/1251] eta: 0:18:30 lr: 0.001765 loss: 3.295890 (3.500939) time: 0.941745 data: 0.000197 max mem: 18817 Epoch: [67/300] [ 150/1251] eta: 0:17:53 lr: 0.001765 loss: 3.531261 (3.482039) time: 0.951028 data: 0.000183 max mem: 18817 Epoch: [67/300] [ 200/1251] eta: 0:17:02 lr: 0.001765 loss: 3.878859 (3.509984) time: 0.984364 data: 0.000190 max mem: 18817 Epoch: [67/300] [ 250/1251] eta: 0:16:09 lr: 0.001764 loss: 3.403873 (3.501823) time: 0.984818 data: 0.000171 max mem: 18817 Epoch: [67/300] [ 300/1251] eta: 0:15:23 lr: 0.001764 loss: 3.504820 (3.522683) time: 0.970340 data: 0.000163 max mem: 18817 Epoch: [67/300] [ 350/1251] eta: 0:14:30 lr: 0.001764 loss: 3.554589 (3.533755) time: 0.913616 data: 0.000170 max mem: 18817 Epoch: [67/300] [ 400/1251] eta: 0:13:41 lr: 0.001764 loss: 3.790076 (3.528606) time: 0.927893 data: 0.000186 max mem: 18817 Epoch: [67/300] [ 450/1251] eta: 0:12:54 lr: 0.001763 loss: 3.607216 (3.530829) time: 0.998586 data: 0.000180 max mem: 18817 Epoch: [67/300] [ 500/1251] eta: 0:12:04 lr: 0.001763 loss: 3.727034 (3.542135) time: 0.968424 data: 0.000163 max mem: 18817 Epoch: [67/300] [ 550/1251] eta: 0:11:16 lr: 0.001763 loss: 3.577468 (3.549888) time: 0.976776 data: 0.000181 max mem: 18817 Epoch: [67/300] [ 600/1251] eta: 0:10:28 lr: 0.001763 loss: 3.768915 (3.547857) time: 0.936870 data: 0.000175 max mem: 18817 Epoch: [67/300] [ 650/1251] eta: 0:09:39 lr: 0.001762 loss: 3.780628 (3.543934) time: 0.937710 data: 0.000169 max mem: 18817 Epoch: [67/300] [ 700/1251] eta: 0:08:51 lr: 0.001762 loss: 3.494813 (3.547190) time: 0.993357 data: 0.000147 max mem: 18817 Epoch: [67/300] [ 750/1251] eta: 0:08:03 lr: 0.001762 loss: 3.558246 (3.543361) time: 1.013886 data: 0.000159 max mem: 18817 Epoch: [67/300] [ 800/1251] eta: 0:07:14 lr: 0.001762 loss: 3.463591 (3.545081) time: 0.958753 data: 0.000171 max mem: 18817 Epoch: [67/300] [ 850/1251] eta: 0:06:26 lr: 0.001761 loss: 3.656293 (3.549912) time: 0.913274 data: 0.000171 max mem: 18817 Epoch: [67/300] [ 900/1251] eta: 0:05:38 lr: 0.001761 loss: 3.633601 (3.558288) time: 0.931014 data: 0.000172 max mem: 18817 Epoch: [67/300] [ 950/1251] eta: 0:04:49 lr: 0.001761 loss: 3.596621 (3.554429) time: 0.985665 data: 0.000169 max mem: 18817 Epoch: [67/300] [1000/1251] eta: 0:04:01 lr: 0.001760 loss: 3.849638 (3.555116) time: 0.970644 data: 0.000166 max mem: 18817 Epoch: [67/300] [1050/1251] eta: 0:03:13 lr: 0.001760 loss: 3.562609 (3.561279) time: 0.975621 data: 0.000197 max mem: 18817 Epoch: [67/300] [1100/1251] eta: 0:02:25 lr: 0.001760 loss: 3.787696 (3.563729) time: 0.932274 data: 0.000183 max mem: 18817 Epoch: [67/300] [1150/1251] eta: 0:01:37 lr: 0.001760 loss: 3.838142 (3.565115) time: 0.918589 data: 0.000181 max mem: 18817 Epoch: [67/300] [1200/1251] eta: 0:00:49 lr: 0.001759 loss: 3.568499 (3.562093) time: 0.991333 data: 0.000183 max mem: 18817 Epoch: [67/300] [1250/1251] eta: 0:00:00 lr: 0.001759 loss: 3.439111 (3.559069) time: 0.993291 data: 0.000741 max mem: 18817 Epoch: [67/300] Total time: 0:20:04 (0.962885 s / it) Averaged stats: lr: 0.001759 loss: 3.439111 (3.560681) Test: [ 0/49] eta: 0:01:21 loss: 0.829966 (0.829966) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.312500) time: 1.667874 data: 1.172905 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.897144 (0.898303) acc1: 79.687500 (80.255682) acc5: 93.750000 (94.318182) time: 0.526978 data: 0.106783 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.957622 (0.947462) acc1: 78.125000 (78.943452) acc5: 93.750000 (94.196429) time: 0.401808 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.983951 (0.944983) acc1: 78.125000 (78.276210) acc5: 93.750000 (94.455645) time: 0.390626 data: 0.000129 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.980794 (0.961111) acc1: 76.562500 (77.667683) acc5: 93.750000 (94.283537) time: 0.374300 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.996026 (0.961091) acc1: 75.000000 (77.664000) acc5: 93.750000 (94.272000) time: 0.361255 data: 0.000105 max mem: 18817 Test: Total time: 0:00:20 (0.413509 s / it) * Acc@1 76.888 Acc@5 93.978 loss 0.971 Max accuracy: 76.90% Epoch: [68/300] [ 0/1251] eta: 0:43:11 lr: 0.001759 loss: 4.033710 (4.033710) time: 2.071387 data: 1.153008 max mem: 18817 Epoch: [68/300] [ 50/1251] eta: 0:19:20 lr: 0.001759 loss: 3.640813 (3.565329) time: 0.906061 data: 0.000162 max mem: 18817 Epoch: [68/300] [ 100/1251] eta: 0:18:36 lr: 0.001759 loss: 3.691131 (3.582863) time: 0.932152 data: 0.000182 max mem: 18817 Epoch: [68/300] [ 150/1251] eta: 0:17:52 lr: 0.001758 loss: 3.557294 (3.568316) time: 0.936470 data: 0.000179 max mem: 18817 Epoch: [68/300] [ 200/1251] eta: 0:17:04 lr: 0.001758 loss: 3.486072 (3.555314) time: 0.993985 data: 0.000174 max mem: 18817 Epoch: [68/300] [ 250/1251] eta: 0:16:09 lr: 0.001758 loss: 3.513287 (3.534960) time: 0.972850 data: 0.000175 max mem: 18817 Epoch: [68/300] [ 300/1251] eta: 0:15:22 lr: 0.001757 loss: 3.680552 (3.543664) time: 0.967461 data: 0.000187 max mem: 18817 Epoch: [68/300] [ 350/1251] eta: 0:14:32 lr: 0.001757 loss: 3.550071 (3.525366) time: 0.921719 data: 0.000184 max mem: 18817 Epoch: [68/300] [ 400/1251] eta: 0:13:45 lr: 0.001757 loss: 3.787033 (3.545160) time: 0.913606 data: 0.000201 max mem: 18817 Epoch: [68/300] [ 450/1251] eta: 0:12:57 lr: 0.001757 loss: 3.382397 (3.534996) time: 1.008012 data: 0.000195 max mem: 18817 Epoch: [68/300] [ 500/1251] eta: 0:12:06 lr: 0.001756 loss: 3.902141 (3.553914) time: 0.970411 data: 0.000216 max mem: 18817 Epoch: [68/300] [ 550/1251] eta: 0:11:18 lr: 0.001756 loss: 3.288506 (3.545243) time: 0.962357 data: 0.000188 max mem: 18817 Epoch: [68/300] [ 600/1251] eta: 0:10:28 lr: 0.001756 loss: 3.624352 (3.547885) time: 0.926989 data: 0.000211 max mem: 18817 Epoch: [68/300] [ 650/1251] eta: 0:09:39 lr: 0.001756 loss: 3.712812 (3.551378) time: 0.926287 data: 0.000169 max mem: 18817 Epoch: [68/300] [ 700/1251] eta: 0:08:52 lr: 0.001755 loss: 3.735478 (3.552565) time: 0.993537 data: 0.000186 max mem: 18817 Epoch: [68/300] [ 750/1251] eta: 0:08:03 lr: 0.001755 loss: 3.455387 (3.546817) time: 0.983137 data: 0.000175 max mem: 18817 Epoch: [68/300] [ 800/1251] eta: 0:07:15 lr: 0.001755 loss: 3.733560 (3.552415) time: 0.984377 data: 0.000174 max mem: 18817 Epoch: [68/300] [ 850/1251] eta: 0:06:26 lr: 0.001754 loss: 3.497354 (3.545728) time: 0.931936 data: 0.000169 max mem: 18817 Epoch: [68/300] [ 900/1251] eta: 0:05:38 lr: 0.001754 loss: 3.519408 (3.543335) time: 0.939554 data: 0.000197 max mem: 18817 Epoch: [68/300] [ 950/1251] eta: 0:04:50 lr: 0.001754 loss: 3.679958 (3.546066) time: 0.981766 data: 0.000184 max mem: 18817 Epoch: [68/300] [1000/1251] eta: 0:04:01 lr: 0.001754 loss: 3.528272 (3.538481) time: 0.990313 data: 0.000161 max mem: 18817 Epoch: [68/300] [1050/1251] eta: 0:03:13 lr: 0.001753 loss: 3.638920 (3.536782) time: 0.994210 data: 0.000219 max mem: 18817 Epoch: [68/300] [1100/1251] eta: 0:02:25 lr: 0.001753 loss: 3.479139 (3.533935) time: 0.936281 data: 0.000179 max mem: 18817 Epoch: [68/300] [1150/1251] eta: 0:01:37 lr: 0.001753 loss: 3.666248 (3.534909) time: 0.930069 data: 0.000215 max mem: 18817 Epoch: [68/300] [1200/1251] eta: 0:00:49 lr: 0.001753 loss: 3.662689 (3.531271) time: 0.997622 data: 0.000221 max mem: 18817 Epoch: [68/300] [1250/1251] eta: 0:00:00 lr: 0.001752 loss: 3.672215 (3.530475) time: 1.045684 data: 0.000747 max mem: 18817 Epoch: [68/300] Total time: 0:20:07 (0.965479 s / it) Averaged stats: lr: 0.001752 loss: 3.672215 (3.530699) Test: [ 0/49] eta: 0:01:18 loss: 0.760471 (0.760471) acc1: 87.500000 (87.500000) acc5: 95.312500 (95.312500) time: 1.602538 data: 1.149312 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.911544 (0.923175) acc1: 79.687500 (79.687500) acc5: 93.750000 (93.892045) time: 0.503863 data: 0.104646 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 1.006588 (0.966373) acc1: 78.125000 (78.348214) acc5: 93.750000 (93.898810) time: 0.393031 data: 0.000160 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 1.038844 (0.985327) acc1: 75.000000 (77.772177) acc5: 93.750000 (94.002016) time: 0.377919 data: 0.000143 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.028469 (1.002411) acc1: 75.000000 (77.400915) acc5: 93.750000 (93.864329) time: 0.365350 data: 0.000133 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.028469 (1.007175) acc1: 78.125000 (77.152000) acc5: 93.750000 (93.824000) time: 0.362462 data: 0.000105 max mem: 18817 Test: Total time: 0:00:19 (0.403256 s / it) * Acc@1 76.872 Acc@5 93.808 loss 1.005 Max accuracy: 76.90% Epoch: [69/300] [ 0/1251] eta: 0:40:35 lr: 0.001752 loss: 3.204933 (3.204933) time: 1.946745 data: 1.053457 max mem: 18817 Epoch: [69/300] [ 50/1251] eta: 0:19:59 lr: 0.001752 loss: 3.525745 (3.568057) time: 0.925736 data: 0.000144 max mem: 18817 Epoch: [69/300] [ 100/1251] eta: 0:19:00 lr: 0.001752 loss: 3.630182 (3.580099) time: 0.969858 data: 0.000165 max mem: 18817 Epoch: [69/300] [ 150/1251] eta: 0:18:03 lr: 0.001751 loss: 3.491220 (3.597178) time: 0.994871 data: 0.000148 max mem: 18817 Epoch: [69/300] [ 200/1251] eta: 0:17:06 lr: 0.001751 loss: 3.593313 (3.539598) time: 0.992878 data: 0.000162 max mem: 18817 Epoch: [69/300] [ 250/1251] eta: 0:16:11 lr: 0.001751 loss: 3.256421 (3.521556) time: 0.905286 data: 0.000162 max mem: 18817 Epoch: [69/300] [ 300/1251] eta: 0:15:22 lr: 0.001751 loss: 3.847232 (3.520985) time: 0.920870 data: 0.000159 max mem: 18817 Epoch: [69/300] [ 350/1251] eta: 0:14:34 lr: 0.001750 loss: 3.546960 (3.519734) time: 0.940656 data: 0.000156 max mem: 18817 Epoch: [69/300] [ 400/1251] eta: 0:13:44 lr: 0.001750 loss: 3.730401 (3.515583) time: 0.986098 data: 0.000156 max mem: 18817 Epoch: [69/300] [ 450/1251] eta: 0:12:54 lr: 0.001750 loss: 3.604869 (3.508160) time: 0.974508 data: 0.000165 max mem: 18817 Epoch: [69/300] [ 500/1251] eta: 0:12:05 lr: 0.001749 loss: 3.532943 (3.505652) time: 0.916977 data: 0.000163 max mem: 18817 Epoch: [69/300] [ 550/1251] eta: 0:11:17 lr: 0.001749 loss: 3.504411 (3.513026) time: 0.916781 data: 0.000175 max mem: 18817 Epoch: [69/300] [ 600/1251] eta: 0:10:29 lr: 0.001749 loss: 3.733301 (3.522401) time: 0.926159 data: 0.000174 max mem: 18817 Epoch: [69/300] [ 650/1251] eta: 0:09:40 lr: 0.001749 loss: 3.273677 (3.523936) time: 0.947901 data: 0.000170 max mem: 18817 Epoch: [69/300] [ 700/1251] eta: 0:08:51 lr: 0.001748 loss: 3.238100 (3.524175) time: 0.982918 data: 0.000175 max mem: 18817 Epoch: [69/300] [ 750/1251] eta: 0:08:02 lr: 0.001748 loss: 3.655451 (3.529051) time: 0.926001 data: 0.000170 max mem: 18817 Epoch: [69/300] [ 800/1251] eta: 0:07:15 lr: 0.001748 loss: 3.747518 (3.527231) time: 0.929953 data: 0.000167 max mem: 18817 Epoch: [69/300] [ 850/1251] eta: 0:06:27 lr: 0.001748 loss: 3.514598 (3.519802) time: 0.938877 data: 0.000196 max mem: 18817 Epoch: [69/300] [ 900/1251] eta: 0:05:39 lr: 0.001747 loss: 3.374179 (3.525544) time: 1.030805 data: 0.000175 max mem: 18817 Epoch: [69/300] [ 950/1251] eta: 0:04:50 lr: 0.001747 loss: 3.609173 (3.531936) time: 0.979867 data: 0.000164 max mem: 18817 Epoch: [69/300] [1000/1251] eta: 0:04:02 lr: 0.001747 loss: 3.662152 (3.532881) time: 0.975304 data: 0.000171 max mem: 18817 Epoch: [69/300] [1050/1251] eta: 0:03:13 lr: 0.001746 loss: 3.672400 (3.539382) time: 0.943435 data: 0.000177 max mem: 18817 Epoch: [69/300] [1100/1251] eta: 0:02:25 lr: 0.001746 loss: 3.593334 (3.536788) time: 0.940259 data: 0.000187 max mem: 18817 Epoch: [69/300] [1150/1251] eta: 0:01:37 lr: 0.001746 loss: 3.445513 (3.538481) time: 0.975750 data: 0.000181 max mem: 18817 Epoch: [69/300] [1200/1251] eta: 0:00:49 lr: 0.001746 loss: 3.095945 (3.533200) time: 0.965140 data: 0.000176 max mem: 18817 Epoch: [69/300] [1250/1251] eta: 0:00:00 lr: 0.001745 loss: 3.589586 (3.537534) time: 0.950539 data: 0.000744 max mem: 18817 Epoch: [69/300] Total time: 0:20:06 (0.964828 s / it) Averaged stats: lr: 0.001745 loss: 3.589586 (3.528109) Test: [ 0/49] eta: 0:01:26 loss: 0.754257 (0.754257) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.764318 data: 1.359527 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.897401 (0.918579) acc1: 78.125000 (78.977273) acc5: 93.750000 (93.750000) time: 0.495374 data: 0.123724 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.986719 (0.945876) acc1: 76.562500 (77.976190) acc5: 93.750000 (93.898810) time: 0.365193 data: 0.000130 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.986050 (0.947636) acc1: 76.562500 (77.116935) acc5: 95.312500 (94.304435) time: 0.362705 data: 0.000141 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.979038 (0.965673) acc1: 76.562500 (76.943598) acc5: 93.750000 (94.169207) time: 0.364095 data: 0.000142 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.986793 (0.965065) acc1: 76.562500 (77.120000) acc5: 93.750000 (94.176000) time: 0.378293 data: 0.000106 max mem: 18817 Test: Total time: 0:00:19 (0.401057 s / it) * Acc@1 77.020 Acc@5 93.930 loss 0.985 Max accuracy: 77.02% Epoch: [70/300] [ 0/1251] eta: 0:42:42 lr: 0.001745 loss: 3.821811 (3.821811) time: 2.048006 data: 1.140526 max mem: 18817 Epoch: [70/300] [ 50/1251] eta: 0:19:37 lr: 0.001745 loss: 3.539614 (3.474986) time: 0.976114 data: 0.000157 max mem: 18817 Epoch: [70/300] [ 100/1251] eta: 0:18:42 lr: 0.001745 loss: 3.496167 (3.477023) time: 1.004768 data: 0.000178 max mem: 18817 Epoch: [70/300] [ 150/1251] eta: 0:17:41 lr: 0.001745 loss: 3.710296 (3.482480) time: 0.959464 data: 0.000182 max mem: 18817 Epoch: [70/300] [ 200/1251] eta: 0:16:52 lr: 0.001744 loss: 3.730968 (3.503535) time: 0.913235 data: 0.000178 max mem: 18817 Epoch: [70/300] [ 250/1251] eta: 0:16:06 lr: 0.001744 loss: 3.528696 (3.503943) time: 0.925866 data: 0.000165 max mem: 18817 Epoch: [70/300] [ 300/1251] eta: 0:15:17 lr: 0.001744 loss: 3.431887 (3.506320) time: 0.978069 data: 0.000195 max mem: 18817 Epoch: [70/300] [ 350/1251] eta: 0:14:29 lr: 0.001743 loss: 3.567938 (3.509316) time: 0.994749 data: 0.000182 max mem: 18817 Epoch: [70/300] [ 400/1251] eta: 0:13:38 lr: 0.001743 loss: 3.446198 (3.507296) time: 0.965675 data: 0.000165 max mem: 18817 Epoch: [70/300] [ 450/1251] eta: 0:12:49 lr: 0.001743 loss: 3.399416 (3.510040) time: 0.922084 data: 0.000167 max mem: 18817 Epoch: [70/300] [ 500/1251] eta: 0:12:02 lr: 0.001743 loss: 3.391394 (3.506206) time: 0.923049 data: 0.000180 max mem: 18817 Epoch: [70/300] [ 550/1251] eta: 0:11:14 lr: 0.001742 loss: 3.521123 (3.497667) time: 0.972677 data: 0.000208 max mem: 18817 Epoch: [70/300] [ 600/1251] eta: 0:10:26 lr: 0.001742 loss: 3.533057 (3.492650) time: 0.994201 data: 0.000184 max mem: 18817 Epoch: [70/300] [ 650/1251] eta: 0:09:37 lr: 0.001742 loss: 3.368362 (3.498978) time: 0.966278 data: 0.000169 max mem: 18817 Epoch: [70/300] [ 700/1251] eta: 0:08:48 lr: 0.001741 loss: 3.668625 (3.497397) time: 0.915288 data: 0.000180 max mem: 18817 Epoch: [70/300] [ 750/1251] eta: 0:08:01 lr: 0.001741 loss: 3.455143 (3.497230) time: 0.927912 data: 0.000167 max mem: 18817 Epoch: [70/300] [ 800/1251] eta: 0:07:13 lr: 0.001741 loss: 3.751811 (3.500636) time: 1.000405 data: 0.000167 max mem: 18817 Epoch: [70/300] [ 850/1251] eta: 0:06:25 lr: 0.001741 loss: 3.563696 (3.507460) time: 0.996793 data: 0.000183 max mem: 18817 Epoch: [70/300] [ 900/1251] eta: 0:05:37 lr: 0.001740 loss: 3.554460 (3.514658) time: 0.964412 data: 0.000191 max mem: 18817 Epoch: [70/300] [ 950/1251] eta: 0:04:48 lr: 0.001740 loss: 3.653433 (3.519451) time: 0.918157 data: 0.000172 max mem: 18817 Epoch: [70/300] [1000/1251] eta: 0:04:00 lr: 0.001740 loss: 3.361507 (3.515570) time: 0.925856 data: 0.000172 max mem: 18817 Epoch: [70/300] [1050/1251] eta: 0:03:12 lr: 0.001739 loss: 3.595005 (3.520964) time: 0.980509 data: 0.000164 max mem: 18817 Epoch: [70/300] [1100/1251] eta: 0:02:25 lr: 0.001739 loss: 3.588996 (3.525592) time: 1.019726 data: 0.000181 max mem: 18817 Epoch: [70/300] [1150/1251] eta: 0:01:37 lr: 0.001739 loss: 3.693332 (3.527267) time: 0.984932 data: 0.000182 max mem: 18817 Epoch: [70/300] [1200/1251] eta: 0:00:48 lr: 0.001739 loss: 3.453840 (3.524354) time: 0.910509 data: 0.000200 max mem: 18817 Epoch: [70/300] [1250/1251] eta: 0:00:00 lr: 0.001738 loss: 3.440551 (3.520783) time: 0.929851 data: 0.000759 max mem: 18817 Epoch: [70/300] Total time: 0:20:02 (0.960913 s / it) Averaged stats: lr: 0.001738 loss: 3.440551 (3.531265) Test: [ 0/49] eta: 0:01:28 loss: 0.738212 (0.738212) acc1: 81.250000 (81.250000) acc5: 96.875000 (96.875000) time: 1.800897 data: 1.395526 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.798434 (0.854411) acc1: 79.687500 (79.829545) acc5: 95.312500 (94.034091) time: 0.498862 data: 0.126993 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.896465 (0.904245) acc1: 78.125000 (78.571429) acc5: 93.750000 (94.122024) time: 0.365690 data: 0.000132 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.952421 (0.907484) acc1: 76.562500 (77.973790) acc5: 93.750000 (94.304435) time: 0.363208 data: 0.000129 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.940940 (0.930826) acc1: 76.562500 (77.553354) acc5: 93.750000 (94.207317) time: 0.460813 data: 0.000130 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.977351 (0.931478) acc1: 78.125000 (77.984000) acc5: 93.750000 (94.304000) time: 0.455387 data: 0.000111 max mem: 18817 Test: Total time: 0:00:21 (0.434090 s / it) * Acc@1 77.216 Acc@5 94.136 loss 0.948 Max accuracy: 77.22% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0070.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0070.pth Epoch: [71/300] [ 0/1251] eta: 0:40:26 lr: 0.001738 loss: 2.784865 (2.784865) time: 1.939545 data: 1.047314 max mem: 18817 Epoch: [71/300] [ 50/1251] eta: 0:19:46 lr: 0.001738 loss: 3.377157 (3.438586) time: 0.944035 data: 0.000169 max mem: 18817 Epoch: [71/300] [ 100/1251] eta: 0:18:41 lr: 0.001738 loss: 3.608891 (3.490153) time: 0.971749 data: 0.000173 max mem: 18817 Epoch: [71/300] [ 150/1251] eta: 0:17:43 lr: 0.001737 loss: 3.578114 (3.498079) time: 0.986299 data: 0.000172 max mem: 18817 Epoch: [71/300] [ 200/1251] eta: 0:16:53 lr: 0.001737 loss: 3.236450 (3.483957) time: 0.916206 data: 0.000173 max mem: 18817 Epoch: [71/300] [ 250/1251] eta: 0:16:07 lr: 0.001737 loss: 3.514866 (3.482609) time: 0.921240 data: 0.000174 max mem: 18817 Epoch: [71/300] [ 300/1251] eta: 0:15:20 lr: 0.001737 loss: 3.528249 (3.469423) time: 0.934724 data: 0.000181 max mem: 18817 Epoch: [71/300] [ 350/1251] eta: 0:14:33 lr: 0.001736 loss: 3.246786 (3.465089) time: 0.981175 data: 0.000166 max mem: 18817 Epoch: [71/300] [ 400/1251] eta: 0:13:41 lr: 0.001736 loss: 3.486987 (3.466536) time: 0.973275 data: 0.000178 max mem: 18817 Epoch: [71/300] [ 450/1251] eta: 0:12:52 lr: 0.001736 loss: 3.492617 (3.460087) time: 0.929732 data: 0.000163 max mem: 18817 Epoch: [71/300] [ 500/1251] eta: 0:12:04 lr: 0.001736 loss: 3.456606 (3.467096) time: 0.933570 data: 0.000163 max mem: 18817 Epoch: [71/300] [ 550/1251] eta: 0:11:15 lr: 0.001735 loss: 3.520542 (3.469336) time: 0.934533 data: 0.000163 max mem: 18817 Epoch: [71/300] [ 600/1251] eta: 0:10:27 lr: 0.001735 loss: 3.680678 (3.471773) time: 0.975761 data: 0.000144 max mem: 18817 Epoch: [71/300] [ 650/1251] eta: 0:09:38 lr: 0.001735 loss: 3.783697 (3.484420) time: 0.961838 data: 0.000170 max mem: 18817 Epoch: [71/300] [ 700/1251] eta: 0:08:49 lr: 0.001734 loss: 3.620916 (3.494668) time: 0.915418 data: 0.000163 max mem: 18817 Epoch: [71/300] [ 750/1251] eta: 0:08:01 lr: 0.001734 loss: 3.338939 (3.486864) time: 0.928254 data: 0.000176 max mem: 18817 Epoch: [71/300] [ 800/1251] eta: 0:07:14 lr: 0.001734 loss: 3.817108 (3.499535) time: 0.943774 data: 0.000158 max mem: 18817 Epoch: [71/300] [ 850/1251] eta: 0:06:26 lr: 0.001734 loss: 3.505185 (3.499200) time: 0.988272 data: 0.000169 max mem: 18817 Epoch: [71/300] [ 900/1251] eta: 0:05:37 lr: 0.001733 loss: 3.542408 (3.500486) time: 0.991100 data: 0.000176 max mem: 18817 Epoch: [71/300] [ 950/1251] eta: 0:04:50 lr: 0.001733 loss: 3.739846 (3.501349) time: 0.970240 data: 0.000171 max mem: 18817 Epoch: [71/300] [1000/1251] eta: 0:04:01 lr: 0.001733 loss: 3.565381 (3.505778) time: 0.920469 data: 0.000159 max mem: 18817 Epoch: [71/300] [1050/1251] eta: 0:03:13 lr: 0.001732 loss: 3.308349 (3.499227) time: 0.944318 data: 0.000191 max mem: 18817 Epoch: [71/300] [1100/1251] eta: 0:02:25 lr: 0.001732 loss: 3.613153 (3.500608) time: 1.003851 data: 0.000162 max mem: 18817 Epoch: [71/300] [1150/1251] eta: 0:01:37 lr: 0.001732 loss: 3.608078 (3.504150) time: 0.967613 data: 0.000184 max mem: 18817 Epoch: [71/300] [1200/1251] eta: 0:00:49 lr: 0.001732 loss: 3.368886 (3.498350) time: 0.932125 data: 0.000176 max mem: 18817 Epoch: [71/300] [1250/1251] eta: 0:00:00 lr: 0.001731 loss: 3.455987 (3.495789) time: 0.941115 data: 0.000740 max mem: 18817 Epoch: [71/300] Total time: 0:20:03 (0.962340 s / it) Averaged stats: lr: 0.001731 loss: 3.455987 (3.505872) Test: [ 0/49] eta: 0:01:17 loss: 0.779034 (0.779034) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.586270 data: 1.140251 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.868710 (0.883085) acc1: 81.250000 (79.687500) acc5: 95.312500 (94.460227) time: 0.481660 data: 0.103832 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.938823 (0.922395) acc1: 76.562500 (78.645833) acc5: 93.750000 (94.345238) time: 0.366417 data: 0.000158 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.937407 (0.927959) acc1: 76.562500 (78.477823) acc5: 93.750000 (94.102823) time: 0.456492 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.976794 (0.955037) acc1: 78.125000 (77.743902) acc5: 93.750000 (94.054878) time: 0.455015 data: 0.000150 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.045641 (0.958409) acc1: 76.562500 (77.984000) acc5: 93.750000 (94.080000) time: 0.356144 data: 0.000126 max mem: 18817 Test: Total time: 0:00:20 (0.427162 s / it) * Acc@1 77.084 Acc@5 94.042 loss 0.966 Max accuracy: 77.22% Epoch: [72/300] [ 0/1251] eta: 0:49:17 lr: 0.001731 loss: 3.461227 (3.461227) time: 2.364135 data: 1.475454 max mem: 18817 Epoch: [72/300] [ 50/1251] eta: 0:20:01 lr: 0.001731 loss: 3.684899 (3.575891) time: 0.978537 data: 0.000197 max mem: 18817 Epoch: [72/300] [ 100/1251] eta: 0:18:42 lr: 0.001731 loss: 3.144968 (3.464662) time: 0.985250 data: 0.000186 max mem: 18817 Epoch: [72/300] [ 150/1251] eta: 0:17:47 lr: 0.001730 loss: 3.375875 (3.465124) time: 0.915344 data: 0.000177 max mem: 18817 Epoch: [72/300] [ 200/1251] eta: 0:17:00 lr: 0.001730 loss: 3.697424 (3.483452) time: 0.927936 data: 0.000173 max mem: 18817 Epoch: [72/300] [ 250/1251] eta: 0:16:11 lr: 0.001730 loss: 3.654848 (3.471449) time: 0.930892 data: 0.000176 max mem: 18817 Epoch: [72/300] [ 300/1251] eta: 0:15:25 lr: 0.001730 loss: 3.711108 (3.473697) time: 0.992856 data: 0.000168 max mem: 18817 Epoch: [72/300] [ 350/1251] eta: 0:14:33 lr: 0.001729 loss: 3.768142 (3.486319) time: 0.971724 data: 0.000150 max mem: 18817 Epoch: [72/300] [ 400/1251] eta: 0:13:42 lr: 0.001729 loss: 3.312564 (3.486522) time: 0.923167 data: 0.000186 max mem: 18817 Epoch: [72/300] [ 450/1251] eta: 0:12:55 lr: 0.001729 loss: 3.501889 (3.487256) time: 0.943388 data: 0.000171 max mem: 18817 Epoch: [72/300] [ 500/1251] eta: 0:12:07 lr: 0.001728 loss: 3.376092 (3.494313) time: 0.930676 data: 0.000179 max mem: 18817 Epoch: [72/300] [ 550/1251] eta: 0:11:18 lr: 0.001728 loss: 3.404601 (3.486682) time: 1.010987 data: 0.000171 max mem: 18817 Epoch: [72/300] [ 600/1251] eta: 0:10:29 lr: 0.001728 loss: 3.508985 (3.475926) time: 0.967474 data: 0.000184 max mem: 18817 Epoch: [72/300] [ 650/1251] eta: 0:09:40 lr: 0.001728 loss: 3.614631 (3.490905) time: 0.947272 data: 0.000163 max mem: 18817 Epoch: [72/300] [ 700/1251] eta: 0:08:52 lr: 0.001727 loss: 3.515622 (3.489644) time: 0.931514 data: 0.000179 max mem: 18817 Epoch: [72/300] [ 750/1251] eta: 0:08:03 lr: 0.001727 loss: 3.649068 (3.490325) time: 0.936474 data: 0.000176 max mem: 18817 Epoch: [72/300] [ 800/1251] eta: 0:07:15 lr: 0.001727 loss: 3.853013 (3.497462) time: 0.992221 data: 0.000172 max mem: 18817 Epoch: [72/300] [ 850/1251] eta: 0:06:26 lr: 0.001726 loss: 3.685672 (3.502907) time: 0.975994 data: 0.000170 max mem: 18817 Epoch: [72/300] [ 900/1251] eta: 0:05:38 lr: 0.001726 loss: 3.730660 (3.504762) time: 0.951046 data: 0.000172 max mem: 18817 Epoch: [72/300] [ 950/1251] eta: 0:04:50 lr: 0.001726 loss: 3.305640 (3.499793) time: 0.933724 data: 0.000161 max mem: 18817 Epoch: [72/300] [1000/1251] eta: 0:04:02 lr: 0.001726 loss: 3.423769 (3.502441) time: 0.929144 data: 0.000180 max mem: 18817 Epoch: [72/300] [1050/1251] eta: 0:03:14 lr: 0.001725 loss: 3.619071 (3.501832) time: 1.006635 data: 0.000190 max mem: 18817 Epoch: [72/300] [1100/1251] eta: 0:02:25 lr: 0.001725 loss: 3.872739 (3.512937) time: 1.038013 data: 0.000169 max mem: 18817 Epoch: [72/300] [1150/1251] eta: 0:01:37 lr: 0.001725 loss: 3.162298 (3.507419) time: 0.984189 data: 0.000184 max mem: 18817 Epoch: [72/300] [1200/1251] eta: 0:00:49 lr: 0.001724 loss: 3.714722 (3.509193) time: 0.918751 data: 0.000169 max mem: 18817 Epoch: [72/300] [1250/1251] eta: 0:00:00 lr: 0.001724 loss: 3.560108 (3.510583) time: 0.939361 data: 0.000785 max mem: 18817 Epoch: [72/300] Total time: 0:20:08 (0.966370 s / it) Averaged stats: lr: 0.001724 loss: 3.560108 (3.509986) Test: [ 0/49] eta: 0:01:31 loss: 0.729894 (0.729894) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.872069 data: 1.449433 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.812534 (0.859733) acc1: 81.250000 (79.829545) acc5: 95.312500 (94.602273) time: 0.506470 data: 0.131898 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.982174 (0.937717) acc1: 75.000000 (77.827381) acc5: 93.750000 (93.898810) time: 0.366354 data: 0.000135 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.966889 (0.930093) acc1: 75.000000 (77.469758) acc5: 93.750000 (94.304435) time: 0.363137 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.966712 (0.950060) acc1: 75.000000 (77.172256) acc5: 93.750000 (94.207317) time: 0.360604 data: 0.000133 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.993396 (0.956566) acc1: 73.437500 (76.992000) acc5: 93.750000 (94.176000) time: 0.446733 data: 0.000109 max mem: 18817 Test: Total time: 0:00:21 (0.433239 s / it) * Acc@1 77.158 Acc@5 94.086 loss 0.953 Max accuracy: 77.22% Epoch: [73/300] [ 0/1251] eta: 0:43:56 lr: 0.001724 loss: 3.228401 (3.228401) time: 2.107675 data: 1.156794 max mem: 18817 Epoch: [73/300] [ 50/1251] eta: 0:19:27 lr: 0.001724 loss: 3.369839 (3.323007) time: 0.983150 data: 0.000165 max mem: 18817 Epoch: [73/300] [ 100/1251] eta: 0:18:25 lr: 0.001724 loss: 3.319942 (3.371977) time: 0.917484 data: 0.000164 max mem: 18817 Epoch: [73/300] [ 150/1251] eta: 0:17:39 lr: 0.001723 loss: 3.409220 (3.438566) time: 0.932893 data: 0.000178 max mem: 18817 Epoch: [73/300] [ 200/1251] eta: 0:16:53 lr: 0.001723 loss: 3.342140 (3.431068) time: 0.932118 data: 0.000164 max mem: 18817 Epoch: [73/300] [ 250/1251] eta: 0:16:04 lr: 0.001723 loss: 3.586986 (3.459596) time: 0.965982 data: 0.000156 max mem: 18817 Epoch: [73/300] [ 300/1251] eta: 0:15:13 lr: 0.001722 loss: 3.207804 (3.459468) time: 0.977387 data: 0.000179 max mem: 18817 Epoch: [73/300] [ 350/1251] eta: 0:14:25 lr: 0.001722 loss: 3.478691 (3.462307) time: 0.937766 data: 0.000172 max mem: 18817 Epoch: [73/300] [ 400/1251] eta: 0:13:36 lr: 0.001722 loss: 3.603078 (3.472073) time: 0.927869 data: 0.000171 max mem: 18817 Epoch: [73/300] [ 450/1251] eta: 0:12:49 lr: 0.001722 loss: 3.323887 (3.458010) time: 0.928218 data: 0.000169 max mem: 18817 Epoch: [73/300] [ 500/1251] eta: 0:12:02 lr: 0.001721 loss: 3.458402 (3.453255) time: 0.996029 data: 0.000160 max mem: 18817 Epoch: [73/300] [ 550/1251] eta: 0:11:13 lr: 0.001721 loss: 3.234180 (3.441763) time: 0.969547 data: 0.000169 max mem: 18817 Epoch: [73/300] [ 600/1251] eta: 0:10:24 lr: 0.001721 loss: 3.261622 (3.441285) time: 0.915545 data: 0.000175 max mem: 18817 Epoch: [73/300] [ 650/1251] eta: 0:09:36 lr: 0.001720 loss: 3.641711 (3.445189) time: 0.929681 data: 0.000168 max mem: 18817 Epoch: [73/300] [ 700/1251] eta: 0:08:49 lr: 0.001720 loss: 3.629702 (3.453250) time: 0.929269 data: 0.000174 max mem: 18817 Epoch: [73/300] [ 750/1251] eta: 0:08:01 lr: 0.001720 loss: 3.596159 (3.454252) time: 0.994695 data: 0.000165 max mem: 18817 Epoch: [73/300] [ 800/1251] eta: 0:07:13 lr: 0.001720 loss: 3.413412 (3.453948) time: 0.972622 data: 0.000171 max mem: 18817 Epoch: [73/300] [ 850/1251] eta: 0:06:25 lr: 0.001719 loss: 3.807975 (3.461339) time: 0.935070 data: 0.000179 max mem: 18817 Epoch: [73/300] [ 900/1251] eta: 0:05:37 lr: 0.001719 loss: 3.739666 (3.467937) time: 0.935031 data: 0.000184 max mem: 18817 Epoch: [73/300] [ 950/1251] eta: 0:04:49 lr: 0.001719 loss: 3.484105 (3.471044) time: 0.917205 data: 0.000164 max mem: 18817 Epoch: [73/300] [1000/1251] eta: 0:04:00 lr: 0.001718 loss: 3.686678 (3.468704) time: 0.954497 data: 0.000165 max mem: 18817 Epoch: [73/300] [1050/1251] eta: 0:03:12 lr: 0.001718 loss: 3.391428 (3.467673) time: 0.967850 data: 0.000179 max mem: 18817 Epoch: [73/300] [1100/1251] eta: 0:02:24 lr: 0.001718 loss: 3.400992 (3.465690) time: 0.917050 data: 0.000171 max mem: 18817 Epoch: [73/300] [1150/1251] eta: 0:01:36 lr: 0.001717 loss: 3.592963 (3.469524) time: 0.924547 data: 0.000162 max mem: 18817 Epoch: [73/300] [1200/1251] eta: 0:00:48 lr: 0.001717 loss: 3.600709 (3.466666) time: 0.952260 data: 0.000187 max mem: 18817 Epoch: [73/300] [1250/1251] eta: 0:00:00 lr: 0.001717 loss: 3.489771 (3.465998) time: 0.996554 data: 0.000749 max mem: 18817 Epoch: [73/300] Total time: 0:20:01 (0.960579 s / it) Averaged stats: lr: 0.001717 loss: 3.489771 (3.468949) Test: [ 0/49] eta: 0:01:29 loss: 0.820933 (0.820933) acc1: 78.125000 (78.125000) acc5: 96.875000 (96.875000) time: 1.832633 data: 1.427895 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.875208 (0.879699) acc1: 76.562500 (78.977273) acc5: 93.750000 (94.176136) time: 0.503011 data: 0.129940 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.935739 (0.911338) acc1: 76.562500 (78.273810) acc5: 93.750000 (94.196429) time: 0.375915 data: 0.000135 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.933473 (0.914931) acc1: 76.562500 (78.175403) acc5: 95.312500 (94.606855) time: 0.374576 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.933473 (0.921592) acc1: 76.562500 (77.934451) acc5: 95.312500 (94.550305) time: 0.368464 data: 0.000148 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.989018 (0.926434) acc1: 76.562500 (77.696000) acc5: 95.312500 (94.528000) time: 0.366067 data: 0.000121 max mem: 18817 Test: Total time: 0:00:19 (0.402045 s / it) * Acc@1 77.522 Acc@5 94.144 loss 0.931 Max accuracy: 77.52% Epoch: [74/300] [ 0/1251] eta: 0:41:12 lr: 0.001717 loss: 3.671171 (3.671171) time: 1.976782 data: 1.089411 max mem: 18817 Epoch: [74/300] [ 50/1251] eta: 0:19:23 lr: 0.001717 loss: 3.373285 (3.349223) time: 0.919222 data: 0.000170 max mem: 18817 Epoch: [74/300] [ 100/1251] eta: 0:18:37 lr: 0.001716 loss: 3.644487 (3.460572) time: 0.926611 data: 0.000160 max mem: 18817 Epoch: [74/300] [ 150/1251] eta: 0:17:49 lr: 0.001716 loss: 3.615132 (3.486390) time: 0.994200 data: 0.000181 max mem: 18817 Epoch: [74/300] [ 200/1251] eta: 0:17:00 lr: 0.001716 loss: 3.670339 (3.489818) time: 1.033366 data: 0.000167 max mem: 18817 Epoch: [74/300] [ 250/1251] eta: 0:16:09 lr: 0.001715 loss: 3.259412 (3.452248) time: 0.986543 data: 0.000177 max mem: 18817 Epoch: [74/300] [ 300/1251] eta: 0:15:17 lr: 0.001715 loss: 3.603302 (3.444179) time: 0.918103 data: 0.000154 max mem: 18817 Epoch: [74/300] [ 350/1251] eta: 0:14:29 lr: 0.001715 loss: 3.626161 (3.463204) time: 0.929142 data: 0.000175 max mem: 18817 Epoch: [74/300] [ 400/1251] eta: 0:13:42 lr: 0.001715 loss: 3.686635 (3.490835) time: 0.982167 data: 0.000223 max mem: 18817 Epoch: [74/300] [ 450/1251] eta: 0:12:54 lr: 0.001714 loss: 3.804600 (3.507426) time: 0.999628 data: 0.000170 max mem: 18817 Epoch: [74/300] [ 500/1251] eta: 0:12:04 lr: 0.001714 loss: 3.269994 (3.496683) time: 0.958376 data: 0.000167 max mem: 18817 Epoch: [74/300] [ 550/1251] eta: 0:11:15 lr: 0.001714 loss: 3.730349 (3.512047) time: 0.931160 data: 0.000181 max mem: 18817 Epoch: [74/300] [ 600/1251] eta: 0:10:28 lr: 0.001713 loss: 3.450356 (3.511093) time: 0.941167 data: 0.000183 max mem: 18817 Epoch: [74/300] [ 650/1251] eta: 0:09:40 lr: 0.001713 loss: 3.168287 (3.502540) time: 0.993143 data: 0.000157 max mem: 18817 Epoch: [74/300] [ 700/1251] eta: 0:08:51 lr: 0.001713 loss: 3.789653 (3.500310) time: 0.987599 data: 0.000189 max mem: 18817 Epoch: [74/300] [ 750/1251] eta: 0:08:02 lr: 0.001713 loss: 3.389118 (3.493513) time: 0.957689 data: 0.000180 max mem: 18817 Epoch: [74/300] [ 800/1251] eta: 0:07:14 lr: 0.001712 loss: 3.198667 (3.494163) time: 0.918665 data: 0.000179 max mem: 18817 Epoch: [74/300] [ 850/1251] eta: 0:06:26 lr: 0.001712 loss: 3.287481 (3.486046) time: 0.933287 data: 0.000189 max mem: 18817 Epoch: [74/300] [ 900/1251] eta: 0:05:37 lr: 0.001712 loss: 3.629749 (3.493504) time: 0.984616 data: 0.000171 max mem: 18817 Epoch: [74/300] [ 950/1251] eta: 0:04:49 lr: 0.001711 loss: 3.546651 (3.498022) time: 0.999651 data: 0.000175 max mem: 18817 Epoch: [74/300] [1000/1251] eta: 0:04:01 lr: 0.001711 loss: 3.398687 (3.493979) time: 0.973676 data: 0.000182 max mem: 18817 Epoch: [74/300] [1050/1251] eta: 0:03:13 lr: 0.001711 loss: 3.371171 (3.490567) time: 0.916125 data: 0.000201 max mem: 18817 Epoch: [74/300] [1100/1251] eta: 0:02:25 lr: 0.001710 loss: 3.725827 (3.499042) time: 0.927874 data: 0.000214 max mem: 18817 Epoch: [74/300] [1150/1251] eta: 0:01:37 lr: 0.001710 loss: 3.715487 (3.501475) time: 0.991514 data: 0.000178 max mem: 18817 Epoch: [74/300] [1200/1251] eta: 0:00:49 lr: 0.001710 loss: 3.345196 (3.497055) time: 0.982921 data: 0.000177 max mem: 18817 Epoch: [74/300] [1250/1251] eta: 0:00:00 lr: 0.001710 loss: 3.736511 (3.501550) time: 0.958430 data: 0.000762 max mem: 18817 Epoch: [74/300] Total time: 0:20:03 (0.962240 s / it) Averaged stats: lr: 0.001710 loss: 3.736511 (3.503356) Test: [ 0/49] eta: 0:01:28 loss: 0.802451 (0.802451) acc1: 82.812500 (82.812500) acc5: 93.750000 (93.750000) time: 1.806609 data: 1.391366 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.801482 (0.851422) acc1: 78.125000 (79.403409) acc5: 93.750000 (94.034091) time: 0.498391 data: 0.126613 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.920058 (0.905094) acc1: 76.562500 (77.827381) acc5: 93.750000 (94.196429) time: 0.372542 data: 0.000128 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.965721 (0.915476) acc1: 75.000000 (77.419355) acc5: 95.312500 (94.354839) time: 0.401514 data: 0.000124 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.965721 (0.933634) acc1: 76.562500 (77.362805) acc5: 93.750000 (94.169207) time: 0.407481 data: 0.000121 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.967778 (0.937929) acc1: 76.562500 (77.248000) acc5: 93.750000 (94.176000) time: 0.374922 data: 0.000103 max mem: 18817 Test: Total time: 0:00:20 (0.416039 s / it) * Acc@1 77.468 Acc@5 94.186 loss 0.948 Max accuracy: 77.52% Epoch: [75/300] [ 0/1251] eta: 0:39:40 lr: 0.001710 loss: 4.127228 (4.127228) time: 1.902858 data: 1.016911 max mem: 18817 Epoch: [75/300] [ 50/1251] eta: 0:19:53 lr: 0.001709 loss: 3.715409 (3.534128) time: 0.929189 data: 0.000178 max mem: 18817 Epoch: [75/300] [ 100/1251] eta: 0:18:53 lr: 0.001709 loss: 3.677329 (3.505808) time: 0.995692 data: 0.000177 max mem: 18817 Epoch: [75/300] [ 150/1251] eta: 0:17:56 lr: 0.001709 loss: 3.630815 (3.496798) time: 1.012667 data: 0.000181 max mem: 18817 Epoch: [75/300] [ 200/1251] eta: 0:16:56 lr: 0.001708 loss: 3.683146 (3.506068) time: 0.968626 data: 0.000173 max mem: 18817 Epoch: [75/300] [ 250/1251] eta: 0:16:04 lr: 0.001708 loss: 3.624550 (3.493075) time: 0.926046 data: 0.000176 max mem: 18817 Epoch: [75/300] [ 300/1251] eta: 0:15:16 lr: 0.001708 loss: 3.696493 (3.506883) time: 0.927576 data: 0.000172 max mem: 18817 Epoch: [75/300] [ 350/1251] eta: 0:14:27 lr: 0.001708 loss: 3.551670 (3.501522) time: 0.953350 data: 0.000162 max mem: 18817 Epoch: [75/300] [ 400/1251] eta: 0:13:38 lr: 0.001707 loss: 3.611563 (3.501572) time: 1.022109 data: 0.000183 max mem: 18817 Epoch: [75/300] [ 450/1251] eta: 0:12:48 lr: 0.001707 loss: 3.586633 (3.499948) time: 0.964037 data: 0.000159 max mem: 18817 Epoch: [75/300] [ 500/1251] eta: 0:11:59 lr: 0.001707 loss: 3.667567 (3.504505) time: 0.920984 data: 0.000185 max mem: 18817 Epoch: [75/300] [ 550/1251] eta: 0:11:12 lr: 0.001706 loss: 3.541958 (3.505655) time: 0.931556 data: 0.000166 max mem: 18817 Epoch: [75/300] [ 600/1251] eta: 0:10:25 lr: 0.001706 loss: 3.634403 (3.511952) time: 0.996367 data: 0.000174 max mem: 18817 Epoch: [75/300] [ 650/1251] eta: 0:09:37 lr: 0.001706 loss: 3.516407 (3.517830) time: 1.035035 data: 0.000162 max mem: 18817 Epoch: [75/300] [ 700/1251] eta: 0:08:48 lr: 0.001705 loss: 3.612093 (3.519911) time: 0.959305 data: 0.000178 max mem: 18817 Epoch: [75/300] [ 750/1251] eta: 0:08:00 lr: 0.001705 loss: 3.291806 (3.509306) time: 0.917524 data: 0.000172 max mem: 18817 Epoch: [75/300] [ 800/1251] eta: 0:07:12 lr: 0.001705 loss: 3.739028 (3.517078) time: 0.924845 data: 0.000180 max mem: 18817 Epoch: [75/300] [ 850/1251] eta: 0:06:24 lr: 0.001705 loss: 3.583825 (3.518665) time: 0.987277 data: 0.000170 max mem: 18817 Epoch: [75/300] [ 900/1251] eta: 0:05:36 lr: 0.001704 loss: 3.697239 (3.516645) time: 1.000294 data: 0.000180 max mem: 18817 Epoch: [75/300] [ 950/1251] eta: 0:04:48 lr: 0.001704 loss: 3.770991 (3.515156) time: 0.964953 data: 0.000179 max mem: 18817 Epoch: [75/300] [1000/1251] eta: 0:04:00 lr: 0.001704 loss: 3.760340 (3.518180) time: 0.933094 data: 0.000166 max mem: 18817 Epoch: [75/300] [1050/1251] eta: 0:03:12 lr: 0.001703 loss: 3.743759 (3.525075) time: 0.936614 data: 0.000177 max mem: 18817 Epoch: [75/300] [1100/1251] eta: 0:02:25 lr: 0.001703 loss: 3.379972 (3.519281) time: 0.981141 data: 0.000180 max mem: 18817 Epoch: [75/300] [1150/1251] eta: 0:01:36 lr: 0.001703 loss: 3.327958 (3.515574) time: 1.003093 data: 0.000161 max mem: 18817 Epoch: [75/300] [1200/1251] eta: 0:00:48 lr: 0.001703 loss: 3.501942 (3.513827) time: 0.967838 data: 0.000193 max mem: 18817 Epoch: [75/300] [1250/1251] eta: 0:00:00 lr: 0.001702 loss: 3.684829 (3.511992) time: 0.916676 data: 0.000897 max mem: 18817 Epoch: [75/300] Total time: 0:19:59 (0.959158 s / it) Averaged stats: lr: 0.001702 loss: 3.684829 (3.522920) Test: [ 0/49] eta: 0:01:17 loss: 0.700220 (0.700220) acc1: 82.812500 (82.812500) acc5: 95.312500 (95.312500) time: 1.571472 data: 1.094931 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.846268 (0.873812) acc1: 78.125000 (78.409091) acc5: 93.750000 (94.034091) time: 0.482114 data: 0.099714 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.919812 (0.906897) acc1: 78.125000 (78.050595) acc5: 93.750000 (93.750000) time: 0.389748 data: 0.000171 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.913727 (0.899117) acc1: 78.125000 (78.225806) acc5: 93.750000 (94.052419) time: 0.451636 data: 0.000158 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.913727 (0.927640) acc1: 75.000000 (77.553354) acc5: 93.750000 (93.788110) time: 0.428161 data: 0.000154 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.995705 (0.934618) acc1: 75.000000 (77.568000) acc5: 93.750000 (93.792000) time: 0.356043 data: 0.000115 max mem: 18817 Test: Total time: 0:00:20 (0.426663 s / it) * Acc@1 77.228 Acc@5 94.136 loss 0.933 Max accuracy: 77.52% Epoch: [76/300] [ 0/1251] eta: 0:41:06 lr: 0.001702 loss: 4.130037 (4.130037) time: 1.971952 data: 1.059121 max mem: 18817 Epoch: [76/300] [ 50/1251] eta: 0:20:08 lr: 0.001702 loss: 3.515500 (3.558446) time: 0.942027 data: 0.000192 max mem: 18817 Epoch: [76/300] [ 100/1251] eta: 0:18:54 lr: 0.001702 loss: 3.781648 (3.550392) time: 0.987197 data: 0.000157 max mem: 18817 Epoch: [76/300] [ 150/1251] eta: 0:17:58 lr: 0.001701 loss: 3.425323 (3.552356) time: 1.022578 data: 0.000172 max mem: 18817 Epoch: [76/300] [ 200/1251] eta: 0:16:59 lr: 0.001701 loss: 3.700838 (3.539755) time: 0.973415 data: 0.000165 max mem: 18817 Epoch: [76/300] [ 250/1251] eta: 0:16:05 lr: 0.001701 loss: 3.366968 (3.502544) time: 0.915408 data: 0.000169 max mem: 18817 Epoch: [76/300] [ 300/1251] eta: 0:15:19 lr: 0.001700 loss: 3.227491 (3.484627) time: 0.932197 data: 0.000170 max mem: 18817 Epoch: [76/300] [ 350/1251] eta: 0:14:30 lr: 0.001700 loss: 3.445735 (3.482262) time: 0.990319 data: 0.000172 max mem: 18817 Epoch: [76/300] [ 400/1251] eta: 0:13:42 lr: 0.001700 loss: 3.423898 (3.462048) time: 1.039780 data: 0.000164 max mem: 18817 Epoch: [76/300] [ 450/1251] eta: 0:12:52 lr: 0.001700 loss: 3.402076 (3.469718) time: 0.973671 data: 0.000167 max mem: 18817 Epoch: [76/300] [ 500/1251] eta: 0:12:02 lr: 0.001699 loss: 3.719858 (3.474910) time: 0.927167 data: 0.000171 max mem: 18817 Epoch: [76/300] [ 550/1251] eta: 0:11:15 lr: 0.001699 loss: 3.744492 (3.483040) time: 0.920392 data: 0.000169 max mem: 18817 Epoch: [76/300] [ 600/1251] eta: 0:10:27 lr: 0.001699 loss: 3.585490 (3.474643) time: 0.994850 data: 0.000171 max mem: 18817 Epoch: [76/300] [ 650/1251] eta: 0:09:38 lr: 0.001698 loss: 3.389189 (3.477500) time: 1.005073 data: 0.000168 max mem: 18817 Epoch: [76/300] [ 700/1251] eta: 0:08:49 lr: 0.001698 loss: 3.398479 (3.485342) time: 0.967002 data: 0.000162 max mem: 18817 Epoch: [76/300] [ 750/1251] eta: 0:08:01 lr: 0.001698 loss: 3.735545 (3.490664) time: 0.924240 data: 0.000158 max mem: 18817 Epoch: [76/300] [ 800/1251] eta: 0:07:12 lr: 0.001697 loss: 3.478283 (3.491698) time: 0.919362 data: 0.000171 max mem: 18817 Epoch: [76/300] [ 850/1251] eta: 0:06:25 lr: 0.001697 loss: 3.461976 (3.493974) time: 0.974284 data: 0.000169 max mem: 18817 Epoch: [76/300] [ 900/1251] eta: 0:05:37 lr: 0.001697 loss: 3.786551 (3.493615) time: 1.071898 data: 0.000177 max mem: 18817 Epoch: [76/300] [ 950/1251] eta: 0:04:49 lr: 0.001697 loss: 3.431698 (3.484021) time: 0.966083 data: 0.000172 max mem: 18817 Epoch: [76/300] [1000/1251] eta: 0:04:01 lr: 0.001696 loss: 3.504667 (3.486996) time: 0.929873 data: 0.000163 max mem: 18817 Epoch: [76/300] [1050/1251] eta: 0:03:12 lr: 0.001696 loss: 3.691667 (3.489823) time: 0.920720 data: 0.000182 max mem: 18817 Epoch: [76/300] [1100/1251] eta: 0:02:24 lr: 0.001696 loss: 3.558048 (3.492742) time: 0.972699 data: 0.000188 max mem: 18817 Epoch: [76/300] [1150/1251] eta: 0:01:36 lr: 0.001695 loss: 3.203804 (3.488250) time: 1.022937 data: 0.000171 max mem: 18817 Epoch: [76/300] [1200/1251] eta: 0:00:48 lr: 0.001695 loss: 3.655303 (3.487941) time: 0.964208 data: 0.000177 max mem: 18817 Epoch: [76/300] [1250/1251] eta: 0:00:00 lr: 0.001695 loss: 3.324132 (3.486199) time: 0.930300 data: 0.000755 max mem: 18817 Epoch: [76/300] Total time: 0:20:00 (0.959705 s / it) Averaged stats: lr: 0.001695 loss: 3.324132 (3.482392) Test: [ 0/49] eta: 0:01:17 loss: 0.740996 (0.740996) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.590319 data: 1.152203 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.856241 (0.882237) acc1: 78.125000 (79.971591) acc5: 95.312500 (94.744318) time: 0.478052 data: 0.104898 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.906602 (0.917059) acc1: 78.125000 (78.943452) acc5: 93.750000 (94.047619) time: 0.377132 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.927325 (0.918673) acc1: 78.125000 (78.528226) acc5: 95.312500 (94.707661) time: 0.454473 data: 0.000130 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.935525 (0.943175) acc1: 78.125000 (78.086890) acc5: 95.312500 (94.474085) time: 0.439743 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.007459 (0.945425) acc1: 76.562500 (78.208000) acc5: 95.312500 (94.432000) time: 0.355354 data: 0.000100 max mem: 18817 Test: Total time: 0:00:20 (0.426087 s / it) * Acc@1 77.536 Acc@5 94.280 loss 0.966 Max accuracy: 77.54% Epoch: [77/300] [ 0/1251] eta: 0:41:15 lr: 0.001695 loss: 2.532485 (2.532485) time: 1.978507 data: 1.079031 max mem: 18817 Epoch: [77/300] [ 50/1251] eta: 0:19:45 lr: 0.001694 loss: 3.542902 (3.358231) time: 0.989245 data: 0.000168 max mem: 18817 Epoch: [77/300] [ 100/1251] eta: 0:18:46 lr: 0.001694 loss: 3.743611 (3.514359) time: 1.029564 data: 0.000174 max mem: 18817 Epoch: [77/300] [ 150/1251] eta: 0:17:42 lr: 0.001694 loss: 3.600494 (3.520802) time: 0.951613 data: 0.000180 max mem: 18817 Epoch: [77/300] [ 200/1251] eta: 0:16:50 lr: 0.001694 loss: 3.638216 (3.518869) time: 0.924961 data: 0.000160 max mem: 18817 Epoch: [77/300] [ 250/1251] eta: 0:16:05 lr: 0.001693 loss: 3.589551 (3.513763) time: 0.923660 data: 0.000187 max mem: 18817 Epoch: [77/300] [ 300/1251] eta: 0:15:16 lr: 0.001693 loss: 3.593323 (3.516199) time: 0.993101 data: 0.000177 max mem: 18817 Epoch: [77/300] [ 350/1251] eta: 0:14:26 lr: 0.001693 loss: 3.744745 (3.520500) time: 1.017724 data: 0.000184 max mem: 18817 Epoch: [77/300] [ 400/1251] eta: 0:13:36 lr: 0.001692 loss: 3.411973 (3.523217) time: 0.976546 data: 0.000168 max mem: 18817 Epoch: [77/300] [ 450/1251] eta: 0:12:47 lr: 0.001692 loss: 3.572576 (3.530310) time: 0.929426 data: 0.000177 max mem: 18817 Epoch: [77/300] [ 500/1251] eta: 0:12:00 lr: 0.001692 loss: 3.449574 (3.532583) time: 0.931743 data: 0.000172 max mem: 18817 Epoch: [77/300] [ 550/1251] eta: 0:11:13 lr: 0.001691 loss: 3.660053 (3.520055) time: 1.013833 data: 0.000184 max mem: 18817 Epoch: [77/300] [ 600/1251] eta: 0:10:26 lr: 0.001691 loss: 3.702940 (3.520277) time: 1.031902 data: 0.000183 max mem: 18817 Epoch: [77/300] [ 650/1251] eta: 0:09:37 lr: 0.001691 loss: 3.481836 (3.513599) time: 0.989970 data: 0.000167 max mem: 18817 Epoch: [77/300] [ 700/1251] eta: 0:08:49 lr: 0.001691 loss: 3.777141 (3.512869) time: 0.931716 data: 0.000172 max mem: 18817 Epoch: [77/300] [ 750/1251] eta: 0:08:02 lr: 0.001690 loss: 3.641597 (3.513862) time: 0.935185 data: 0.000172 max mem: 18817 Epoch: [77/300] [ 800/1251] eta: 0:07:14 lr: 0.001690 loss: 3.470644 (3.510513) time: 1.001871 data: 0.000180 max mem: 18817 Epoch: [77/300] [ 850/1251] eta: 0:06:26 lr: 0.001690 loss: 3.485694 (3.507212) time: 1.036958 data: 0.000183 max mem: 18817 Epoch: [77/300] [ 900/1251] eta: 0:05:37 lr: 0.001689 loss: 3.463876 (3.502059) time: 0.975418 data: 0.000179 max mem: 18817 Epoch: [77/300] [ 950/1251] eta: 0:04:49 lr: 0.001689 loss: 3.475863 (3.501316) time: 0.905487 data: 0.000180 max mem: 18817 Epoch: [77/300] [1000/1251] eta: 0:04:01 lr: 0.001689 loss: 3.397145 (3.502800) time: 0.923803 data: 0.000162 max mem: 18817 Epoch: [77/300] [1050/1251] eta: 0:03:13 lr: 0.001688 loss: 3.544379 (3.506253) time: 0.970733 data: 0.000174 max mem: 18817 Epoch: [77/300] [1100/1251] eta: 0:02:25 lr: 0.001688 loss: 3.385229 (3.504254) time: 1.022438 data: 0.000169 max mem: 18817 Epoch: [77/300] [1150/1251] eta: 0:01:37 lr: 0.001688 loss: 3.522355 (3.502670) time: 0.993695 data: 0.000175 max mem: 18817 Epoch: [77/300] [1200/1251] eta: 0:00:49 lr: 0.001688 loss: 3.261073 (3.494043) time: 0.903681 data: 0.000175 max mem: 18817 Epoch: [77/300] [1250/1251] eta: 0:00:00 lr: 0.001687 loss: 3.720044 (3.499232) time: 0.929134 data: 0.000743 max mem: 18817 Epoch: [77/300] Total time: 0:20:04 (0.963002 s / it) Averaged stats: lr: 0.001687 loss: 3.720044 (3.505223) Test: [ 0/49] eta: 0:01:23 loss: 0.755335 (0.755335) acc1: 81.250000 (81.250000) acc5: 96.875000 (96.875000) time: 1.694162 data: 1.277825 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.866085 (0.910174) acc1: 81.250000 (78.551136) acc5: 95.312500 (94.318182) time: 0.488943 data: 0.116327 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.934261 (0.945169) acc1: 76.562500 (77.752976) acc5: 93.750000 (94.047619) time: 0.364350 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.980094 (0.946409) acc1: 76.562500 (77.822581) acc5: 93.750000 (94.254032) time: 0.361440 data: 0.000123 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 1.004242 (0.967004) acc1: 76.562500 (77.553354) acc5: 93.750000 (94.321646) time: 0.455964 data: 0.000130 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.020966 (0.964806) acc1: 76.562500 (77.664000) acc5: 95.312500 (94.272000) time: 0.451049 data: 0.000107 max mem: 18817 Test: Total time: 0:00:21 (0.429633 s / it) * Acc@1 77.314 Acc@5 94.078 loss 0.970 Max accuracy: 77.54% Epoch: [78/300] [ 0/1251] eta: 0:43:08 lr: 0.001687 loss: 3.981498 (3.981498) time: 2.068878 data: 1.158638 max mem: 18817 Epoch: [78/300] [ 50/1251] eta: 0:19:38 lr: 0.001687 loss: 3.464801 (3.580312) time: 0.979766 data: 0.000159 max mem: 18817 Epoch: [78/300] [ 100/1251] eta: 0:18:34 lr: 0.001687 loss: 3.675469 (3.522548) time: 1.008042 data: 0.000153 max mem: 18817 Epoch: [78/300] [ 150/1251] eta: 0:17:44 lr: 0.001686 loss: 3.434914 (3.517547) time: 0.972848 data: 0.000170 max mem: 18817 Epoch: [78/300] [ 200/1251] eta: 0:16:51 lr: 0.001686 loss: 3.129986 (3.470035) time: 0.918060 data: 0.000162 max mem: 18817 Epoch: [78/300] [ 250/1251] eta: 0:16:05 lr: 0.001686 loss: 3.389240 (3.459168) time: 0.937080 data: 0.000156 max mem: 18817 Epoch: [78/300] [ 300/1251] eta: 0:15:17 lr: 0.001685 loss: 3.758673 (3.470218) time: 0.996385 data: 0.000162 max mem: 18817 Epoch: [78/300] [ 350/1251] eta: 0:14:28 lr: 0.001685 loss: 3.696244 (3.450117) time: 1.010319 data: 0.000157 max mem: 18817 Epoch: [78/300] [ 400/1251] eta: 0:13:38 lr: 0.001685 loss: 3.517854 (3.460502) time: 0.966713 data: 0.000167 max mem: 18817 Epoch: [78/300] [ 450/1251] eta: 0:12:49 lr: 0.001685 loss: 3.466392 (3.459853) time: 0.921918 data: 0.000175 max mem: 18817 Epoch: [78/300] [ 500/1251] eta: 0:12:02 lr: 0.001684 loss: 3.756343 (3.466203) time: 0.926710 data: 0.000175 max mem: 18817 Epoch: [78/300] [ 550/1251] eta: 0:11:14 lr: 0.001684 loss: 3.200597 (3.461408) time: 0.975804 data: 0.000177 max mem: 18817 Epoch: [78/300] [ 600/1251] eta: 0:10:27 lr: 0.001684 loss: 3.510151 (3.453374) time: 1.059790 data: 0.000177 max mem: 18817 Epoch: [78/300] [ 650/1251] eta: 0:09:38 lr: 0.001683 loss: 3.491742 (3.450617) time: 0.978370 data: 0.000152 max mem: 18817 Epoch: [78/300] [ 700/1251] eta: 0:08:49 lr: 0.001683 loss: 3.687743 (3.456841) time: 0.927684 data: 0.000161 max mem: 18817 Epoch: [78/300] [ 750/1251] eta: 0:08:01 lr: 0.001683 loss: 3.695956 (3.465404) time: 0.930082 data: 0.000180 max mem: 18817 Epoch: [78/300] [ 800/1251] eta: 0:07:13 lr: 0.001682 loss: 3.703897 (3.473134) time: 1.004973 data: 0.000179 max mem: 18817 Epoch: [78/300] [ 850/1251] eta: 0:06:25 lr: 0.001682 loss: 3.432502 (3.471542) time: 1.032293 data: 0.000163 max mem: 18817 Epoch: [78/300] [ 900/1251] eta: 0:05:37 lr: 0.001682 loss: 3.618100 (3.480086) time: 0.984270 data: 0.000169 max mem: 18817 Epoch: [78/300] [ 950/1251] eta: 0:04:49 lr: 0.001681 loss: 3.684048 (3.484049) time: 0.935356 data: 0.000166 max mem: 18817 Epoch: [78/300] [1000/1251] eta: 0:04:01 lr: 0.001681 loss: 3.534992 (3.485045) time: 0.951516 data: 0.000162 max mem: 18817 Epoch: [78/300] [1050/1251] eta: 0:03:13 lr: 0.001681 loss: 3.588600 (3.487476) time: 0.965059 data: 0.000176 max mem: 18817 Epoch: [78/300] [1100/1251] eta: 0:02:25 lr: 0.001681 loss: 3.419052 (3.485979) time: 1.039556 data: 0.000168 max mem: 18817 Epoch: [78/300] [1150/1251] eta: 0:01:37 lr: 0.001680 loss: 3.488043 (3.489219) time: 0.987733 data: 0.000176 max mem: 18817 Epoch: [78/300] [1200/1251] eta: 0:00:49 lr: 0.001680 loss: 3.588318 (3.489744) time: 0.925144 data: 0.000163 max mem: 18817 Epoch: [78/300] [1250/1251] eta: 0:00:00 lr: 0.001680 loss: 3.420088 (3.489864) time: 0.927179 data: 0.000757 max mem: 18817 Epoch: [78/300] Total time: 0:20:04 (0.962871 s / it) Averaged stats: lr: 0.001680 loss: 3.420088 (3.486755) Test: [ 0/49] eta: 0:01:22 loss: 0.679303 (0.679303) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.687690 data: 1.287927 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.848616 (0.878661) acc1: 79.687500 (80.539773) acc5: 93.750000 (94.460227) time: 0.488932 data: 0.117239 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.941040 (0.933260) acc1: 76.562500 (78.422619) acc5: 93.750000 (94.270833) time: 0.366240 data: 0.000154 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.941040 (0.931430) acc1: 76.562500 (78.074597) acc5: 93.750000 (94.354839) time: 0.363125 data: 0.000140 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.958623 (0.957950) acc1: 76.562500 (77.743902) acc5: 93.750000 (94.092988) time: 0.361785 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.976448 (0.957129) acc1: 76.562500 (77.760000) acc5: 95.312500 (94.208000) time: 0.465119 data: 0.000102 max mem: 18817 Test: Total time: 0:00:21 (0.435604 s / it) * Acc@1 77.770 Acc@5 94.186 loss 0.972 Max accuracy: 77.77% Epoch: [79/300] [ 0/1251] eta: 0:44:00 lr: 0.001680 loss: 2.871933 (2.871933) time: 2.110592 data: 1.090268 max mem: 18817 Epoch: [79/300] [ 50/1251] eta: 0:19:30 lr: 0.001679 loss: 3.372602 (3.413213) time: 0.979164 data: 0.000166 max mem: 18817 Epoch: [79/300] [ 100/1251] eta: 0:18:49 lr: 0.001679 loss: 3.404709 (3.398531) time: 0.984518 data: 0.000166 max mem: 18817 Epoch: [79/300] [ 150/1251] eta: 0:17:44 lr: 0.001679 loss: 3.570625 (3.420723) time: 0.926402 data: 0.000169 max mem: 18817 Epoch: [79/300] [ 200/1251] eta: 0:16:55 lr: 0.001678 loss: 3.425344 (3.448093) time: 0.935500 data: 0.000174 max mem: 18817 Epoch: [79/300] [ 250/1251] eta: 0:16:06 lr: 0.001678 loss: 3.399850 (3.432768) time: 0.983227 data: 0.000180 max mem: 18817 Epoch: [79/300] [ 300/1251] eta: 0:15:14 lr: 0.001678 loss: 3.681427 (3.447096) time: 0.973873 data: 0.000161 max mem: 18817 Epoch: [79/300] [ 350/1251] eta: 0:14:26 lr: 0.001677 loss: 3.475832 (3.446622) time: 0.938919 data: 0.000172 max mem: 18817 Epoch: [79/300] [ 400/1251] eta: 0:13:38 lr: 0.001677 loss: 3.223115 (3.437865) time: 0.952132 data: 0.000179 max mem: 18817 Epoch: [79/300] [ 450/1251] eta: 0:12:51 lr: 0.001677 loss: 3.608469 (3.465546) time: 0.932045 data: 0.000186 max mem: 18817 Epoch: [79/300] [ 500/1251] eta: 0:12:03 lr: 0.001677 loss: 3.462706 (3.463601) time: 0.975911 data: 0.000174 max mem: 18817 Epoch: [79/300] [ 550/1251] eta: 0:11:14 lr: 0.001676 loss: 3.741820 (3.472220) time: 0.971575 data: 0.000180 max mem: 18817 Epoch: [79/300] [ 600/1251] eta: 0:10:26 lr: 0.001676 loss: 3.800479 (3.487572) time: 0.965043 data: 0.000177 max mem: 18817 Epoch: [79/300] [ 650/1251] eta: 0:09:37 lr: 0.001676 loss: 3.230455 (3.494814) time: 0.918308 data: 0.000153 max mem: 18817 Epoch: [79/300] [ 700/1251] eta: 0:08:49 lr: 0.001675 loss: 3.402779 (3.497391) time: 0.929962 data: 0.000171 max mem: 18817 Epoch: [79/300] [ 750/1251] eta: 0:08:01 lr: 0.001675 loss: 3.454783 (3.491438) time: 0.971971 data: 0.000179 max mem: 18817 Epoch: [79/300] [ 800/1251] eta: 0:07:13 lr: 0.001675 loss: 3.613226 (3.486144) time: 0.971657 data: 0.000181 max mem: 18817 Epoch: [79/300] [ 850/1251] eta: 0:06:25 lr: 0.001674 loss: 3.682326 (3.489074) time: 0.973677 data: 0.000169 max mem: 18817 Epoch: [79/300] [ 900/1251] eta: 0:05:37 lr: 0.001674 loss: 3.490910 (3.480243) time: 0.928224 data: 0.000165 max mem: 18817 Epoch: [79/300] [ 950/1251] eta: 0:04:49 lr: 0.001674 loss: 3.547127 (3.474431) time: 0.926154 data: 0.000168 max mem: 18817 Epoch: [79/300] [1000/1251] eta: 0:04:01 lr: 0.001673 loss: 3.673279 (3.475293) time: 0.982876 data: 0.000163 max mem: 18817 Epoch: [79/300] [1050/1251] eta: 0:03:12 lr: 0.001673 loss: 3.499156 (3.475498) time: 0.963335 data: 0.000182 max mem: 18817 Epoch: [79/300] [1100/1251] eta: 0:02:24 lr: 0.001673 loss: 3.464349 (3.477464) time: 0.924422 data: 0.000179 max mem: 18817 Epoch: [79/300] [1150/1251] eta: 0:01:36 lr: 0.001673 loss: 3.771549 (3.480138) time: 0.933026 data: 0.000195 max mem: 18817 Epoch: [79/300] [1200/1251] eta: 0:00:48 lr: 0.001672 loss: 3.375244 (3.476584) time: 0.931380 data: 0.000176 max mem: 18817 Epoch: [79/300] [1250/1251] eta: 0:00:00 lr: 0.001672 loss: 3.621526 (3.472789) time: 0.956297 data: 0.000762 max mem: 18817 Epoch: [79/300] Total time: 0:20:01 (0.960182 s / it) Averaged stats: lr: 0.001672 loss: 3.621526 (3.483170) Test: [ 0/49] eta: 0:01:30 loss: 0.810613 (0.810613) acc1: 82.812500 (82.812500) acc5: 95.312500 (95.312500) time: 1.851399 data: 1.452708 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.810613 (0.851778) acc1: 79.687500 (80.113636) acc5: 95.312500 (94.602273) time: 0.505799 data: 0.132200 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.902309 (0.894979) acc1: 78.125000 (78.125000) acc5: 95.312500 (94.717262) time: 0.369345 data: 0.000140 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.914315 (0.908044) acc1: 76.562500 (77.772177) acc5: 95.312500 (94.657258) time: 0.364693 data: 0.000132 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.963740 (0.928277) acc1: 78.125000 (77.553354) acc5: 95.312500 (94.664634) time: 0.361359 data: 0.000125 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.990117 (0.931844) acc1: 76.562500 (77.632000) acc5: 93.750000 (94.624000) time: 0.356731 data: 0.000104 max mem: 18817 Test: Total time: 0:00:19 (0.396928 s / it) * Acc@1 77.630 Acc@5 94.316 loss 0.933 Max accuracy: 77.77% Epoch: [80/300] [ 0/1251] eta: 0:39:46 lr: 0.001672 loss: 2.520907 (2.520907) time: 1.907436 data: 1.013399 max mem: 18817 Epoch: [80/300] [ 50/1251] eta: 0:19:24 lr: 0.001672 loss: 3.590971 (3.378232) time: 0.925626 data: 0.000188 max mem: 18817 Epoch: [80/300] [ 100/1251] eta: 0:18:34 lr: 0.001671 loss: 3.708229 (3.451908) time: 0.926506 data: 0.000187 max mem: 18817 Epoch: [80/300] [ 150/1251] eta: 0:17:49 lr: 0.001671 loss: 3.297247 (3.458668) time: 1.000164 data: 0.000199 max mem: 18817 Epoch: [80/300] [ 200/1251] eta: 0:16:58 lr: 0.001671 loss: 3.790867 (3.488786) time: 1.025306 data: 0.000184 max mem: 18817 Epoch: [80/300] [ 250/1251] eta: 0:16:04 lr: 0.001670 loss: 3.764946 (3.496651) time: 0.963272 data: 0.000189 max mem: 18817 Epoch: [80/300] [ 300/1251] eta: 0:15:14 lr: 0.001670 loss: 3.386432 (3.486971) time: 0.934377 data: 0.000168 max mem: 18817 Epoch: [80/300] [ 350/1251] eta: 0:14:28 lr: 0.001670 loss: 3.203706 (3.473684) time: 0.924377 data: 0.000148 max mem: 18817 Epoch: [80/300] [ 400/1251] eta: 0:13:40 lr: 0.001669 loss: 3.391473 (3.459008) time: 0.972597 data: 0.000198 max mem: 18817 Epoch: [80/300] [ 450/1251] eta: 0:12:54 lr: 0.001669 loss: 3.531689 (3.449491) time: 1.065033 data: 0.000181 max mem: 18817 Epoch: [80/300] [ 500/1251] eta: 0:12:04 lr: 0.001669 loss: 3.443984 (3.439366) time: 0.980657 data: 0.000172 max mem: 18817 Epoch: [80/300] [ 550/1251] eta: 0:11:15 lr: 0.001669 loss: 3.226085 (3.438307) time: 0.933414 data: 0.000179 max mem: 18817 Epoch: [80/300] [ 600/1251] eta: 0:10:27 lr: 0.001668 loss: 3.594621 (3.439754) time: 0.932592 data: 0.000197 max mem: 18817 Epoch: [80/300] [ 650/1251] eta: 0:09:40 lr: 0.001668 loss: 3.713960 (3.439469) time: 0.997732 data: 0.000184 max mem: 18817 Epoch: [80/300] [ 700/1251] eta: 0:08:52 lr: 0.001668 loss: 3.467275 (3.439283) time: 1.016890 data: 0.000184 max mem: 18817 Epoch: [80/300] [ 750/1251] eta: 0:08:03 lr: 0.001667 loss: 3.572477 (3.445854) time: 0.977960 data: 0.000170 max mem: 18817 Epoch: [80/300] [ 800/1251] eta: 0:07:14 lr: 0.001667 loss: 3.478547 (3.449035) time: 0.920831 data: 0.000187 max mem: 18817 Epoch: [80/300] [ 850/1251] eta: 0:06:26 lr: 0.001667 loss: 3.438073 (3.441572) time: 0.933602 data: 0.000174 max mem: 18817 Epoch: [80/300] [ 900/1251] eta: 0:05:38 lr: 0.001666 loss: 3.618941 (3.438597) time: 0.986869 data: 0.000178 max mem: 18817 Epoch: [80/300] [ 950/1251] eta: 0:04:50 lr: 0.001666 loss: 3.345597 (3.439387) time: 0.994612 data: 0.000166 max mem: 18817 Epoch: [80/300] [1000/1251] eta: 0:04:01 lr: 0.001666 loss: 3.565884 (3.441287) time: 0.975521 data: 0.000190 max mem: 18817 Epoch: [80/300] [1050/1251] eta: 0:03:13 lr: 0.001665 loss: 3.366504 (3.440104) time: 0.913118 data: 0.000193 max mem: 18817 Epoch: [80/300] [1100/1251] eta: 0:02:25 lr: 0.001665 loss: 3.571603 (3.443974) time: 0.920509 data: 0.000177 max mem: 18817 Epoch: [80/300] [1150/1251] eta: 0:01:37 lr: 0.001665 loss: 3.631454 (3.438560) time: 0.974867 data: 0.000160 max mem: 18817 Epoch: [80/300] [1200/1251] eta: 0:00:49 lr: 0.001665 loss: 3.441261 (3.441806) time: 1.001207 data: 0.000166 max mem: 18817 Epoch: [80/300] [1250/1251] eta: 0:00:00 lr: 0.001664 loss: 3.850805 (3.450226) time: 0.980177 data: 0.000748 max mem: 18817 Epoch: [80/300] Total time: 0:20:05 (0.963429 s / it) Averaged stats: lr: 0.001664 loss: 3.850805 (3.465460) Test: [ 0/49] eta: 0:01:27 loss: 0.716621 (0.716621) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.776159 data: 1.354619 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.830546 (0.873056) acc1: 78.125000 (79.403409) acc5: 95.312500 (95.028409) time: 0.495884 data: 0.123306 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.926437 (0.917015) acc1: 76.562500 (78.125000) acc5: 93.750000 (94.717262) time: 0.365159 data: 0.000155 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.955069 (0.922298) acc1: 76.562500 (77.973790) acc5: 93.750000 (94.606855) time: 0.384039 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.952438 (0.933270) acc1: 76.562500 (77.934451) acc5: 93.750000 (94.435976) time: 0.402004 data: 0.000130 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.955069 (0.932831) acc1: 79.687500 (77.952000) acc5: 93.750000 (94.496000) time: 0.375270 data: 0.000107 max mem: 18817 Test: Total time: 0:00:20 (0.409549 s / it) * Acc@1 77.804 Acc@5 94.284 loss 0.934 Max accuracy: 77.80% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0080.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0080.pth Epoch: [81/300] [ 0/1251] eta: 0:47:23 lr: 0.001664 loss: 3.179083 (3.179083) time: 2.273040 data: 1.305503 max mem: 18817 Epoch: [81/300] [ 50/1251] eta: 0:19:23 lr: 0.001664 loss: 3.250899 (3.460076) time: 0.977103 data: 0.000175 max mem: 18817 Epoch: [81/300] [ 100/1251] eta: 0:18:28 lr: 0.001664 loss: 3.644504 (3.473148) time: 0.929039 data: 0.000169 max mem: 18817 Epoch: [81/300] [ 150/1251] eta: 0:17:50 lr: 0.001663 loss: 3.567111 (3.447476) time: 0.939453 data: 0.000167 max mem: 18817 Epoch: [81/300] [ 200/1251] eta: 0:17:01 lr: 0.001663 loss: 3.345404 (3.460099) time: 1.007325 data: 0.000189 max mem: 18817 Epoch: [81/300] [ 250/1251] eta: 0:16:10 lr: 0.001663 loss: 3.446882 (3.459390) time: 1.028304 data: 0.000185 max mem: 18817 Epoch: [81/300] [ 300/1251] eta: 0:15:18 lr: 0.001662 loss: 3.674304 (3.464221) time: 0.978035 data: 0.000160 max mem: 18817 Epoch: [81/300] [ 350/1251] eta: 0:14:26 lr: 0.001662 loss: 3.580850 (3.457952) time: 0.923081 data: 0.000264 max mem: 18817 Epoch: [81/300] [ 400/1251] eta: 0:13:40 lr: 0.001662 loss: 3.700331 (3.465820) time: 0.924368 data: 0.000220 max mem: 18817 Epoch: [81/300] [ 450/1251] eta: 0:12:51 lr: 0.001661 loss: 3.618554 (3.464132) time: 0.973766 data: 0.000210 max mem: 18817 Epoch: [81/300] [ 500/1251] eta: 0:12:04 lr: 0.001661 loss: 3.576796 (3.468188) time: 1.044370 data: 0.000232 max mem: 18817 Epoch: [81/300] [ 550/1251] eta: 0:11:14 lr: 0.001661 loss: 3.552330 (3.460557) time: 0.965108 data: 0.000211 max mem: 18817 Epoch: [81/300] [ 600/1251] eta: 0:10:25 lr: 0.001660 loss: 3.541917 (3.469907) time: 0.925192 data: 0.000214 max mem: 18817 Epoch: [81/300] [ 650/1251] eta: 0:09:38 lr: 0.001660 loss: 3.607967 (3.477813) time: 0.925186 data: 0.000216 max mem: 18817 Epoch: [81/300] [ 700/1251] eta: 0:08:50 lr: 0.001660 loss: 3.259846 (3.479380) time: 0.981697 data: 0.000205 max mem: 18817 Epoch: [81/300] [ 750/1251] eta: 0:08:02 lr: 0.001660 loss: 3.322406 (3.471605) time: 1.061367 data: 0.000207 max mem: 18817 Epoch: [81/300] [ 800/1251] eta: 0:07:13 lr: 0.001659 loss: 3.458968 (3.465052) time: 0.973459 data: 0.000217 max mem: 18817 Epoch: [81/300] [ 850/1251] eta: 0:06:25 lr: 0.001659 loss: 3.740986 (3.476824) time: 0.931304 data: 0.000207 max mem: 18817 Epoch: [81/300] [ 900/1251] eta: 0:05:38 lr: 0.001659 loss: 3.629033 (3.477108) time: 0.939803 data: 0.000217 max mem: 18817 Epoch: [81/300] [ 950/1251] eta: 0:04:49 lr: 0.001658 loss: 3.536996 (3.473561) time: 0.990668 data: 0.000216 max mem: 18817 Epoch: [81/300] [1000/1251] eta: 0:04:01 lr: 0.001658 loss: 3.490302 (3.472596) time: 1.039215 data: 0.000197 max mem: 18817 Epoch: [81/300] [1050/1251] eta: 0:03:13 lr: 0.001658 loss: 3.379479 (3.473175) time: 0.969150 data: 0.000226 max mem: 18817 Epoch: [81/300] [1100/1251] eta: 0:02:25 lr: 0.001657 loss: 3.343688 (3.465022) time: 0.918758 data: 0.000203 max mem: 18817 Epoch: [81/300] [1150/1251] eta: 0:01:37 lr: 0.001657 loss: 3.314605 (3.459806) time: 0.916854 data: 0.000210 max mem: 18817 Epoch: [81/300] [1200/1251] eta: 0:00:49 lr: 0.001657 loss: 3.709647 (3.464093) time: 0.972627 data: 0.000223 max mem: 18817 Epoch: [81/300] [1250/1251] eta: 0:00:00 lr: 0.001656 loss: 3.595191 (3.467185) time: 1.023875 data: 0.000913 max mem: 18817 Epoch: [81/300] Total time: 0:20:04 (0.962823 s / it) Averaged stats: lr: 0.001656 loss: 3.595191 (3.465308) Test: [ 0/49] eta: 0:01:25 loss: 0.820824 (0.820824) acc1: 82.812500 (82.812500) acc5: 95.312500 (95.312500) time: 1.753541 data: 1.279494 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.820824 (0.882401) acc1: 78.125000 (78.693182) acc5: 93.750000 (94.176136) time: 0.505181 data: 0.116466 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.931799 (0.922207) acc1: 78.125000 (77.976190) acc5: 93.750000 (93.824405) time: 0.370982 data: 0.000154 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.958576 (0.923783) acc1: 76.562500 (78.024194) acc5: 93.750000 (94.304435) time: 0.372789 data: 0.000149 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.965709 (0.941206) acc1: 76.562500 (77.934451) acc5: 95.312500 (94.207317) time: 0.378612 data: 0.000135 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.998991 (0.938972) acc1: 75.000000 (77.952000) acc5: 95.312500 (94.432000) time: 0.369575 data: 0.000107 max mem: 18817 Test: Total time: 0:00:19 (0.401804 s / it) * Acc@1 77.946 Acc@5 94.430 loss 0.933 Max accuracy: 77.95% Epoch: [82/300] [ 0/1251] eta: 0:42:21 lr: 0.001656 loss: 2.212616 (2.212616) time: 2.031263 data: 1.132530 max mem: 18817 Epoch: [82/300] [ 50/1251] eta: 0:19:21 lr: 0.001656 loss: 3.678835 (3.511184) time: 0.928338 data: 0.000188 max mem: 18817 Epoch: [82/300] [ 100/1251] eta: 0:18:38 lr: 0.001656 loss: 3.485274 (3.440358) time: 0.923916 data: 0.000169 max mem: 18817 Epoch: [82/300] [ 150/1251] eta: 0:17:47 lr: 0.001655 loss: 3.527285 (3.459276) time: 0.985832 data: 0.000162 max mem: 18817 Epoch: [82/300] [ 200/1251] eta: 0:16:59 lr: 0.001655 loss: 3.548185 (3.465648) time: 1.042581 data: 0.000160 max mem: 18817 Epoch: [82/300] [ 250/1251] eta: 0:16:08 lr: 0.001655 loss: 3.462522 (3.452164) time: 0.986327 data: 0.000181 max mem: 18817 Epoch: [82/300] [ 300/1251] eta: 0:15:16 lr: 0.001654 loss: 3.421242 (3.454702) time: 0.912939 data: 0.000177 max mem: 18817 Epoch: [82/300] [ 350/1251] eta: 0:14:27 lr: 0.001654 loss: 3.631320 (3.452399) time: 0.923183 data: 0.000157 max mem: 18817 Epoch: [82/300] [ 400/1251] eta: 0:13:40 lr: 0.001654 loss: 3.485779 (3.455045) time: 0.998919 data: 0.000187 max mem: 18817 Epoch: [82/300] [ 450/1251] eta: 0:12:52 lr: 0.001654 loss: 3.440395 (3.453387) time: 1.042718 data: 0.000166 max mem: 18817 Epoch: [82/300] [ 500/1251] eta: 0:12:03 lr: 0.001653 loss: 3.539344 (3.446505) time: 0.965479 data: 0.000187 max mem: 18817 Epoch: [82/300] [ 550/1251] eta: 0:11:14 lr: 0.001653 loss: 3.763488 (3.450812) time: 0.916390 data: 0.000170 max mem: 18817 Epoch: [82/300] [ 600/1251] eta: 0:10:26 lr: 0.001653 loss: 3.216929 (3.445982) time: 0.945097 data: 0.000198 max mem: 18817 Epoch: [82/300] [ 650/1251] eta: 0:09:38 lr: 0.001652 loss: 3.411273 (3.455120) time: 0.996915 data: 0.000162 max mem: 18817 Epoch: [82/300] [ 700/1251] eta: 0:08:50 lr: 0.001652 loss: 3.577436 (3.456567) time: 1.015903 data: 0.000179 max mem: 18817 Epoch: [82/300] [ 750/1251] eta: 0:08:02 lr: 0.001652 loss: 3.582782 (3.457308) time: 0.967406 data: 0.000179 max mem: 18817 Epoch: [82/300] [ 800/1251] eta: 0:07:13 lr: 0.001651 loss: 3.600764 (3.456057) time: 0.939089 data: 0.000184 max mem: 18817 Epoch: [82/300] [ 850/1251] eta: 0:06:25 lr: 0.001651 loss: 3.751361 (3.469033) time: 0.934371 data: 0.000172 max mem: 18817 Epoch: [82/300] [ 900/1251] eta: 0:05:37 lr: 0.001651 loss: 3.506256 (3.464084) time: 0.991759 data: 0.000166 max mem: 18817 Epoch: [82/300] [ 950/1251] eta: 0:04:49 lr: 0.001650 loss: 3.457385 (3.465185) time: 1.014903 data: 0.000172 max mem: 18817 Epoch: [82/300] [1000/1251] eta: 0:04:01 lr: 0.001650 loss: 3.599517 (3.465685) time: 0.984947 data: 0.000179 max mem: 18817 Epoch: [82/300] [1050/1251] eta: 0:03:13 lr: 0.001650 loss: 3.480563 (3.461043) time: 0.931958 data: 0.000183 max mem: 18817 Epoch: [82/300] [1100/1251] eta: 0:02:25 lr: 0.001649 loss: 3.509282 (3.461395) time: 0.931576 data: 0.000168 max mem: 18817 Epoch: [82/300] [1150/1251] eta: 0:01:37 lr: 0.001649 loss: 3.517616 (3.457338) time: 0.990678 data: 0.000173 max mem: 18817 Epoch: [82/300] [1200/1251] eta: 0:00:49 lr: 0.001649 loss: 3.505857 (3.451912) time: 1.022408 data: 0.000174 max mem: 18817 Epoch: [82/300] [1250/1251] eta: 0:00:00 lr: 0.001648 loss: 3.540956 (3.451387) time: 0.966777 data: 0.000802 max mem: 18817 Epoch: [82/300] Total time: 0:20:02 (0.961528 s / it) Averaged stats: lr: 0.001648 loss: 3.540956 (3.461549) Test: [ 0/49] eta: 0:01:13 loss: 0.759860 (0.759860) acc1: 79.687500 (79.687500) acc5: 95.312500 (95.312500) time: 1.495463 data: 1.099951 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.876830 (0.892016) acc1: 79.687500 (78.835227) acc5: 93.750000 (93.892045) time: 0.477003 data: 0.100140 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.917204 (0.917059) acc1: 75.000000 (77.678571) acc5: 93.750000 (93.973214) time: 0.368593 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.917204 (0.902525) acc1: 76.562500 (78.074597) acc5: 95.312500 (94.556452) time: 0.364847 data: 0.000136 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.919675 (0.915200) acc1: 78.125000 (78.086890) acc5: 95.312500 (94.435976) time: 0.375099 data: 0.000131 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.943762 (0.919903) acc1: 78.125000 (78.048000) acc5: 93.750000 (94.272000) time: 0.384225 data: 0.000101 max mem: 18817 Test: Total time: 0:00:19 (0.399745 s / it) * Acc@1 77.928 Acc@5 94.374 loss 0.920 Max accuracy: 77.95% Epoch: [83/300] [ 0/1251] eta: 0:42:16 lr: 0.001648 loss: 3.114371 (3.114371) time: 2.027481 data: 1.124313 max mem: 18817 Epoch: [83/300] [ 50/1251] eta: 0:19:52 lr: 0.001648 loss: 3.599184 (3.481991) time: 0.949181 data: 0.000172 max mem: 18817 Epoch: [83/300] [ 100/1251] eta: 0:18:42 lr: 0.001648 loss: 3.669170 (3.545021) time: 0.970468 data: 0.000160 max mem: 18817 Epoch: [83/300] [ 150/1251] eta: 0:17:41 lr: 0.001648 loss: 3.536495 (3.499901) time: 0.968820 data: 0.000177 max mem: 18817 Epoch: [83/300] [ 200/1251] eta: 0:16:49 lr: 0.001647 loss: 3.359333 (3.486622) time: 0.910040 data: 0.000170 max mem: 18817 Epoch: [83/300] [ 250/1251] eta: 0:16:00 lr: 0.001647 loss: 3.540864 (3.497809) time: 0.919088 data: 0.000165 max mem: 18817 Epoch: [83/300] [ 300/1251] eta: 0:15:14 lr: 0.001647 loss: 3.542858 (3.487202) time: 0.932264 data: 0.000161 max mem: 18817 Epoch: [83/300] [ 350/1251] eta: 0:14:26 lr: 0.001646 loss: 3.561019 (3.471824) time: 0.972398 data: 0.000170 max mem: 18817 Epoch: [83/300] [ 400/1251] eta: 0:13:37 lr: 0.001646 loss: 3.237819 (3.465850) time: 0.975874 data: 0.000175 max mem: 18817 Epoch: [83/300] [ 450/1251] eta: 0:12:48 lr: 0.001646 loss: 3.321174 (3.459701) time: 0.914373 data: 0.000185 max mem: 18817 Epoch: [83/300] [ 500/1251] eta: 0:12:00 lr: 0.001645 loss: 3.395342 (3.451903) time: 0.923288 data: 0.000160 max mem: 18817 Epoch: [83/300] [ 550/1251] eta: 0:11:12 lr: 0.001645 loss: 3.630417 (3.452624) time: 0.947093 data: 0.000192 max mem: 18817 Epoch: [83/300] [ 600/1251] eta: 0:10:25 lr: 0.001645 loss: 3.363993 (3.451641) time: 0.994105 data: 0.000183 max mem: 18817 Epoch: [83/300] [ 650/1251] eta: 0:09:36 lr: 0.001644 loss: 3.516666 (3.459348) time: 0.972089 data: 0.000168 max mem: 18817 Epoch: [83/300] [ 700/1251] eta: 0:08:48 lr: 0.001644 loss: 3.430265 (3.455085) time: 0.914097 data: 0.000161 max mem: 18817 Epoch: [83/300] [ 750/1251] eta: 0:08:01 lr: 0.001644 loss: 3.500910 (3.454381) time: 0.929290 data: 0.000178 max mem: 18817 Epoch: [83/300] [ 800/1251] eta: 0:07:13 lr: 0.001643 loss: 3.517976 (3.449547) time: 0.947859 data: 0.000209 max mem: 18817 Epoch: [83/300] [ 850/1251] eta: 0:06:25 lr: 0.001643 loss: 3.406845 (3.445093) time: 1.004812 data: 0.000186 max mem: 18817 Epoch: [83/300] [ 900/1251] eta: 0:05:37 lr: 0.001643 loss: 3.554585 (3.448769) time: 0.956892 data: 0.000186 max mem: 18817 Epoch: [83/300] [ 950/1251] eta: 0:04:49 lr: 0.001642 loss: 3.529330 (3.449302) time: 0.931681 data: 0.000165 max mem: 18817 Epoch: [83/300] [1000/1251] eta: 0:04:01 lr: 0.001642 loss: 3.420073 (3.450992) time: 0.915047 data: 0.000179 max mem: 18817 Epoch: [83/300] [1050/1251] eta: 0:03:13 lr: 0.001642 loss: 3.335352 (3.445174) time: 0.937855 data: 0.000186 max mem: 18817 Epoch: [83/300] [1100/1251] eta: 0:02:25 lr: 0.001641 loss: 3.465007 (3.443733) time: 0.967432 data: 0.000170 max mem: 18817 Epoch: [83/300] [1150/1251] eta: 0:01:36 lr: 0.001641 loss: 3.793397 (3.445300) time: 0.979764 data: 0.000199 max mem: 18817 Epoch: [83/300] [1200/1251] eta: 0:00:48 lr: 0.001641 loss: 3.449950 (3.446354) time: 0.915549 data: 0.000185 max mem: 18817 Epoch: [83/300] [1250/1251] eta: 0:00:00 lr: 0.001641 loss: 3.529026 (3.445239) time: 0.914620 data: 0.000757 max mem: 18817 Epoch: [83/300] Total time: 0:20:00 (0.959329 s / it) Averaged stats: lr: 0.001641 loss: 3.529026 (3.439176) Test: [ 0/49] eta: 0:01:26 loss: 0.818556 (0.818556) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.312500) time: 1.759686 data: 1.326293 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.814803 (0.839651) acc1: 78.125000 (79.687500) acc5: 95.312500 (94.460227) time: 0.495868 data: 0.120743 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.897193 (0.898865) acc1: 76.562500 (78.125000) acc5: 93.750000 (94.345238) time: 0.365958 data: 0.000155 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.928490 (0.896907) acc1: 76.562500 (78.175403) acc5: 93.750000 (94.657258) time: 0.374729 data: 0.000125 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.915816 (0.918119) acc1: 78.125000 (77.934451) acc5: 93.750000 (94.359756) time: 0.461367 data: 0.000121 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.007039 (0.919922) acc1: 76.562500 (77.920000) acc5: 93.750000 (94.528000) time: 0.451677 data: 0.000100 max mem: 18817 Test: Total time: 0:00:21 (0.434900 s / it) * Acc@1 78.158 Acc@5 94.516 loss 0.917 Max accuracy: 78.16% Epoch: [84/300] [ 0/1251] eta: 0:42:24 lr: 0.001641 loss: 3.389396 (3.389396) time: 2.033948 data: 1.136127 max mem: 18817 Epoch: [84/300] [ 50/1251] eta: 0:19:40 lr: 0.001640 loss: 3.603832 (3.378947) time: 0.970182 data: 0.000167 max mem: 18817 Epoch: [84/300] [ 100/1251] eta: 0:18:28 lr: 0.001640 loss: 3.334957 (3.409140) time: 0.975122 data: 0.000161 max mem: 18817 Epoch: [84/300] [ 150/1251] eta: 0:17:33 lr: 0.001640 loss: 3.634907 (3.447085) time: 0.912717 data: 0.000167 max mem: 18817 Epoch: [84/300] [ 200/1251] eta: 0:16:48 lr: 0.001639 loss: 3.536651 (3.439318) time: 0.927768 data: 0.000169 max mem: 18817 Epoch: [84/300] [ 250/1251] eta: 0:16:03 lr: 0.001639 loss: 3.505592 (3.402381) time: 0.969594 data: 0.000177 max mem: 18817 Epoch: [84/300] [ 300/1251] eta: 0:15:16 lr: 0.001639 loss: 3.394914 (3.404647) time: 0.964418 data: 0.000166 max mem: 18817 Epoch: [84/300] [ 350/1251] eta: 0:14:27 lr: 0.001638 loss: 3.461799 (3.419084) time: 0.987994 data: 0.000160 max mem: 18817 Epoch: [84/300] [ 400/1251] eta: 0:13:38 lr: 0.001638 loss: 3.582297 (3.429476) time: 0.924589 data: 0.000184 max mem: 18817 Epoch: [84/300] [ 450/1251] eta: 0:12:50 lr: 0.001638 loss: 3.637197 (3.434221) time: 0.929257 data: 0.000177 max mem: 18817 Epoch: [84/300] [ 500/1251] eta: 0:12:04 lr: 0.001637 loss: 3.533533 (3.442694) time: 0.945513 data: 0.000172 max mem: 18817 Epoch: [84/300] [ 550/1251] eta: 0:11:15 lr: 0.001637 loss: 3.463830 (3.440669) time: 0.972490 data: 0.000170 max mem: 18817 Epoch: [84/300] [ 600/1251] eta: 0:10:26 lr: 0.001637 loss: 3.424943 (3.435600) time: 0.974329 data: 0.000180 max mem: 18817 Epoch: [84/300] [ 650/1251] eta: 0:09:37 lr: 0.001636 loss: 3.369959 (3.437742) time: 0.907261 data: 0.000170 max mem: 18817 Epoch: [84/300] [ 700/1251] eta: 0:08:50 lr: 0.001636 loss: 3.766153 (3.452161) time: 0.920804 data: 0.000152 max mem: 18817 Epoch: [84/300] [ 750/1251] eta: 0:08:02 lr: 0.001636 loss: 3.551575 (3.454786) time: 0.942536 data: 0.000180 max mem: 18817 Epoch: [84/300] [ 800/1251] eta: 0:07:14 lr: 0.001635 loss: 3.601187 (3.453002) time: 0.977186 data: 0.000175 max mem: 18817 Epoch: [84/300] [ 850/1251] eta: 0:06:25 lr: 0.001635 loss: 3.660204 (3.459904) time: 0.966146 data: 0.000163 max mem: 18817 Epoch: [84/300] [ 900/1251] eta: 0:05:37 lr: 0.001635 loss: 3.565645 (3.462498) time: 0.917518 data: 0.000162 max mem: 18817 Epoch: [84/300] [ 950/1251] eta: 0:04:49 lr: 0.001634 loss: 3.560533 (3.461157) time: 0.942339 data: 0.000176 max mem: 18817 Epoch: [84/300] [1000/1251] eta: 0:04:01 lr: 0.001634 loss: 3.544874 (3.462856) time: 0.925854 data: 0.000176 max mem: 18817 Epoch: [84/300] [1050/1251] eta: 0:03:13 lr: 0.001634 loss: 3.365927 (3.461064) time: 0.994944 data: 0.000182 max mem: 18817 Epoch: [84/300] [1100/1251] eta: 0:02:25 lr: 0.001633 loss: 3.560741 (3.458137) time: 0.968618 data: 0.000183 max mem: 18817 Epoch: [84/300] [1150/1251] eta: 0:01:37 lr: 0.001633 loss: 3.572579 (3.456469) time: 0.941686 data: 0.000169 max mem: 18817 Epoch: [84/300] [1200/1251] eta: 0:00:49 lr: 0.001633 loss: 3.253924 (3.459059) time: 0.922439 data: 0.000175 max mem: 18817 Epoch: [84/300] [1250/1251] eta: 0:00:00 lr: 0.001632 loss: 3.686955 (3.456654) time: 0.929179 data: 0.000758 max mem: 18817 Epoch: [84/300] Total time: 0:20:03 (0.962384 s / it) Averaged stats: lr: 0.001632 loss: 3.686955 (3.454451) Test: [ 0/49] eta: 0:01:17 loss: 0.774474 (0.774474) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.578708 data: 1.136890 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.797736 (0.869412) acc1: 81.250000 (80.255682) acc5: 96.875000 (94.176136) time: 0.484126 data: 0.103502 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.887000 (0.912994) acc1: 78.125000 (78.943452) acc5: 95.312500 (94.122024) time: 0.368356 data: 0.000149 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.937820 (0.917491) acc1: 78.125000 (78.931452) acc5: 95.312500 (94.405242) time: 0.361957 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.960123 (0.929809) acc1: 78.125000 (78.544207) acc5: 93.750000 (94.359756) time: 0.359984 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.974468 (0.929454) acc1: 78.125000 (78.528000) acc5: 93.750000 (94.432000) time: 0.355299 data: 0.000099 max mem: 18817 Test: Total time: 0:00:19 (0.389297 s / it) * Acc@1 78.012 Acc@5 94.420 loss 0.927 Max accuracy: 78.16% Epoch: [85/300] [ 0/1251] eta: 0:42:37 lr: 0.001632 loss: 2.113737 (2.113737) time: 2.044185 data: 1.118730 max mem: 18817 Epoch: [85/300] [ 50/1251] eta: 0:19:18 lr: 0.001632 loss: 3.451048 (3.450928) time: 0.954664 data: 0.000183 max mem: 18817 Epoch: [85/300] [ 100/1251] eta: 0:18:14 lr: 0.001632 loss: 3.484614 (3.428244) time: 0.925356 data: 0.000180 max mem: 18817 Epoch: [85/300] [ 150/1251] eta: 0:17:33 lr: 0.001632 loss: 3.189394 (3.416643) time: 0.935016 data: 0.000169 max mem: 18817 Epoch: [85/300] [ 200/1251] eta: 0:16:51 lr: 0.001631 loss: 3.808839 (3.424078) time: 1.000372 data: 0.000165 max mem: 18817 Epoch: [85/300] [ 250/1251] eta: 0:16:06 lr: 0.001631 loss: 3.465602 (3.411758) time: 1.047055 data: 0.000179 max mem: 18817 Epoch: [85/300] [ 300/1251] eta: 0:15:15 lr: 0.001631 loss: 3.569755 (3.427940) time: 0.975740 data: 0.000164 max mem: 18817 Epoch: [85/300] [ 350/1251] eta: 0:14:25 lr: 0.001630 loss: 3.635989 (3.433726) time: 0.922776 data: 0.000161 max mem: 18817 Epoch: [85/300] [ 400/1251] eta: 0:13:38 lr: 0.001630 loss: 3.335442 (3.429918) time: 0.912391 data: 0.000188 max mem: 18817 Epoch: [85/300] [ 450/1251] eta: 0:12:50 lr: 0.001630 loss: 3.490598 (3.428804) time: 0.972365 data: 0.000181 max mem: 18817 Epoch: [85/300] [ 500/1251] eta: 0:12:01 lr: 0.001629 loss: 3.591168 (3.423640) time: 1.011660 data: 0.000175 max mem: 18817 Epoch: [85/300] [ 550/1251] eta: 0:11:13 lr: 0.001629 loss: 3.430491 (3.421728) time: 0.982704 data: 0.000189 max mem: 18817 Epoch: [85/300] [ 600/1251] eta: 0:10:25 lr: 0.001629 loss: 3.246667 (3.428668) time: 0.936209 data: 0.000165 max mem: 18817 Epoch: [85/300] [ 650/1251] eta: 0:09:37 lr: 0.001628 loss: 3.540163 (3.434253) time: 0.927006 data: 0.000173 max mem: 18817 Epoch: [85/300] [ 700/1251] eta: 0:08:50 lr: 0.001628 loss: 3.548549 (3.434142) time: 1.024064 data: 0.000173 max mem: 18817 Epoch: [85/300] [ 750/1251] eta: 0:08:02 lr: 0.001628 loss: 3.496239 (3.431669) time: 1.017873 data: 0.000170 max mem: 18817 Epoch: [85/300] [ 800/1251] eta: 0:07:14 lr: 0.001627 loss: 3.375632 (3.425393) time: 1.001609 data: 0.000164 max mem: 18817 Epoch: [85/300] [ 850/1251] eta: 0:06:25 lr: 0.001627 loss: 3.201525 (3.420889) time: 0.923390 data: 0.000168 max mem: 18817 Epoch: [85/300] [ 900/1251] eta: 0:05:37 lr: 0.001627 loss: 3.757574 (3.423818) time: 0.935510 data: 0.000195 max mem: 18817 Epoch: [85/300] [ 950/1251] eta: 0:04:49 lr: 0.001626 loss: 3.432808 (3.425480) time: 0.976580 data: 0.000167 max mem: 18817 Epoch: [85/300] [1000/1251] eta: 0:04:01 lr: 0.001626 loss: 3.505165 (3.424159) time: 1.042843 data: 0.000159 max mem: 18817 Epoch: [85/300] [1050/1251] eta: 0:03:13 lr: 0.001626 loss: 3.463250 (3.423615) time: 0.970553 data: 0.000183 max mem: 18817 Epoch: [85/300] [1100/1251] eta: 0:02:25 lr: 0.001625 loss: 3.468944 (3.422806) time: 0.915810 data: 0.000160 max mem: 18817 Epoch: [85/300] [1150/1251] eta: 0:01:37 lr: 0.001625 loss: 3.665761 (3.430405) time: 0.930654 data: 0.000191 max mem: 18817 Epoch: [85/300] [1200/1251] eta: 0:00:49 lr: 0.001625 loss: 3.601511 (3.428918) time: 0.997406 data: 0.000180 max mem: 18817 Epoch: [85/300] [1250/1251] eta: 0:00:00 lr: 0.001624 loss: 3.532196 (3.432741) time: 0.992148 data: 0.000727 max mem: 18817 Epoch: [85/300] Total time: 0:20:05 (0.963395 s / it) Averaged stats: lr: 0.001624 loss: 3.532196 (3.437036) Test: [ 0/49] eta: 0:01:27 loss: 0.734173 (0.734173) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.875000) time: 1.789186 data: 1.393597 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.819053 (0.838608) acc1: 81.250000 (80.113636) acc5: 95.312500 (94.602273) time: 0.526219 data: 0.126831 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.866784 (0.879410) acc1: 78.125000 (78.720238) acc5: 93.750000 (94.494048) time: 0.381525 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.924826 (0.885969) acc1: 78.125000 (78.578629) acc5: 93.750000 (94.758065) time: 0.368483 data: 0.000132 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.924826 (0.906137) acc1: 78.125000 (78.277439) acc5: 95.312500 (94.702744) time: 0.376960 data: 0.000150 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.991140 (0.903232) acc1: 76.562500 (78.432000) acc5: 95.312500 (94.880000) time: 0.367230 data: 0.000127 max mem: 18817 Test: Total time: 0:00:19 (0.406111 s / it) * Acc@1 78.036 Acc@5 94.420 loss 0.911 Max accuracy: 78.16% Epoch: [86/300] [ 0/1251] eta: 0:43:08 lr: 0.001624 loss: 3.732497 (3.732497) time: 2.068989 data: 1.165546 max mem: 18817 Epoch: [86/300] [ 50/1251] eta: 0:19:27 lr: 0.001624 loss: 3.526587 (3.550531) time: 0.920907 data: 0.000176 max mem: 18817 Epoch: [86/300] [ 100/1251] eta: 0:18:39 lr: 0.001624 loss: 3.548381 (3.510964) time: 0.939616 data: 0.000185 max mem: 18817 Epoch: [86/300] [ 150/1251] eta: 0:17:51 lr: 0.001623 loss: 3.439938 (3.471827) time: 0.999242 data: 0.000182 max mem: 18817 Epoch: [86/300] [ 200/1251] eta: 0:17:00 lr: 0.001623 loss: 3.472775 (3.468612) time: 1.023079 data: 0.000170 max mem: 18817 Epoch: [86/300] [ 250/1251] eta: 0:16:07 lr: 0.001623 loss: 3.639243 (3.503095) time: 0.986334 data: 0.000169 max mem: 18817 Epoch: [86/300] [ 300/1251] eta: 0:15:17 lr: 0.001622 loss: 3.662212 (3.517972) time: 0.924282 data: 0.000168 max mem: 18817 Epoch: [86/300] [ 350/1251] eta: 0:14:29 lr: 0.001622 loss: 3.490123 (3.505072) time: 0.928117 data: 0.000172 max mem: 18817 Epoch: [86/300] [ 400/1251] eta: 0:13:41 lr: 0.001622 loss: 3.702458 (3.495775) time: 0.991789 data: 0.000162 max mem: 18817 Epoch: [86/300] [ 450/1251] eta: 0:12:53 lr: 0.001621 loss: 3.470131 (3.488005) time: 1.031389 data: 0.000199 max mem: 18817 Epoch: [86/300] [ 500/1251] eta: 0:12:04 lr: 0.001621 loss: 3.606690 (3.477584) time: 0.998141 data: 0.000188 max mem: 18817 Epoch: [86/300] [ 550/1251] eta: 0:11:15 lr: 0.001621 loss: 3.524574 (3.472222) time: 0.918809 data: 0.000180 max mem: 18817 Epoch: [86/300] [ 600/1251] eta: 0:10:27 lr: 0.001620 loss: 3.217103 (3.466345) time: 0.926217 data: 0.000172 max mem: 18817 Epoch: [86/300] [ 650/1251] eta: 0:09:40 lr: 0.001620 loss: 3.633129 (3.475985) time: 0.985141 data: 0.000177 max mem: 18817 Epoch: [86/300] [ 700/1251] eta: 0:08:51 lr: 0.001620 loss: 3.490834 (3.466906) time: 1.002169 data: 0.000173 max mem: 18817 Epoch: [86/300] [ 750/1251] eta: 0:08:03 lr: 0.001620 loss: 3.451627 (3.463452) time: 0.979176 data: 0.000174 max mem: 18817 Epoch: [86/300] [ 800/1251] eta: 0:07:14 lr: 0.001619 loss: 3.640106 (3.465213) time: 0.917046 data: 0.000166 max mem: 18817 Epoch: [86/300] [ 850/1251] eta: 0:06:26 lr: 0.001619 loss: 3.492531 (3.468716) time: 0.921253 data: 0.000188 max mem: 18817 Epoch: [86/300] [ 900/1251] eta: 0:05:38 lr: 0.001619 loss: 3.207990 (3.465598) time: 0.992220 data: 0.000182 max mem: 18817 Epoch: [86/300] [ 950/1251] eta: 0:04:50 lr: 0.001618 loss: 3.488880 (3.463929) time: 1.036787 data: 0.000182 max mem: 18817 Epoch: [86/300] [1000/1251] eta: 0:04:01 lr: 0.001618 loss: 2.964154 (3.454199) time: 0.973607 data: 0.000178 max mem: 18817 Epoch: [86/300] [1050/1251] eta: 0:03:13 lr: 0.001618 loss: 3.375406 (3.449888) time: 0.912008 data: 0.000188 max mem: 18817 Epoch: [86/300] [1100/1251] eta: 0:02:25 lr: 0.001617 loss: 3.523444 (3.451647) time: 0.949569 data: 0.000182 max mem: 18817 Epoch: [86/300] [1150/1251] eta: 0:01:37 lr: 0.001617 loss: 3.730819 (3.456971) time: 1.000305 data: 0.000179 max mem: 18817 Epoch: [86/300] [1200/1251] eta: 0:00:49 lr: 0.001617 loss: 3.427796 (3.450697) time: 0.987035 data: 0.000176 max mem: 18817 Epoch: [86/300] [1250/1251] eta: 0:00:00 lr: 0.001616 loss: 3.568127 (3.452133) time: 0.955349 data: 0.000389 max mem: 18817 Epoch: [86/300] Total time: 0:20:04 (0.963147 s / it) Averaged stats: lr: 0.001616 loss: 3.568127 (3.452384) Test: [ 0/49] eta: 0:01:25 loss: 0.762339 (0.762339) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.312500) time: 1.743013 data: 1.322095 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.781261 (0.855467) acc1: 81.250000 (78.977273) acc5: 95.312500 (94.034091) time: 0.493978 data: 0.120339 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.914889 (0.902916) acc1: 76.562500 (78.794643) acc5: 93.750000 (94.270833) time: 0.364865 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.914889 (0.895145) acc1: 78.125000 (78.931452) acc5: 95.312500 (94.455645) time: 0.373195 data: 0.000128 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.912986 (0.912917) acc1: 78.125000 (78.772866) acc5: 93.750000 (94.283537) time: 0.384290 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.945560 (0.908626) acc1: 78.125000 (79.040000) acc5: 93.750000 (94.400000) time: 0.382344 data: 0.000104 max mem: 18817 Test: Total time: 0:00:19 (0.407933 s / it) * Acc@1 78.152 Acc@5 94.450 loss 0.917 Max accuracy: 78.16% Epoch: [87/300] [ 0/1251] eta: 0:42:30 lr: 0.001616 loss: 3.316935 (3.316935) time: 2.038903 data: 1.132760 max mem: 18817 Epoch: [87/300] [ 50/1251] eta: 0:19:50 lr: 0.001616 loss: 3.541370 (3.380434) time: 0.930288 data: 0.000182 max mem: 18817 Epoch: [87/300] [ 100/1251] eta: 0:18:49 lr: 0.001616 loss: 3.494963 (3.418086) time: 0.979830 data: 0.000168 max mem: 18817 Epoch: [87/300] [ 150/1251] eta: 0:17:48 lr: 0.001615 loss: 3.397461 (3.353013) time: 0.999437 data: 0.000176 max mem: 18817 Epoch: [87/300] [ 200/1251] eta: 0:16:55 lr: 0.001615 loss: 3.224772 (3.344753) time: 0.967311 data: 0.000171 max mem: 18817 Epoch: [87/300] [ 250/1251] eta: 0:16:04 lr: 0.001615 loss: 3.570583 (3.365557) time: 0.923477 data: 0.000170 max mem: 18817 Epoch: [87/300] [ 300/1251] eta: 0:15:15 lr: 0.001614 loss: 3.563517 (3.369417) time: 0.932238 data: 0.000165 max mem: 18817 Epoch: [87/300] [ 350/1251] eta: 0:14:26 lr: 0.001614 loss: 3.392401 (3.366236) time: 0.979024 data: 0.000172 max mem: 18817 Epoch: [87/300] [ 400/1251] eta: 0:13:38 lr: 0.001614 loss: 3.335607 (3.376558) time: 1.006878 data: 0.000169 max mem: 18817 Epoch: [87/300] [ 450/1251] eta: 0:12:49 lr: 0.001613 loss: 3.506196 (3.374558) time: 0.961904 data: 0.000180 max mem: 18817 Epoch: [87/300] [ 500/1251] eta: 0:12:00 lr: 0.001613 loss: 3.598300 (3.385459) time: 0.919500 data: 0.000192 max mem: 18817 Epoch: [87/300] [ 550/1251] eta: 0:11:14 lr: 0.001613 loss: 3.349200 (3.389803) time: 0.941444 data: 0.000181 max mem: 18817 Epoch: [87/300] [ 600/1251] eta: 0:10:26 lr: 0.001612 loss: 3.522974 (3.397197) time: 1.006389 data: 0.000174 max mem: 18817 Epoch: [87/300] [ 650/1251] eta: 0:09:39 lr: 0.001612 loss: 3.680430 (3.403103) time: 1.040056 data: 0.000175 max mem: 18817 Epoch: [87/300] [ 700/1251] eta: 0:08:50 lr: 0.001612 loss: 3.539970 (3.400552) time: 0.972058 data: 0.000156 max mem: 18817 Epoch: [87/300] [ 750/1251] eta: 0:08:01 lr: 0.001611 loss: 3.223931 (3.400164) time: 0.924466 data: 0.000176 max mem: 18817 Epoch: [87/300] [ 800/1251] eta: 0:07:13 lr: 0.001611 loss: 3.168230 (3.400558) time: 0.935574 data: 0.000174 max mem: 18817 Epoch: [87/300] [ 850/1251] eta: 0:06:25 lr: 0.001611 loss: 3.559072 (3.395335) time: 0.988183 data: 0.000166 max mem: 18817 Epoch: [87/300] [ 900/1251] eta: 0:05:37 lr: 0.001610 loss: 3.476184 (3.397863) time: 1.038633 data: 0.000175 max mem: 18817 Epoch: [87/300] [ 950/1251] eta: 0:04:49 lr: 0.001610 loss: 3.631035 (3.401364) time: 0.978185 data: 0.000179 max mem: 18817 Epoch: [87/300] [1000/1251] eta: 0:04:01 lr: 0.001610 loss: 3.513041 (3.401827) time: 0.918834 data: 0.000159 max mem: 18817 Epoch: [87/300] [1050/1251] eta: 0:03:13 lr: 0.001609 loss: 3.502938 (3.407712) time: 0.914361 data: 0.000179 max mem: 18817 Epoch: [87/300] [1100/1251] eta: 0:02:25 lr: 0.001609 loss: 3.683369 (3.409108) time: 0.976875 data: 0.000188 max mem: 18817 Epoch: [87/300] [1150/1251] eta: 0:01:37 lr: 0.001609 loss: 3.470373 (3.411567) time: 1.040048 data: 0.000168 max mem: 18817 Epoch: [87/300] [1200/1251] eta: 0:00:48 lr: 0.001608 loss: 3.477656 (3.411372) time: 0.965217 data: 0.000167 max mem: 18817 Epoch: [87/300] [1250/1251] eta: 0:00:00 lr: 0.001608 loss: 3.776501 (3.413357) time: 0.919742 data: 0.000733 max mem: 18817 Epoch: [87/300] Total time: 0:20:01 (0.960558 s / it) Averaged stats: lr: 0.001608 loss: 3.776501 (3.420831) Test: [ 0/49] eta: 0:01:15 loss: 0.796423 (0.796423) acc1: 79.687500 (79.687500) acc5: 95.312500 (95.312500) time: 1.534341 data: 1.114735 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.796423 (0.827969) acc1: 78.125000 (78.551136) acc5: 95.312500 (95.170455) time: 0.476758 data: 0.101479 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 0.845120 (0.879677) acc1: 76.562500 (77.380952) acc5: 93.750000 (94.642857) time: 0.473181 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.881528 (0.875908) acc1: 76.562500 (77.217742) acc5: 95.312500 (94.657258) time: 0.469217 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.881865 (0.891566) acc1: 76.562500 (77.210366) acc5: 95.312500 (94.435976) time: 0.360832 data: 0.000128 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.907254 (0.887971) acc1: 75.000000 (77.536000) acc5: 95.312500 (94.496000) time: 0.355366 data: 0.000102 max mem: 18817 Test: Total time: 0:00:21 (0.431086 s / it) * Acc@1 78.142 Acc@5 94.462 loss 0.882 Max accuracy: 78.16% Epoch: [88/300] [ 0/1251] eta: 0:42:18 lr: 0.001608 loss: 3.404603 (3.404603) time: 2.029099 data: 1.127096 max mem: 18817 Epoch: [88/300] [ 50/1251] eta: 0:19:34 lr: 0.001608 loss: 3.689523 (3.385926) time: 0.964663 data: 0.000170 max mem: 18817 Epoch: [88/300] [ 100/1251] eta: 0:18:31 lr: 0.001607 loss: 3.557411 (3.381545) time: 1.006088 data: 0.000159 max mem: 18817 Epoch: [88/300] [ 150/1251] eta: 0:17:40 lr: 0.001607 loss: 3.593165 (3.415418) time: 0.979020 data: 0.000174 max mem: 18817 Epoch: [88/300] [ 200/1251] eta: 0:16:48 lr: 0.001607 loss: 3.645911 (3.404234) time: 0.925172 data: 0.000181 max mem: 18817 Epoch: [88/300] [ 250/1251] eta: 0:16:03 lr: 0.001606 loss: 3.482111 (3.405019) time: 0.924617 data: 0.000166 max mem: 18817 Epoch: [88/300] [ 300/1251] eta: 0:15:16 lr: 0.001606 loss: 3.157715 (3.407132) time: 0.982260 data: 0.000163 max mem: 18817 Epoch: [88/300] [ 350/1251] eta: 0:14:29 lr: 0.001606 loss: 3.551968 (3.397012) time: 1.047235 data: 0.000168 max mem: 18817 Epoch: [88/300] [ 400/1251] eta: 0:13:39 lr: 0.001605 loss: 3.202529 (3.395683) time: 0.965742 data: 0.000169 max mem: 18817 Epoch: [88/300] [ 450/1251] eta: 0:12:49 lr: 0.001605 loss: 3.745012 (3.413634) time: 0.932304 data: 0.000189 max mem: 18817 Epoch: [88/300] [ 500/1251] eta: 0:12:02 lr: 0.001605 loss: 3.554569 (3.409955) time: 0.927409 data: 0.000167 max mem: 18817 Epoch: [88/300] [ 550/1251] eta: 0:11:14 lr: 0.001604 loss: 3.448311 (3.407287) time: 0.991741 data: 0.000174 max mem: 18817 Epoch: [88/300] [ 600/1251] eta: 0:10:26 lr: 0.001604 loss: 3.605633 (3.411279) time: 1.027695 data: 0.000182 max mem: 18817 Epoch: [88/300] [ 650/1251] eta: 0:09:36 lr: 0.001604 loss: 3.425557 (3.401703) time: 0.952275 data: 0.000162 max mem: 18817 Epoch: [88/300] [ 700/1251] eta: 0:08:48 lr: 0.001603 loss: 3.622424 (3.406422) time: 0.934163 data: 0.000164 max mem: 18817 Epoch: [88/300] [ 750/1251] eta: 0:08:01 lr: 0.001603 loss: 3.014769 (3.398648) time: 0.944585 data: 0.000176 max mem: 18817 Epoch: [88/300] [ 800/1251] eta: 0:07:13 lr: 0.001603 loss: 3.504643 (3.405339) time: 1.001196 data: 0.000185 max mem: 18817 Epoch: [88/300] [ 850/1251] eta: 0:06:25 lr: 0.001602 loss: 3.345813 (3.407874) time: 1.011135 data: 0.000172 max mem: 18817 Epoch: [88/300] [ 900/1251] eta: 0:05:37 lr: 0.001602 loss: 3.521311 (3.413812) time: 0.983815 data: 0.000175 max mem: 18817 Epoch: [88/300] [ 950/1251] eta: 0:04:49 lr: 0.001602 loss: 3.584008 (3.419572) time: 0.921444 data: 0.000177 max mem: 18817 Epoch: [88/300] [1000/1251] eta: 0:04:01 lr: 0.001601 loss: 3.711806 (3.427631) time: 0.922542 data: 0.000175 max mem: 18817 Epoch: [88/300] [1050/1251] eta: 0:03:13 lr: 0.001601 loss: 3.487628 (3.430915) time: 0.986068 data: 0.000165 max mem: 18817 Epoch: [88/300] [1100/1251] eta: 0:02:25 lr: 0.001601 loss: 3.370416 (3.433977) time: 1.034296 data: 0.000176 max mem: 18817 Epoch: [88/300] [1150/1251] eta: 0:01:37 lr: 0.001600 loss: 3.708127 (3.434347) time: 0.985382 data: 0.000157 max mem: 18817 Epoch: [88/300] [1200/1251] eta: 0:00:49 lr: 0.001600 loss: 3.260753 (3.430575) time: 0.927858 data: 0.000185 max mem: 18817 Epoch: [88/300] [1250/1251] eta: 0:00:00 lr: 0.001600 loss: 3.422411 (3.423594) time: 0.925075 data: 0.000731 max mem: 18817 Epoch: [88/300] Total time: 0:20:03 (0.962008 s / it) Averaged stats: lr: 0.001600 loss: 3.422411 (3.426267) Test: [ 0/49] eta: 0:01:21 loss: 0.696226 (0.696226) acc1: 79.687500 (79.687500) acc5: 96.875000 (96.875000) time: 1.660084 data: 1.258890 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.787506 (0.836560) acc1: 78.125000 (78.835227) acc5: 95.312500 (95.170455) time: 0.485339 data: 0.114603 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.862565 (0.879294) acc1: 76.562500 (78.199405) acc5: 95.312500 (94.940476) time: 0.364330 data: 0.000152 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.905416 (0.889925) acc1: 76.562500 (78.074597) acc5: 95.312500 (95.110887) time: 0.362141 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.898895 (0.901314) acc1: 78.125000 (78.163110) acc5: 95.312500 (94.931402) time: 0.364592 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.908396 (0.907708) acc1: 78.125000 (78.112000) acc5: 95.312500 (94.880000) time: 0.441894 data: 0.000102 max mem: 18817 Test: Total time: 0:00:20 (0.426831 s / it) * Acc@1 78.094 Acc@5 94.562 loss 0.918 Max accuracy: 78.16% Epoch: [89/300] [ 0/1251] eta: 0:41:22 lr: 0.001600 loss: 3.513551 (3.513551) time: 1.984304 data: 1.008982 max mem: 18817 Epoch: [89/300] [ 50/1251] eta: 0:19:42 lr: 0.001599 loss: 3.594247 (3.399177) time: 1.019053 data: 0.000170 max mem: 18817 Epoch: [89/300] [ 100/1251] eta: 0:18:41 lr: 0.001599 loss: 3.351746 (3.334315) time: 0.987403 data: 0.000164 max mem: 18817 Epoch: [89/300] [ 150/1251] eta: 0:17:44 lr: 0.001599 loss: 3.490469 (3.367163) time: 0.928629 data: 0.000161 max mem: 18817 Epoch: [89/300] [ 200/1251] eta: 0:16:56 lr: 0.001598 loss: 3.489963 (3.398356) time: 0.927417 data: 0.000175 max mem: 18817 Epoch: [89/300] [ 250/1251] eta: 0:16:10 lr: 0.001598 loss: 3.525980 (3.415402) time: 0.995512 data: 0.000170 max mem: 18817 Epoch: [89/300] [ 300/1251] eta: 0:15:23 lr: 0.001598 loss: 3.698834 (3.424514) time: 1.034899 data: 0.000194 max mem: 18817 Epoch: [89/300] [ 350/1251] eta: 0:14:31 lr: 0.001597 loss: 3.483518 (3.427287) time: 0.970852 data: 0.000179 max mem: 18817 Epoch: [89/300] [ 400/1251] eta: 0:13:41 lr: 0.001597 loss: 3.313638 (3.415229) time: 0.922767 data: 0.000171 max mem: 18817 Epoch: [89/300] [ 450/1251] eta: 0:12:54 lr: 0.001597 loss: 3.243563 (3.407267) time: 0.938760 data: 0.000169 max mem: 18817 Epoch: [89/300] [ 500/1251] eta: 0:12:06 lr: 0.001596 loss: 3.597254 (3.409013) time: 0.998674 data: 0.000163 max mem: 18817 Epoch: [89/300] [ 550/1251] eta: 0:11:18 lr: 0.001596 loss: 3.531953 (3.404499) time: 1.045265 data: 0.000159 max mem: 18817 Epoch: [89/300] [ 600/1251] eta: 0:10:29 lr: 0.001596 loss: 3.592427 (3.408599) time: 0.971280 data: 0.000158 max mem: 18817 Epoch: [89/300] [ 650/1251] eta: 0:09:40 lr: 0.001595 loss: 3.387309 (3.406895) time: 0.920898 data: 0.000169 max mem: 18817 Epoch: [89/300] [ 700/1251] eta: 0:08:52 lr: 0.001595 loss: 3.686056 (3.423029) time: 0.916026 data: 0.000173 max mem: 18817 Epoch: [89/300] [ 750/1251] eta: 0:08:04 lr: 0.001595 loss: 3.098075 (3.420889) time: 1.004530 data: 0.000158 max mem: 18817 Epoch: [89/300] [ 800/1251] eta: 0:07:16 lr: 0.001594 loss: 3.147526 (3.427337) time: 1.041422 data: 0.000172 max mem: 18817 Epoch: [89/300] [ 850/1251] eta: 0:06:27 lr: 0.001594 loss: 3.483006 (3.430940) time: 0.962019 data: 0.000162 max mem: 18817 Epoch: [89/300] [ 900/1251] eta: 0:05:38 lr: 0.001594 loss: 3.601005 (3.430634) time: 0.919920 data: 0.000168 max mem: 18817 Epoch: [89/300] [ 950/1251] eta: 0:04:50 lr: 0.001593 loss: 3.517858 (3.437697) time: 0.918905 data: 0.000183 max mem: 18817 Epoch: [89/300] [1000/1251] eta: 0:04:02 lr: 0.001593 loss: 3.330770 (3.432580) time: 0.984554 data: 0.000179 max mem: 18817 Epoch: [89/300] [1050/1251] eta: 0:03:14 lr: 0.001593 loss: 3.452742 (3.433007) time: 0.991991 data: 0.000211 max mem: 18817 Epoch: [89/300] [1100/1251] eta: 0:02:25 lr: 0.001592 loss: 3.451904 (3.434230) time: 0.971297 data: 0.000168 max mem: 18817 Epoch: [89/300] [1150/1251] eta: 0:01:37 lr: 0.001592 loss: 3.100581 (3.427986) time: 0.921538 data: 0.000168 max mem: 18817 Epoch: [89/300] [1200/1251] eta: 0:00:49 lr: 0.001592 loss: 3.326248 (3.425696) time: 0.924048 data: 0.000179 max mem: 18817 Epoch: [89/300] [1250/1251] eta: 0:00:00 lr: 0.001591 loss: 3.323694 (3.421181) time: 0.966152 data: 0.000743 max mem: 18817 Epoch: [89/300] Total time: 0:20:07 (0.964902 s / it) Averaged stats: lr: 0.001591 loss: 3.323694 (3.411539) Test: [ 0/49] eta: 0:01:19 loss: 0.826132 (0.826132) acc1: 76.562500 (76.562500) acc5: 92.187500 (92.187500) time: 1.621498 data: 1.186530 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.826132 (0.843933) acc1: 78.125000 (78.835227) acc5: 95.312500 (94.176136) time: 0.490268 data: 0.108034 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.912913 (0.883752) acc1: 78.125000 (78.199405) acc5: 93.750000 (93.973214) time: 0.377805 data: 0.000155 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.928259 (0.886517) acc1: 78.125000 (78.225806) acc5: 95.312500 (94.354839) time: 0.371641 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.900860 (0.899659) acc1: 78.125000 (78.163110) acc5: 95.312500 (94.169207) time: 0.361313 data: 0.000125 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 1.008409 (0.898956) acc1: 75.000000 (78.176000) acc5: 93.750000 (94.176000) time: 0.355240 data: 0.000098 max mem: 18817 Test: Total time: 0:00:19 (0.394648 s / it) * Acc@1 77.816 Acc@5 94.350 loss 0.911 Max accuracy: 78.16% Epoch: [90/300] [ 0/1251] eta: 0:42:06 lr: 0.001591 loss: 4.135770 (4.135770) time: 2.019944 data: 1.116619 max mem: 18817 Epoch: [90/300] [ 50/1251] eta: 0:19:46 lr: 0.001591 loss: 3.373470 (3.499881) time: 0.969850 data: 0.000161 max mem: 18817 Epoch: [90/300] [ 100/1251] eta: 0:18:34 lr: 0.001591 loss: 3.537185 (3.457697) time: 0.932246 data: 0.000165 max mem: 18817 Epoch: [90/300] [ 150/1251] eta: 0:17:41 lr: 0.001590 loss: 3.289062 (3.388892) time: 0.924643 data: 0.000195 max mem: 18817 Epoch: [90/300] [ 200/1251] eta: 0:16:56 lr: 0.001590 loss: 3.174547 (3.390869) time: 0.994455 data: 0.000157 max mem: 18817 Epoch: [90/300] [ 250/1251] eta: 0:16:01 lr: 0.001590 loss: 3.466392 (3.389897) time: 0.962526 data: 0.000159 max mem: 18817 Epoch: [90/300] [ 300/1251] eta: 0:15:09 lr: 0.001589 loss: 3.789823 (3.405197) time: 0.910947 data: 0.000162 max mem: 18817 Epoch: [90/300] [ 350/1251] eta: 0:14:22 lr: 0.001589 loss: 3.411885 (3.409595) time: 0.925525 data: 0.000161 max mem: 18817 Epoch: [90/300] [ 400/1251] eta: 0:13:35 lr: 0.001589 loss: 3.457081 (3.411940) time: 0.930841 data: 0.000184 max mem: 18817 Epoch: [90/300] [ 450/1251] eta: 0:12:48 lr: 0.001588 loss: 3.646298 (3.424756) time: 1.003208 data: 0.000167 max mem: 18817 Epoch: [90/300] [ 500/1251] eta: 0:11:59 lr: 0.001588 loss: 3.192083 (3.417725) time: 0.985227 data: 0.000185 max mem: 18817 Epoch: [90/300] [ 550/1251] eta: 0:11:11 lr: 0.001588 loss: 3.458752 (3.415520) time: 0.926614 data: 0.000165 max mem: 18817 Epoch: [90/300] [ 600/1251] eta: 0:10:24 lr: 0.001587 loss: 3.535750 (3.413135) time: 0.931007 data: 0.000166 max mem: 18817 Epoch: [90/300] [ 650/1251] eta: 0:09:37 lr: 0.001587 loss: 3.468052 (3.416310) time: 0.953199 data: 0.000160 max mem: 18817 Epoch: [90/300] [ 700/1251] eta: 0:08:49 lr: 0.001587 loss: 3.466584 (3.415954) time: 0.993049 data: 0.000159 max mem: 18817 Epoch: [90/300] [ 750/1251] eta: 0:08:00 lr: 0.001586 loss: 3.595131 (3.421220) time: 0.960070 data: 0.000189 max mem: 18817 Epoch: [90/300] [ 800/1251] eta: 0:07:12 lr: 0.001586 loss: 3.461170 (3.416810) time: 0.915032 data: 0.000177 max mem: 18817 Epoch: [90/300] [ 850/1251] eta: 0:06:25 lr: 0.001586 loss: 3.396750 (3.418228) time: 0.936886 data: 0.000163 max mem: 18817 Epoch: [90/300] [ 900/1251] eta: 0:05:37 lr: 0.001585 loss: 3.377194 (3.417563) time: 0.919522 data: 0.000185 max mem: 18817 Epoch: [90/300] [ 950/1251] eta: 0:04:49 lr: 0.001585 loss: 3.491546 (3.416280) time: 0.986968 data: 0.000173 max mem: 18817 Epoch: [90/300] [1000/1251] eta: 0:04:01 lr: 0.001585 loss: 3.522201 (3.414233) time: 0.974356 data: 0.000156 max mem: 18817 Epoch: [90/300] [1050/1251] eta: 0:03:13 lr: 0.001584 loss: 3.702454 (3.412802) time: 0.957462 data: 0.000174 max mem: 18817 Epoch: [90/300] [1100/1251] eta: 0:02:25 lr: 0.001584 loss: 3.622976 (3.413433) time: 0.931505 data: 0.000183 max mem: 18817 Epoch: [90/300] [1150/1251] eta: 0:01:37 lr: 0.001584 loss: 3.521671 (3.408012) time: 0.930916 data: 0.000197 max mem: 18817 Epoch: [90/300] [1200/1251] eta: 0:00:49 lr: 0.001583 loss: 3.539454 (3.407633) time: 0.982956 data: 0.000176 max mem: 18817 Epoch: [90/300] [1250/1251] eta: 0:00:00 lr: 0.001583 loss: 3.807976 (3.410111) time: 0.982621 data: 0.000750 max mem: 18817 Epoch: [90/300] Total time: 0:20:02 (0.960854 s / it) Averaged stats: lr: 0.001583 loss: 3.807976 (3.405696) Test: [ 0/49] eta: 0:01:17 loss: 0.787658 (0.787658) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.312500) time: 1.577284 data: 1.169905 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.787658 (0.820392) acc1: 81.250000 (81.392045) acc5: 95.312500 (94.886364) time: 0.478885 data: 0.106509 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.897214 (0.859186) acc1: 78.125000 (79.985119) acc5: 95.312500 (94.791667) time: 0.388839 data: 0.000167 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.910321 (0.874995) acc1: 78.125000 (79.536290) acc5: 95.312500 (94.959677) time: 0.398155 data: 0.000171 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.903238 (0.892248) acc1: 79.687500 (79.382622) acc5: 95.312500 (94.740854) time: 0.375398 data: 0.000159 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.904611 (0.890939) acc1: 78.125000 (79.392000) acc5: 93.750000 (94.752000) time: 0.366780 data: 0.000121 max mem: 18817 Test: Total time: 0:00:19 (0.407907 s / it) * Acc@1 78.344 Acc@5 94.570 loss 0.907 Max accuracy: 78.34% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0090.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0090.pth Epoch: [91/300] [ 0/1251] eta: 0:43:04 lr: 0.001583 loss: 3.260044 (3.260044) time: 2.065651 data: 1.116296 max mem: 18817 Epoch: [91/300] [ 50/1251] eta: 0:19:31 lr: 0.001583 loss: 3.572891 (3.408485) time: 0.969460 data: 0.000157 max mem: 18817 Epoch: [91/300] [ 100/1251] eta: 0:18:24 lr: 0.001582 loss: 3.318304 (3.440377) time: 0.927565 data: 0.000162 max mem: 18817 Epoch: [91/300] [ 150/1251] eta: 0:17:42 lr: 0.001582 loss: 3.429511 (3.416913) time: 0.940762 data: 0.000165 max mem: 18817 Epoch: [91/300] [ 200/1251] eta: 0:16:54 lr: 0.001582 loss: 3.364851 (3.433430) time: 0.998625 data: 0.000169 max mem: 18817 Epoch: [91/300] [ 250/1251] eta: 0:16:10 lr: 0.001581 loss: 3.554966 (3.421931) time: 1.063154 data: 0.000165 max mem: 18817 Epoch: [91/300] [ 300/1251] eta: 0:15:18 lr: 0.001581 loss: 3.555800 (3.424820) time: 0.985281 data: 0.000159 max mem: 18817 Epoch: [91/300] [ 350/1251] eta: 0:14:29 lr: 0.001581 loss: 3.298237 (3.438506) time: 0.937721 data: 0.000146 max mem: 18817 Epoch: [91/300] [ 400/1251] eta: 0:13:40 lr: 0.001580 loss: 3.724195 (3.443913) time: 0.917532 data: 0.000183 max mem: 18817 Epoch: [91/300] [ 450/1251] eta: 0:12:52 lr: 0.001580 loss: 3.292639 (3.426161) time: 0.989497 data: 0.000167 max mem: 18817 Epoch: [91/300] [ 500/1251] eta: 0:12:04 lr: 0.001579 loss: 3.351392 (3.431415) time: 1.029433 data: 0.000173 max mem: 18817 Epoch: [91/300] [ 550/1251] eta: 0:11:15 lr: 0.001579 loss: 3.607321 (3.437027) time: 0.981935 data: 0.000173 max mem: 18817 Epoch: [91/300] [ 600/1251] eta: 0:10:25 lr: 0.001579 loss: 3.533566 (3.436243) time: 0.915966 data: 0.000167 max mem: 18817 Epoch: [91/300] [ 650/1251] eta: 0:09:38 lr: 0.001578 loss: 3.611667 (3.436554) time: 0.930456 data: 0.000153 max mem: 18817 Epoch: [91/300] [ 700/1251] eta: 0:08:49 lr: 0.001578 loss: 3.792233 (3.446368) time: 0.978643 data: 0.000178 max mem: 18817 Epoch: [91/300] [ 750/1251] eta: 0:08:01 lr: 0.001578 loss: 3.515426 (3.446174) time: 0.995220 data: 0.000172 max mem: 18817 Epoch: [91/300] [ 800/1251] eta: 0:07:13 lr: 0.001577 loss: 3.565253 (3.453859) time: 0.975363 data: 0.000174 max mem: 18817 Epoch: [91/300] [ 850/1251] eta: 0:06:25 lr: 0.001577 loss: 3.619619 (3.457668) time: 0.923035 data: 0.000166 max mem: 18817 Epoch: [91/300] [ 900/1251] eta: 0:05:37 lr: 0.001577 loss: 3.537474 (3.455241) time: 0.946430 data: 0.000179 max mem: 18817 Epoch: [91/300] [ 950/1251] eta: 0:04:49 lr: 0.001576 loss: 3.434095 (3.450537) time: 0.992262 data: 0.000170 max mem: 18817 Epoch: [91/300] [1000/1251] eta: 0:04:01 lr: 0.001576 loss: 3.572422 (3.452374) time: 1.005701 data: 0.000165 max mem: 18817 Epoch: [91/300] [1050/1251] eta: 0:03:13 lr: 0.001576 loss: 3.307554 (3.451465) time: 0.950087 data: 0.000165 max mem: 18817 Epoch: [91/300] [1100/1251] eta: 0:02:25 lr: 0.001575 loss: 3.291743 (3.451492) time: 0.929237 data: 0.000165 max mem: 18817 Epoch: [91/300] [1150/1251] eta: 0:01:37 lr: 0.001575 loss: 3.459594 (3.450354) time: 0.919309 data: 0.000176 max mem: 18817 Epoch: [91/300] [1200/1251] eta: 0:00:49 lr: 0.001575 loss: 3.002217 (3.444373) time: 0.977358 data: 0.000175 max mem: 18817 Epoch: [91/300] [1250/1251] eta: 0:00:00 lr: 0.001574 loss: 3.126755 (3.441757) time: 0.992758 data: 0.000954 max mem: 18817 Epoch: [91/300] Total time: 0:20:02 (0.960914 s / it) Averaged stats: lr: 0.001574 loss: 3.126755 (3.444115) Test: [ 0/49] eta: 0:01:16 loss: 0.881443 (0.881443) acc1: 76.562500 (76.562500) acc5: 95.312500 (95.312500) time: 1.566806 data: 1.121253 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.881443 (0.864325) acc1: 78.125000 (79.687500) acc5: 95.312500 (94.744318) time: 0.532746 data: 0.102087 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.913042 (0.913236) acc1: 76.562500 (78.869048) acc5: 95.312500 (94.568452) time: 0.409627 data: 0.000151 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.930133 (0.908404) acc1: 78.125000 (78.729839) acc5: 95.312500 (94.707661) time: 0.380706 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.901207 (0.918001) acc1: 78.125000 (78.544207) acc5: 95.312500 (94.817073) time: 0.369887 data: 0.000135 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.979208 (0.916191) acc1: 78.125000 (78.592000) acc5: 95.312500 (94.880000) time: 0.365758 data: 0.000111 max mem: 18817 Test: Total time: 0:00:20 (0.411949 s / it) * Acc@1 78.110 Acc@5 94.514 loss 0.932 Max accuracy: 78.34% Epoch: [92/300] [ 0/1251] eta: 0:41:58 lr: 0.001574 loss: 3.226676 (3.226676) time: 2.012804 data: 1.104182 max mem: 18817 Epoch: [92/300] [ 50/1251] eta: 0:19:10 lr: 0.001574 loss: 3.294487 (3.270260) time: 0.965459 data: 0.000148 max mem: 18817 Epoch: [92/300] [ 100/1251] eta: 0:18:20 lr: 0.001574 loss: 3.283831 (3.272011) time: 0.925937 data: 0.000165 max mem: 18817 Epoch: [92/300] [ 150/1251] eta: 0:17:37 lr: 0.001573 loss: 3.487839 (3.353623) time: 0.949277 data: 0.000192 max mem: 18817 Epoch: [92/300] [ 200/1251] eta: 0:16:50 lr: 0.001573 loss: 3.385216 (3.375817) time: 0.989543 data: 0.000187 max mem: 18817 Epoch: [92/300] [ 250/1251] eta: 0:15:58 lr: 0.001573 loss: 3.516089 (3.393143) time: 0.980455 data: 0.000175 max mem: 18817 Epoch: [92/300] [ 300/1251] eta: 0:15:10 lr: 0.001572 loss: 3.253324 (3.391304) time: 0.974546 data: 0.000174 max mem: 18817 Epoch: [92/300] [ 350/1251] eta: 0:14:20 lr: 0.001572 loss: 3.546797 (3.406962) time: 0.916050 data: 0.000174 max mem: 18817 Epoch: [92/300] [ 400/1251] eta: 0:13:36 lr: 0.001572 loss: 3.516385 (3.410081) time: 0.931664 data: 0.000193 max mem: 18817 Epoch: [92/300] [ 450/1251] eta: 0:12:50 lr: 0.001571 loss: 3.425581 (3.413789) time: 1.016896 data: 0.000184 max mem: 18817 Epoch: [92/300] [ 500/1251] eta: 0:12:02 lr: 0.001571 loss: 3.430929 (3.421743) time: 1.039688 data: 0.000168 max mem: 18817 Epoch: [92/300] [ 550/1251] eta: 0:11:14 lr: 0.001571 loss: 3.608268 (3.425785) time: 0.992479 data: 0.000169 max mem: 18817 Epoch: [92/300] [ 600/1251] eta: 0:10:25 lr: 0.001570 loss: 3.707199 (3.427292) time: 0.926604 data: 0.000177 max mem: 18817 Epoch: [92/300] [ 650/1251] eta: 0:09:37 lr: 0.001570 loss: 3.618423 (3.428911) time: 0.938712 data: 0.000171 max mem: 18817 Epoch: [92/300] [ 700/1251] eta: 0:08:49 lr: 0.001570 loss: 3.289077 (3.425487) time: 0.970871 data: 0.000173 max mem: 18817 Epoch: [92/300] [ 750/1251] eta: 0:08:01 lr: 0.001569 loss: 3.696503 (3.432396) time: 1.036373 data: 0.000202 max mem: 18817 Epoch: [92/300] [ 800/1251] eta: 0:07:13 lr: 0.001569 loss: 3.604591 (3.428532) time: 0.982413 data: 0.000174 max mem: 18817 Epoch: [92/300] [ 850/1251] eta: 0:06:25 lr: 0.001569 loss: 3.380693 (3.426974) time: 0.919900 data: 0.000166 max mem: 18817 Epoch: [92/300] [ 900/1251] eta: 0:05:37 lr: 0.001568 loss: 3.578959 (3.427641) time: 0.930121 data: 0.000162 max mem: 18817 Epoch: [92/300] [ 950/1251] eta: 0:04:49 lr: 0.001568 loss: 3.323534 (3.424465) time: 0.978157 data: 0.000167 max mem: 18817 Epoch: [92/300] [1000/1251] eta: 0:04:01 lr: 0.001568 loss: 3.537519 (3.420393) time: 1.019231 data: 0.000170 max mem: 18817 Epoch: [92/300] [1050/1251] eta: 0:03:13 lr: 0.001567 loss: 3.535862 (3.418527) time: 0.957146 data: 0.000182 max mem: 18817 Epoch: [92/300] [1100/1251] eta: 0:02:24 lr: 0.001567 loss: 3.498470 (3.416466) time: 0.925989 data: 0.000173 max mem: 18817 Epoch: [92/300] [1150/1251] eta: 0:01:37 lr: 0.001567 loss: 3.464186 (3.417154) time: 0.939320 data: 0.000166 max mem: 18817 Epoch: [92/300] [1200/1251] eta: 0:00:48 lr: 0.001566 loss: 3.198190 (3.416477) time: 0.973966 data: 0.000167 max mem: 18817 Epoch: [92/300] [1250/1251] eta: 0:00:00 lr: 0.001566 loss: 3.366981 (3.419148) time: 1.040139 data: 0.000762 max mem: 18817 Epoch: [92/300] Total time: 0:20:02 (0.961351 s / it) Averaged stats: lr: 0.001566 loss: 3.366981 (3.422329) Test: [ 0/49] eta: 0:01:27 loss: 0.878911 (0.878911) acc1: 79.687500 (79.687500) acc5: 96.875000 (96.875000) time: 1.782753 data: 1.158917 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.876981 (0.872448) acc1: 79.687500 (79.829545) acc5: 95.312500 (94.460227) time: 0.537137 data: 0.105519 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.903260 (0.919825) acc1: 76.562500 (78.794643) acc5: 95.312500 (94.345238) time: 0.390258 data: 0.000156 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.943735 (0.906958) acc1: 78.125000 (78.830645) acc5: 95.312500 (94.556452) time: 0.365645 data: 0.000129 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.913442 (0.927908) acc1: 78.125000 (78.582317) acc5: 93.750000 (94.435976) time: 0.364358 data: 0.000120 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.935986 (0.930290) acc1: 76.562500 (78.464000) acc5: 93.750000 (94.432000) time: 0.358660 data: 0.000099 max mem: 18817 Test: Total time: 0:00:19 (0.404508 s / it) * Acc@1 78.246 Acc@5 94.564 loss 0.921 Max accuracy: 78.34% Epoch: [93/300] [ 0/1251] eta: 0:43:56 lr: 0.001566 loss: 3.785722 (3.785722) time: 2.107759 data: 1.064658 max mem: 18817 Epoch: [93/300] [ 50/1251] eta: 0:19:21 lr: 0.001565 loss: 3.239958 (3.436628) time: 0.969515 data: 0.000178 max mem: 18817 Epoch: [93/300] [ 100/1251] eta: 0:18:19 lr: 0.001565 loss: 3.698345 (3.464553) time: 0.920105 data: 0.000171 max mem: 18817 Epoch: [93/300] [ 150/1251] eta: 0:17:39 lr: 0.001565 loss: 3.466029 (3.465005) time: 0.924294 data: 0.000170 max mem: 18817 Epoch: [93/300] [ 200/1251] eta: 0:16:52 lr: 0.001564 loss: 3.229831 (3.447684) time: 1.003375 data: 0.000172 max mem: 18817 Epoch: [93/300] [ 250/1251] eta: 0:16:06 lr: 0.001564 loss: 3.610873 (3.429956) time: 1.051642 data: 0.000171 max mem: 18817 Epoch: [93/300] [ 300/1251] eta: 0:15:16 lr: 0.001564 loss: 3.474048 (3.441704) time: 0.979004 data: 0.000163 max mem: 18817 Epoch: [93/300] [ 350/1251] eta: 0:14:26 lr: 0.001563 loss: 3.350889 (3.438305) time: 0.936796 data: 0.000162 max mem: 18817 Epoch: [93/300] [ 400/1251] eta: 0:13:39 lr: 0.001563 loss: 3.416262 (3.425827) time: 0.955870 data: 0.000178 max mem: 18817 Epoch: [93/300] [ 450/1251] eta: 0:12:51 lr: 0.001563 loss: 3.443386 (3.431378) time: 0.969813 data: 0.000175 max mem: 18817 Epoch: [93/300] [ 500/1251] eta: 0:12:04 lr: 0.001562 loss: 3.514330 (3.435639) time: 1.054412 data: 0.000181 max mem: 18817 Epoch: [93/300] [ 550/1251] eta: 0:11:15 lr: 0.001562 loss: 3.422790 (3.421810) time: 0.959334 data: 0.000161 max mem: 18817 Epoch: [93/300] [ 600/1251] eta: 0:10:26 lr: 0.001562 loss: 3.501365 (3.425763) time: 0.913157 data: 0.000168 max mem: 18817 Epoch: [93/300] [ 650/1251] eta: 0:09:38 lr: 0.001561 loss: 3.529320 (3.422863) time: 0.933957 data: 0.000190 max mem: 18817 Epoch: [93/300] [ 700/1251] eta: 0:08:50 lr: 0.001561 loss: 3.546559 (3.423965) time: 0.986657 data: 0.000174 max mem: 18817 Epoch: [93/300] [ 750/1251] eta: 0:08:02 lr: 0.001561 loss: 3.673225 (3.428822) time: 1.024762 data: 0.000160 max mem: 18817 Epoch: [93/300] [ 800/1251] eta: 0:07:14 lr: 0.001560 loss: 3.504746 (3.427792) time: 0.979163 data: 0.000165 max mem: 18817 Epoch: [93/300] [ 850/1251] eta: 0:06:25 lr: 0.001560 loss: 3.338467 (3.425013) time: 0.925666 data: 0.000167 max mem: 18817 Epoch: [93/300] [ 900/1251] eta: 0:05:38 lr: 0.001560 loss: 3.620569 (3.426347) time: 0.935141 data: 0.000177 max mem: 18817 Epoch: [93/300] [ 950/1251] eta: 0:04:49 lr: 0.001559 loss: 3.432883 (3.426652) time: 1.004366 data: 0.000172 max mem: 18817 Epoch: [93/300] [1000/1251] eta: 0:04:01 lr: 0.001559 loss: 3.400746 (3.424425) time: 1.063104 data: 0.000177 max mem: 18817 Epoch: [93/300] [1050/1251] eta: 0:03:13 lr: 0.001559 loss: 3.633785 (3.421151) time: 0.962261 data: 0.000195 max mem: 18817 Epoch: [93/300] [1100/1251] eta: 0:02:25 lr: 0.001558 loss: 3.634247 (3.423656) time: 0.927485 data: 0.000184 max mem: 18817 Epoch: [93/300] [1150/1251] eta: 0:01:37 lr: 0.001558 loss: 3.174977 (3.417365) time: 0.922742 data: 0.000167 max mem: 18817 Epoch: [93/300] [1200/1251] eta: 0:00:49 lr: 0.001558 loss: 3.404830 (3.419439) time: 1.001788 data: 0.000175 max mem: 18817 Epoch: [93/300] [1250/1251] eta: 0:00:00 lr: 0.001557 loss: 3.519718 (3.420871) time: 1.047124 data: 0.000730 max mem: 18817 Epoch: [93/300] Total time: 0:20:06 (0.964277 s / it) Averaged stats: lr: 0.001557 loss: 3.519718 (3.430127) Test: [ 0/49] eta: 0:01:27 loss: 0.604035 (0.604035) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.793187 data: 1.394181 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.738911 (0.825908) acc1: 79.687500 (81.534091) acc5: 96.875000 (96.306818) time: 0.509708 data: 0.126878 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.922458 (0.886596) acc1: 78.125000 (79.985119) acc5: 95.312500 (95.163690) time: 0.372043 data: 0.000138 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.922458 (0.890099) acc1: 78.125000 (79.334677) acc5: 95.312500 (95.312500) time: 0.372380 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.884898 (0.903898) acc1: 76.562500 (78.849085) acc5: 95.312500 (95.083841) time: 0.382795 data: 0.000139 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.909816 (0.906343) acc1: 76.562500 (78.880000) acc5: 93.750000 (94.944000) time: 0.377624 data: 0.000115 max mem: 18817 Test: Total time: 0:00:19 (0.407428 s / it) * Acc@1 78.324 Acc@5 94.626 loss 0.919 Max accuracy: 78.34% Epoch: [94/300] [ 0/1251] eta: 0:41:12 lr: 0.001557 loss: 3.254504 (3.254504) time: 1.976036 data: 1.081410 max mem: 18817 Epoch: [94/300] [ 50/1251] eta: 0:19:18 lr: 0.001557 loss: 3.394722 (3.370781) time: 0.921620 data: 0.000156 max mem: 18817 Epoch: [94/300] [ 100/1251] eta: 0:18:28 lr: 0.001557 loss: 3.481919 (3.359945) time: 0.919151 data: 0.000174 max mem: 18817 Epoch: [94/300] [ 150/1251] eta: 0:17:40 lr: 0.001556 loss: 3.680864 (3.403005) time: 0.987825 data: 0.000175 max mem: 18817 Epoch: [94/300] [ 200/1251] eta: 0:16:52 lr: 0.001556 loss: 3.228128 (3.388260) time: 1.015852 data: 0.000182 max mem: 18817 Epoch: [94/300] [ 250/1251] eta: 0:16:03 lr: 0.001555 loss: 3.352405 (3.387422) time: 0.995067 data: 0.000167 max mem: 18817 Epoch: [94/300] [ 300/1251] eta: 0:15:15 lr: 0.001555 loss: 3.653875 (3.403284) time: 0.927610 data: 0.000149 max mem: 18817 Epoch: [94/300] [ 350/1251] eta: 0:14:28 lr: 0.001555 loss: 3.479288 (3.415980) time: 0.927941 data: 0.000155 max mem: 18817 Epoch: [94/300] [ 400/1251] eta: 0:13:39 lr: 0.001554 loss: 3.680776 (3.432137) time: 0.969631 data: 0.000168 max mem: 18817 Epoch: [94/300] [ 450/1251] eta: 0:12:52 lr: 0.001554 loss: 3.666039 (3.420591) time: 1.036454 data: 0.000166 max mem: 18817 Epoch: [94/300] [ 500/1251] eta: 0:12:03 lr: 0.001554 loss: 3.406915 (3.403607) time: 0.992591 data: 0.000170 max mem: 18817 Epoch: [94/300] [ 550/1251] eta: 0:11:15 lr: 0.001553 loss: 3.469413 (3.405719) time: 0.927369 data: 0.000161 max mem: 18817 Epoch: [94/300] [ 600/1251] eta: 0:10:28 lr: 0.001553 loss: 3.218204 (3.402022) time: 0.938788 data: 0.000172 max mem: 18817 Epoch: [94/300] [ 650/1251] eta: 0:09:40 lr: 0.001553 loss: 3.471001 (3.398988) time: 0.978462 data: 0.000178 max mem: 18817 Epoch: [94/300] [ 700/1251] eta: 0:08:51 lr: 0.001552 loss: 3.643882 (3.410905) time: 1.009549 data: 0.000177 max mem: 18817 Epoch: [94/300] [ 750/1251] eta: 0:08:02 lr: 0.001552 loss: 3.514249 (3.417575) time: 0.982212 data: 0.000162 max mem: 18817 Epoch: [94/300] [ 800/1251] eta: 0:07:14 lr: 0.001552 loss: 3.543633 (3.417875) time: 0.919078 data: 0.000175 max mem: 18817 Epoch: [94/300] [ 850/1251] eta: 0:06:26 lr: 0.001551 loss: 3.116123 (3.409885) time: 0.940253 data: 0.000159 max mem: 18817 Epoch: [94/300] [ 900/1251] eta: 0:05:38 lr: 0.001551 loss: 3.257161 (3.414998) time: 0.993227 data: 0.000168 max mem: 18817 Epoch: [94/300] [ 950/1251] eta: 0:04:50 lr: 0.001551 loss: 3.394550 (3.410079) time: 1.032996 data: 0.000166 max mem: 18817 Epoch: [94/300] [1000/1251] eta: 0:04:01 lr: 0.001550 loss: 3.568945 (3.413005) time: 0.967735 data: 0.000153 max mem: 18817 Epoch: [94/300] [1050/1251] eta: 0:03:13 lr: 0.001550 loss: 3.687498 (3.411471) time: 0.909304 data: 0.000167 max mem: 18817 Epoch: [94/300] [1100/1251] eta: 0:02:25 lr: 0.001550 loss: 3.308431 (3.411176) time: 0.931895 data: 0.000171 max mem: 18817 Epoch: [94/300] [1150/1251] eta: 0:01:37 lr: 0.001549 loss: 3.208396 (3.407905) time: 0.975903 data: 0.000169 max mem: 18817 Epoch: [94/300] [1200/1251] eta: 0:00:49 lr: 0.001549 loss: 3.212520 (3.403035) time: 1.058514 data: 0.000171 max mem: 18817 Epoch: [94/300] [1250/1251] eta: 0:00:00 lr: 0.001549 loss: 3.504819 (3.401063) time: 0.985649 data: 0.000770 max mem: 18817 Epoch: [94/300] Total time: 0:20:04 (0.963198 s / it) Averaged stats: lr: 0.001549 loss: 3.504819 (3.399456) Test: [ 0/49] eta: 0:01:13 loss: 0.804076 (0.804076) acc1: 81.250000 (81.250000) acc5: 93.750000 (93.750000) time: 1.507161 data: 1.109294 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.804076 (0.843913) acc1: 81.250000 (79.971591) acc5: 93.750000 (95.170455) time: 0.474459 data: 0.100975 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.911956 (0.889822) acc1: 78.125000 (79.538690) acc5: 93.750000 (94.791667) time: 0.368457 data: 0.000136 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.923870 (0.892206) acc1: 78.125000 (79.082661) acc5: 95.312500 (95.010081) time: 0.407915 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.939740 (0.906566) acc1: 78.125000 (78.925305) acc5: 95.312500 (94.740854) time: 0.414913 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.949022 (0.906499) acc1: 78.125000 (78.976000) acc5: 93.750000 (94.688000) time: 0.375765 data: 0.000098 max mem: 18817 Test: Total time: 0:00:20 (0.413471 s / it) * Acc@1 78.572 Acc@5 94.628 loss 0.917 Max accuracy: 78.57% Epoch: [95/300] [ 0/1251] eta: 0:43:43 lr: 0.001549 loss: 3.856305 (3.856305) time: 2.097485 data: 1.183767 max mem: 18817 Epoch: [95/300] [ 50/1251] eta: 0:19:18 lr: 0.001548 loss: 3.453714 (3.304352) time: 0.911908 data: 0.000176 max mem: 18817 Epoch: [95/300] [ 100/1251] eta: 0:18:31 lr: 0.001548 loss: 3.703594 (3.410509) time: 0.921703 data: 0.000183 max mem: 18817 Epoch: [95/300] [ 150/1251] eta: 0:17:46 lr: 0.001547 loss: 3.438375 (3.418437) time: 0.982697 data: 0.000169 max mem: 18817 Epoch: [95/300] [ 200/1251] eta: 0:17:00 lr: 0.001547 loss: 3.435048 (3.399855) time: 1.000580 data: 0.000168 max mem: 18817 Epoch: [95/300] [ 250/1251] eta: 0:16:06 lr: 0.001547 loss: 3.523847 (3.426007) time: 0.968899 data: 0.000171 max mem: 18817 Epoch: [95/300] [ 300/1251] eta: 0:15:15 lr: 0.001546 loss: 3.462757 (3.433150) time: 0.913327 data: 0.000163 max mem: 18817 Epoch: [95/300] [ 350/1251] eta: 0:14:28 lr: 0.001546 loss: 3.589351 (3.435462) time: 0.925060 data: 0.000184 max mem: 18817 Epoch: [95/300] [ 400/1251] eta: 0:13:40 lr: 0.001546 loss: 3.321326 (3.425159) time: 0.955476 data: 0.000160 max mem: 18817 Epoch: [95/300] [ 450/1251] eta: 0:12:53 lr: 0.001545 loss: 3.472533 (3.413185) time: 0.969546 data: 0.000169 max mem: 18817 Epoch: [95/300] [ 500/1251] eta: 0:12:03 lr: 0.001545 loss: 3.369931 (3.410905) time: 0.979044 data: 0.000176 max mem: 18817 Epoch: [95/300] [ 550/1251] eta: 0:11:14 lr: 0.001545 loss: 3.598311 (3.417335) time: 0.916564 data: 0.000176 max mem: 18817 Epoch: [95/300] [ 600/1251] eta: 0:10:26 lr: 0.001544 loss: 3.101984 (3.412580) time: 0.926683 data: 0.000177 max mem: 18817 Epoch: [95/300] [ 650/1251] eta: 0:09:38 lr: 0.001544 loss: 3.523983 (3.412086) time: 0.950673 data: 0.000166 max mem: 18817 Epoch: [95/300] [ 700/1251] eta: 0:08:50 lr: 0.001544 loss: 3.347036 (3.405377) time: 0.989623 data: 0.000173 max mem: 18817 Epoch: [95/300] [ 750/1251] eta: 0:08:02 lr: 0.001543 loss: 3.438105 (3.397758) time: 0.993755 data: 0.000171 max mem: 18817 Epoch: [95/300] [ 800/1251] eta: 0:07:14 lr: 0.001543 loss: 3.470088 (3.401822) time: 0.925703 data: 0.000167 max mem: 18817 Epoch: [95/300] [ 850/1251] eta: 0:06:26 lr: 0.001543 loss: 3.618221 (3.404268) time: 0.941303 data: 0.000167 max mem: 18817 Epoch: [95/300] [ 900/1251] eta: 0:05:38 lr: 0.001542 loss: 3.474539 (3.406855) time: 0.949437 data: 0.000179 max mem: 18817 Epoch: [95/300] [ 950/1251] eta: 0:04:50 lr: 0.001542 loss: 3.438967 (3.410375) time: 0.993621 data: 0.000176 max mem: 18817 Epoch: [95/300] [1000/1251] eta: 0:04:01 lr: 0.001542 loss: 3.237817 (3.405433) time: 0.978958 data: 0.000167 max mem: 18817 Epoch: [95/300] [1050/1251] eta: 0:03:13 lr: 0.001541 loss: 3.160528 (3.403772) time: 0.948922 data: 0.000164 max mem: 18817 Epoch: [95/300] [1100/1251] eta: 0:02:25 lr: 0.001541 loss: 3.620152 (3.405195) time: 0.943217 data: 0.000179 max mem: 18817 Epoch: [95/300] [1150/1251] eta: 0:01:37 lr: 0.001541 loss: 3.596301 (3.403911) time: 0.921046 data: 0.000185 max mem: 18817 Epoch: [95/300] [1200/1251] eta: 0:00:49 lr: 0.001540 loss: 3.521627 (3.403151) time: 0.991238 data: 0.000169 max mem: 18817 Epoch: [95/300] [1250/1251] eta: 0:00:00 lr: 0.001540 loss: 3.330873 (3.403467) time: 0.983101 data: 0.000767 max mem: 18817 Epoch: [95/300] Total time: 0:20:05 (0.963928 s / it) Averaged stats: lr: 0.001540 loss: 3.330873 (3.400756) Test: [ 0/49] eta: 0:01:28 loss: 0.807146 (0.807146) acc1: 82.812500 (82.812500) acc5: 93.750000 (93.750000) time: 1.807375 data: 1.333094 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.807146 (0.814789) acc1: 82.812500 (80.965909) acc5: 93.750000 (94.886364) time: 0.531284 data: 0.121348 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.868872 (0.853964) acc1: 78.125000 (79.613095) acc5: 93.750000 (94.642857) time: 0.406001 data: 0.000153 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.873611 (0.866418) acc1: 78.125000 (79.233871) acc5: 95.312500 (94.808468) time: 0.385678 data: 0.000130 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.909489 (0.888161) acc1: 78.125000 (78.734756) acc5: 95.312500 (94.512195) time: 0.360714 data: 0.000120 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.948795 (0.881584) acc1: 75.000000 (78.720000) acc5: 95.312500 (94.656000) time: 0.362361 data: 0.000097 max mem: 18817 Test: Total time: 0:00:20 (0.414623 s / it) * Acc@1 78.726 Acc@5 94.684 loss 0.883 Max accuracy: 78.73% Epoch: [96/300] [ 0/1251] eta: 0:42:00 lr: 0.001540 loss: 4.227387 (4.227387) time: 2.014658 data: 1.117342 max mem: 18817 Epoch: [96/300] [ 50/1251] eta: 0:19:07 lr: 0.001539 loss: 2.848650 (3.245905) time: 0.915590 data: 0.000159 max mem: 18817 Epoch: [96/300] [ 100/1251] eta: 0:18:27 lr: 0.001539 loss: 3.467610 (3.305967) time: 0.930975 data: 0.000166 max mem: 18817 Epoch: [96/300] [ 150/1251] eta: 0:17:36 lr: 0.001539 loss: 3.593222 (3.358212) time: 0.971885 data: 0.000174 max mem: 18817 Epoch: [96/300] [ 200/1251] eta: 0:16:48 lr: 0.001538 loss: 3.676060 (3.374701) time: 1.015200 data: 0.000178 max mem: 18817 Epoch: [96/300] [ 250/1251] eta: 0:16:02 lr: 0.001538 loss: 3.148171 (3.355159) time: 0.980902 data: 0.000161 max mem: 18817 Epoch: [96/300] [ 300/1251] eta: 0:15:13 lr: 0.001538 loss: 3.518737 (3.374962) time: 0.936333 data: 0.000158 max mem: 18817 Epoch: [96/300] [ 350/1251] eta: 0:14:25 lr: 0.001537 loss: 3.320261 (3.384666) time: 0.924927 data: 0.000165 max mem: 18817 Epoch: [96/300] [ 400/1251] eta: 0:13:37 lr: 0.001537 loss: 3.252019 (3.385250) time: 0.976107 data: 0.000204 max mem: 18817 Epoch: [96/300] [ 450/1251] eta: 0:12:50 lr: 0.001537 loss: 3.400930 (3.390531) time: 1.022991 data: 0.000159 max mem: 18817 Epoch: [96/300] [ 500/1251] eta: 0:12:01 lr: 0.001536 loss: 3.179581 (3.377101) time: 0.963415 data: 0.000169 max mem: 18817 Epoch: [96/300] [ 550/1251] eta: 0:11:12 lr: 0.001536 loss: 3.666824 (3.389138) time: 0.928256 data: 0.000165 max mem: 18817 Epoch: [96/300] [ 600/1251] eta: 0:10:25 lr: 0.001536 loss: 3.604466 (3.398826) time: 0.930957 data: 0.000171 max mem: 18817 Epoch: [96/300] [ 650/1251] eta: 0:09:38 lr: 0.001535 loss: 3.540509 (3.409195) time: 0.998752 data: 0.000181 max mem: 18817 Epoch: [96/300] [ 700/1251] eta: 0:08:50 lr: 0.001535 loss: 3.295676 (3.405675) time: 1.052899 data: 0.000173 max mem: 18817 Epoch: [96/300] [ 750/1251] eta: 0:08:01 lr: 0.001535 loss: 3.428859 (3.407843) time: 0.972883 data: 0.000164 max mem: 18817 Epoch: [96/300] [ 800/1251] eta: 0:07:13 lr: 0.001534 loss: 3.505138 (3.413203) time: 0.930810 data: 0.000185 max mem: 18817 Epoch: [96/300] [ 850/1251] eta: 0:06:25 lr: 0.001534 loss: 3.640505 (3.416627) time: 0.924999 data: 0.000174 max mem: 18817 Epoch: [96/300] [ 900/1251] eta: 0:05:37 lr: 0.001533 loss: 3.594533 (3.417050) time: 0.955659 data: 0.000173 max mem: 18817 Epoch: [96/300] [ 950/1251] eta: 0:04:49 lr: 0.001533 loss: 3.380543 (3.409005) time: 1.028976 data: 0.000166 max mem: 18817 Epoch: [96/300] [1000/1251] eta: 0:04:01 lr: 0.001533 loss: 3.203119 (3.410717) time: 0.980555 data: 0.000166 max mem: 18817 Epoch: [96/300] [1050/1251] eta: 0:03:12 lr: 0.001532 loss: 3.135201 (3.406658) time: 0.921958 data: 0.000208 max mem: 18817 Epoch: [96/300] [1100/1251] eta: 0:02:24 lr: 0.001532 loss: 3.226393 (3.411830) time: 0.941627 data: 0.000206 max mem: 18817 Epoch: [96/300] [1150/1251] eta: 0:01:36 lr: 0.001532 loss: 3.603904 (3.415243) time: 0.963296 data: 0.000173 max mem: 18817 Epoch: [96/300] [1200/1251] eta: 0:00:48 lr: 0.001531 loss: 3.568529 (3.413662) time: 1.006905 data: 0.000174 max mem: 18817 Epoch: [96/300] [1250/1251] eta: 0:00:00 lr: 0.001531 loss: 3.169045 (3.411244) time: 0.966101 data: 0.000806 max mem: 18817 Epoch: [96/300] Total time: 0:20:01 (0.960179 s / it) Averaged stats: lr: 0.001531 loss: 3.169045 (3.410399) Test: [ 0/49] eta: 0:01:28 loss: 0.693981 (0.693981) acc1: 84.375000 (84.375000) acc5: 95.312500 (95.312500) time: 1.798168 data: 1.410462 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.818528 (0.868071) acc1: 79.687500 (79.403409) acc5: 95.312500 (94.744318) time: 0.499644 data: 0.128369 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.875722 (0.898591) acc1: 76.562500 (78.720238) acc5: 93.750000 (94.642857) time: 0.366471 data: 0.000136 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.876503 (0.893509) acc1: 76.562500 (78.679435) acc5: 95.312500 (95.010081) time: 0.363088 data: 0.000116 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.894361 (0.909695) acc1: 78.125000 (78.544207) acc5: 95.312500 (94.893293) time: 0.366814 data: 0.000119 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.945132 (0.911618) acc1: 78.125000 (78.496000) acc5: 95.312500 (94.848000) time: 0.375788 data: 0.000100 max mem: 18817 Test: Total time: 0:00:19 (0.402638 s / it) * Acc@1 78.604 Acc@5 94.830 loss 0.922 Max accuracy: 78.73% Epoch: [97/300] [ 0/1251] eta: 0:41:45 lr: 0.001531 loss: 3.118095 (3.118095) time: 2.002629 data: 1.111045 max mem: 18817 Epoch: [97/300] [ 50/1251] eta: 0:19:37 lr: 0.001531 loss: 3.022214 (3.464700) time: 0.945698 data: 0.000180 max mem: 18817 Epoch: [97/300] [ 100/1251] eta: 0:18:44 lr: 0.001530 loss: 3.318636 (3.392190) time: 0.990383 data: 0.000156 max mem: 18817 Epoch: [97/300] [ 150/1251] eta: 0:17:43 lr: 0.001530 loss: 3.406017 (3.439964) time: 0.970521 data: 0.000188 max mem: 18817 Epoch: [97/300] [ 200/1251] eta: 0:16:52 lr: 0.001530 loss: 3.531799 (3.434444) time: 0.924180 data: 0.000177 max mem: 18817 Epoch: [97/300] [ 250/1251] eta: 0:16:02 lr: 0.001529 loss: 3.641655 (3.424828) time: 0.933485 data: 0.000181 max mem: 18817 Epoch: [97/300] [ 300/1251] eta: 0:15:16 lr: 0.001529 loss: 3.477502 (3.420218) time: 0.930939 data: 0.000170 max mem: 18817 Epoch: [97/300] [ 350/1251] eta: 0:14:28 lr: 0.001529 loss: 3.326802 (3.416746) time: 0.980921 data: 0.000175 max mem: 18817 Epoch: [97/300] [ 400/1251] eta: 0:13:37 lr: 0.001528 loss: 3.529880 (3.419654) time: 0.964586 data: 0.000183 max mem: 18817 Epoch: [97/300] [ 450/1251] eta: 0:12:47 lr: 0.001528 loss: 3.410240 (3.405669) time: 0.910173 data: 0.000178 max mem: 18817 Epoch: [97/300] [ 500/1251] eta: 0:12:00 lr: 0.001527 loss: 3.470267 (3.403866) time: 0.928329 data: 0.000165 max mem: 18817 Epoch: [97/300] [ 550/1251] eta: 0:11:14 lr: 0.001527 loss: 3.371528 (3.396298) time: 0.980815 data: 0.000169 max mem: 18817 Epoch: [97/300] [ 600/1251] eta: 0:10:26 lr: 0.001527 loss: 3.417719 (3.400019) time: 0.999557 data: 0.000166 max mem: 18817 Epoch: [97/300] [ 650/1251] eta: 0:09:37 lr: 0.001526 loss: 3.168161 (3.403696) time: 0.976492 data: 0.000166 max mem: 18817 Epoch: [97/300] [ 700/1251] eta: 0:08:49 lr: 0.001526 loss: 3.316717 (3.403843) time: 0.924934 data: 0.000169 max mem: 18817 Epoch: [97/300] [ 750/1251] eta: 0:08:02 lr: 0.001526 loss: 3.455127 (3.396398) time: 0.923057 data: 0.000166 max mem: 18817 Epoch: [97/300] [ 800/1251] eta: 0:07:14 lr: 0.001525 loss: 3.411005 (3.399658) time: 0.942427 data: 0.000167 max mem: 18817 Epoch: [97/300] [ 850/1251] eta: 0:06:25 lr: 0.001525 loss: 3.548513 (3.401440) time: 0.959810 data: 0.000175 max mem: 18817 Epoch: [97/300] [ 900/1251] eta: 0:05:37 lr: 0.001525 loss: 3.427588 (3.404010) time: 0.981927 data: 0.000177 max mem: 18817 Epoch: [97/300] [ 950/1251] eta: 0:04:49 lr: 0.001524 loss: 3.073379 (3.396965) time: 0.920879 data: 0.000159 max mem: 18817 Epoch: [97/300] [1000/1251] eta: 0:04:01 lr: 0.001524 loss: 3.241950 (3.394751) time: 0.937436 data: 0.000164 max mem: 18817 Epoch: [97/300] [1050/1251] eta: 0:03:13 lr: 0.001524 loss: 3.361313 (3.392116) time: 0.936977 data: 0.000167 max mem: 18817 Epoch: [97/300] [1100/1251] eta: 0:02:25 lr: 0.001523 loss: 3.386650 (3.385856) time: 1.005649 data: 0.000177 max mem: 18817 Epoch: [97/300] [1150/1251] eta: 0:01:37 lr: 0.001523 loss: 3.563910 (3.384598) time: 0.976790 data: 0.000175 max mem: 18817 Epoch: [97/300] [1200/1251] eta: 0:00:49 lr: 0.001523 loss: 3.556680 (3.390254) time: 0.949525 data: 0.000171 max mem: 18817 Epoch: [97/300] [1250/1251] eta: 0:00:00 lr: 0.001522 loss: 3.287055 (3.386998) time: 0.919022 data: 0.000802 max mem: 18817 Epoch: [97/300] Total time: 0:20:04 (0.963070 s / it) Averaged stats: lr: 0.001522 loss: 3.287055 (3.387922) Test: [ 0/49] eta: 0:01:27 loss: 0.795116 (0.795116) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.875000) time: 1.784414 data: 1.369464 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.846891 (0.864711) acc1: 79.687500 (81.107955) acc5: 95.312500 (94.744318) time: 0.498726 data: 0.124627 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.944779 (0.903758) acc1: 78.125000 (79.241071) acc5: 95.312500 (94.717262) time: 0.366942 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.896241 (0.898757) acc1: 78.125000 (79.233871) acc5: 95.312500 (94.959677) time: 0.458465 data: 0.000143 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.898787 (0.915248) acc1: 78.125000 (78.887195) acc5: 95.312500 (94.778963) time: 0.455741 data: 0.000132 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.937106 (0.911067) acc1: 78.125000 (79.232000) acc5: 93.750000 (94.912000) time: 0.355512 data: 0.000105 max mem: 18817 Test: Total time: 0:00:21 (0.434305 s / it) * Acc@1 78.662 Acc@5 94.702 loss 0.923 Max accuracy: 78.73% Epoch: [98/300] [ 0/1251] eta: 0:42:19 lr: 0.001522 loss: 3.935986 (3.935986) time: 2.029970 data: 1.124651 max mem: 18817 Epoch: [98/300] [ 50/1251] eta: 0:19:41 lr: 0.001522 loss: 3.395294 (3.438678) time: 1.036644 data: 0.000149 max mem: 18817 Epoch: [98/300] [ 100/1251] eta: 0:18:31 lr: 0.001521 loss: 3.576743 (3.413085) time: 0.971685 data: 0.000189 max mem: 18817 Epoch: [98/300] [ 150/1251] eta: 0:17:35 lr: 0.001521 loss: 3.085596 (3.379775) time: 0.925235 data: 0.000186 max mem: 18817 Epoch: [98/300] [ 200/1251] eta: 0:16:52 lr: 0.001521 loss: 3.713021 (3.381425) time: 0.929336 data: 0.000176 max mem: 18817 Epoch: [98/300] [ 250/1251] eta: 0:16:04 lr: 0.001520 loss: 3.454012 (3.404132) time: 0.980775 data: 0.000183 max mem: 18817 Epoch: [98/300] [ 300/1251] eta: 0:15:18 lr: 0.001520 loss: 3.526464 (3.391929) time: 1.046530 data: 0.000184 max mem: 18817 Epoch: [98/300] [ 350/1251] eta: 0:14:26 lr: 0.001520 loss: 3.536820 (3.383962) time: 0.963205 data: 0.000158 max mem: 18817 Epoch: [98/300] [ 400/1251] eta: 0:13:36 lr: 0.001519 loss: 3.526373 (3.386315) time: 0.922882 data: 0.000169 max mem: 18817 Epoch: [98/300] [ 450/1251] eta: 0:12:51 lr: 0.001519 loss: 3.503039 (3.393925) time: 0.943382 data: 0.000166 max mem: 18817 Epoch: [98/300] [ 500/1251] eta: 0:12:03 lr: 0.001519 loss: 2.959499 (3.377389) time: 0.987879 data: 0.000174 max mem: 18817 Epoch: [98/300] [ 550/1251] eta: 0:11:15 lr: 0.001518 loss: 3.129917 (3.374936) time: 1.031852 data: 0.000177 max mem: 18817 Epoch: [98/300] [ 600/1251] eta: 0:10:25 lr: 0.001518 loss: 3.437027 (3.374564) time: 0.951918 data: 0.000165 max mem: 18817 Epoch: [98/300] [ 650/1251] eta: 0:09:37 lr: 0.001518 loss: 3.299429 (3.362332) time: 0.927420 data: 0.000177 max mem: 18817 Epoch: [98/300] [ 700/1251] eta: 0:08:49 lr: 0.001517 loss: 3.581649 (3.373641) time: 0.923447 data: 0.000180 max mem: 18817 Epoch: [98/300] [ 750/1251] eta: 0:08:02 lr: 0.001517 loss: 3.546672 (3.371671) time: 0.982144 data: 0.000180 max mem: 18817 Epoch: [98/300] [ 800/1251] eta: 0:07:14 lr: 0.001516 loss: 3.572457 (3.371618) time: 1.032044 data: 0.000164 max mem: 18817 Epoch: [98/300] [ 850/1251] eta: 0:06:25 lr: 0.001516 loss: 3.519466 (3.362458) time: 0.982803 data: 0.000162 max mem: 18817 Epoch: [98/300] [ 900/1251] eta: 0:05:37 lr: 0.001516 loss: 3.415603 (3.369127) time: 0.913792 data: 0.000172 max mem: 18817 Epoch: [98/300] [ 950/1251] eta: 0:04:49 lr: 0.001515 loss: 3.688865 (3.376707) time: 0.928011 data: 0.000182 max mem: 18817 Epoch: [98/300] [1000/1251] eta: 0:04:01 lr: 0.001515 loss: 3.456680 (3.378023) time: 0.979269 data: 0.000177 max mem: 18817 Epoch: [98/300] [1050/1251] eta: 0:03:13 lr: 0.001515 loss: 3.597207 (3.381521) time: 1.066233 data: 0.000171 max mem: 18817 Epoch: [98/300] [1100/1251] eta: 0:02:25 lr: 0.001514 loss: 3.276043 (3.381692) time: 0.977299 data: 0.000166 max mem: 18817 Epoch: [98/300] [1150/1251] eta: 0:01:37 lr: 0.001514 loss: 3.668264 (3.383817) time: 0.911321 data: 0.000180 max mem: 18817 Epoch: [98/300] [1200/1251] eta: 0:00:49 lr: 0.001514 loss: 3.417141 (3.385508) time: 0.928928 data: 0.000169 max mem: 18817 Epoch: [98/300] [1250/1251] eta: 0:00:00 lr: 0.001513 loss: 3.388295 (3.382805) time: 0.974400 data: 0.000750 max mem: 18817 Epoch: [98/300] Total time: 0:20:03 (0.962132 s / it) Averaged stats: lr: 0.001513 loss: 3.388295 (3.381490) Test: [ 0/49] eta: 0:01:30 loss: 0.765237 (0.765237) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.838497 data: 1.437972 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.885510 (0.883718) acc1: 81.250000 (80.965909) acc5: 95.312500 (95.028409) time: 0.508864 data: 0.130880 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.919657 (0.918635) acc1: 78.125000 (79.389881) acc5: 95.312500 (94.717262) time: 0.374430 data: 0.000163 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.946355 (0.928273) acc1: 76.562500 (78.629032) acc5: 95.312500 (94.707661) time: 0.371028 data: 0.000149 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.969939 (0.943563) acc1: 78.125000 (78.429878) acc5: 93.750000 (94.626524) time: 0.366609 data: 0.000132 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.987363 (0.941694) acc1: 78.125000 (78.496000) acc5: 93.750000 (94.592000) time: 0.360943 data: 0.000107 max mem: 18817 Test: Total time: 0:00:19 (0.400215 s / it) * Acc@1 78.368 Acc@5 94.640 loss 0.939 Max accuracy: 78.73% Epoch: [99/300] [ 0/1251] eta: 0:42:41 lr: 0.001513 loss: 2.660078 (2.660078) time: 2.047496 data: 1.139569 max mem: 18817 Epoch: [99/300] [ 50/1251] eta: 0:19:33 lr: 0.001513 loss: 3.547782 (3.431744) time: 0.975450 data: 0.000198 max mem: 18817 Epoch: [99/300] [ 100/1251] eta: 0:18:29 lr: 0.001513 loss: 3.172353 (3.335786) time: 0.935015 data: 0.000190 max mem: 18817 Epoch: [99/300] [ 150/1251] eta: 0:17:47 lr: 0.001512 loss: 3.152466 (3.300394) time: 0.943421 data: 0.000181 max mem: 18817 Epoch: [99/300] [ 200/1251] eta: 0:17:00 lr: 0.001512 loss: 3.455172 (3.326698) time: 0.990649 data: 0.000177 max mem: 18817 Epoch: [99/300] [ 250/1251] eta: 0:16:10 lr: 0.001511 loss: 3.537020 (3.340880) time: 1.042186 data: 0.000182 max mem: 18817 Epoch: [99/300] [ 300/1251] eta: 0:15:17 lr: 0.001511 loss: 3.209486 (3.335612) time: 0.959349 data: 0.000165 max mem: 18817 Epoch: [99/300] [ 350/1251] eta: 0:14:26 lr: 0.001511 loss: 3.604197 (3.359462) time: 0.934185 data: 0.000166 max mem: 18817 Epoch: [99/300] [ 400/1251] eta: 0:13:39 lr: 0.001510 loss: 3.617238 (3.375886) time: 0.922471 data: 0.000182 max mem: 18817 Epoch: [99/300] [ 450/1251] eta: 0:12:51 lr: 0.001510 loss: 3.522874 (3.377174) time: 1.000713 data: 0.000167 max mem: 18817 Epoch: [99/300] [ 500/1251] eta: 0:12:03 lr: 0.001510 loss: 3.462834 (3.374413) time: 1.040458 data: 0.000184 max mem: 18817 Epoch: [99/300] [ 550/1251] eta: 0:11:14 lr: 0.001509 loss: 3.518464 (3.361847) time: 0.968932 data: 0.000168 max mem: 18817 Epoch: [99/300] [ 600/1251] eta: 0:10:25 lr: 0.001509 loss: 3.501637 (3.365275) time: 0.928352 data: 0.000172 max mem: 18817 Epoch: [99/300] [ 650/1251] eta: 0:09:38 lr: 0.001509 loss: 3.627312 (3.364797) time: 0.954918 data: 0.000189 max mem: 18817 Epoch: [99/300] [ 700/1251] eta: 0:08:50 lr: 0.001508 loss: 3.381883 (3.363042) time: 0.981270 data: 0.000163 max mem: 18817 Epoch: [99/300] [ 750/1251] eta: 0:08:02 lr: 0.001508 loss: 3.642365 (3.364915) time: 1.011987 data: 0.000179 max mem: 18817 Epoch: [99/300] [ 800/1251] eta: 0:07:13 lr: 0.001508 loss: 3.413881 (3.364838) time: 0.968126 data: 0.000178 max mem: 18817 Epoch: [99/300] [ 850/1251] eta: 0:06:24 lr: 0.001507 loss: 3.160491 (3.368715) time: 0.928604 data: 0.000166 max mem: 18817 Epoch: [99/300] [ 900/1251] eta: 0:05:37 lr: 0.001507 loss: 3.462929 (3.372627) time: 0.933146 data: 0.000176 max mem: 18817 Epoch: [99/300] [ 950/1251] eta: 0:04:49 lr: 0.001506 loss: 3.605792 (3.377442) time: 0.990179 data: 0.000188 max mem: 18817 Epoch: [99/300] [1000/1251] eta: 0:04:01 lr: 0.001506 loss: 3.526279 (3.380194) time: 1.035455 data: 0.000180 max mem: 18817 Epoch: [99/300] [1050/1251] eta: 0:03:13 lr: 0.001506 loss: 3.267316 (3.380033) time: 0.978632 data: 0.000174 max mem: 18817 Epoch: [99/300] [1100/1251] eta: 0:02:25 lr: 0.001505 loss: 3.531092 (3.379271) time: 0.925444 data: 0.000158 max mem: 18817 Epoch: [99/300] [1150/1251] eta: 0:01:37 lr: 0.001505 loss: 3.432339 (3.379700) time: 0.911427 data: 0.000168 max mem: 18817 Epoch: [99/300] [1200/1251] eta: 0:00:49 lr: 0.001505 loss: 3.615106 (3.382913) time: 0.985887 data: 0.000168 max mem: 18817 Epoch: [99/300] [1250/1251] eta: 0:00:00 lr: 0.001504 loss: 3.391355 (3.387385) time: 1.014048 data: 0.000772 max mem: 18817 Epoch: [99/300] Total time: 0:20:03 (0.961636 s / it) Averaged stats: lr: 0.001504 loss: 3.391355 (3.386620) Test: [ 0/49] eta: 0:01:21 loss: 0.769503 (0.769503) acc1: 84.375000 (84.375000) acc5: 93.750000 (93.750000) time: 1.672213 data: 1.214869 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.799628 (0.828275) acc1: 81.250000 (80.823864) acc5: 95.312500 (94.744318) time: 0.516365 data: 0.110583 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.871411 (0.878031) acc1: 78.125000 (79.687500) acc5: 93.750000 (94.568452) time: 0.395076 data: 0.000139 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.942023 (0.883368) acc1: 78.125000 (79.334677) acc5: 93.750000 (94.808468) time: 0.376357 data: 0.000127 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.875445 (0.890872) acc1: 79.687500 (79.192073) acc5: 95.312500 (94.855183) time: 0.367342 data: 0.000122 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.875445 (0.886778) acc1: 78.125000 (79.392000) acc5: 95.312500 (94.912000) time: 0.362022 data: 0.000099 max mem: 18817 Test: Total time: 0:00:19 (0.406000 s / it) * Acc@1 78.776 Acc@5 94.804 loss 0.904 Max accuracy: 78.78% Epoch: [100/300] [ 0/1251] eta: 0:54:19 lr: 0.001504 loss: 2.438055 (2.438055) time: 2.605125 data: 1.058516 max mem: 18817 Epoch: [100/300] [ 50/1251] eta: 0:19:51 lr: 0.001504 loss: 3.463130 (3.353577) time: 0.925948 data: 0.000171 max mem: 18817 Epoch: [100/300] [ 100/1251] eta: 0:18:52 lr: 0.001504 loss: 3.681060 (3.340265) time: 0.939163 data: 0.000170 max mem: 18817 Epoch: [100/300] [ 150/1251] eta: 0:17:54 lr: 0.001503 loss: 3.209046 (3.294122) time: 0.976096 data: 0.000187 max mem: 18817 Epoch: [100/300] [ 200/1251] eta: 0:16:56 lr: 0.001503 loss: 3.428587 (3.300693) time: 0.961900 data: 0.000168 max mem: 18817 Epoch: [100/300] [ 250/1251] eta: 0:16:07 lr: 0.001502 loss: 3.564595 (3.320484) time: 0.944458 data: 0.000162 max mem: 18817 Epoch: [100/300] [ 300/1251] eta: 0:15:20 lr: 0.001502 loss: 3.385127 (3.314232) time: 0.938516 data: 0.000168 max mem: 18817 Epoch: [100/300] [ 350/1251] eta: 0:14:32 lr: 0.001502 loss: 3.493459 (3.335498) time: 0.922974 data: 0.000161 max mem: 18817 Epoch: [100/300] [ 400/1251] eta: 0:13:43 lr: 0.001501 loss: 3.209283 (3.323875) time: 0.999447 data: 0.000185 max mem: 18817 Epoch: [100/300] [ 450/1251] eta: 0:12:52 lr: 0.001501 loss: 3.218861 (3.326624) time: 0.972893 data: 0.000174 max mem: 18817 Epoch: [100/300] [ 500/1251] eta: 0:12:04 lr: 0.001501 loss: 3.679135 (3.342772) time: 0.952344 data: 0.000212 max mem: 18817 Epoch: [100/300] [ 550/1251] eta: 0:11:15 lr: 0.001500 loss: 3.556638 (3.351106) time: 0.919577 data: 0.000171 max mem: 18817 Epoch: [100/300] [ 600/1251] eta: 0:10:27 lr: 0.001500 loss: 3.431251 (3.356827) time: 0.931540 data: 0.000186 max mem: 18817 Epoch: [100/300] [ 650/1251] eta: 0:09:38 lr: 0.001500 loss: 3.446238 (3.353267) time: 0.980555 data: 0.000183 max mem: 18817 Epoch: [100/300] [ 700/1251] eta: 0:08:49 lr: 0.001499 loss: 3.675187 (3.366772) time: 0.964482 data: 0.000178 max mem: 18817 Epoch: [100/300] [ 750/1251] eta: 0:08:00 lr: 0.001499 loss: 3.200852 (3.361539) time: 0.916695 data: 0.000159 max mem: 18817 Epoch: [100/300] [ 800/1251] eta: 0:07:13 lr: 0.001499 loss: 3.479648 (3.359817) time: 0.929804 data: 0.000172 max mem: 18817 Epoch: [100/300] [ 850/1251] eta: 0:06:25 lr: 0.001498 loss: 3.412904 (3.354134) time: 0.939753 data: 0.000168 max mem: 18817 Epoch: [100/300] [ 900/1251] eta: 0:05:37 lr: 0.001498 loss: 3.563157 (3.356006) time: 0.987454 data: 0.000167 max mem: 18817 Epoch: [100/300] [ 950/1251] eta: 0:04:49 lr: 0.001497 loss: 3.560941 (3.359164) time: 0.971249 data: 0.000168 max mem: 18817 Epoch: [100/300] [1000/1251] eta: 0:04:00 lr: 0.001497 loss: 3.063466 (3.353205) time: 0.923473 data: 0.000167 max mem: 18817 Epoch: [100/300] [1050/1251] eta: 0:03:13 lr: 0.001497 loss: 3.461190 (3.349346) time: 0.928954 data: 0.000188 max mem: 18817 Epoch: [100/300] [1100/1251] eta: 0:02:25 lr: 0.001496 loss: 3.540881 (3.357592) time: 0.938661 data: 0.000180 max mem: 18817 Epoch: [100/300] [1150/1251] eta: 0:01:37 lr: 0.001496 loss: 3.415003 (3.358128) time: 0.975170 data: 0.000196 max mem: 18817 Epoch: [100/300] [1200/1251] eta: 0:00:48 lr: 0.001496 loss: 3.356946 (3.355191) time: 0.964397 data: 0.000168 max mem: 18817 Epoch: [100/300] [1250/1251] eta: 0:00:00 lr: 0.001495 loss: 3.534844 (3.356716) time: 0.918585 data: 0.000761 max mem: 18817 Epoch: [100/300] Total time: 0:20:00 (0.959707 s / it) Averaged stats: lr: 0.001495 loss: 3.534844 (3.366027) Test: [ 0/49] eta: 0:02:11 loss: 0.768718 (0.768718) acc1: 81.250000 (81.250000) acc5: 93.750000 (93.750000) time: 2.692129 data: 1.206516 max mem: 18817 Test: [10/49] eta: 0:00:24 loss: 0.806337 (0.856078) acc1: 81.250000 (79.971591) acc5: 93.750000 (94.318182) time: 0.617700 data: 0.109830 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.882015 (0.884621) acc1: 78.125000 (78.645833) acc5: 93.750000 (94.122024) time: 0.386540 data: 0.000159 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.888232 (0.883979) acc1: 78.125000 (78.578629) acc5: 93.750000 (94.354839) time: 0.365210 data: 0.000159 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.911417 (0.900758) acc1: 76.562500 (78.163110) acc5: 95.312500 (94.550305) time: 0.365770 data: 0.000157 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.945729 (0.895985) acc1: 76.562500 (78.432000) acc5: 95.312500 (94.592000) time: 0.381223 data: 0.000130 max mem: 18817 Test: Total time: 0:00:21 (0.431426 s / it) * Acc@1 78.680 Acc@5 94.810 loss 0.893 Max accuracy: 78.78% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0100.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0100.pth Epoch: [101/300] [ 0/1251] eta: 0:45:09 lr: 0.001495 loss: 3.129290 (3.129290) time: 2.165849 data: 1.250947 max mem: 18817 Epoch: [101/300] [ 50/1251] eta: 0:19:29 lr: 0.001495 loss: 3.592291 (3.327296) time: 0.919679 data: 0.000225 max mem: 18817 Epoch: [101/300] [ 100/1251] eta: 0:18:39 lr: 0.001495 loss: 3.328767 (3.361238) time: 0.940982 data: 0.000216 max mem: 18817 Epoch: [101/300] [ 150/1251] eta: 0:17:51 lr: 0.001494 loss: 3.420955 (3.382140) time: 0.942237 data: 0.000213 max mem: 18817 Epoch: [101/300] [ 200/1251] eta: 0:17:02 lr: 0.001494 loss: 3.458094 (3.387846) time: 0.993978 data: 0.000223 max mem: 18817 Epoch: [101/300] [ 250/1251] eta: 0:16:08 lr: 0.001493 loss: 2.953634 (3.357946) time: 0.974771 data: 0.000221 max mem: 18817 Epoch: [101/300] [ 300/1251] eta: 0:15:16 lr: 0.001493 loss: 3.305845 (3.352282) time: 0.922826 data: 0.000210 max mem: 18817 Epoch: [101/300] [ 350/1251] eta: 0:14:30 lr: 0.001493 loss: 3.295526 (3.359054) time: 0.927498 data: 0.000206 max mem: 18817 Epoch: [101/300] [ 400/1251] eta: 0:13:42 lr: 0.001492 loss: 3.262099 (3.349861) time: 0.942581 data: 0.000205 max mem: 18817 Epoch: [101/300] [ 450/1251] eta: 0:12:54 lr: 0.001492 loss: 3.325556 (3.351355) time: 0.988053 data: 0.000230 max mem: 18817 Epoch: [101/300] [ 500/1251] eta: 0:12:03 lr: 0.001492 loss: 3.333737 (3.356026) time: 0.955171 data: 0.000201 max mem: 18817 Epoch: [101/300] [ 550/1251] eta: 0:11:14 lr: 0.001491 loss: 3.444095 (3.359995) time: 0.921046 data: 0.000213 max mem: 18817 Epoch: [101/300] [ 600/1251] eta: 0:10:26 lr: 0.001491 loss: 3.382456 (3.361756) time: 0.931130 data: 0.000154 max mem: 18817 Epoch: [101/300] [ 650/1251] eta: 0:09:39 lr: 0.001491 loss: 3.587226 (3.362081) time: 0.930970 data: 0.000151 max mem: 18817 Epoch: [101/300] [ 700/1251] eta: 0:08:51 lr: 0.001490 loss: 3.627498 (3.360359) time: 0.979005 data: 0.000159 max mem: 18817 Epoch: [101/300] [ 750/1251] eta: 0:08:02 lr: 0.001490 loss: 3.292686 (3.364003) time: 0.959271 data: 0.000159 max mem: 18817 Epoch: [101/300] [ 800/1251] eta: 0:07:13 lr: 0.001489 loss: 3.327242 (3.371121) time: 0.913850 data: 0.000178 max mem: 18817 Epoch: [101/300] [ 850/1251] eta: 0:06:25 lr: 0.001489 loss: 3.374293 (3.370289) time: 0.927599 data: 0.000171 max mem: 18817 Epoch: [101/300] [ 900/1251] eta: 0:05:37 lr: 0.001489 loss: 3.328736 (3.368873) time: 0.928968 data: 0.000159 max mem: 18817 Epoch: [101/300] [ 950/1251] eta: 0:04:49 lr: 0.001488 loss: 3.640431 (3.370915) time: 0.980617 data: 0.000154 max mem: 18817 Epoch: [101/300] [1000/1251] eta: 0:04:01 lr: 0.001488 loss: 3.417753 (3.373261) time: 0.972270 data: 0.000158 max mem: 18817 Epoch: [101/300] [1050/1251] eta: 0:03:13 lr: 0.001488 loss: 3.530497 (3.373151) time: 0.913620 data: 0.000178 max mem: 18817 Epoch: [101/300] [1100/1251] eta: 0:02:25 lr: 0.001487 loss: 3.421146 (3.373030) time: 0.942702 data: 0.000174 max mem: 18817 Epoch: [101/300] [1150/1251] eta: 0:01:37 lr: 0.001487 loss: 3.192598 (3.376343) time: 0.930203 data: 0.000168 max mem: 18817 Epoch: [101/300] [1200/1251] eta: 0:00:49 lr: 0.001487 loss: 3.623065 (3.378499) time: 0.990183 data: 0.000189 max mem: 18817 Epoch: [101/300] [1250/1251] eta: 0:00:00 lr: 0.001486 loss: 3.545402 (3.381518) time: 0.983422 data: 0.000761 max mem: 18817 Epoch: [101/300] Total time: 0:20:04 (0.962646 s / it) Averaged stats: lr: 0.001486 loss: 3.545402 (3.379450) Test: [ 0/49] eta: 0:01:20 loss: 0.705980 (0.705980) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.312500) time: 1.635305 data: 1.177600 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.833649 (0.824598) acc1: 81.250000 (81.107955) acc5: 95.312500 (95.170455) time: 0.500855 data: 0.107191 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.934045 (0.872689) acc1: 78.125000 (79.166667) acc5: 93.750000 (94.717262) time: 0.393411 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.934045 (0.868471) acc1: 76.562500 (79.082661) acc5: 95.312500 (95.161290) time: 0.394252 data: 0.000121 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.858640 (0.874242) acc1: 78.125000 (79.115854) acc5: 95.312500 (95.236280) time: 0.373845 data: 0.000119 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.909460 (0.868879) acc1: 78.125000 (79.328000) acc5: 95.312500 (95.136000) time: 0.361303 data: 0.000102 max mem: 18817 Test: Total time: 0:00:20 (0.409627 s / it) * Acc@1 78.910 Acc@5 94.746 loss 0.887 Max accuracy: 78.91% Epoch: [102/300] [ 0/1251] eta: 0:40:55 lr: 0.001486 loss: 3.529209 (3.529209) time: 1.962508 data: 1.064949 max mem: 18817 Epoch: [102/300] [ 50/1251] eta: 0:19:50 lr: 0.001486 loss: 3.497092 (3.342956) time: 0.944034 data: 0.000140 max mem: 18817 Epoch: [102/300] [ 100/1251] eta: 0:18:55 lr: 0.001485 loss: 3.391253 (3.304679) time: 0.951821 data: 0.000188 max mem: 18817 Epoch: [102/300] [ 150/1251] eta: 0:18:03 lr: 0.001485 loss: 3.522166 (3.273756) time: 1.003158 data: 0.000184 max mem: 18817 Epoch: [102/300] [ 200/1251] eta: 0:17:05 lr: 0.001485 loss: 3.132890 (3.278664) time: 0.984987 data: 0.000177 max mem: 18817 Epoch: [102/300] [ 250/1251] eta: 0:16:15 lr: 0.001484 loss: 3.504512 (3.303815) time: 0.972630 data: 0.000145 max mem: 18817 Epoch: [102/300] [ 300/1251] eta: 0:15:24 lr: 0.001484 loss: 3.573776 (3.337527) time: 0.935754 data: 0.000151 max mem: 18817 Epoch: [102/300] [ 350/1251] eta: 0:14:36 lr: 0.001484 loss: 3.250087 (3.353029) time: 0.927062 data: 0.000162 max mem: 18817 Epoch: [102/300] [ 400/1251] eta: 0:13:47 lr: 0.001483 loss: 3.642930 (3.356123) time: 0.997107 data: 0.000166 max mem: 18817 Epoch: [102/300] [ 450/1251] eta: 0:12:55 lr: 0.001483 loss: 3.228926 (3.355479) time: 0.974214 data: 0.000165 max mem: 18817 Epoch: [102/300] [ 500/1251] eta: 0:12:07 lr: 0.001483 loss: 3.189831 (3.348351) time: 0.966099 data: 0.000163 max mem: 18817 Epoch: [102/300] [ 550/1251] eta: 0:11:17 lr: 0.001482 loss: 3.494334 (3.339874) time: 0.932348 data: 0.000158 max mem: 18817 Epoch: [102/300] [ 600/1251] eta: 0:10:29 lr: 0.001482 loss: 3.243021 (3.338681) time: 0.922128 data: 0.000180 max mem: 18817 Epoch: [102/300] [ 650/1251] eta: 0:09:40 lr: 0.001481 loss: 3.081357 (3.341382) time: 0.985128 data: 0.000167 max mem: 18817 Epoch: [102/300] [ 700/1251] eta: 0:08:51 lr: 0.001481 loss: 3.308764 (3.342764) time: 0.988422 data: 0.000164 max mem: 18817 Epoch: [102/300] [ 750/1251] eta: 0:08:04 lr: 0.001481 loss: 3.330250 (3.339541) time: 0.982255 data: 0.000171 max mem: 18817 Epoch: [102/300] [ 800/1251] eta: 0:07:15 lr: 0.001480 loss: 3.673613 (3.351100) time: 0.941268 data: 0.000162 max mem: 18817 Epoch: [102/300] [ 850/1251] eta: 0:06:27 lr: 0.001480 loss: 3.553068 (3.353167) time: 0.933416 data: 0.000183 max mem: 18817 Epoch: [102/300] [ 900/1251] eta: 0:05:39 lr: 0.001480 loss: 3.595766 (3.353161) time: 1.000677 data: 0.000176 max mem: 18817 Epoch: [102/300] [ 950/1251] eta: 0:04:50 lr: 0.001479 loss: 3.625988 (3.362107) time: 1.002525 data: 0.000180 max mem: 18817 Epoch: [102/300] [1000/1251] eta: 0:04:02 lr: 0.001479 loss: 3.177915 (3.358690) time: 0.972767 data: 0.000175 max mem: 18817 Epoch: [102/300] [1050/1251] eta: 0:03:14 lr: 0.001479 loss: 3.346159 (3.360618) time: 0.921324 data: 0.000167 max mem: 18817 Epoch: [102/300] [1100/1251] eta: 0:02:25 lr: 0.001478 loss: 3.516631 (3.360970) time: 0.926112 data: 0.000175 max mem: 18817 Epoch: [102/300] [1150/1251] eta: 0:01:37 lr: 0.001478 loss: 3.013520 (3.355286) time: 0.978699 data: 0.000169 max mem: 18817 Epoch: [102/300] [1200/1251] eta: 0:00:49 lr: 0.001477 loss: 3.217328 (3.355923) time: 0.996785 data: 0.000191 max mem: 18817 Epoch: [102/300] [1250/1251] eta: 0:00:00 lr: 0.001477 loss: 3.614940 (3.355021) time: 0.963985 data: 0.000764 max mem: 18817 Epoch: [102/300] Total time: 0:20:06 (0.964784 s / it) Averaged stats: lr: 0.001477 loss: 3.614940 (3.363724) Test: [ 0/49] eta: 0:01:29 loss: 0.734628 (0.734628) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.312500) time: 1.819493 data: 1.396283 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.809563 (0.829221) acc1: 78.125000 (79.971591) acc5: 95.312500 (94.744318) time: 0.501431 data: 0.127063 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.867398 (0.878542) acc1: 78.125000 (78.943452) acc5: 93.750000 (93.824405) time: 0.366032 data: 0.000128 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.890611 (0.866842) acc1: 79.687500 (78.981855) acc5: 95.312500 (94.506048) time: 0.362881 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.916503 (0.880918) acc1: 79.687500 (78.963415) acc5: 95.312500 (94.626524) time: 0.379981 data: 0.000126 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.944464 (0.879827) acc1: 78.125000 (78.944000) acc5: 93.750000 (94.592000) time: 0.388264 data: 0.000100 max mem: 18817 Test: Total time: 0:00:19 (0.407581 s / it) * Acc@1 78.878 Acc@5 94.898 loss 0.891 Max accuracy: 78.91% Epoch: [103/300] [ 0/1251] eta: 0:42:57 lr: 0.001477 loss: 3.547460 (3.547460) time: 2.060351 data: 1.155995 max mem: 18817 Epoch: [103/300] [ 50/1251] eta: 0:20:02 lr: 0.001477 loss: 3.112579 (3.246432) time: 0.983609 data: 0.000171 max mem: 18817 Epoch: [103/300] [ 100/1251] eta: 0:18:52 lr: 0.001476 loss: 3.399578 (3.336918) time: 1.018332 data: 0.000178 max mem: 18817 Epoch: [103/300] [ 150/1251] eta: 0:17:48 lr: 0.001476 loss: 3.594796 (3.325553) time: 0.965098 data: 0.000181 max mem: 18817 Epoch: [103/300] [ 200/1251] eta: 0:16:56 lr: 0.001476 loss: 3.322272 (3.332056) time: 0.939957 data: 0.000165 max mem: 18817 Epoch: [103/300] [ 250/1251] eta: 0:16:08 lr: 0.001475 loss: 3.470391 (3.351119) time: 0.934418 data: 0.000184 max mem: 18817 Epoch: [103/300] [ 300/1251] eta: 0:15:22 lr: 0.001475 loss: 3.130513 (3.329956) time: 0.999368 data: 0.000176 max mem: 18817 Epoch: [103/300] [ 350/1251] eta: 0:14:34 lr: 0.001475 loss: 3.219714 (3.323583) time: 0.985315 data: 0.000194 max mem: 18817 Epoch: [103/300] [ 400/1251] eta: 0:13:42 lr: 0.001474 loss: 3.595436 (3.337331) time: 0.977501 data: 0.000184 max mem: 18817 Epoch: [103/300] [ 450/1251] eta: 0:12:52 lr: 0.001474 loss: 3.374979 (3.339956) time: 0.918946 data: 0.000188 max mem: 18817 Epoch: [103/300] [ 500/1251] eta: 0:12:04 lr: 0.001473 loss: 3.390162 (3.332994) time: 0.937602 data: 0.000179 max mem: 18817 Epoch: [103/300] [ 550/1251] eta: 0:11:15 lr: 0.001473 loss: 3.267334 (3.330676) time: 0.968651 data: 0.000180 max mem: 18817 Epoch: [103/300] [ 600/1251] eta: 0:10:28 lr: 0.001473 loss: 3.211455 (3.332918) time: 1.027535 data: 0.000178 max mem: 18817 Epoch: [103/300] [ 650/1251] eta: 0:09:39 lr: 0.001472 loss: 3.566228 (3.345716) time: 0.988774 data: 0.000178 max mem: 18817 Epoch: [103/300] [ 700/1251] eta: 0:08:51 lr: 0.001472 loss: 3.468948 (3.345491) time: 0.917267 data: 0.000150 max mem: 18817 Epoch: [103/300] [ 750/1251] eta: 0:08:03 lr: 0.001472 loss: 3.498784 (3.347585) time: 0.931638 data: 0.000166 max mem: 18817 Epoch: [103/300] [ 800/1251] eta: 0:07:14 lr: 0.001471 loss: 3.092204 (3.337294) time: 0.976780 data: 0.000185 max mem: 18817 Epoch: [103/300] [ 850/1251] eta: 0:06:26 lr: 0.001471 loss: 3.560939 (3.343661) time: 0.993044 data: 0.000174 max mem: 18817 Epoch: [103/300] [ 900/1251] eta: 0:05:38 lr: 0.001470 loss: 2.982524 (3.337125) time: 0.974620 data: 0.000183 max mem: 18817 Epoch: [103/300] [ 950/1251] eta: 0:04:50 lr: 0.001470 loss: 3.649278 (3.346999) time: 0.917386 data: 0.000181 max mem: 18817 Epoch: [103/300] [1000/1251] eta: 0:04:01 lr: 0.001470 loss: 3.451427 (3.353568) time: 0.926459 data: 0.000184 max mem: 18817 Epoch: [103/300] [1050/1251] eta: 0:03:13 lr: 0.001469 loss: 3.437243 (3.352746) time: 0.960716 data: 0.000185 max mem: 18817 Epoch: [103/300] [1100/1251] eta: 0:02:25 lr: 0.001469 loss: 3.471338 (3.350318) time: 0.981759 data: 0.000178 max mem: 18817 Epoch: [103/300] [1150/1251] eta: 0:01:37 lr: 0.001469 loss: 3.334946 (3.353588) time: 0.973544 data: 0.000185 max mem: 18817 Epoch: [103/300] [1200/1251] eta: 0:00:49 lr: 0.001468 loss: 3.377775 (3.351869) time: 0.929427 data: 0.000192 max mem: 18817 Epoch: [103/300] [1250/1251] eta: 0:00:00 lr: 0.001468 loss: 3.412752 (3.352868) time: 0.909828 data: 0.000749 max mem: 18817 Epoch: [103/300] Total time: 0:20:06 (0.964514 s / it) Averaged stats: lr: 0.001468 loss: 3.412752 (3.355719) Test: [ 0/49] eta: 0:01:17 loss: 0.680102 (0.680102) acc1: 84.375000 (84.375000) acc5: 95.312500 (95.312500) time: 1.579761 data: 1.158412 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.771529 (0.793190) acc1: 81.250000 (80.823864) acc5: 95.312500 (94.886364) time: 0.478522 data: 0.105448 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.823116 (0.823941) acc1: 79.687500 (80.580357) acc5: 95.312500 (94.866071) time: 0.364738 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.828600 (0.823001) acc1: 79.687500 (79.989919) acc5: 95.312500 (95.514113) time: 0.456354 data: 0.000141 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.828600 (0.837498) acc1: 79.687500 (79.763720) acc5: 96.875000 (95.541159) time: 0.454571 data: 0.000143 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.897333 (0.841766) acc1: 78.125000 (79.392000) acc5: 95.312500 (95.552000) time: 0.437890 data: 0.000112 max mem: 18817 Test: Total time: 0:00:20 (0.427019 s / it) * Acc@1 78.792 Acc@5 94.940 loss 0.867 Max accuracy: 78.91% Epoch: [104/300] [ 0/1251] eta: 0:41:11 lr: 0.001468 loss: 3.241432 (3.241432) time: 1.976016 data: 1.074103 max mem: 18817 Epoch: [104/300] [ 50/1251] eta: 0:19:49 lr: 0.001468 loss: 3.153405 (3.214525) time: 1.044910 data: 0.000159 max mem: 18817 Epoch: [104/300] [ 100/1251] eta: 0:18:40 lr: 0.001467 loss: 3.578857 (3.347354) time: 0.982702 data: 0.000179 max mem: 18817 Epoch: [104/300] [ 150/1251] eta: 0:17:43 lr: 0.001467 loss: 3.302226 (3.362970) time: 0.919314 data: 0.000179 max mem: 18817 Epoch: [104/300] [ 200/1251] eta: 0:16:56 lr: 0.001466 loss: 3.503767 (3.378135) time: 0.940322 data: 0.000178 max mem: 18817 Epoch: [104/300] [ 250/1251] eta: 0:16:09 lr: 0.001466 loss: 3.262735 (3.367473) time: 1.005849 data: 0.000169 max mem: 18817 Epoch: [104/300] [ 300/1251] eta: 0:15:21 lr: 0.001466 loss: 3.550204 (3.370645) time: 1.022041 data: 0.000176 max mem: 18817 Epoch: [104/300] [ 350/1251] eta: 0:14:31 lr: 0.001465 loss: 3.243070 (3.351779) time: 0.971727 data: 0.000192 max mem: 18817 Epoch: [104/300] [ 400/1251] eta: 0:13:40 lr: 0.001465 loss: 3.265244 (3.364309) time: 0.908867 data: 0.000199 max mem: 18817 Epoch: [104/300] [ 450/1251] eta: 0:12:52 lr: 0.001465 loss: 3.325450 (3.348062) time: 0.936518 data: 0.000186 max mem: 18817 Epoch: [104/300] [ 500/1251] eta: 0:12:04 lr: 0.001464 loss: 3.470052 (3.353112) time: 0.997915 data: 0.000171 max mem: 18817 Epoch: [104/300] [ 550/1251] eta: 0:11:16 lr: 0.001464 loss: 3.612134 (3.365748) time: 0.999531 data: 0.000195 max mem: 18817 Epoch: [104/300] [ 600/1251] eta: 0:10:27 lr: 0.001463 loss: 3.206078 (3.355529) time: 0.960197 data: 0.000187 max mem: 18817 Epoch: [104/300] [ 650/1251] eta: 0:09:37 lr: 0.001463 loss: 3.058868 (3.351579) time: 0.906076 data: 0.000169 max mem: 18817 Epoch: [104/300] [ 700/1251] eta: 0:08:50 lr: 0.001463 loss: 3.207758 (3.349214) time: 0.940447 data: 0.000179 max mem: 18817 Epoch: [104/300] [ 750/1251] eta: 0:08:01 lr: 0.001462 loss: 3.604260 (3.355592) time: 0.974545 data: 0.000174 max mem: 18817 Epoch: [104/300] [ 800/1251] eta: 0:07:13 lr: 0.001462 loss: 3.573500 (3.360653) time: 1.028935 data: 0.000177 max mem: 18817 Epoch: [104/300] [ 850/1251] eta: 0:06:25 lr: 0.001462 loss: 3.591575 (3.364613) time: 0.962299 data: 0.000201 max mem: 18817 Epoch: [104/300] [ 900/1251] eta: 0:05:37 lr: 0.001461 loss: 3.595592 (3.363395) time: 0.912937 data: 0.000172 max mem: 18817 Epoch: [104/300] [ 950/1251] eta: 0:04:49 lr: 0.001461 loss: 3.346680 (3.358086) time: 0.934562 data: 0.000171 max mem: 18817 Epoch: [104/300] [1000/1251] eta: 0:04:01 lr: 0.001461 loss: 3.373715 (3.353571) time: 0.985347 data: 0.000173 max mem: 18817 Epoch: [104/300] [1050/1251] eta: 0:03:13 lr: 0.001460 loss: 3.416991 (3.353952) time: 0.999600 data: 0.000173 max mem: 18817 Epoch: [104/300] [1100/1251] eta: 0:02:25 lr: 0.001460 loss: 3.099526 (3.351313) time: 0.962070 data: 0.000173 max mem: 18817 Epoch: [104/300] [1150/1251] eta: 0:01:37 lr: 0.001459 loss: 3.631791 (3.352860) time: 0.916268 data: 0.000171 max mem: 18817 Epoch: [104/300] [1200/1251] eta: 0:00:49 lr: 0.001459 loss: 3.429869 (3.351261) time: 0.938601 data: 0.000183 max mem: 18817 Epoch: [104/300] [1250/1251] eta: 0:00:00 lr: 0.001459 loss: 3.381303 (3.352408) time: 1.010715 data: 0.000757 max mem: 18817 Epoch: [104/300] Total time: 0:20:03 (0.962230 s / it) Averaged stats: lr: 0.001459 loss: 3.381303 (3.347217) Test: [ 0/49] eta: 0:01:20 loss: 0.709853 (0.709853) acc1: 84.375000 (84.375000) acc5: 95.312500 (95.312500) time: 1.637214 data: 1.111989 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.814533 (0.825989) acc1: 78.125000 (80.113636) acc5: 95.312500 (94.744318) time: 0.484856 data: 0.101249 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.831193 (0.835752) acc1: 78.125000 (80.059524) acc5: 95.312500 (94.866071) time: 0.373859 data: 0.000176 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.829305 (0.838516) acc1: 79.687500 (79.788306) acc5: 95.312500 (94.959677) time: 0.370543 data: 0.000150 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.820445 (0.848358) acc1: 78.125000 (79.306402) acc5: 95.312500 (95.007622) time: 0.360817 data: 0.000121 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.865874 (0.852564) acc1: 78.125000 (79.040000) acc5: 95.312500 (94.944000) time: 0.355668 data: 0.000102 max mem: 18817 Test: Total time: 0:00:19 (0.393985 s / it) * Acc@1 78.814 Acc@5 94.888 loss 0.873 Max accuracy: 78.91% Epoch: [105/300] [ 0/1251] eta: 0:46:01 lr: 0.001459 loss: 3.678541 (3.678541) time: 2.207670 data: 1.158299 max mem: 18817 Epoch: [105/300] [ 50/1251] eta: 0:19:33 lr: 0.001458 loss: 3.264060 (3.298152) time: 0.978096 data: 0.000172 max mem: 18817 Epoch: [105/300] [ 100/1251] eta: 0:18:26 lr: 0.001458 loss: 3.310578 (3.233377) time: 0.920799 data: 0.000170 max mem: 18817 Epoch: [105/300] [ 150/1251] eta: 0:17:49 lr: 0.001458 loss: 3.390051 (3.258317) time: 0.940844 data: 0.000184 max mem: 18817 Epoch: [105/300] [ 200/1251] eta: 0:17:01 lr: 0.001457 loss: 3.083997 (3.261240) time: 0.986761 data: 0.000182 max mem: 18817 Epoch: [105/300] [ 250/1251] eta: 0:16:13 lr: 0.001457 loss: 3.505168 (3.282314) time: 1.056304 data: 0.000171 max mem: 18817 Epoch: [105/300] [ 300/1251] eta: 0:15:19 lr: 0.001456 loss: 3.508651 (3.298197) time: 0.963782 data: 0.000183 max mem: 18817 Epoch: [105/300] [ 350/1251] eta: 0:14:28 lr: 0.001456 loss: 3.218083 (3.291122) time: 0.912218 data: 0.000178 max mem: 18817 Epoch: [105/300] [ 400/1251] eta: 0:13:41 lr: 0.001456 loss: 3.466137 (3.286921) time: 0.932686 data: 0.000177 max mem: 18817 Epoch: [105/300] [ 450/1251] eta: 0:12:52 lr: 0.001455 loss: 3.453349 (3.295625) time: 0.972783 data: 0.000177 max mem: 18817 Epoch: [105/300] [ 500/1251] eta: 0:12:04 lr: 0.001455 loss: 3.519760 (3.305532) time: 1.039115 data: 0.000164 max mem: 18817 Epoch: [105/300] [ 550/1251] eta: 0:11:15 lr: 0.001455 loss: 3.288870 (3.307443) time: 0.982874 data: 0.000175 max mem: 18817 Epoch: [105/300] [ 600/1251] eta: 0:10:27 lr: 0.001454 loss: 3.240791 (3.301797) time: 0.925224 data: 0.000177 max mem: 18817 Epoch: [105/300] [ 650/1251] eta: 0:09:39 lr: 0.001454 loss: 3.433639 (3.308367) time: 0.932300 data: 0.000160 max mem: 18817 Epoch: [105/300] [ 700/1251] eta: 0:08:51 lr: 0.001453 loss: 3.450224 (3.310996) time: 0.988786 data: 0.000188 max mem: 18817 Epoch: [105/300] [ 750/1251] eta: 0:08:03 lr: 0.001453 loss: 3.509139 (3.320478) time: 1.042364 data: 0.000173 max mem: 18817 Epoch: [105/300] [ 800/1251] eta: 0:07:14 lr: 0.001453 loss: 3.528372 (3.319704) time: 0.969230 data: 0.000175 max mem: 18817 Epoch: [105/300] [ 850/1251] eta: 0:06:26 lr: 0.001452 loss: 3.164361 (3.323934) time: 0.919239 data: 0.000166 max mem: 18817 Epoch: [105/300] [ 900/1251] eta: 0:05:38 lr: 0.001452 loss: 3.449857 (3.325348) time: 0.931406 data: 0.000187 max mem: 18817 Epoch: [105/300] [ 950/1251] eta: 0:04:50 lr: 0.001452 loss: 3.511668 (3.329703) time: 0.984793 data: 0.000172 max mem: 18817 Epoch: [105/300] [1000/1251] eta: 0:04:01 lr: 0.001451 loss: 3.497470 (3.327062) time: 1.024703 data: 0.000172 max mem: 18817 Epoch: [105/300] [1050/1251] eta: 0:03:13 lr: 0.001451 loss: 3.438401 (3.329185) time: 0.976131 data: 0.000170 max mem: 18817 Epoch: [105/300] [1100/1251] eta: 0:02:25 lr: 0.001451 loss: 3.506371 (3.330382) time: 0.932878 data: 0.000164 max mem: 18817 Epoch: [105/300] [1150/1251] eta: 0:01:37 lr: 0.001450 loss: 3.650885 (3.334485) time: 0.940785 data: 0.000185 max mem: 18817 Epoch: [105/300] [1200/1251] eta: 0:00:49 lr: 0.001450 loss: 3.334902 (3.338074) time: 0.994470 data: 0.000262 max mem: 18817 Epoch: [105/300] [1250/1251] eta: 0:00:00 lr: 0.001449 loss: 3.148679 (3.339377) time: 0.973066 data: 0.000759 max mem: 18817 Epoch: [105/300] Total time: 0:20:06 (0.964342 s / it) Averaged stats: lr: 0.001449 loss: 3.148679 (3.346144) Test: [ 0/49] eta: 0:01:28 loss: 0.766912 (0.766912) acc1: 84.375000 (84.375000) acc5: 95.312500 (95.312500) time: 1.809412 data: 1.374672 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.812732 (0.838394) acc1: 81.250000 (80.539773) acc5: 95.312500 (94.460227) time: 0.498913 data: 0.125111 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.852212 (0.883757) acc1: 78.125000 (79.538690) acc5: 93.750000 (94.270833) time: 0.365115 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.906235 (0.883390) acc1: 78.125000 (79.334677) acc5: 95.312500 (94.758065) time: 0.366121 data: 0.000141 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.930551 (0.897079) acc1: 78.125000 (79.268293) acc5: 95.312500 (94.626524) time: 0.363597 data: 0.000136 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.955447 (0.891599) acc1: 78.125000 (79.392000) acc5: 93.750000 (94.688000) time: 0.357556 data: 0.000102 max mem: 18817 Test: Total time: 0:00:19 (0.394712 s / it) * Acc@1 78.808 Acc@5 94.882 loss 0.906 Max accuracy: 78.91% Epoch: [106/300] [ 0/1251] eta: 0:40:46 lr: 0.001449 loss: 3.463851 (3.463851) time: 1.955646 data: 1.069607 max mem: 18817 Epoch: [106/300] [ 50/1251] eta: 0:19:13 lr: 0.001449 loss: 3.368560 (3.323383) time: 0.919891 data: 0.000173 max mem: 18817 Epoch: [106/300] [ 100/1251] eta: 0:18:30 lr: 0.001449 loss: 3.407376 (3.299266) time: 0.931411 data: 0.000169 max mem: 18817 Epoch: [106/300] [ 150/1251] eta: 0:17:43 lr: 0.001448 loss: 2.891647 (3.229238) time: 0.977450 data: 0.000190 max mem: 18817 Epoch: [106/300] [ 200/1251] eta: 0:16:51 lr: 0.001448 loss: 3.480231 (3.280090) time: 0.998351 data: 0.000175 max mem: 18817 Epoch: [106/300] [ 250/1251] eta: 0:16:01 lr: 0.001448 loss: 3.618952 (3.321688) time: 0.967014 data: 0.000163 max mem: 18817 Epoch: [106/300] [ 300/1251] eta: 0:15:10 lr: 0.001447 loss: 3.310938 (3.340764) time: 0.928201 data: 0.000174 max mem: 18817 Epoch: [106/300] [ 350/1251] eta: 0:14:25 lr: 0.001447 loss: 3.556331 (3.357554) time: 0.939015 data: 0.000157 max mem: 18817 Epoch: [106/300] [ 400/1251] eta: 0:13:37 lr: 0.001446 loss: 3.422324 (3.359083) time: 0.967655 data: 0.000179 max mem: 18817 Epoch: [106/300] [ 450/1251] eta: 0:12:49 lr: 0.001446 loss: 3.506318 (3.358404) time: 1.035437 data: 0.000170 max mem: 18817 Epoch: [106/300] [ 500/1251] eta: 0:12:01 lr: 0.001446 loss: 3.504708 (3.368348) time: 0.965103 data: 0.000164 max mem: 18817 Epoch: [106/300] [ 550/1251] eta: 0:11:13 lr: 0.001445 loss: 3.388111 (3.365303) time: 0.931286 data: 0.000155 max mem: 18817 Epoch: [106/300] [ 600/1251] eta: 0:10:26 lr: 0.001445 loss: 3.456837 (3.356646) time: 0.936507 data: 0.000172 max mem: 18817 Epoch: [106/300] [ 650/1251] eta: 0:09:38 lr: 0.001445 loss: 3.377282 (3.366743) time: 0.984182 data: 0.000166 max mem: 18817 Epoch: [106/300] [ 700/1251] eta: 0:08:50 lr: 0.001444 loss: 3.170435 (3.362550) time: 1.014726 data: 0.000173 max mem: 18817 Epoch: [106/300] [ 750/1251] eta: 0:08:02 lr: 0.001444 loss: 3.624959 (3.368233) time: 0.997004 data: 0.000166 max mem: 18817 Epoch: [106/300] [ 800/1251] eta: 0:07:13 lr: 0.001443 loss: 3.294201 (3.364181) time: 0.938520 data: 0.000183 max mem: 18817 Epoch: [106/300] [ 850/1251] eta: 0:06:25 lr: 0.001443 loss: 3.388789 (3.364480) time: 0.943748 data: 0.000180 max mem: 18817 Epoch: [106/300] [ 900/1251] eta: 0:05:37 lr: 0.001443 loss: 3.444612 (3.365303) time: 0.971940 data: 0.000161 max mem: 18817 Epoch: [106/300] [ 950/1251] eta: 0:04:49 lr: 0.001442 loss: 3.172069 (3.360859) time: 1.059699 data: 0.000191 max mem: 18817 Epoch: [106/300] [1000/1251] eta: 0:04:01 lr: 0.001442 loss: 3.654341 (3.365945) time: 0.974035 data: 0.000163 max mem: 18817 Epoch: [106/300] [1050/1251] eta: 0:03:13 lr: 0.001442 loss: 3.433148 (3.365080) time: 0.916197 data: 0.000165 max mem: 18817 Epoch: [106/300] [1100/1251] eta: 0:02:25 lr: 0.001441 loss: 3.443781 (3.366841) time: 0.926639 data: 0.000190 max mem: 18817 Epoch: [106/300] [1150/1251] eta: 0:01:37 lr: 0.001441 loss: 3.390235 (3.363492) time: 0.972197 data: 0.000169 max mem: 18817 Epoch: [106/300] [1200/1251] eta: 0:00:49 lr: 0.001440 loss: 3.436094 (3.358625) time: 1.042681 data: 0.000170 max mem: 18817 Epoch: [106/300] [1250/1251] eta: 0:00:00 lr: 0.001440 loss: 3.411257 (3.359551) time: 0.980602 data: 0.000747 max mem: 18817 Epoch: [106/300] Total time: 0:20:04 (0.962508 s / it) Averaged stats: lr: 0.001440 loss: 3.411257 (3.359853) Test: [ 0/49] eta: 0:01:20 loss: 0.699760 (0.699760) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.639719 data: 1.210025 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.786876 (0.802635) acc1: 79.687500 (80.681818) acc5: 95.312500 (95.454545) time: 0.492738 data: 0.110137 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.835102 (0.842198) acc1: 78.125000 (79.538690) acc5: 95.312500 (95.163690) time: 0.369950 data: 0.000136 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.864350 (0.849794) acc1: 78.125000 (78.981855) acc5: 95.312500 (95.262097) time: 0.374160 data: 0.000139 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.865049 (0.873957) acc1: 78.125000 (78.696646) acc5: 95.312500 (94.893293) time: 0.409649 data: 0.000137 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.908586 (0.872415) acc1: 78.125000 (78.688000) acc5: 95.312500 (94.976000) time: 0.422248 data: 0.000109 max mem: 18817 Test: Total time: 0:00:20 (0.421461 s / it) * Acc@1 78.956 Acc@5 94.920 loss 0.870 Max accuracy: 78.96% Epoch: [107/300] [ 0/1251] eta: 0:39:52 lr: 0.001440 loss: 2.261266 (2.261266) time: 1.912449 data: 1.004171 max mem: 18817 Epoch: [107/300] [ 50/1251] eta: 0:19:59 lr: 0.001440 loss: 3.367459 (3.260360) time: 0.937930 data: 0.000176 max mem: 18817 Epoch: [107/300] [ 100/1251] eta: 0:18:49 lr: 0.001439 loss: 3.283379 (3.284584) time: 0.984098 data: 0.000176 max mem: 18817 Epoch: [107/300] [ 150/1251] eta: 0:17:54 lr: 0.001439 loss: 3.315866 (3.291212) time: 1.015919 data: 0.000187 max mem: 18817 Epoch: [107/300] [ 200/1251] eta: 0:16:57 lr: 0.001439 loss: 3.286299 (3.295393) time: 0.969458 data: 0.000171 max mem: 18817 Epoch: [107/300] [ 250/1251] eta: 0:16:05 lr: 0.001438 loss: 3.464610 (3.303636) time: 0.940432 data: 0.000194 max mem: 18817 Epoch: [107/300] [ 300/1251] eta: 0:15:19 lr: 0.001438 loss: 3.612577 (3.317494) time: 0.925923 data: 0.000184 max mem: 18817 Epoch: [107/300] [ 350/1251] eta: 0:14:30 lr: 0.001437 loss: 3.181400 (3.325052) time: 0.976362 data: 0.000155 max mem: 18817 Epoch: [107/300] [ 400/1251] eta: 0:13:38 lr: 0.001437 loss: 3.366146 (3.327627) time: 0.970756 data: 0.000173 max mem: 18817 Epoch: [107/300] [ 450/1251] eta: 0:12:49 lr: 0.001437 loss: 3.342126 (3.330681) time: 0.959186 data: 0.000189 max mem: 18817 Epoch: [107/300] [ 500/1251] eta: 0:12:01 lr: 0.001436 loss: 3.459290 (3.331299) time: 0.934743 data: 0.000172 max mem: 18817 Epoch: [107/300] [ 550/1251] eta: 0:11:14 lr: 0.001436 loss: 3.527494 (3.330744) time: 0.930788 data: 0.000177 max mem: 18817 Epoch: [107/300] [ 600/1251] eta: 0:10:26 lr: 0.001436 loss: 3.251988 (3.325003) time: 0.989922 data: 0.000178 max mem: 18817 Epoch: [107/300] [ 650/1251] eta: 0:09:38 lr: 0.001435 loss: 3.521017 (3.334169) time: 0.996503 data: 0.000177 max mem: 18817 Epoch: [107/300] [ 700/1251] eta: 0:08:50 lr: 0.001435 loss: 3.168448 (3.329287) time: 0.971616 data: 0.000182 max mem: 18817 Epoch: [107/300] [ 750/1251] eta: 0:08:01 lr: 0.001434 loss: 3.244551 (3.321447) time: 0.931383 data: 0.000173 max mem: 18817 Epoch: [107/300] [ 800/1251] eta: 0:07:13 lr: 0.001434 loss: 3.511486 (3.324395) time: 0.922250 data: 0.000178 max mem: 18817 Epoch: [107/300] [ 850/1251] eta: 0:06:26 lr: 0.001434 loss: 3.426390 (3.328992) time: 0.988332 data: 0.000171 max mem: 18817 Epoch: [107/300] [ 900/1251] eta: 0:05:37 lr: 0.001433 loss: 3.208432 (3.329694) time: 1.012812 data: 0.000172 max mem: 18817 Epoch: [107/300] [ 950/1251] eta: 0:04:49 lr: 0.001433 loss: 3.360352 (3.337468) time: 0.989099 data: 0.000174 max mem: 18817 Epoch: [107/300] [1000/1251] eta: 0:04:01 lr: 0.001433 loss: 3.502010 (3.337809) time: 0.936915 data: 0.000160 max mem: 18817 Epoch: [107/300] [1050/1251] eta: 0:03:13 lr: 0.001432 loss: 3.387699 (3.341488) time: 0.939703 data: 0.000200 max mem: 18817 Epoch: [107/300] [1100/1251] eta: 0:02:25 lr: 0.001432 loss: 3.262244 (3.345417) time: 0.991238 data: 0.000185 max mem: 18817 Epoch: [107/300] [1150/1251] eta: 0:01:37 lr: 0.001431 loss: 3.576899 (3.343330) time: 0.993543 data: 0.000168 max mem: 18817 Epoch: [107/300] [1200/1251] eta: 0:00:49 lr: 0.001431 loss: 3.394829 (3.342987) time: 0.967142 data: 0.000184 max mem: 18817 Epoch: [107/300] [1250/1251] eta: 0:00:00 lr: 0.001431 loss: 3.461821 (3.337266) time: 0.942655 data: 0.000737 max mem: 18817 Epoch: [107/300] Total time: 0:20:03 (0.962131 s / it) Averaged stats: lr: 0.001431 loss: 3.461821 (3.333979) Test: [ 0/49] eta: 0:01:27 loss: 0.634829 (0.634829) acc1: 82.812500 (82.812500) acc5: 100.000000 (100.000000) time: 1.776992 data: 1.354131 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.818596 (0.815757) acc1: 81.250000 (79.971591) acc5: 95.312500 (95.454545) time: 0.494107 data: 0.123261 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 0.853250 (0.860733) acc1: 76.562500 (78.571429) acc5: 95.312500 (95.238095) time: 0.464822 data: 0.000158 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 0.893688 (0.863809) acc1: 76.562500 (78.377016) acc5: 95.312500 (95.211694) time: 0.462818 data: 0.000147 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.857447 (0.865055) acc1: 78.125000 (78.582317) acc5: 95.312500 (95.198171) time: 0.359673 data: 0.000146 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.858456 (0.863341) acc1: 78.125000 (78.464000) acc5: 95.312500 (95.232000) time: 0.355104 data: 0.000117 max mem: 18817 Test: Total time: 0:00:21 (0.433364 s / it) * Acc@1 79.124 Acc@5 94.942 loss 0.872 Max accuracy: 79.12% Epoch: [108/300] [ 0/1251] eta: 0:40:57 lr: 0.001431 loss: 3.490193 (3.490193) time: 1.964460 data: 1.077027 max mem: 18817 Epoch: [108/300] [ 50/1251] eta: 0:19:33 lr: 0.001430 loss: 3.397553 (3.421137) time: 0.924566 data: 0.000156 max mem: 18817 Epoch: [108/300] [ 100/1251] eta: 0:18:36 lr: 0.001430 loss: 3.242596 (3.367507) time: 0.985561 data: 0.000172 max mem: 18817 Epoch: [108/300] [ 150/1251] eta: 0:17:40 lr: 0.001430 loss: 3.477234 (3.379320) time: 0.985283 data: 0.000196 max mem: 18817 Epoch: [108/300] [ 200/1251] eta: 0:16:48 lr: 0.001429 loss: 3.529877 (3.375567) time: 0.919965 data: 0.000169 max mem: 18817 Epoch: [108/300] [ 250/1251] eta: 0:16:06 lr: 0.001429 loss: 3.402659 (3.371411) time: 0.938543 data: 0.000161 max mem: 18817 Epoch: [108/300] [ 300/1251] eta: 0:15:18 lr: 0.001428 loss: 3.464355 (3.379136) time: 0.941945 data: 0.000168 max mem: 18817 Epoch: [108/300] [ 350/1251] eta: 0:14:32 lr: 0.001428 loss: 3.496137 (3.381495) time: 0.989477 data: 0.000175 max mem: 18817 Epoch: [108/300] [ 400/1251] eta: 0:13:44 lr: 0.001428 loss: 3.082199 (3.350524) time: 1.013027 data: 0.000170 max mem: 18817 Epoch: [108/300] [ 450/1251] eta: 0:12:55 lr: 0.001427 loss: 3.587912 (3.361811) time: 0.993141 data: 0.000194 max mem: 18817 Epoch: [108/300] [ 500/1251] eta: 0:12:07 lr: 0.001427 loss: 3.522042 (3.355797) time: 0.950396 data: 0.000183 max mem: 18817 Epoch: [108/300] [ 550/1251] eta: 0:11:18 lr: 0.001427 loss: 3.448046 (3.364133) time: 0.920020 data: 0.000162 max mem: 18817 Epoch: [108/300] [ 600/1251] eta: 0:10:30 lr: 0.001426 loss: 3.407206 (3.371473) time: 0.990032 data: 0.000174 max mem: 18817 Epoch: [108/300] [ 650/1251] eta: 0:09:42 lr: 0.001426 loss: 3.412953 (3.373025) time: 1.059865 data: 0.000162 max mem: 18817 Epoch: [108/300] [ 700/1251] eta: 0:08:52 lr: 0.001425 loss: 3.263857 (3.367182) time: 0.971625 data: 0.000172 max mem: 18817 Epoch: [108/300] [ 750/1251] eta: 0:08:03 lr: 0.001425 loss: 3.296480 (3.365105) time: 0.918401 data: 0.000155 max mem: 18817 Epoch: [108/300] [ 800/1251] eta: 0:07:15 lr: 0.001425 loss: 3.650562 (3.365111) time: 0.931531 data: 0.000163 max mem: 18817 Epoch: [108/300] [ 850/1251] eta: 0:06:27 lr: 0.001424 loss: 3.629015 (3.374140) time: 0.973131 data: 0.000165 max mem: 18817 Epoch: [108/300] [ 900/1251] eta: 0:05:38 lr: 0.001424 loss: 3.177070 (3.370861) time: 1.048624 data: 0.000178 max mem: 18817 Epoch: [108/300] [ 950/1251] eta: 0:04:50 lr: 0.001424 loss: 3.575379 (3.370042) time: 0.974318 data: 0.000167 max mem: 18817 Epoch: [108/300] [1000/1251] eta: 0:04:01 lr: 0.001423 loss: 3.666259 (3.374337) time: 0.931867 data: 0.000164 max mem: 18817 Epoch: [108/300] [1050/1251] eta: 0:03:13 lr: 0.001423 loss: 3.252634 (3.369103) time: 0.929177 data: 0.000182 max mem: 18817 Epoch: [108/300] [1100/1251] eta: 0:02:25 lr: 0.001422 loss: 3.651948 (3.371330) time: 0.982518 data: 0.000173 max mem: 18817 Epoch: [108/300] [1150/1251] eta: 0:01:37 lr: 0.001422 loss: 3.433086 (3.367684) time: 1.015881 data: 0.000180 max mem: 18817 Epoch: [108/300] [1200/1251] eta: 0:00:49 lr: 0.001422 loss: 3.487258 (3.373521) time: 0.981791 data: 0.000164 max mem: 18817 Epoch: [108/300] [1250/1251] eta: 0:00:00 lr: 0.001421 loss: 3.523038 (3.376261) time: 0.927287 data: 0.000786 max mem: 18817 Epoch: [108/300] Total time: 0:20:03 (0.962287 s / it) Averaged stats: lr: 0.001421 loss: 3.523038 (3.372809) Test: [ 0/49] eta: 0:01:28 loss: 0.650937 (0.650937) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.809579 data: 1.399759 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.800123 (0.833479) acc1: 81.250000 (80.965909) acc5: 95.312500 (94.602273) time: 0.501915 data: 0.127415 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.919849 (0.876279) acc1: 79.687500 (79.538690) acc5: 95.312500 (94.717262) time: 0.377902 data: 0.000156 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 0.941834 (0.887253) acc1: 78.125000 (79.334677) acc5: 95.312500 (95.060484) time: 0.467535 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.941834 (0.907049) acc1: 78.125000 (78.925305) acc5: 95.312500 (94.740854) time: 0.454358 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.961991 (0.908667) acc1: 76.562500 (78.720000) acc5: 94.339622 (94.944000) time: 0.355176 data: 0.000103 max mem: 18817 Test: Total time: 0:00:21 (0.436999 s / it) * Acc@1 78.702 Acc@5 94.954 loss 0.910 Max accuracy: 79.12% Epoch: [109/300] [ 0/1251] eta: 0:42:04 lr: 0.001421 loss: 3.210594 (3.210594) time: 2.018113 data: 1.136808 max mem: 18817 Epoch: [109/300] [ 50/1251] eta: 0:19:33 lr: 0.001421 loss: 3.475315 (3.299626) time: 0.967785 data: 0.000183 max mem: 18817 Epoch: [109/300] [ 100/1251] eta: 0:18:36 lr: 0.001421 loss: 3.387830 (3.250161) time: 1.015465 data: 0.000187 max mem: 18817 Epoch: [109/300] [ 150/1251] eta: 0:17:40 lr: 0.001420 loss: 3.323722 (3.248271) time: 0.969757 data: 0.000179 max mem: 18817 Epoch: [109/300] [ 200/1251] eta: 0:16:49 lr: 0.001420 loss: 3.362309 (3.245456) time: 0.938783 data: 0.000176 max mem: 18817 Epoch: [109/300] [ 250/1251] eta: 0:16:03 lr: 0.001419 loss: 3.357964 (3.291798) time: 0.928073 data: 0.000180 max mem: 18817 Epoch: [109/300] [ 300/1251] eta: 0:15:17 lr: 0.001419 loss: 3.189444 (3.275130) time: 0.991580 data: 0.000171 max mem: 18817 Epoch: [109/300] [ 350/1251] eta: 0:14:30 lr: 0.001419 loss: 3.279525 (3.271578) time: 1.044543 data: 0.000185 max mem: 18817 Epoch: [109/300] [ 400/1251] eta: 0:13:41 lr: 0.001418 loss: 3.210754 (3.285824) time: 0.994920 data: 0.000201 max mem: 18817 Epoch: [109/300] [ 450/1251] eta: 0:12:51 lr: 0.001418 loss: 3.391013 (3.289072) time: 0.921983 data: 0.000191 max mem: 18817 Epoch: [109/300] [ 500/1251] eta: 0:12:03 lr: 0.001418 loss: 3.401925 (3.298766) time: 0.939565 data: 0.000177 max mem: 18817 Epoch: [109/300] [ 550/1251] eta: 0:11:15 lr: 0.001417 loss: 3.419262 (3.307920) time: 0.970027 data: 0.000176 max mem: 18817 Epoch: [109/300] [ 600/1251] eta: 0:10:27 lr: 0.001417 loss: 3.277150 (3.307114) time: 1.037325 data: 0.000186 max mem: 18817 Epoch: [109/300] [ 650/1251] eta: 0:09:39 lr: 0.001416 loss: 3.464699 (3.317075) time: 0.988855 data: 0.000174 max mem: 18817 Epoch: [109/300] [ 700/1251] eta: 0:08:50 lr: 0.001416 loss: 3.629672 (3.325820) time: 0.929019 data: 0.000164 max mem: 18817 Epoch: [109/300] [ 750/1251] eta: 0:08:02 lr: 0.001416 loss: 3.529417 (3.327803) time: 0.924571 data: 0.000164 max mem: 18817 Epoch: [109/300] [ 800/1251] eta: 0:07:14 lr: 0.001415 loss: 3.379495 (3.324532) time: 0.989265 data: 0.000175 max mem: 18817 Epoch: [109/300] [ 850/1251] eta: 0:06:26 lr: 0.001415 loss: 3.231150 (3.320656) time: 1.052639 data: 0.000192 max mem: 18817 Epoch: [109/300] [ 900/1251] eta: 0:05:38 lr: 0.001415 loss: 3.310630 (3.321832) time: 0.984647 data: 0.000172 max mem: 18817 Epoch: [109/300] [ 950/1251] eta: 0:04:49 lr: 0.001414 loss: 3.331864 (3.325985) time: 0.920378 data: 0.000169 max mem: 18817 Epoch: [109/300] [1000/1251] eta: 0:04:01 lr: 0.001414 loss: 3.473630 (3.323875) time: 0.940205 data: 0.000183 max mem: 18817 Epoch: [109/300] [1050/1251] eta: 0:03:13 lr: 0.001413 loss: 3.235866 (3.317445) time: 0.997649 data: 0.000175 max mem: 18817 Epoch: [109/300] [1100/1251] eta: 0:02:25 lr: 0.001413 loss: 3.509628 (3.319018) time: 1.053523 data: 0.000178 max mem: 18817 Epoch: [109/300] [1150/1251] eta: 0:01:37 lr: 0.001413 loss: 3.351290 (3.320446) time: 0.986371 data: 0.000185 max mem: 18817 Epoch: [109/300] [1200/1251] eta: 0:00:49 lr: 0.001412 loss: 3.340558 (3.318321) time: 0.907491 data: 0.000172 max mem: 18817 Epoch: [109/300] [1250/1251] eta: 0:00:00 lr: 0.001412 loss: 3.457402 (3.317935) time: 0.931596 data: 0.000754 max mem: 18817 Epoch: [109/300] Total time: 0:20:04 (0.963026 s / it) Averaged stats: lr: 0.001412 loss: 3.457402 (3.323254) Test: [ 0/49] eta: 0:01:18 loss: 0.732126 (0.732126) acc1: 81.250000 (81.250000) acc5: 93.750000 (93.750000) time: 1.610851 data: 1.151665 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.736386 (0.804913) acc1: 81.250000 (80.965909) acc5: 95.312500 (94.602273) time: 0.483064 data: 0.104843 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.807588 (0.832269) acc1: 79.687500 (79.389881) acc5: 95.312500 (94.791667) time: 0.366174 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.805685 (0.834802) acc1: 78.125000 (78.981855) acc5: 95.312500 (95.010081) time: 0.362007 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.803102 (0.846397) acc1: 78.125000 (79.230183) acc5: 95.312500 (95.007622) time: 0.363030 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.861036 (0.838428) acc1: 79.687500 (79.328000) acc5: 95.312500 (95.168000) time: 0.443579 data: 0.000104 max mem: 18817 Test: Total time: 0:00:20 (0.425867 s / it) * Acc@1 79.306 Acc@5 95.132 loss 0.846 Max accuracy: 79.31% Epoch: [110/300] [ 0/1251] eta: 0:42:52 lr: 0.001412 loss: 4.017311 (4.017311) time: 2.056284 data: 1.147270 max mem: 18817 Epoch: [110/300] [ 50/1251] eta: 0:19:53 lr: 0.001411 loss: 3.244192 (3.349884) time: 1.042860 data: 0.000160 max mem: 18817 Epoch: [110/300] [ 100/1251] eta: 0:18:30 lr: 0.001411 loss: 3.220735 (3.331293) time: 0.965405 data: 0.000167 max mem: 18817 Epoch: [110/300] [ 150/1251] eta: 0:17:36 lr: 0.001411 loss: 3.486222 (3.325307) time: 0.929265 data: 0.000160 max mem: 18817 Epoch: [110/300] [ 200/1251] eta: 0:16:52 lr: 0.001410 loss: 3.520290 (3.326044) time: 0.946863 data: 0.000169 max mem: 18817 Epoch: [110/300] [ 250/1251] eta: 0:16:04 lr: 0.001410 loss: 3.327432 (3.321438) time: 0.983724 data: 0.000172 max mem: 18817 Epoch: [110/300] [ 300/1251] eta: 0:15:17 lr: 0.001410 loss: 3.336714 (3.323507) time: 1.043281 data: 0.000166 max mem: 18817 Epoch: [110/300] [ 350/1251] eta: 0:14:25 lr: 0.001409 loss: 3.490089 (3.319791) time: 0.952036 data: 0.000155 max mem: 18817 Epoch: [110/300] [ 400/1251] eta: 0:13:34 lr: 0.001409 loss: 3.605812 (3.327459) time: 0.924734 data: 0.000185 max mem: 18817 Epoch: [110/300] [ 450/1251] eta: 0:12:46 lr: 0.001408 loss: 3.064327 (3.326013) time: 0.920905 data: 0.000173 max mem: 18817 Epoch: [110/300] [ 500/1251] eta: 0:11:59 lr: 0.001408 loss: 3.458116 (3.329347) time: 0.989119 data: 0.000196 max mem: 18817 Epoch: [110/300] [ 550/1251] eta: 0:11:11 lr: 0.001408 loss: 3.350857 (3.338111) time: 0.982400 data: 0.000176 max mem: 18817 Epoch: [110/300] [ 600/1251] eta: 0:10:23 lr: 0.001407 loss: 3.519246 (3.337409) time: 0.967267 data: 0.000172 max mem: 18817 Epoch: [110/300] [ 650/1251] eta: 0:09:35 lr: 0.001407 loss: 3.350609 (3.338247) time: 0.923955 data: 0.000158 max mem: 18817 Epoch: [110/300] [ 700/1251] eta: 0:08:47 lr: 0.001407 loss: 3.424242 (3.330698) time: 0.946174 data: 0.000184 max mem: 18817 Epoch: [110/300] [ 750/1251] eta: 0:08:00 lr: 0.001406 loss: 3.187959 (3.327939) time: 0.991552 data: 0.000168 max mem: 18817 Epoch: [110/300] [ 800/1251] eta: 0:07:12 lr: 0.001406 loss: 3.527214 (3.324658) time: 0.998377 data: 0.000196 max mem: 18817 Epoch: [110/300] [ 850/1251] eta: 0:06:24 lr: 0.001405 loss: 3.322029 (3.325397) time: 0.983833 data: 0.000193 max mem: 18817 Epoch: [110/300] [ 900/1251] eta: 0:05:36 lr: 0.001405 loss: 3.412223 (3.319960) time: 0.915734 data: 0.000169 max mem: 18817 Epoch: [110/300] [ 950/1251] eta: 0:04:49 lr: 0.001405 loss: 3.278198 (3.319494) time: 0.942299 data: 0.000179 max mem: 18817 Epoch: [110/300] [1000/1251] eta: 0:04:01 lr: 0.001404 loss: 2.877491 (3.317298) time: 0.994021 data: 0.000183 max mem: 18817 Epoch: [110/300] [1050/1251] eta: 0:03:13 lr: 0.001404 loss: 3.274936 (3.312260) time: 1.027798 data: 0.000163 max mem: 18817 Epoch: [110/300] [1100/1251] eta: 0:02:24 lr: 0.001403 loss: 3.169318 (3.310890) time: 0.962800 data: 0.000199 max mem: 18817 Epoch: [110/300] [1150/1251] eta: 0:01:36 lr: 0.001403 loss: 3.298747 (3.312877) time: 0.915032 data: 0.000180 max mem: 18817 Epoch: [110/300] [1200/1251] eta: 0:00:48 lr: 0.001403 loss: 3.337681 (3.309238) time: 0.936383 data: 0.000188 max mem: 18817 Epoch: [110/300] [1250/1251] eta: 0:00:00 lr: 0.001402 loss: 3.481897 (3.311608) time: 1.001551 data: 0.000758 max mem: 18817 Epoch: [110/300] Total time: 0:20:01 (0.960646 s / it) Averaged stats: lr: 0.001402 loss: 3.481897 (3.310210) Test: [ 0/49] eta: 0:01:29 loss: 0.794342 (0.794342) acc1: 76.562500 (76.562500) acc5: 95.312500 (95.312500) time: 1.819932 data: 1.434065 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.805003 (0.855946) acc1: 78.125000 (80.113636) acc5: 95.312500 (94.460227) time: 0.500274 data: 0.130501 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.902771 (0.886390) acc1: 78.125000 (79.613095) acc5: 95.312500 (94.717262) time: 0.372705 data: 0.000136 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.886835 (0.871205) acc1: 78.125000 (79.485887) acc5: 95.312500 (95.312500) time: 0.376998 data: 0.000130 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.886835 (0.886735) acc1: 79.687500 (79.420732) acc5: 95.312500 (95.236280) time: 0.367953 data: 0.000126 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.931828 (0.888914) acc1: 79.687500 (79.296000) acc5: 95.312500 (95.232000) time: 0.358504 data: 0.000101 max mem: 18817 Test: Total time: 0:00:19 (0.399870 s / it) * Acc@1 79.180 Acc@5 95.040 loss 0.895 Max accuracy: 79.31% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0110.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0110.pth Epoch: [111/300] [ 0/1251] eta: 0:44:43 lr: 0.001402 loss: 3.029564 (3.029564) time: 2.144861 data: 1.260491 max mem: 18817 Epoch: [111/300] [ 50/1251] eta: 0:19:48 lr: 0.001402 loss: 3.176559 (3.271786) time: 0.975704 data: 0.000205 max mem: 18817 Epoch: [111/300] [ 100/1251] eta: 0:18:31 lr: 0.001402 loss: 3.126853 (3.308364) time: 0.966153 data: 0.000204 max mem: 18817 Epoch: [111/300] [ 150/1251] eta: 0:17:35 lr: 0.001401 loss: 3.169098 (3.270493) time: 0.909247 data: 0.000224 max mem: 18817 Epoch: [111/300] [ 200/1251] eta: 0:16:50 lr: 0.001401 loss: 3.330571 (3.297942) time: 0.929650 data: 0.000206 max mem: 18817 Epoch: [111/300] [ 250/1251] eta: 0:16:02 lr: 0.001400 loss: 3.582886 (3.304219) time: 0.929690 data: 0.000227 max mem: 18817 Epoch: [111/300] [ 300/1251] eta: 0:15:17 lr: 0.001400 loss: 3.334142 (3.306786) time: 0.997134 data: 0.000205 max mem: 18817 Epoch: [111/300] [ 350/1251] eta: 0:14:26 lr: 0.001400 loss: 3.120957 (3.305133) time: 0.967704 data: 0.000215 max mem: 18817 Epoch: [111/300] [ 400/1251] eta: 0:13:38 lr: 0.001399 loss: 3.429853 (3.305782) time: 0.931687 data: 0.000154 max mem: 18817 Epoch: [111/300] [ 450/1251] eta: 0:12:51 lr: 0.001399 loss: 3.368399 (3.313671) time: 0.940881 data: 0.000181 max mem: 18817 Epoch: [111/300] [ 500/1251] eta: 0:12:03 lr: 0.001399 loss: 3.258970 (3.317856) time: 0.927794 data: 0.000178 max mem: 18817 Epoch: [111/300] [ 550/1251] eta: 0:11:15 lr: 0.001398 loss: 3.434508 (3.316889) time: 0.985370 data: 0.000160 max mem: 18817 Epoch: [111/300] [ 600/1251] eta: 0:10:26 lr: 0.001398 loss: 3.671045 (3.324819) time: 0.969349 data: 0.000174 max mem: 18817 Epoch: [111/300] [ 650/1251] eta: 0:09:36 lr: 0.001397 loss: 3.441102 (3.327253) time: 0.903997 data: 0.000167 max mem: 18817 Epoch: [111/300] [ 700/1251] eta: 0:08:49 lr: 0.001397 loss: 3.436569 (3.334512) time: 0.940000 data: 0.000169 max mem: 18817 Epoch: [111/300] [ 750/1251] eta: 0:08:01 lr: 0.001397 loss: 3.563730 (3.330829) time: 0.928392 data: 0.000400 max mem: 18817 Epoch: [111/300] [ 800/1251] eta: 0:07:13 lr: 0.001396 loss: 2.995847 (3.329486) time: 0.964666 data: 0.000175 max mem: 18817 Epoch: [111/300] [ 850/1251] eta: 0:06:24 lr: 0.001396 loss: 3.299012 (3.333107) time: 0.970936 data: 0.000173 max mem: 18817 Epoch: [111/300] [ 900/1251] eta: 0:05:36 lr: 0.001395 loss: 3.312699 (3.331586) time: 0.909405 data: 0.000172 max mem: 18817 Epoch: [111/300] [ 950/1251] eta: 0:04:48 lr: 0.001395 loss: 3.401574 (3.331765) time: 0.926304 data: 0.000165 max mem: 18817 Epoch: [111/300] [1000/1251] eta: 0:04:00 lr: 0.001395 loss: 3.457542 (3.336738) time: 0.941933 data: 0.000164 max mem: 18817 Epoch: [111/300] [1050/1251] eta: 0:03:13 lr: 0.001394 loss: 3.585299 (3.343378) time: 0.992719 data: 0.000168 max mem: 18817 Epoch: [111/300] [1100/1251] eta: 0:02:24 lr: 0.001394 loss: 3.020546 (3.337874) time: 0.962485 data: 0.000179 max mem: 18817 Epoch: [111/300] [1150/1251] eta: 0:01:36 lr: 0.001394 loss: 3.387655 (3.335726) time: 0.914097 data: 0.000180 max mem: 18817 Epoch: [111/300] [1200/1251] eta: 0:00:48 lr: 0.001393 loss: 3.706681 (3.342237) time: 0.930660 data: 0.000169 max mem: 18817 Epoch: [111/300] [1250/1251] eta: 0:00:00 lr: 0.001393 loss: 2.989696 (3.340247) time: 0.916935 data: 0.000727 max mem: 18817 Epoch: [111/300] Total time: 0:20:02 (0.961148 s / it) Averaged stats: lr: 0.001393 loss: 2.989696 (3.333105) Test: [ 0/49] eta: 0:01:27 loss: 0.626485 (0.626485) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.780521 data: 1.380855 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.727877 (0.803653) acc1: 82.812500 (81.818182) acc5: 95.312500 (95.170455) time: 0.498318 data: 0.125666 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.851970 (0.851454) acc1: 79.687500 (80.133929) acc5: 95.312500 (95.386905) time: 0.366092 data: 0.000135 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.851970 (0.849313) acc1: 78.125000 (80.292339) acc5: 95.312500 (95.614919) time: 0.362382 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.836098 (0.861443) acc1: 79.687500 (80.144817) acc5: 95.312500 (95.541159) time: 0.360490 data: 0.000131 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.887313 (0.860146) acc1: 79.687500 (79.776000) acc5: 95.312500 (95.488000) time: 0.355094 data: 0.000104 max mem: 18817 Test: Total time: 0:00:19 (0.393006 s / it) * Acc@1 79.390 Acc@5 95.116 loss 0.871 Max accuracy: 79.39% Epoch: [112/300] [ 0/1251] eta: 0:41:10 lr: 0.001393 loss: 3.513001 (3.513001) time: 1.974750 data: 1.071775 max mem: 18817 Epoch: [112/300] [ 50/1251] eta: 0:19:18 lr: 0.001392 loss: 3.251509 (3.315641) time: 0.969526 data: 0.000173 max mem: 18817 Epoch: [112/300] [ 100/1251] eta: 0:18:26 lr: 0.001392 loss: 3.514438 (3.320263) time: 0.930799 data: 0.000170 max mem: 18817 Epoch: [112/300] [ 150/1251] eta: 0:17:43 lr: 0.001392 loss: 3.538539 (3.315225) time: 0.949975 data: 0.000183 max mem: 18817 Epoch: [112/300] [ 200/1251] eta: 0:16:56 lr: 0.001391 loss: 3.222584 (3.314121) time: 0.944663 data: 0.000168 max mem: 18817 Epoch: [112/300] [ 250/1251] eta: 0:16:08 lr: 0.001391 loss: 3.086426 (3.295714) time: 0.991338 data: 0.000164 max mem: 18817 Epoch: [112/300] [ 300/1251] eta: 0:15:15 lr: 0.001391 loss: 3.516659 (3.300855) time: 0.981039 data: 0.000161 max mem: 18817 Epoch: [112/300] [ 350/1251] eta: 0:14:26 lr: 0.001390 loss: 3.464936 (3.302376) time: 0.918830 data: 0.000174 max mem: 18817 Epoch: [112/300] [ 400/1251] eta: 0:13:39 lr: 0.001390 loss: 3.599205 (3.318150) time: 0.930557 data: 0.000170 max mem: 18817 Epoch: [112/300] [ 450/1251] eta: 0:12:53 lr: 0.001389 loss: 3.529318 (3.323621) time: 0.931156 data: 0.000188 max mem: 18817 Epoch: [112/300] [ 500/1251] eta: 0:12:05 lr: 0.001389 loss: 3.475230 (3.322160) time: 0.988084 data: 0.000173 max mem: 18817 Epoch: [112/300] [ 550/1251] eta: 0:11:15 lr: 0.001389 loss: 3.421339 (3.332402) time: 0.962380 data: 0.000175 max mem: 18817 Epoch: [112/300] [ 600/1251] eta: 0:10:27 lr: 0.001388 loss: 3.290356 (3.322787) time: 0.954194 data: 0.000173 max mem: 18817 Epoch: [112/300] [ 650/1251] eta: 0:09:38 lr: 0.001388 loss: 3.356711 (3.319097) time: 0.925160 data: 0.000162 max mem: 18817 Epoch: [112/300] [ 700/1251] eta: 0:08:51 lr: 0.001387 loss: 3.527151 (3.320908) time: 0.949139 data: 0.000179 max mem: 18817 Epoch: [112/300] [ 750/1251] eta: 0:08:03 lr: 0.001387 loss: 3.420012 (3.321670) time: 1.005360 data: 0.000165 max mem: 18817 Epoch: [112/300] [ 800/1251] eta: 0:07:14 lr: 0.001387 loss: 3.124601 (3.321993) time: 0.958227 data: 0.000180 max mem: 18817 Epoch: [112/300] [ 850/1251] eta: 0:06:26 lr: 0.001386 loss: 3.370322 (3.322442) time: 0.974074 data: 0.000159 max mem: 18817 Epoch: [112/300] [ 900/1251] eta: 0:05:38 lr: 0.001386 loss: 3.171269 (3.317772) time: 0.938388 data: 0.000177 max mem: 18817 Epoch: [112/300] [ 950/1251] eta: 0:04:49 lr: 0.001386 loss: 3.077435 (3.315521) time: 0.933039 data: 0.000168 max mem: 18817 Epoch: [112/300] [1000/1251] eta: 0:04:01 lr: 0.001385 loss: 3.661509 (3.323865) time: 1.002025 data: 0.000166 max mem: 18817 Epoch: [112/300] [1050/1251] eta: 0:03:13 lr: 0.001385 loss: 3.119430 (3.319849) time: 0.992995 data: 0.000183 max mem: 18817 Epoch: [112/300] [1100/1251] eta: 0:02:25 lr: 0.001384 loss: 3.049925 (3.318580) time: 0.960890 data: 0.000172 max mem: 18817 Epoch: [112/300] [1150/1251] eta: 0:01:37 lr: 0.001384 loss: 3.185756 (3.313344) time: 0.936428 data: 0.000199 max mem: 18817 Epoch: [112/300] [1200/1251] eta: 0:00:49 lr: 0.001384 loss: 3.406565 (3.313742) time: 0.939099 data: 0.000187 max mem: 18817 Epoch: [112/300] [1250/1251] eta: 0:00:00 lr: 0.001383 loss: 3.269345 (3.310763) time: 0.976295 data: 0.000759 max mem: 18817 Epoch: [112/300] Total time: 0:20:05 (0.963421 s / it) Averaged stats: lr: 0.001383 loss: 3.269345 (3.306007) Test: [ 0/49] eta: 0:01:15 loss: 0.672811 (0.672811) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.539237 data: 1.134326 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.821989 (0.841713) acc1: 82.812500 (80.823864) acc5: 95.312500 (94.602273) time: 0.492429 data: 0.103279 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.892897 (0.876186) acc1: 76.562500 (79.092262) acc5: 93.750000 (94.717262) time: 0.374701 data: 0.000157 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.907537 (0.877539) acc1: 75.000000 (78.528226) acc5: 95.312500 (95.060484) time: 0.361813 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.915109 (0.883864) acc1: 78.125000 (78.849085) acc5: 95.312500 (94.893293) time: 0.364323 data: 0.000126 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.911300 (0.874837) acc1: 79.687500 (79.104000) acc5: 95.312500 (95.136000) time: 0.359568 data: 0.000104 max mem: 18817 Test: Total time: 0:00:19 (0.392304 s / it) * Acc@1 79.224 Acc@5 94.962 loss 0.876 Max accuracy: 79.39% Epoch: [113/300] [ 0/1251] eta: 0:40:24 lr: 0.001383 loss: 2.461769 (2.461769) time: 1.937749 data: 1.032641 max mem: 18817 Epoch: [113/300] [ 50/1251] eta: 0:19:30 lr: 0.001383 loss: 3.134141 (3.186386) time: 0.909717 data: 0.000154 max mem: 18817 Epoch: [113/300] [ 100/1251] eta: 0:18:39 lr: 0.001382 loss: 3.304558 (3.263022) time: 0.932230 data: 0.000157 max mem: 18817 Epoch: [113/300] [ 150/1251] eta: 0:17:44 lr: 0.001382 loss: 3.462868 (3.303844) time: 0.977823 data: 0.000157 max mem: 18817 Epoch: [113/300] [ 200/1251] eta: 0:16:58 lr: 0.001382 loss: 3.308216 (3.296923) time: 1.011938 data: 0.000170 max mem: 18817 Epoch: [113/300] [ 250/1251] eta: 0:16:04 lr: 0.001381 loss: 3.312575 (3.300894) time: 0.969555 data: 0.000159 max mem: 18817 Epoch: [113/300] [ 300/1251] eta: 0:15:13 lr: 0.001381 loss: 3.437828 (3.303255) time: 0.924184 data: 0.000161 max mem: 18817 Epoch: [113/300] [ 350/1251] eta: 0:14:26 lr: 0.001381 loss: 3.270907 (3.313269) time: 0.918162 data: 0.000164 max mem: 18817 Epoch: [113/300] [ 400/1251] eta: 0:13:38 lr: 0.001380 loss: 3.639933 (3.332336) time: 0.979349 data: 0.000168 max mem: 18817 Epoch: [113/300] [ 450/1251] eta: 0:12:51 lr: 0.001380 loss: 3.307854 (3.325500) time: 1.026833 data: 0.000163 max mem: 18817 Epoch: [113/300] [ 500/1251] eta: 0:12:02 lr: 0.001379 loss: 3.411708 (3.326912) time: 0.976752 data: 0.000167 max mem: 18817 Epoch: [113/300] [ 550/1251] eta: 0:11:13 lr: 0.001379 loss: 3.293103 (3.325110) time: 0.925091 data: 0.000166 max mem: 18817 Epoch: [113/300] [ 600/1251] eta: 0:10:26 lr: 0.001379 loss: 3.341957 (3.317545) time: 0.921132 data: 0.000163 max mem: 18817 Epoch: [113/300] [ 650/1251] eta: 0:09:38 lr: 0.001378 loss: 3.352607 (3.311820) time: 0.967020 data: 0.000178 max mem: 18817 Epoch: [113/300] [ 700/1251] eta: 0:08:50 lr: 0.001378 loss: 3.278244 (3.308543) time: 1.004555 data: 0.000157 max mem: 18817 Epoch: [113/300] [ 750/1251] eta: 0:08:01 lr: 0.001377 loss: 3.407502 (3.308336) time: 0.969251 data: 0.000173 max mem: 18817 Epoch: [113/300] [ 800/1251] eta: 0:07:13 lr: 0.001377 loss: 3.382193 (3.313541) time: 0.920951 data: 0.000177 max mem: 18817 Epoch: [113/300] [ 850/1251] eta: 0:06:25 lr: 0.001377 loss: 2.956423 (3.306080) time: 0.927430 data: 0.000161 max mem: 18817 Epoch: [113/300] [ 900/1251] eta: 0:05:37 lr: 0.001376 loss: 3.285333 (3.305880) time: 0.965507 data: 0.000159 max mem: 18817 Epoch: [113/300] [ 950/1251] eta: 0:04:49 lr: 0.001376 loss: 3.431569 (3.308374) time: 0.971073 data: 0.000165 max mem: 18817 Epoch: [113/300] [1000/1251] eta: 0:04:01 lr: 0.001376 loss: 3.547287 (3.305156) time: 0.971127 data: 0.000155 max mem: 18817 Epoch: [113/300] [1050/1251] eta: 0:03:12 lr: 0.001375 loss: 3.090436 (3.301463) time: 0.927070 data: 0.000165 max mem: 18817 Epoch: [113/300] [1100/1251] eta: 0:02:25 lr: 0.001375 loss: 3.450969 (3.303954) time: 0.933621 data: 0.000165 max mem: 18817 Epoch: [113/300] [1150/1251] eta: 0:01:37 lr: 0.001374 loss: 3.435199 (3.302151) time: 0.967643 data: 0.000171 max mem: 18817 Epoch: [113/300] [1200/1251] eta: 0:00:49 lr: 0.001374 loss: 3.520721 (3.306266) time: 1.048519 data: 0.000196 max mem: 18817 Epoch: [113/300] [1250/1251] eta: 0:00:00 lr: 0.001374 loss: 3.345058 (3.304386) time: 0.952323 data: 0.000744 max mem: 18817 Epoch: [113/300] Total time: 0:20:02 (0.960908 s / it) Averaged stats: lr: 0.001374 loss: 3.345058 (3.308659) Test: [ 0/49] eta: 0:01:19 loss: 0.645393 (0.645393) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.620112 data: 1.170807 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.769707 (0.806318) acc1: 81.250000 (81.107955) acc5: 96.875000 (95.454545) time: 0.484909 data: 0.106572 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.826898 (0.836982) acc1: 78.125000 (79.910714) acc5: 96.875000 (95.312500) time: 0.366688 data: 0.000135 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.853226 (0.839736) acc1: 79.687500 (79.737903) acc5: 96.875000 (95.413306) time: 0.378521 data: 0.000128 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.876339 (0.856685) acc1: 78.125000 (79.344512) acc5: 95.312500 (95.198171) time: 0.398683 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.925570 (0.857098) acc1: 78.125000 (79.328000) acc5: 95.312500 (95.232000) time: 0.376974 data: 0.000105 max mem: 18817 Test: Total time: 0:00:19 (0.404914 s / it) * Acc@1 79.174 Acc@5 95.150 loss 0.865 Max accuracy: 79.39% Epoch: [114/300] [ 0/1251] eta: 0:41:04 lr: 0.001374 loss: 3.980233 (3.980233) time: 1.969783 data: 1.071804 max mem: 18817 Epoch: [114/300] [ 50/1251] eta: 0:19:47 lr: 0.001373 loss: 3.060013 (3.264802) time: 0.935438 data: 0.000173 max mem: 18817 Epoch: [114/300] [ 100/1251] eta: 0:18:47 lr: 0.001373 loss: 3.432223 (3.274840) time: 0.988271 data: 0.000172 max mem: 18817 Epoch: [114/300] [ 150/1251] eta: 0:17:45 lr: 0.001372 loss: 3.420682 (3.317673) time: 0.972917 data: 0.000165 max mem: 18817 Epoch: [114/300] [ 200/1251] eta: 0:16:59 lr: 0.001372 loss: 3.399987 (3.317273) time: 0.977534 data: 0.000168 max mem: 18817 Epoch: [114/300] [ 250/1251] eta: 0:16:07 lr: 0.001372 loss: 3.581979 (3.307038) time: 0.931939 data: 0.000193 max mem: 18817 Epoch: [114/300] [ 300/1251] eta: 0:15:19 lr: 0.001371 loss: 3.345526 (3.308397) time: 0.921141 data: 0.000163 max mem: 18817 Epoch: [114/300] [ 350/1251] eta: 0:14:32 lr: 0.001371 loss: 3.394975 (3.305877) time: 0.986851 data: 0.000172 max mem: 18817 Epoch: [114/300] [ 400/1251] eta: 0:13:43 lr: 0.001370 loss: 3.409621 (3.311827) time: 1.007886 data: 0.000228 max mem: 18817 Epoch: [114/300] [ 450/1251] eta: 0:12:53 lr: 0.001370 loss: 3.445218 (3.306349) time: 0.983843 data: 0.000202 max mem: 18817 Epoch: [114/300] [ 500/1251] eta: 0:12:03 lr: 0.001370 loss: 2.933244 (3.302495) time: 0.924838 data: 0.000208 max mem: 18817 Epoch: [114/300] [ 550/1251] eta: 0:11:15 lr: 0.001369 loss: 3.017117 (3.291346) time: 0.915566 data: 0.000208 max mem: 18817 Epoch: [114/300] [ 600/1251] eta: 0:10:27 lr: 0.001369 loss: 3.259873 (3.284744) time: 0.970284 data: 0.000168 max mem: 18817 Epoch: [114/300] [ 650/1251] eta: 0:09:38 lr: 0.001369 loss: 3.590024 (3.302172) time: 1.002935 data: 0.000165 max mem: 18817 Epoch: [114/300] [ 700/1251] eta: 0:08:50 lr: 0.001368 loss: 3.306254 (3.305300) time: 0.972366 data: 0.000172 max mem: 18817 Epoch: [114/300] [ 750/1251] eta: 0:08:01 lr: 0.001368 loss: 3.362979 (3.305572) time: 0.918450 data: 0.000171 max mem: 18817 Epoch: [114/300] [ 800/1251] eta: 0:07:13 lr: 0.001367 loss: 3.541384 (3.305873) time: 0.924792 data: 0.000156 max mem: 18817 Epoch: [114/300] [ 850/1251] eta: 0:06:25 lr: 0.001367 loss: 3.538390 (3.308257) time: 0.967113 data: 0.000177 max mem: 18817 Epoch: [114/300] [ 900/1251] eta: 0:05:36 lr: 0.001367 loss: 3.390905 (3.314833) time: 0.961694 data: 0.000166 max mem: 18817 Epoch: [114/300] [ 950/1251] eta: 0:04:48 lr: 0.001366 loss: 3.263223 (3.315510) time: 0.912597 data: 0.000175 max mem: 18817 Epoch: [114/300] [1000/1251] eta: 0:04:00 lr: 0.001366 loss: 3.219432 (3.310164) time: 0.916102 data: 0.000163 max mem: 18817 Epoch: [114/300] [1050/1251] eta: 0:03:13 lr: 0.001365 loss: 3.185842 (3.303955) time: 0.944339 data: 0.000205 max mem: 18817 Epoch: [114/300] [1100/1251] eta: 0:02:25 lr: 0.001365 loss: 3.416622 (3.303512) time: 0.983282 data: 0.000196 max mem: 18817 Epoch: [114/300] [1150/1251] eta: 0:01:36 lr: 0.001365 loss: 3.562817 (3.305883) time: 0.979809 data: 0.000217 max mem: 18817 Epoch: [114/300] [1200/1251] eta: 0:00:48 lr: 0.001364 loss: 3.441258 (3.305461) time: 0.953144 data: 0.000214 max mem: 18817 Epoch: [114/300] [1250/1251] eta: 0:00:00 lr: 0.001364 loss: 3.211734 (3.299385) time: 0.929533 data: 0.000796 max mem: 18817 Epoch: [114/300] Total time: 0:20:01 (0.960304 s / it) Averaged stats: lr: 0.001364 loss: 3.211734 (3.297825) Test: [ 0/49] eta: 0:01:15 loss: 0.670720 (0.670720) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.312500) time: 1.532477 data: 1.108566 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.729635 (0.787567) acc1: 81.250000 (81.392045) acc5: 95.312500 (95.028409) time: 0.480804 data: 0.100927 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.819346 (0.822731) acc1: 79.687500 (80.133929) acc5: 95.312500 (95.089286) time: 0.368499 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.848765 (0.824222) acc1: 78.125000 (79.687500) acc5: 95.312500 (95.110887) time: 0.448688 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.850389 (0.843955) acc1: 79.687500 (79.496951) acc5: 95.312500 (95.045732) time: 0.447359 data: 0.000121 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.912794 (0.848138) acc1: 78.125000 (79.424000) acc5: 93.750000 (95.008000) time: 0.355318 data: 0.000100 max mem: 18817 Test: Total time: 0:00:20 (0.424470 s / it) * Acc@1 79.338 Acc@5 95.128 loss 0.839 Max accuracy: 79.39% Epoch: [115/300] [ 0/1251] eta: 0:42:15 lr: 0.001364 loss: 4.138960 (4.138960) time: 2.026381 data: 1.124831 max mem: 18817 Epoch: [115/300] [ 50/1251] eta: 0:19:56 lr: 0.001364 loss: 3.242540 (3.296273) time: 1.040191 data: 0.000192 max mem: 18817 Epoch: [115/300] [ 100/1251] eta: 0:18:34 lr: 0.001363 loss: 3.093884 (3.266019) time: 0.964805 data: 0.000164 max mem: 18817 Epoch: [115/300] [ 150/1251] eta: 0:17:41 lr: 0.001363 loss: 3.171557 (3.263014) time: 0.915038 data: 0.000173 max mem: 18817 Epoch: [115/300] [ 200/1251] eta: 0:16:58 lr: 0.001362 loss: 3.059327 (3.268966) time: 0.933839 data: 0.000161 max mem: 18817 Epoch: [115/300] [ 250/1251] eta: 0:16:10 lr: 0.001362 loss: 3.327942 (3.319894) time: 0.995007 data: 0.000179 max mem: 18817 Epoch: [115/300] [ 300/1251] eta: 0:15:20 lr: 0.001362 loss: 3.166538 (3.306966) time: 0.982083 data: 0.000165 max mem: 18817 Epoch: [115/300] [ 350/1251] eta: 0:14:29 lr: 0.001361 loss: 3.360651 (3.307204) time: 0.961565 data: 0.000168 max mem: 18817 Epoch: [115/300] [ 400/1251] eta: 0:13:37 lr: 0.001361 loss: 3.229046 (3.293995) time: 0.910733 data: 0.000163 max mem: 18817 Epoch: [115/300] [ 450/1251] eta: 0:12:50 lr: 0.001360 loss: 3.490346 (3.292225) time: 0.924768 data: 0.000184 max mem: 18817 Epoch: [115/300] [ 500/1251] eta: 0:12:02 lr: 0.001360 loss: 3.174630 (3.286424) time: 0.981153 data: 0.000173 max mem: 18817 Epoch: [115/300] [ 550/1251] eta: 0:11:14 lr: 0.001360 loss: 3.450737 (3.294975) time: 1.025499 data: 0.000166 max mem: 18817 Epoch: [115/300] [ 600/1251] eta: 0:10:25 lr: 0.001359 loss: 3.409289 (3.288053) time: 0.988286 data: 0.000161 max mem: 18817 Epoch: [115/300] [ 650/1251] eta: 0:09:37 lr: 0.001359 loss: 2.971070 (3.278906) time: 0.914080 data: 0.000164 max mem: 18817 Epoch: [115/300] [ 700/1251] eta: 0:08:49 lr: 0.001358 loss: 3.601723 (3.287998) time: 0.929190 data: 0.000153 max mem: 18817 Epoch: [115/300] [ 750/1251] eta: 0:08:01 lr: 0.001358 loss: 3.457189 (3.292168) time: 0.979583 data: 0.000168 max mem: 18817 Epoch: [115/300] [ 800/1251] eta: 0:07:13 lr: 0.001358 loss: 3.288352 (3.292991) time: 1.039472 data: 0.000172 max mem: 18817 Epoch: [115/300] [ 850/1251] eta: 0:06:25 lr: 0.001357 loss: 3.089970 (3.286068) time: 0.972064 data: 0.000177 max mem: 18817 Epoch: [115/300] [ 900/1251] eta: 0:05:37 lr: 0.001357 loss: 3.424786 (3.281572) time: 0.928837 data: 0.000167 max mem: 18817 Epoch: [115/300] [ 950/1251] eta: 0:04:49 lr: 0.001357 loss: 3.427886 (3.283041) time: 0.940038 data: 0.000185 max mem: 18817 Epoch: [115/300] [1000/1251] eta: 0:04:01 lr: 0.001356 loss: 3.270802 (3.282469) time: 0.972507 data: 0.000160 max mem: 18817 Epoch: [115/300] [1050/1251] eta: 0:03:13 lr: 0.001356 loss: 3.322470 (3.282949) time: 1.032387 data: 0.000192 max mem: 18817 Epoch: [115/300] [1100/1251] eta: 0:02:25 lr: 0.001355 loss: 3.159701 (3.280696) time: 0.966108 data: 0.000170 max mem: 18817 Epoch: [115/300] [1150/1251] eta: 0:01:37 lr: 0.001355 loss: 3.322023 (3.282236) time: 0.908555 data: 0.000164 max mem: 18817 Epoch: [115/300] [1200/1251] eta: 0:00:49 lr: 0.001355 loss: 3.006495 (3.277164) time: 0.936512 data: 0.000165 max mem: 18817 Epoch: [115/300] [1250/1251] eta: 0:00:00 lr: 0.001354 loss: 3.030705 (3.277235) time: 0.984410 data: 0.000745 max mem: 18817 Epoch: [115/300] Total time: 0:20:05 (0.963464 s / it) Averaged stats: lr: 0.001354 loss: 3.030705 (3.282475) Test: [ 0/49] eta: 0:01:17 loss: 0.649273 (0.649273) acc1: 84.375000 (84.375000) acc5: 93.750000 (93.750000) time: 1.578343 data: 1.157865 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.701182 (0.767826) acc1: 84.375000 (82.102273) acc5: 95.312500 (95.028409) time: 0.479589 data: 0.105405 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.790992 (0.815573) acc1: 79.687500 (80.505952) acc5: 95.312500 (95.238095) time: 0.372921 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.822573 (0.822633) acc1: 79.687500 (79.889113) acc5: 95.312500 (95.362903) time: 0.370120 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.836804 (0.838572) acc1: 79.687500 (79.763720) acc5: 95.312500 (95.388720) time: 0.361043 data: 0.000130 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.859564 (0.838499) acc1: 79.687500 (79.680000) acc5: 95.312500 (95.424000) time: 0.354983 data: 0.000103 max mem: 18817 Test: Total time: 0:00:19 (0.391049 s / it) * Acc@1 79.438 Acc@5 95.132 loss 0.854 Max accuracy: 79.44% Epoch: [116/300] [ 0/1251] eta: 0:43:46 lr: 0.001354 loss: 3.337331 (3.337331) time: 2.099811 data: 1.073474 max mem: 18817 Epoch: [116/300] [ 50/1251] eta: 0:19:12 lr: 0.001354 loss: 3.220343 (3.336316) time: 0.955202 data: 0.000169 max mem: 18817 Epoch: [116/300] [ 100/1251] eta: 0:18:21 lr: 0.001353 loss: 3.295243 (3.338133) time: 0.926728 data: 0.000173 max mem: 18817 Epoch: [116/300] [ 150/1251] eta: 0:17:42 lr: 0.001353 loss: 3.293036 (3.285670) time: 0.953860 data: 0.000180 max mem: 18817 Epoch: [116/300] [ 200/1251] eta: 0:16:52 lr: 0.001353 loss: 3.260070 (3.287718) time: 0.995668 data: 0.000176 max mem: 18817 Epoch: [116/300] [ 250/1251] eta: 0:16:07 lr: 0.001352 loss: 3.071012 (3.276762) time: 1.042442 data: 0.000164 max mem: 18817 Epoch: [116/300] [ 300/1251] eta: 0:15:15 lr: 0.001352 loss: 3.294166 (3.266329) time: 0.963295 data: 0.000186 max mem: 18817 Epoch: [116/300] [ 350/1251] eta: 0:14:25 lr: 0.001351 loss: 3.210845 (3.287486) time: 0.923372 data: 0.000185 max mem: 18817 Epoch: [116/300] [ 400/1251] eta: 0:13:38 lr: 0.001351 loss: 2.774112 (3.287477) time: 0.932642 data: 0.000186 max mem: 18817 Epoch: [116/300] [ 450/1251] eta: 0:12:50 lr: 0.001351 loss: 3.335131 (3.289342) time: 0.985818 data: 0.000192 max mem: 18817 Epoch: [116/300] [ 500/1251] eta: 0:12:03 lr: 0.001350 loss: 3.310681 (3.296772) time: 1.037492 data: 0.000170 max mem: 18817 Epoch: [116/300] [ 550/1251] eta: 0:11:14 lr: 0.001350 loss: 3.331065 (3.285077) time: 0.980707 data: 0.000201 max mem: 18817 Epoch: [116/300] [ 600/1251] eta: 0:10:25 lr: 0.001350 loss: 3.575064 (3.299813) time: 0.916503 data: 0.000161 max mem: 18817 Epoch: [116/300] [ 650/1251] eta: 0:09:38 lr: 0.001349 loss: 3.341395 (3.296618) time: 0.923124 data: 0.000174 max mem: 18817 Epoch: [116/300] [ 700/1251] eta: 0:08:50 lr: 0.001349 loss: 3.301716 (3.298226) time: 0.971019 data: 0.000172 max mem: 18817 Epoch: [116/300] [ 750/1251] eta: 0:08:02 lr: 0.001348 loss: 3.450382 (3.306815) time: 1.039352 data: 0.000178 max mem: 18817 Epoch: [116/300] [ 800/1251] eta: 0:07:13 lr: 0.001348 loss: 3.404358 (3.303610) time: 0.946660 data: 0.000168 max mem: 18817 Epoch: [116/300] [ 850/1251] eta: 0:06:25 lr: 0.001348 loss: 3.307541 (3.300010) time: 0.915272 data: 0.000169 max mem: 18817 Epoch: [116/300] [ 900/1251] eta: 0:05:37 lr: 0.001347 loss: 3.601299 (3.306320) time: 0.919954 data: 0.000180 max mem: 18817 Epoch: [116/300] [ 950/1251] eta: 0:04:49 lr: 0.001347 loss: 3.352156 (3.307732) time: 0.964786 data: 0.000163 max mem: 18817 Epoch: [116/300] [1000/1251] eta: 0:04:01 lr: 0.001346 loss: 3.371580 (3.307122) time: 1.039594 data: 0.000172 max mem: 18817 Epoch: [116/300] [1050/1251] eta: 0:03:13 lr: 0.001346 loss: 3.342726 (3.304834) time: 0.976639 data: 0.000182 max mem: 18817 Epoch: [116/300] [1100/1251] eta: 0:02:25 lr: 0.001346 loss: 3.467510 (3.309630) time: 0.917906 data: 0.000165 max mem: 18817 Epoch: [116/300] [1150/1251] eta: 0:01:37 lr: 0.001345 loss: 3.109803 (3.306541) time: 0.940986 data: 0.000188 max mem: 18817 Epoch: [116/300] [1200/1251] eta: 0:00:49 lr: 0.001345 loss: 3.540188 (3.302763) time: 0.973536 data: 0.000170 max mem: 18817 Epoch: [116/300] [1250/1251] eta: 0:00:00 lr: 0.001344 loss: 3.162627 (3.300876) time: 1.018641 data: 0.000739 max mem: 18817 Epoch: [116/300] Total time: 0:20:03 (0.961947 s / it) Averaged stats: lr: 0.001344 loss: 3.162627 (3.305594) Test: [ 0/49] eta: 0:01:15 loss: 0.686710 (0.686710) acc1: 82.812500 (82.812500) acc5: 95.312500 (95.312500) time: 1.536376 data: 1.109895 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.759389 (0.794679) acc1: 79.687500 (80.397727) acc5: 95.312500 (94.744318) time: 0.486345 data: 0.101045 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.849915 (0.829881) acc1: 79.687500 (80.133929) acc5: 95.312500 (94.717262) time: 0.371882 data: 0.000151 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.843528 (0.839000) acc1: 79.687500 (79.838710) acc5: 95.312500 (95.010081) time: 0.374077 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.841280 (0.850151) acc1: 78.125000 (79.496951) acc5: 95.312500 (95.198171) time: 0.371928 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.842052 (0.848785) acc1: 78.125000 (79.424000) acc5: 95.312500 (95.360000) time: 0.358614 data: 0.000104 max mem: 18817 Test: Total time: 0:00:19 (0.396465 s / it) * Acc@1 79.456 Acc@5 95.056 loss 0.856 Max accuracy: 79.46% Epoch: [117/300] [ 0/1251] eta: 0:44:09 lr: 0.001344 loss: 3.622368 (3.622368) time: 2.118045 data: 1.140559 max mem: 18817 Epoch: [117/300] [ 50/1251] eta: 0:19:11 lr: 0.001344 loss: 3.295751 (3.355078) time: 0.924680 data: 0.000147 max mem: 18817 Epoch: [117/300] [ 100/1251] eta: 0:18:35 lr: 0.001344 loss: 3.456788 (3.346293) time: 0.939773 data: 0.000177 max mem: 18817 Epoch: [117/300] [ 150/1251] eta: 0:17:45 lr: 0.001343 loss: 3.465129 (3.329271) time: 0.995421 data: 0.000179 max mem: 18817 Epoch: [117/300] [ 200/1251] eta: 0:16:51 lr: 0.001343 loss: 3.465266 (3.327771) time: 0.965353 data: 0.000187 max mem: 18817 Epoch: [117/300] [ 250/1251] eta: 0:16:06 lr: 0.001342 loss: 3.325561 (3.317053) time: 0.994556 data: 0.000175 max mem: 18817 Epoch: [117/300] [ 300/1251] eta: 0:15:15 lr: 0.001342 loss: 3.243261 (3.300420) time: 0.914213 data: 0.000169 max mem: 18817 Epoch: [117/300] [ 350/1251] eta: 0:14:29 lr: 0.001342 loss: 3.375449 (3.303739) time: 0.946411 data: 0.000189 max mem: 18817 Epoch: [117/300] [ 400/1251] eta: 0:13:42 lr: 0.001341 loss: 3.223131 (3.295657) time: 0.987575 data: 0.000151 max mem: 18817 Epoch: [117/300] [ 450/1251] eta: 0:12:52 lr: 0.001341 loss: 3.293224 (3.304233) time: 0.995412 data: 0.000195 max mem: 18817 Epoch: [117/300] [ 500/1251] eta: 0:12:03 lr: 0.001341 loss: 3.064887 (3.309697) time: 0.961238 data: 0.000161 max mem: 18817 Epoch: [117/300] [ 550/1251] eta: 0:11:14 lr: 0.001340 loss: 3.370832 (3.308667) time: 0.942688 data: 0.000181 max mem: 18817 Epoch: [117/300] [ 600/1251] eta: 0:10:26 lr: 0.001340 loss: 3.424692 (3.307603) time: 0.918236 data: 0.000181 max mem: 18817 Epoch: [117/300] [ 650/1251] eta: 0:09:39 lr: 0.001339 loss: 3.392405 (3.307385) time: 0.991299 data: 0.000164 max mem: 18817 Epoch: [117/300] [ 700/1251] eta: 0:08:50 lr: 0.001339 loss: 3.338850 (3.299545) time: 0.983072 data: 0.000164 max mem: 18817 Epoch: [117/300] [ 750/1251] eta: 0:08:02 lr: 0.001339 loss: 3.234509 (3.296416) time: 0.978745 data: 0.000154 max mem: 18817 Epoch: [117/300] [ 800/1251] eta: 0:07:13 lr: 0.001338 loss: 3.389628 (3.299886) time: 0.922565 data: 0.000159 max mem: 18817 Epoch: [117/300] [ 850/1251] eta: 0:06:25 lr: 0.001338 loss: 3.419631 (3.299242) time: 0.920660 data: 0.000182 max mem: 18817 Epoch: [117/300] [ 900/1251] eta: 0:05:37 lr: 0.001337 loss: 3.187914 (3.295451) time: 0.970519 data: 0.000163 max mem: 18817 Epoch: [117/300] [ 950/1251] eta: 0:04:49 lr: 0.001337 loss: 3.484249 (3.293098) time: 0.979396 data: 0.000180 max mem: 18817 Epoch: [117/300] [1000/1251] eta: 0:04:01 lr: 0.001337 loss: 3.313356 (3.295477) time: 0.990940 data: 0.000164 max mem: 18817 Epoch: [117/300] [1050/1251] eta: 0:03:13 lr: 0.001336 loss: 3.317843 (3.294642) time: 0.932926 data: 0.000172 max mem: 18817 Epoch: [117/300] [1100/1251] eta: 0:02:25 lr: 0.001336 loss: 3.696549 (3.296911) time: 0.941673 data: 0.000192 max mem: 18817 Epoch: [117/300] [1150/1251] eta: 0:01:37 lr: 0.001335 loss: 3.367290 (3.298892) time: 0.988399 data: 0.000177 max mem: 18817 Epoch: [117/300] [1200/1251] eta: 0:00:49 lr: 0.001335 loss: 3.310574 (3.297562) time: 1.019122 data: 0.000171 max mem: 18817 Epoch: [117/300] [1250/1251] eta: 0:00:00 lr: 0.001335 loss: 3.396159 (3.305064) time: 0.974591 data: 0.000740 max mem: 18817 Epoch: [117/300] Total time: 0:20:04 (0.962857 s / it) Averaged stats: lr: 0.001335 loss: 3.396159 (3.308997) Test: [ 0/49] eta: 0:01:17 loss: 0.705590 (0.705590) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.575323 data: 1.109812 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.716397 (0.857465) acc1: 81.250000 (80.539773) acc5: 95.312500 (94.602273) time: 0.479141 data: 0.101060 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.863452 (0.860033) acc1: 79.687500 (80.505952) acc5: 95.312500 (94.791667) time: 0.365543 data: 0.000153 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.871314 (0.855507) acc1: 79.687500 (80.292339) acc5: 95.312500 (95.161290) time: 0.361978 data: 0.000122 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.890350 (0.875860) acc1: 79.687500 (79.725610) acc5: 95.312500 (95.007622) time: 0.377807 data: 0.000120 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.918652 (0.878743) acc1: 78.125000 (79.712000) acc5: 95.312500 (95.136000) time: 0.392464 data: 0.000101 max mem: 18817 Test: Total time: 0:00:19 (0.403545 s / it) * Acc@1 79.226 Acc@5 95.126 loss 0.882 Max accuracy: 79.46% Epoch: [118/300] [ 0/1251] eta: 0:43:49 lr: 0.001335 loss: 2.400606 (2.400606) time: 2.101630 data: 1.186315 max mem: 18817 Epoch: [118/300] [ 50/1251] eta: 0:20:02 lr: 0.001334 loss: 3.506817 (3.390358) time: 0.935900 data: 0.000184 max mem: 18817 Epoch: [118/300] [ 100/1251] eta: 0:18:48 lr: 0.001334 loss: 3.219841 (3.300923) time: 0.973865 data: 0.000154 max mem: 18817 Epoch: [118/300] [ 150/1251] eta: 0:17:53 lr: 0.001333 loss: 3.240047 (3.309230) time: 1.021408 data: 0.000177 max mem: 18817 Epoch: [118/300] [ 200/1251] eta: 0:16:53 lr: 0.001333 loss: 3.248618 (3.332680) time: 0.953486 data: 0.000173 max mem: 18817 Epoch: [118/300] [ 250/1251] eta: 0:16:01 lr: 0.001333 loss: 3.178589 (3.302844) time: 0.933640 data: 0.000192 max mem: 18817 Epoch: [118/300] [ 300/1251] eta: 0:15:13 lr: 0.001332 loss: 3.282782 (3.287261) time: 0.936244 data: 0.000162 max mem: 18817 Epoch: [118/300] [ 350/1251] eta: 0:14:24 lr: 0.001332 loss: 3.468481 (3.285443) time: 0.980843 data: 0.000177 max mem: 18817 Epoch: [118/300] [ 400/1251] eta: 0:13:34 lr: 0.001332 loss: 3.538872 (3.291015) time: 0.970519 data: 0.000175 max mem: 18817 Epoch: [118/300] [ 450/1251] eta: 0:12:48 lr: 0.001331 loss: 3.411064 (3.291192) time: 0.984493 data: 0.000168 max mem: 18817 Epoch: [118/300] [ 500/1251] eta: 0:12:00 lr: 0.001331 loss: 3.384327 (3.288064) time: 0.931266 data: 0.000175 max mem: 18817 Epoch: [118/300] [ 550/1251] eta: 0:11:12 lr: 0.001330 loss: 3.100333 (3.285821) time: 0.917840 data: 0.000163 max mem: 18817 Epoch: [118/300] [ 600/1251] eta: 0:10:24 lr: 0.001330 loss: 3.610507 (3.298795) time: 0.981753 data: 0.000165 max mem: 18817 Epoch: [118/300] [ 650/1251] eta: 0:09:37 lr: 0.001330 loss: 3.443552 (3.301408) time: 1.003779 data: 0.000150 max mem: 18817 Epoch: [118/300] [ 700/1251] eta: 0:08:49 lr: 0.001329 loss: 3.354873 (3.298944) time: 0.978103 data: 0.000167 max mem: 18817 Epoch: [118/300] [ 750/1251] eta: 0:08:00 lr: 0.001329 loss: 3.101978 (3.294823) time: 0.927135 data: 0.000156 max mem: 18817 Epoch: [118/300] [ 800/1251] eta: 0:07:13 lr: 0.001328 loss: 3.448863 (3.303534) time: 0.928978 data: 0.000171 max mem: 18817 Epoch: [118/300] [ 850/1251] eta: 0:06:25 lr: 0.001328 loss: 3.299312 (3.301804) time: 0.989238 data: 0.000160 max mem: 18817 Epoch: [118/300] [ 900/1251] eta: 0:05:37 lr: 0.001328 loss: 3.239290 (3.298015) time: 1.026771 data: 0.000162 max mem: 18817 Epoch: [118/300] [ 950/1251] eta: 0:04:49 lr: 0.001327 loss: 3.447609 (3.299847) time: 0.983240 data: 0.000171 max mem: 18817 Epoch: [118/300] [1000/1251] eta: 0:04:00 lr: 0.001327 loss: 3.131515 (3.295567) time: 0.928790 data: 0.000173 max mem: 18817 Epoch: [118/300] [1050/1251] eta: 0:03:12 lr: 0.001326 loss: 3.308717 (3.300585) time: 0.919343 data: 0.000209 max mem: 18817 Epoch: [118/300] [1100/1251] eta: 0:02:24 lr: 0.001326 loss: 2.855855 (3.296647) time: 0.973722 data: 0.000167 max mem: 18817 Epoch: [118/300] [1150/1251] eta: 0:01:36 lr: 0.001326 loss: 3.563946 (3.303507) time: 0.982014 data: 0.000167 max mem: 18817 Epoch: [118/300] [1200/1251] eta: 0:00:48 lr: 0.001325 loss: 3.317674 (3.300765) time: 0.988499 data: 0.000170 max mem: 18817 Epoch: [118/300] [1250/1251] eta: 0:00:00 lr: 0.001325 loss: 3.348528 (3.301575) time: 0.932326 data: 0.000754 max mem: 18817 Epoch: [118/300] Total time: 0:20:01 (0.960191 s / it) Averaged stats: lr: 0.001325 loss: 3.348528 (3.297552) Test: [ 0/49] eta: 0:01:18 loss: 0.631238 (0.631238) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.592697 data: 1.142339 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.683247 (0.815659) acc1: 79.687500 (79.971591) acc5: 95.312500 (94.602273) time: 0.482736 data: 0.104023 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.863108 (0.837847) acc1: 78.125000 (79.761905) acc5: 95.312500 (94.717262) time: 0.367827 data: 0.000162 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 0.863108 (0.841815) acc1: 78.125000 (79.435484) acc5: 95.312500 (95.161290) time: 0.470847 data: 0.000135 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.824646 (0.846110) acc1: 78.125000 (79.611280) acc5: 95.312500 (95.198171) time: 0.467883 data: 0.000139 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.817549 (0.843930) acc1: 78.125000 (79.616000) acc5: 95.312500 (95.296000) time: 0.355610 data: 0.000117 max mem: 18817 Test: Total time: 0:00:21 (0.434154 s / it) * Acc@1 79.486 Acc@5 95.290 loss 0.844 Max accuracy: 79.49% Epoch: [119/300] [ 0/1251] eta: 0:39:43 lr: 0.001325 loss: 3.831045 (3.831045) time: 1.905225 data: 1.006687 max mem: 18817 Epoch: [119/300] [ 50/1251] eta: 0:19:41 lr: 0.001324 loss: 3.518891 (3.246170) time: 0.975720 data: 0.000163 max mem: 18817 Epoch: [119/300] [ 100/1251] eta: 0:18:26 lr: 0.001324 loss: 3.474907 (3.197223) time: 0.962264 data: 0.000173 max mem: 18817 Epoch: [119/300] [ 150/1251] eta: 0:17:29 lr: 0.001324 loss: 3.286223 (3.215818) time: 0.913858 data: 0.000162 max mem: 18817 Epoch: [119/300] [ 200/1251] eta: 0:16:47 lr: 0.001323 loss: 3.236424 (3.226111) time: 0.923904 data: 0.000178 max mem: 18817 Epoch: [119/300] [ 250/1251] eta: 0:16:01 lr: 0.001323 loss: 3.202242 (3.252265) time: 0.987406 data: 0.000182 max mem: 18817 Epoch: [119/300] [ 300/1251] eta: 0:15:16 lr: 0.001322 loss: 3.066215 (3.233078) time: 1.016679 data: 0.000163 max mem: 18817 Epoch: [119/300] [ 350/1251] eta: 0:14:25 lr: 0.001322 loss: 3.319514 (3.232800) time: 0.963935 data: 0.000159 max mem: 18817 Epoch: [119/300] [ 400/1251] eta: 0:13:36 lr: 0.001322 loss: 3.088551 (3.238579) time: 0.920909 data: 0.000163 max mem: 18817 Epoch: [119/300] [ 450/1251] eta: 0:12:50 lr: 0.001321 loss: 3.443840 (3.233074) time: 0.922110 data: 0.000172 max mem: 18817 Epoch: [119/300] [ 500/1251] eta: 0:12:03 lr: 0.001321 loss: 3.521127 (3.247835) time: 1.005833 data: 0.000161 max mem: 18817 Epoch: [119/300] [ 550/1251] eta: 0:11:15 lr: 0.001321 loss: 3.543235 (3.253958) time: 0.989092 data: 0.000183 max mem: 18817 Epoch: [119/300] [ 600/1251] eta: 0:10:26 lr: 0.001320 loss: 3.385062 (3.262624) time: 0.968425 data: 0.000162 max mem: 18817 Epoch: [119/300] [ 650/1251] eta: 0:09:37 lr: 0.001320 loss: 3.233116 (3.270828) time: 0.909945 data: 0.000157 max mem: 18817 Epoch: [119/300] [ 700/1251] eta: 0:08:49 lr: 0.001319 loss: 3.381126 (3.272960) time: 0.917890 data: 0.000156 max mem: 18817 Epoch: [119/300] [ 750/1251] eta: 0:08:01 lr: 0.001319 loss: 3.026742 (3.264787) time: 0.988916 data: 0.000176 max mem: 18817 Epoch: [119/300] [ 800/1251] eta: 0:07:13 lr: 0.001319 loss: 3.279053 (3.269584) time: 0.990112 data: 0.000177 max mem: 18817 Epoch: [119/300] [ 850/1251] eta: 0:06:25 lr: 0.001318 loss: 3.300889 (3.271008) time: 0.979214 data: 0.000185 max mem: 18817 Epoch: [119/300] [ 900/1251] eta: 0:05:37 lr: 0.001318 loss: 3.494282 (3.275531) time: 0.916179 data: 0.000163 max mem: 18817 Epoch: [119/300] [ 950/1251] eta: 0:04:49 lr: 0.001317 loss: 3.146978 (3.275722) time: 0.927465 data: 0.000152 max mem: 18817 Epoch: [119/300] [1000/1251] eta: 0:04:01 lr: 0.001317 loss: 3.518246 (3.275715) time: 0.956873 data: 0.000192 max mem: 18817 Epoch: [119/300] [1050/1251] eta: 0:03:13 lr: 0.001317 loss: 3.427975 (3.281382) time: 0.973707 data: 0.000176 max mem: 18817 Epoch: [119/300] [1100/1251] eta: 0:02:25 lr: 0.001316 loss: 3.554474 (3.284032) time: 0.975626 data: 0.000174 max mem: 18817 Epoch: [119/300] [1150/1251] eta: 0:01:36 lr: 0.001316 loss: 3.452855 (3.288468) time: 0.912014 data: 0.000166 max mem: 18817 Epoch: [119/300] [1200/1251] eta: 0:00:48 lr: 0.001315 loss: 3.266405 (3.290007) time: 0.922744 data: 0.000171 max mem: 18817 Epoch: [119/300] [1250/1251] eta: 0:00:00 lr: 0.001315 loss: 3.374288 (3.290165) time: 0.982788 data: 0.000737 max mem: 18817 Epoch: [119/300] Total time: 0:20:01 (0.960553 s / it) Averaged stats: lr: 0.001315 loss: 3.374288 (3.290047) Test: [ 0/49] eta: 0:01:20 loss: 0.639420 (0.639420) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.645074 data: 1.171342 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.756283 (0.799211) acc1: 81.250000 (80.965909) acc5: 95.312500 (95.028409) time: 0.487929 data: 0.106642 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.875798 (0.845513) acc1: 79.687500 (79.910714) acc5: 95.312500 (94.642857) time: 0.374426 data: 0.000170 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.875798 (0.850316) acc1: 78.125000 (79.233871) acc5: 95.312500 (95.010081) time: 0.370264 data: 0.000159 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.819963 (0.849741) acc1: 78.125000 (79.382622) acc5: 95.312500 (95.160061) time: 0.361384 data: 0.000147 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.842329 (0.845654) acc1: 81.250000 (79.680000) acc5: 95.312500 (95.200000) time: 0.355767 data: 0.000120 max mem: 18817 Test: Total time: 0:00:19 (0.393478 s / it) * Acc@1 79.486 Acc@5 95.198 loss 0.848 Max accuracy: 79.49% Epoch: [120/300] [ 0/1251] eta: 0:47:23 lr: 0.001315 loss: 2.838132 (2.838132) time: 2.273054 data: 1.179806 max mem: 18817 Epoch: [120/300] [ 50/1251] eta: 0:19:35 lr: 0.001315 loss: 3.631194 (3.373256) time: 0.981268 data: 0.000170 max mem: 18817 Epoch: [120/300] [ 100/1251] eta: 0:18:37 lr: 0.001314 loss: 3.472298 (3.295099) time: 0.943125 data: 0.000177 max mem: 18817 Epoch: [120/300] [ 150/1251] eta: 0:17:50 lr: 0.001314 loss: 3.133753 (3.279226) time: 0.933401 data: 0.000161 max mem: 18817 Epoch: [120/300] [ 200/1251] eta: 0:17:02 lr: 0.001313 loss: 3.477448 (3.285941) time: 1.003473 data: 0.000173 max mem: 18817 Epoch: [120/300] [ 250/1251] eta: 0:16:12 lr: 0.001313 loss: 3.284770 (3.288651) time: 1.038403 data: 0.000159 max mem: 18817 Epoch: [120/300] [ 300/1251] eta: 0:15:19 lr: 0.001313 loss: 3.462554 (3.304975) time: 0.965973 data: 0.000160 max mem: 18817 Epoch: [120/300] [ 350/1251] eta: 0:14:29 lr: 0.001312 loss: 3.404696 (3.315281) time: 0.924049 data: 0.000165 max mem: 18817 Epoch: [120/300] [ 400/1251] eta: 0:13:41 lr: 0.001312 loss: 3.426188 (3.314969) time: 0.931800 data: 0.000160 max mem: 18817 Epoch: [120/300] [ 450/1251] eta: 0:12:53 lr: 0.001311 loss: 3.297901 (3.315495) time: 0.987477 data: 0.000168 max mem: 18817 Epoch: [120/300] [ 500/1251] eta: 0:12:06 lr: 0.001311 loss: 3.318681 (3.310951) time: 1.039804 data: 0.000183 max mem: 18817 Epoch: [120/300] [ 550/1251] eta: 0:11:16 lr: 0.001311 loss: 3.419802 (3.313271) time: 0.963709 data: 0.000169 max mem: 18817 Epoch: [120/300] [ 600/1251] eta: 0:10:27 lr: 0.001310 loss: 3.430283 (3.312151) time: 0.928957 data: 0.000177 max mem: 18817 Epoch: [120/300] [ 650/1251] eta: 0:09:39 lr: 0.001310 loss: 3.087516 (3.306165) time: 0.936455 data: 0.000158 max mem: 18817 Epoch: [120/300] [ 700/1251] eta: 0:08:51 lr: 0.001309 loss: 3.469922 (3.303982) time: 0.996329 data: 0.000165 max mem: 18817 Epoch: [120/300] [ 750/1251] eta: 0:08:04 lr: 0.001309 loss: 3.224574 (3.304707) time: 1.004987 data: 0.000200 max mem: 18817 Epoch: [120/300] [ 800/1251] eta: 0:07:15 lr: 0.001309 loss: 2.983393 (3.301661) time: 0.968130 data: 0.000171 max mem: 18817 Epoch: [120/300] [ 850/1251] eta: 0:06:26 lr: 0.001308 loss: 3.449811 (3.306263) time: 0.927512 data: 0.000158 max mem: 18817 Epoch: [120/300] [ 900/1251] eta: 0:05:38 lr: 0.001308 loss: 3.128571 (3.304075) time: 0.932831 data: 0.000171 max mem: 18817 Epoch: [120/300] [ 950/1251] eta: 0:04:50 lr: 0.001307 loss: 3.399116 (3.300908) time: 1.002476 data: 0.000159 max mem: 18817 Epoch: [120/300] [1000/1251] eta: 0:04:02 lr: 0.001307 loss: 3.490800 (3.304992) time: 0.997504 data: 0.000191 max mem: 18817 Epoch: [120/300] [1050/1251] eta: 0:03:13 lr: 0.001307 loss: 3.361908 (3.303682) time: 0.982203 data: 0.000172 max mem: 18817 Epoch: [120/300] [1100/1251] eta: 0:02:25 lr: 0.001306 loss: 3.388440 (3.301685) time: 0.919839 data: 0.000165 max mem: 18817 Epoch: [120/300] [1150/1251] eta: 0:01:37 lr: 0.001306 loss: 3.303577 (3.298931) time: 0.934334 data: 0.000167 max mem: 18817 Epoch: [120/300] [1200/1251] eta: 0:00:49 lr: 0.001306 loss: 3.258744 (3.299254) time: 0.954512 data: 0.000167 max mem: 18817 Epoch: [120/300] [1250/1251] eta: 0:00:00 lr: 0.001305 loss: 3.417976 (3.302630) time: 0.989312 data: 0.000761 max mem: 18817 Epoch: [120/300] Total time: 0:20:08 (0.966228 s / it) Averaged stats: lr: 0.001305 loss: 3.417976 (3.300444) Test: [ 0/49] eta: 0:01:26 loss: 0.763606 (0.763606) acc1: 85.937500 (85.937500) acc5: 93.750000 (93.750000) time: 1.767250 data: 1.340955 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.763606 (0.798823) acc1: 81.250000 (81.818182) acc5: 96.875000 (94.886364) time: 0.494137 data: 0.122047 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.854515 (0.834847) acc1: 79.687500 (80.282738) acc5: 95.312500 (95.386905) time: 0.375668 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.857524 (0.836060) acc1: 79.687500 (80.443548) acc5: 95.312500 (95.564516) time: 0.377560 data: 0.000123 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.841308 (0.849111) acc1: 79.687500 (80.259146) acc5: 95.312500 (95.464939) time: 0.370096 data: 0.000116 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.841308 (0.848727) acc1: 79.687500 (80.064000) acc5: 95.312500 (95.424000) time: 0.361953 data: 0.000099 max mem: 18817 Test: Total time: 0:00:19 (0.400878 s / it) * Acc@1 79.508 Acc@5 95.240 loss 0.862 Max accuracy: 79.51% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0120.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0120.pth Epoch: [121/300] [ 0/1251] eta: 0:42:58 lr: 0.001305 loss: 2.643483 (2.643483) time: 2.061295 data: 1.158636 max mem: 18817 Epoch: [121/300] [ 50/1251] eta: 0:19:40 lr: 0.001305 loss: 3.146855 (3.348352) time: 0.966182 data: 0.000173 max mem: 18817 Epoch: [121/300] [ 100/1251] eta: 0:18:30 lr: 0.001304 loss: 2.985798 (3.259738) time: 0.971491 data: 0.000167 max mem: 18817 Epoch: [121/300] [ 150/1251] eta: 0:17:45 lr: 0.001304 loss: 3.364707 (3.227869) time: 0.973680 data: 0.000153 max mem: 18817 Epoch: [121/300] [ 200/1251] eta: 0:16:50 lr: 0.001304 loss: 3.388473 (3.213688) time: 0.922644 data: 0.000165 max mem: 18817 Epoch: [121/300] [ 250/1251] eta: 0:16:01 lr: 0.001303 loss: 3.313919 (3.232142) time: 0.927878 data: 0.000177 max mem: 18817 Epoch: [121/300] [ 300/1251] eta: 0:15:13 lr: 0.001303 loss: 3.126426 (3.232559) time: 0.976094 data: 0.000170 max mem: 18817 Epoch: [121/300] [ 350/1251] eta: 0:14:23 lr: 0.001302 loss: 3.179412 (3.238311) time: 0.986833 data: 0.000166 max mem: 18817 Epoch: [121/300] [ 400/1251] eta: 0:13:35 lr: 0.001302 loss: 3.255243 (3.242727) time: 0.947608 data: 0.000187 max mem: 18817 Epoch: [121/300] [ 450/1251] eta: 0:12:48 lr: 0.001302 loss: 3.342136 (3.256052) time: 0.943052 data: 0.000173 max mem: 18817 Epoch: [121/300] [ 500/1251] eta: 0:12:01 lr: 0.001301 loss: 3.302506 (3.266191) time: 0.932489 data: 0.000176 max mem: 18817 Epoch: [121/300] [ 550/1251] eta: 0:11:13 lr: 0.001301 loss: 3.450825 (3.265859) time: 0.977335 data: 0.000177 max mem: 18817 Epoch: [121/300] [ 600/1251] eta: 0:10:25 lr: 0.001300 loss: 3.272762 (3.264277) time: 0.982109 data: 0.000168 max mem: 18817 Epoch: [121/300] [ 650/1251] eta: 0:09:38 lr: 0.001300 loss: 3.377775 (3.267857) time: 0.984970 data: 0.000175 max mem: 18817 Epoch: [121/300] [ 700/1251] eta: 0:08:49 lr: 0.001300 loss: 3.203512 (3.267496) time: 0.928722 data: 0.000166 max mem: 18817 Epoch: [121/300] [ 750/1251] eta: 0:08:01 lr: 0.001299 loss: 3.535541 (3.277706) time: 0.934240 data: 0.000170 max mem: 18817 Epoch: [121/300] [ 800/1251] eta: 0:07:13 lr: 0.001299 loss: 3.194127 (3.276798) time: 0.976066 data: 0.000173 max mem: 18817 Epoch: [121/300] [ 850/1251] eta: 0:06:25 lr: 0.001298 loss: 3.337909 (3.273247) time: 0.985747 data: 0.000172 max mem: 18817 Epoch: [121/300] [ 900/1251] eta: 0:05:37 lr: 0.001298 loss: 3.526642 (3.276317) time: 0.958830 data: 0.000163 max mem: 18817 Epoch: [121/300] [ 950/1251] eta: 0:04:49 lr: 0.001298 loss: 3.050192 (3.274564) time: 0.924446 data: 0.000174 max mem: 18817 Epoch: [121/300] [1000/1251] eta: 0:04:01 lr: 0.001297 loss: 3.090303 (3.276881) time: 0.929475 data: 0.000163 max mem: 18817 Epoch: [121/300] [1050/1251] eta: 0:03:13 lr: 0.001297 loss: 3.292961 (3.279191) time: 0.989923 data: 0.000174 max mem: 18817 Epoch: [121/300] [1100/1251] eta: 0:02:25 lr: 0.001296 loss: 3.197448 (3.275852) time: 0.980258 data: 0.000165 max mem: 18817 Epoch: [121/300] [1150/1251] eta: 0:01:37 lr: 0.001296 loss: 3.522687 (3.271065) time: 0.984854 data: 0.000167 max mem: 18817 Epoch: [121/300] [1200/1251] eta: 0:00:49 lr: 0.001296 loss: 3.371182 (3.271673) time: 0.923350 data: 0.000176 max mem: 18817 Epoch: [121/300] [1250/1251] eta: 0:00:00 lr: 0.001295 loss: 3.309635 (3.274219) time: 0.942604 data: 0.000731 max mem: 18817 Epoch: [121/300] Total time: 0:20:03 (0.962109 s / it) Averaged stats: lr: 0.001295 loss: 3.309635 (3.277894) Test: [ 0/49] eta: 0:01:15 loss: 0.782140 (0.782140) acc1: 82.812500 (82.812500) acc5: 93.750000 (93.750000) time: 1.531972 data: 1.108831 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.782140 (0.766040) acc1: 82.812500 (81.250000) acc5: 95.312500 (95.028409) time: 0.479070 data: 0.100982 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.818545 (0.812154) acc1: 78.125000 (80.282738) acc5: 95.312500 (95.312500) time: 0.368527 data: 0.000158 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.834904 (0.815573) acc1: 78.125000 (79.737903) acc5: 95.312500 (95.514113) time: 0.363338 data: 0.000121 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.834904 (0.828892) acc1: 79.687500 (79.573171) acc5: 95.312500 (95.464939) time: 0.360931 data: 0.000118 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.886297 (0.835371) acc1: 79.687500 (79.424000) acc5: 95.312500 (95.360000) time: 0.355057 data: 0.000097 max mem: 18817 Test: Total time: 0:00:19 (0.389312 s / it) * Acc@1 79.796 Acc@5 95.316 loss 0.839 Max accuracy: 79.80% Epoch: [122/300] [ 0/1251] eta: 0:42:18 lr: 0.001295 loss: 3.423319 (3.423319) time: 2.029572 data: 1.049063 max mem: 18817 Epoch: [122/300] [ 50/1251] eta: 0:19:12 lr: 0.001295 loss: 3.237445 (3.237173) time: 0.963117 data: 0.000166 max mem: 18817 Epoch: [122/300] [ 100/1251] eta: 0:18:21 lr: 0.001294 loss: 2.956991 (3.139400) time: 0.923336 data: 0.000188 max mem: 18817 Epoch: [122/300] [ 150/1251] eta: 0:17:39 lr: 0.001294 loss: 3.153000 (3.163553) time: 0.934239 data: 0.000179 max mem: 18817 Epoch: [122/300] [ 200/1251] eta: 0:16:53 lr: 0.001294 loss: 3.245438 (3.165037) time: 0.981076 data: 0.000175 max mem: 18817 Epoch: [122/300] [ 250/1251] eta: 0:16:04 lr: 0.001293 loss: 3.140264 (3.178041) time: 1.019352 data: 0.000176 max mem: 18817 Epoch: [122/300] [ 300/1251] eta: 0:15:13 lr: 0.001293 loss: 3.100531 (3.192616) time: 0.969700 data: 0.000178 max mem: 18817 Epoch: [122/300] [ 350/1251] eta: 0:14:23 lr: 0.001292 loss: 3.292067 (3.193238) time: 0.919241 data: 0.000178 max mem: 18817 Epoch: [122/300] [ 400/1251] eta: 0:13:38 lr: 0.001292 loss: 3.266058 (3.191991) time: 0.928620 data: 0.000183 max mem: 18817 Epoch: [122/300] [ 450/1251] eta: 0:12:50 lr: 0.001292 loss: 3.401129 (3.208524) time: 0.985962 data: 0.000176 max mem: 18817 Epoch: [122/300] [ 500/1251] eta: 0:12:03 lr: 0.001291 loss: 3.242828 (3.208215) time: 1.012491 data: 0.000191 max mem: 18817 Epoch: [122/300] [ 550/1251] eta: 0:11:13 lr: 0.001291 loss: 3.325444 (3.208852) time: 0.970133 data: 0.000181 max mem: 18817 Epoch: [122/300] [ 600/1251] eta: 0:10:25 lr: 0.001290 loss: 3.356392 (3.202490) time: 0.920967 data: 0.000174 max mem: 18817 Epoch: [122/300] [ 650/1251] eta: 0:09:37 lr: 0.001290 loss: 3.535070 (3.216342) time: 0.927461 data: 0.000164 max mem: 18817 Epoch: [122/300] [ 700/1251] eta: 0:08:49 lr: 0.001290 loss: 3.428605 (3.225513) time: 0.975524 data: 0.000168 max mem: 18817 Epoch: [122/300] [ 750/1251] eta: 0:08:01 lr: 0.001289 loss: 3.351057 (3.234407) time: 1.060714 data: 0.000186 max mem: 18817 Epoch: [122/300] [ 800/1251] eta: 0:07:13 lr: 0.001289 loss: 3.425488 (3.241362) time: 0.968874 data: 0.000182 max mem: 18817 Epoch: [122/300] [ 850/1251] eta: 0:06:25 lr: 0.001288 loss: 3.455683 (3.247697) time: 0.916570 data: 0.000179 max mem: 18817 Epoch: [122/300] [ 900/1251] eta: 0:05:37 lr: 0.001288 loss: 3.311419 (3.249412) time: 0.920129 data: 0.000184 max mem: 18817 Epoch: [122/300] [ 950/1251] eta: 0:04:49 lr: 0.001288 loss: 3.029148 (3.251275) time: 1.008005 data: 0.000179 max mem: 18817 Epoch: [122/300] [1000/1251] eta: 0:04:01 lr: 0.001287 loss: 3.096343 (3.250887) time: 0.986253 data: 0.000179 max mem: 18817 Epoch: [122/300] [1050/1251] eta: 0:03:13 lr: 0.001287 loss: 3.507732 (3.252426) time: 0.962718 data: 0.000184 max mem: 18817 Epoch: [122/300] [1100/1251] eta: 0:02:24 lr: 0.001286 loss: 3.274125 (3.249556) time: 0.915053 data: 0.000179 max mem: 18817 Epoch: [122/300] [1150/1251] eta: 0:01:36 lr: 0.001286 loss: 3.426849 (3.252901) time: 0.917258 data: 0.000178 max mem: 18817 Epoch: [122/300] [1200/1251] eta: 0:00:48 lr: 0.001286 loss: 3.470392 (3.252242) time: 0.984666 data: 0.000187 max mem: 18817 Epoch: [122/300] [1250/1251] eta: 0:00:00 lr: 0.001285 loss: 3.487180 (3.253801) time: 1.021606 data: 0.000765 max mem: 18817 Epoch: [122/300] Total time: 0:20:01 (0.960429 s / it) Averaged stats: lr: 0.001285 loss: 3.487180 (3.265346) Test: [ 0/49] eta: 0:01:33 loss: 0.734468 (0.734468) acc1: 84.375000 (84.375000) acc5: 95.312500 (95.312500) time: 1.905640 data: 1.423953 max mem: 18817 Test: [10/49] eta: 0:00:21 loss: 0.734468 (0.803803) acc1: 81.250000 (81.818182) acc5: 95.312500 (95.312500) time: 0.546831 data: 0.129580 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.881331 (0.841626) acc1: 79.687500 (80.059524) acc5: 95.312500 (95.163690) time: 0.386568 data: 0.000138 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.870734 (0.845116) acc1: 78.125000 (79.485887) acc5: 96.875000 (95.514113) time: 0.365271 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.867276 (0.854721) acc1: 79.687500 (79.916159) acc5: 96.875000 (95.464939) time: 0.365366 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.879612 (0.854371) acc1: 79.687500 (79.744000) acc5: 95.312500 (95.424000) time: 0.357041 data: 0.000100 max mem: 18817 Test: Total time: 0:00:19 (0.407031 s / it) * Acc@1 79.440 Acc@5 95.158 loss 0.854 Max accuracy: 79.80% Epoch: [123/300] [ 0/1251] eta: 0:41:35 lr: 0.001285 loss: 3.574099 (3.574099) time: 1.995087 data: 1.094961 max mem: 18817 Epoch: [123/300] [ 50/1251] eta: 0:19:11 lr: 0.001285 loss: 3.345621 (3.239403) time: 0.920552 data: 0.000162 max mem: 18817 Epoch: [123/300] [ 100/1251] eta: 0:18:26 lr: 0.001284 loss: 3.353361 (3.210021) time: 0.921754 data: 0.000164 max mem: 18817 Epoch: [123/300] [ 150/1251] eta: 0:17:39 lr: 0.001284 loss: 3.543731 (3.250711) time: 0.993352 data: 0.000165 max mem: 18817 Epoch: [123/300] [ 200/1251] eta: 0:16:53 lr: 0.001284 loss: 3.401995 (3.286610) time: 1.031795 data: 0.000170 max mem: 18817 Epoch: [123/300] [ 250/1251] eta: 0:16:03 lr: 0.001283 loss: 3.072630 (3.265424) time: 0.969579 data: 0.000192 max mem: 18817 Epoch: [123/300] [ 300/1251] eta: 0:15:13 lr: 0.001283 loss: 3.025661 (3.240655) time: 0.928829 data: 0.000172 max mem: 18817 Epoch: [123/300] [ 350/1251] eta: 0:14:27 lr: 0.001282 loss: 3.464297 (3.258858) time: 0.944963 data: 0.000155 max mem: 18817 Epoch: [123/300] [ 400/1251] eta: 0:13:40 lr: 0.001282 loss: 3.387273 (3.264215) time: 0.974062 data: 0.000180 max mem: 18817 Epoch: [123/300] [ 450/1251] eta: 0:12:52 lr: 0.001282 loss: 3.296618 (3.264323) time: 1.047510 data: 0.000179 max mem: 18817 Epoch: [123/300] [ 500/1251] eta: 0:12:02 lr: 0.001281 loss: 3.414610 (3.261705) time: 0.963309 data: 0.000165 max mem: 18817 Epoch: [123/300] [ 550/1251] eta: 0:11:13 lr: 0.001281 loss: 3.457935 (3.269788) time: 0.930074 data: 0.000144 max mem: 18817 Epoch: [123/300] [ 600/1251] eta: 0:10:25 lr: 0.001280 loss: 3.184777 (3.274520) time: 0.931568 data: 0.000171 max mem: 18817 Epoch: [123/300] [ 650/1251] eta: 0:09:37 lr: 0.001280 loss: 3.471380 (3.283511) time: 0.970452 data: 0.000176 max mem: 18817 Epoch: [123/300] [ 700/1251] eta: 0:08:49 lr: 0.001280 loss: 3.241701 (3.278253) time: 1.013885 data: 0.000172 max mem: 18817 Epoch: [123/300] [ 750/1251] eta: 0:08:01 lr: 0.001279 loss: 3.276159 (3.278581) time: 0.985345 data: 0.000168 max mem: 18817 Epoch: [123/300] [ 800/1251] eta: 0:07:13 lr: 0.001279 loss: 3.358661 (3.278576) time: 0.920975 data: 0.000169 max mem: 18817 Epoch: [123/300] [ 850/1251] eta: 0:06:25 lr: 0.001278 loss: 3.420891 (3.278051) time: 0.940038 data: 0.000177 max mem: 18817 Epoch: [123/300] [ 900/1251] eta: 0:05:37 lr: 0.001278 loss: 3.328422 (3.274482) time: 1.004543 data: 0.000182 max mem: 18817 Epoch: [123/300] [ 950/1251] eta: 0:04:49 lr: 0.001278 loss: 3.458836 (3.273808) time: 0.994795 data: 0.000179 max mem: 18817 Epoch: [123/300] [1000/1251] eta: 0:04:01 lr: 0.001277 loss: 3.097974 (3.278047) time: 0.970946 data: 0.000179 max mem: 18817 Epoch: [123/300] [1050/1251] eta: 0:03:12 lr: 0.001277 loss: 3.360197 (3.276798) time: 0.925448 data: 0.000177 max mem: 18817 Epoch: [123/300] [1100/1251] eta: 0:02:25 lr: 0.001276 loss: 3.147588 (3.278957) time: 0.939147 data: 0.000174 max mem: 18817 Epoch: [123/300] [1150/1251] eta: 0:01:37 lr: 0.001276 loss: 3.485854 (3.278391) time: 0.981942 data: 0.000179 max mem: 18817 Epoch: [123/300] [1200/1251] eta: 0:00:48 lr: 0.001276 loss: 3.415198 (3.278962) time: 0.994838 data: 0.000176 max mem: 18817 Epoch: [123/300] [1250/1251] eta: 0:00:00 lr: 0.001275 loss: 3.287191 (3.274408) time: 0.977461 data: 0.000734 max mem: 18817 Epoch: [123/300] Total time: 0:20:01 (0.960615 s / it) Averaged stats: lr: 0.001275 loss: 3.287191 (3.278914) Test: [ 0/49] eta: 0:01:18 loss: 0.652601 (0.652601) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.611733 data: 1.177238 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.726844 (0.814068) acc1: 81.250000 (82.812500) acc5: 96.875000 (96.306818) time: 0.482744 data: 0.107172 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.920078 (0.868385) acc1: 78.125000 (80.431548) acc5: 95.312500 (95.610119) time: 0.366845 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.918282 (0.864709) acc1: 78.125000 (80.040323) acc5: 95.312500 (95.665323) time: 0.364033 data: 0.000135 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.905211 (0.875414) acc1: 79.687500 (79.954268) acc5: 95.312500 (95.655488) time: 0.373075 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.918282 (0.877059) acc1: 79.687500 (79.840000) acc5: 95.312500 (95.552000) time: 0.382033 data: 0.000102 max mem: 18817 Test: Total time: 0:00:19 (0.402006 s / it) * Acc@1 79.692 Acc@5 95.204 loss 0.889 Max accuracy: 79.80% Epoch: [124/300] [ 0/1251] eta: 0:39:14 lr: 0.001275 loss: 3.070517 (3.070517) time: 1.882244 data: 0.988297 max mem: 18817 Epoch: [124/300] [ 50/1251] eta: 0:19:42 lr: 0.001275 loss: 3.444283 (3.255631) time: 0.930995 data: 0.000159 max mem: 18817 Epoch: [124/300] [ 100/1251] eta: 0:18:43 lr: 0.001274 loss: 3.161146 (3.232061) time: 0.986095 data: 0.000173 max mem: 18817 Epoch: [124/300] [ 150/1251] eta: 0:17:46 lr: 0.001274 loss: 3.192178 (3.248527) time: 1.001350 data: 0.000182 max mem: 18817 Epoch: [124/300] [ 200/1251] eta: 0:16:53 lr: 0.001274 loss: 3.402902 (3.248714) time: 0.978213 data: 0.000179 max mem: 18817 Epoch: [124/300] [ 250/1251] eta: 0:15:59 lr: 0.001273 loss: 3.560809 (3.277623) time: 0.918609 data: 0.000162 max mem: 18817 Epoch: [124/300] [ 300/1251] eta: 0:15:15 lr: 0.001273 loss: 3.185076 (3.266874) time: 0.936710 data: 0.000190 max mem: 18817 Epoch: [124/300] [ 350/1251] eta: 0:14:29 lr: 0.001272 loss: 3.369463 (3.276709) time: 0.991103 data: 0.000165 max mem: 18817 Epoch: [124/300] [ 400/1251] eta: 0:13:42 lr: 0.001272 loss: 3.287422 (3.270679) time: 1.045426 data: 0.000208 max mem: 18817 Epoch: [124/300] [ 450/1251] eta: 0:12:52 lr: 0.001272 loss: 3.445239 (3.270911) time: 0.980542 data: 0.000184 max mem: 18817 Epoch: [124/300] [ 500/1251] eta: 0:12:03 lr: 0.001271 loss: 3.381884 (3.278815) time: 0.921117 data: 0.000168 max mem: 18817 Epoch: [124/300] [ 550/1251] eta: 0:11:15 lr: 0.001271 loss: 3.165335 (3.272976) time: 0.935136 data: 0.000183 max mem: 18817 Epoch: [124/300] [ 600/1251] eta: 0:10:27 lr: 0.001270 loss: 3.256948 (3.271261) time: 0.997812 data: 0.000164 max mem: 18817 Epoch: [124/300] [ 650/1251] eta: 0:09:40 lr: 0.001270 loss: 3.213134 (3.272581) time: 1.069942 data: 0.000174 max mem: 18817 Epoch: [124/300] [ 700/1251] eta: 0:08:51 lr: 0.001270 loss: 3.193736 (3.267524) time: 0.991393 data: 0.000176 max mem: 18817 Epoch: [124/300] [ 750/1251] eta: 0:08:02 lr: 0.001269 loss: 3.516555 (3.274283) time: 0.922356 data: 0.000170 max mem: 18817 Epoch: [124/300] [ 800/1251] eta: 0:07:14 lr: 0.001269 loss: 3.288693 (3.270534) time: 0.931635 data: 0.000174 max mem: 18817 Epoch: [124/300] [ 850/1251] eta: 0:06:26 lr: 0.001268 loss: 3.056787 (3.270273) time: 0.978423 data: 0.000178 max mem: 18817 Epoch: [124/300] [ 900/1251] eta: 0:05:38 lr: 0.001268 loss: 3.454879 (3.270675) time: 1.021449 data: 0.000167 max mem: 18817 Epoch: [124/300] [ 950/1251] eta: 0:04:49 lr: 0.001268 loss: 3.458048 (3.274872) time: 0.955968 data: 0.000172 max mem: 18817 Epoch: [124/300] [1000/1251] eta: 0:04:01 lr: 0.001267 loss: 3.323528 (3.272593) time: 0.929025 data: 0.000165 max mem: 18817 Epoch: [124/300] [1050/1251] eta: 0:03:13 lr: 0.001267 loss: 3.183898 (3.269030) time: 0.930841 data: 0.000188 max mem: 18817 Epoch: [124/300] [1100/1251] eta: 0:02:25 lr: 0.001266 loss: 3.273988 (3.275811) time: 1.001853 data: 0.000195 max mem: 18817 Epoch: [124/300] [1150/1251] eta: 0:01:37 lr: 0.001266 loss: 3.494926 (3.277554) time: 1.035155 data: 0.000190 max mem: 18817 Epoch: [124/300] [1200/1251] eta: 0:00:49 lr: 0.001266 loss: 3.131508 (3.280653) time: 0.974272 data: 0.000188 max mem: 18817 Epoch: [124/300] [1250/1251] eta: 0:00:00 lr: 0.001265 loss: 3.159323 (3.281724) time: 0.928127 data: 0.000756 max mem: 18817 Epoch: [124/300] Total time: 0:20:04 (0.962574 s / it) Averaged stats: lr: 0.001265 loss: 3.159323 (3.286511) Test: [ 0/49] eta: 0:01:29 loss: 0.650191 (0.650191) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.875000) time: 1.823620 data: 1.399477 max mem: 18817 Test: [10/49] eta: 0:00:23 loss: 0.760301 (0.792495) acc1: 81.250000 (81.392045) acc5: 96.875000 (96.022727) time: 0.612108 data: 0.127364 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 0.849590 (0.836104) acc1: 78.125000 (79.910714) acc5: 95.312500 (95.684524) time: 0.462786 data: 0.000146 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 0.880831 (0.842614) acc1: 78.125000 (79.385081) acc5: 96.875000 (95.866935) time: 0.398639 data: 0.000147 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.868479 (0.862902) acc1: 78.125000 (79.306402) acc5: 95.312500 (95.655488) time: 0.359972 data: 0.000150 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.933929 (0.862752) acc1: 78.125000 (79.520000) acc5: 95.312500 (95.680000) time: 0.354923 data: 0.000115 max mem: 18817 Test: Total time: 0:00:21 (0.433834 s / it) * Acc@1 79.800 Acc@5 95.374 loss 0.871 Max accuracy: 79.80% Epoch: [125/300] [ 0/1251] eta: 0:41:42 lr: 0.001265 loss: 3.437855 (3.437855) time: 2.000333 data: 1.109864 max mem: 18817 Epoch: [125/300] [ 50/1251] eta: 0:19:40 lr: 0.001265 loss: 3.312014 (3.200375) time: 0.988946 data: 0.000152 max mem: 18817 Epoch: [125/300] [ 100/1251] eta: 0:18:38 lr: 0.001264 loss: 3.133815 (3.215573) time: 1.004820 data: 0.000173 max mem: 18817 Epoch: [125/300] [ 150/1251] eta: 0:17:39 lr: 0.001264 loss: 3.401623 (3.271578) time: 0.967579 data: 0.000173 max mem: 18817 Epoch: [125/300] [ 200/1251] eta: 0:16:46 lr: 0.001264 loss: 3.335300 (3.240227) time: 0.922050 data: 0.000182 max mem: 18817 Epoch: [125/300] [ 250/1251] eta: 0:16:01 lr: 0.001263 loss: 3.242478 (3.242717) time: 0.927912 data: 0.000164 max mem: 18817 Epoch: [125/300] [ 300/1251] eta: 0:15:15 lr: 0.001263 loss: 3.246171 (3.269863) time: 0.976087 data: 0.000168 max mem: 18817 Epoch: [125/300] [ 350/1251] eta: 0:14:28 lr: 0.001262 loss: 3.191847 (3.265203) time: 1.040274 data: 0.000172 max mem: 18817 Epoch: [125/300] [ 400/1251] eta: 0:13:38 lr: 0.001262 loss: 3.465208 (3.258979) time: 0.973998 data: 0.000180 max mem: 18817 Epoch: [125/300] [ 450/1251] eta: 0:12:49 lr: 0.001262 loss: 3.399533 (3.262484) time: 0.919701 data: 0.000167 max mem: 18817 Epoch: [125/300] [ 500/1251] eta: 0:12:02 lr: 0.001261 loss: 3.352907 (3.250389) time: 0.938715 data: 0.000189 max mem: 18817 Epoch: [125/300] [ 550/1251] eta: 0:11:14 lr: 0.001261 loss: 3.484308 (3.254994) time: 0.998181 data: 0.000171 max mem: 18817 Epoch: [125/300] [ 600/1251] eta: 0:10:26 lr: 0.001260 loss: 3.304461 (3.253967) time: 1.012269 data: 0.000178 max mem: 18817 Epoch: [125/300] [ 650/1251] eta: 0:09:37 lr: 0.001260 loss: 3.031214 (3.245795) time: 0.970622 data: 0.000165 max mem: 18817 Epoch: [125/300] [ 700/1251] eta: 0:08:49 lr: 0.001260 loss: 3.139926 (3.241853) time: 0.917338 data: 0.000166 max mem: 18817 Epoch: [125/300] [ 750/1251] eta: 0:08:01 lr: 0.001259 loss: 3.329666 (3.244184) time: 0.925218 data: 0.000179 max mem: 18817 Epoch: [125/300] [ 800/1251] eta: 0:07:13 lr: 0.001259 loss: 3.106714 (3.244058) time: 0.974321 data: 0.000181 max mem: 18817 Epoch: [125/300] [ 850/1251] eta: 0:06:25 lr: 0.001258 loss: 3.398601 (3.240042) time: 1.047830 data: 0.000167 max mem: 18817 Epoch: [125/300] [ 900/1251] eta: 0:05:37 lr: 0.001258 loss: 3.529434 (3.246511) time: 0.980121 data: 0.000163 max mem: 18817 Epoch: [125/300] [ 950/1251] eta: 0:04:49 lr: 0.001258 loss: 3.208735 (3.245649) time: 0.926740 data: 0.000169 max mem: 18817 Epoch: [125/300] [1000/1251] eta: 0:04:01 lr: 0.001257 loss: 3.555470 (3.249975) time: 0.925815 data: 0.000156 max mem: 18817 Epoch: [125/300] [1050/1251] eta: 0:03:13 lr: 0.001257 loss: 3.140810 (3.249420) time: 0.982001 data: 0.000165 max mem: 18817 Epoch: [125/300] [1100/1251] eta: 0:02:25 lr: 0.001256 loss: 3.075731 (3.247170) time: 1.046564 data: 0.000184 max mem: 18817 Epoch: [125/300] [1150/1251] eta: 0:01:37 lr: 0.001256 loss: 3.408938 (3.251403) time: 0.975499 data: 0.000169 max mem: 18817 Epoch: [125/300] [1200/1251] eta: 0:00:49 lr: 0.001256 loss: 3.316995 (3.256271) time: 0.927621 data: 0.000172 max mem: 18817 Epoch: [125/300] [1250/1251] eta: 0:00:00 lr: 0.001255 loss: 3.395109 (3.258489) time: 0.931769 data: 0.000748 max mem: 18817 Epoch: [125/300] Total time: 0:20:03 (0.961878 s / it) Averaged stats: lr: 0.001255 loss: 3.395109 (3.260201) Test: [ 0/49] eta: 0:01:16 loss: 0.602824 (0.602824) acc1: 89.062500 (89.062500) acc5: 96.875000 (96.875000) time: 1.569477 data: 1.160213 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.747145 (0.776066) acc1: 79.687500 (80.823864) acc5: 96.875000 (96.306818) time: 0.479677 data: 0.105622 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.837807 (0.830303) acc1: 79.687500 (79.613095) acc5: 95.312500 (95.461310) time: 0.366841 data: 0.000152 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.837807 (0.829015) acc1: 78.125000 (79.284274) acc5: 95.312500 (95.463710) time: 0.363138 data: 0.000139 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.850505 (0.844619) acc1: 79.687500 (79.268293) acc5: 95.312500 (95.274390) time: 0.360742 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.857943 (0.845551) acc1: 79.687500 (79.296000) acc5: 95.312500 (95.232000) time: 0.449144 data: 0.000108 max mem: 18817 Test: Total time: 0:00:20 (0.428125 s / it) * Acc@1 79.892 Acc@5 95.320 loss 0.841 Max accuracy: 79.89% Epoch: [126/300] [ 0/1251] eta: 0:41:22 lr: 0.001255 loss: 3.740695 (3.740695) time: 1.984328 data: 1.054793 max mem: 18817 Epoch: [126/300] [ 50/1251] eta: 0:19:20 lr: 0.001255 loss: 3.507965 (3.329107) time: 0.977420 data: 0.000170 max mem: 18817 Epoch: [126/300] [ 100/1251] eta: 0:18:39 lr: 0.001254 loss: 3.478786 (3.296612) time: 0.980313 data: 0.000173 max mem: 18817 Epoch: [126/300] [ 150/1251] eta: 0:17:42 lr: 0.001254 loss: 3.078288 (3.284276) time: 0.925611 data: 0.000160 max mem: 18817 Epoch: [126/300] [ 200/1251] eta: 0:16:54 lr: 0.001254 loss: 3.442818 (3.296742) time: 0.925927 data: 0.000173 max mem: 18817 Epoch: [126/300] [ 250/1251] eta: 0:16:05 lr: 0.001253 loss: 3.345654 (3.280291) time: 0.983621 data: 0.000171 max mem: 18817 Epoch: [126/300] [ 300/1251] eta: 0:15:13 lr: 0.001253 loss: 3.279226 (3.296384) time: 0.975907 data: 0.000166 max mem: 18817 Epoch: [126/300] [ 350/1251] eta: 0:14:28 lr: 0.001252 loss: 3.388526 (3.293235) time: 0.977940 data: 0.000165 max mem: 18817 Epoch: [126/300] [ 400/1251] eta: 0:13:37 lr: 0.001252 loss: 3.413281 (3.288834) time: 0.930853 data: 0.000173 max mem: 18817 Epoch: [126/300] [ 450/1251] eta: 0:12:50 lr: 0.001252 loss: 3.156979 (3.287290) time: 0.920576 data: 0.000185 max mem: 18817 Epoch: [126/300] [ 500/1251] eta: 0:12:02 lr: 0.001251 loss: 3.270816 (3.286549) time: 0.970214 data: 0.000156 max mem: 18817 Epoch: [126/300] [ 550/1251] eta: 0:11:13 lr: 0.001251 loss: 3.355223 (3.272765) time: 0.980666 data: 0.000181 max mem: 18817 Epoch: [126/300] [ 600/1251] eta: 0:10:26 lr: 0.001250 loss: 3.474022 (3.273842) time: 0.978314 data: 0.000169 max mem: 18817 Epoch: [126/300] [ 650/1251] eta: 0:09:38 lr: 0.001250 loss: 3.480566 (3.274606) time: 0.933021 data: 0.000170 max mem: 18817 Epoch: [126/300] [ 700/1251] eta: 0:08:50 lr: 0.001250 loss: 3.606421 (3.276909) time: 0.927479 data: 0.000171 max mem: 18817 Epoch: [126/300] [ 750/1251] eta: 0:08:02 lr: 0.001249 loss: 3.221386 (3.272587) time: 0.992114 data: 0.000160 max mem: 18817 Epoch: [126/300] [ 800/1251] eta: 0:07:14 lr: 0.001249 loss: 3.323086 (3.269968) time: 1.027740 data: 0.000166 max mem: 18817 Epoch: [126/300] [ 850/1251] eta: 0:06:26 lr: 0.001248 loss: 3.470498 (3.276193) time: 0.978583 data: 0.000175 max mem: 18817 Epoch: [126/300] [ 900/1251] eta: 0:05:37 lr: 0.001248 loss: 3.055761 (3.273443) time: 0.941093 data: 0.000175 max mem: 18817 Epoch: [126/300] [ 950/1251] eta: 0:04:49 lr: 0.001248 loss: 3.284799 (3.269422) time: 0.928703 data: 0.000167 max mem: 18817 Epoch: [126/300] [1000/1251] eta: 0:04:01 lr: 0.001247 loss: 3.614913 (3.271670) time: 0.979129 data: 0.000157 max mem: 18817 Epoch: [126/300] [1050/1251] eta: 0:03:13 lr: 0.001247 loss: 2.800964 (3.269871) time: 1.028078 data: 0.000163 max mem: 18817 Epoch: [126/300] [1100/1251] eta: 0:02:25 lr: 0.001246 loss: 3.184056 (3.267830) time: 0.980994 data: 0.000155 max mem: 18817 Epoch: [126/300] [1150/1251] eta: 0:01:37 lr: 0.001246 loss: 3.320867 (3.267572) time: 0.921279 data: 0.000173 max mem: 18817 Epoch: [126/300] [1200/1251] eta: 0:00:49 lr: 0.001246 loss: 3.596854 (3.265212) time: 0.925812 data: 0.000162 max mem: 18817 Epoch: [126/300] [1250/1251] eta: 0:00:00 lr: 0.001245 loss: 3.480996 (3.271318) time: 0.980521 data: 0.000749 max mem: 18817 Epoch: [126/300] Total time: 0:20:03 (0.962176 s / it) Averaged stats: lr: 0.001245 loss: 3.480996 (3.270788) Test: [ 0/49] eta: 0:01:15 loss: 0.585765 (0.585765) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.539523 data: 1.105365 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.728109 (0.770094) acc1: 81.250000 (81.676136) acc5: 96.875000 (95.454545) time: 0.476053 data: 0.100638 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.817037 (0.817448) acc1: 79.687500 (80.208333) acc5: 95.312500 (95.163690) time: 0.371824 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.869352 (0.824066) acc1: 78.125000 (79.687500) acc5: 95.312500 (95.362903) time: 0.378283 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.859258 (0.834396) acc1: 78.125000 (79.649390) acc5: 95.312500 (95.464939) time: 0.370769 data: 0.000131 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.872351 (0.839461) acc1: 78.125000 (79.456000) acc5: 95.312500 (95.424000) time: 0.361593 data: 0.000110 max mem: 18817 Test: Total time: 0:00:19 (0.393908 s / it) * Acc@1 79.968 Acc@5 95.362 loss 0.835 Max accuracy: 79.97% Epoch: [127/300] [ 0/1251] eta: 0:42:55 lr: 0.001245 loss: 4.106450 (4.106450) time: 2.058921 data: 1.157038 max mem: 18817 Epoch: [127/300] [ 50/1251] eta: 0:19:23 lr: 0.001245 loss: 2.996966 (3.238383) time: 0.981824 data: 0.000151 max mem: 18817 Epoch: [127/300] [ 100/1251] eta: 0:18:27 lr: 0.001244 loss: 3.236924 (3.205573) time: 0.934421 data: 0.000188 max mem: 18817 Epoch: [127/300] [ 150/1251] eta: 0:17:40 lr: 0.001244 loss: 3.353618 (3.208224) time: 0.925881 data: 0.000160 max mem: 18817 Epoch: [127/300] [ 200/1251] eta: 0:16:53 lr: 0.001244 loss: 3.519021 (3.261368) time: 0.991370 data: 0.000155 max mem: 18817 Epoch: [127/300] [ 250/1251] eta: 0:16:05 lr: 0.001243 loss: 3.376822 (3.265690) time: 1.011699 data: 0.000166 max mem: 18817 Epoch: [127/300] [ 300/1251] eta: 0:15:17 lr: 0.001243 loss: 3.312366 (3.253444) time: 0.990263 data: 0.000152 max mem: 18817 Epoch: [127/300] [ 350/1251] eta: 0:14:27 lr: 0.001242 loss: 3.269855 (3.254140) time: 0.930891 data: 0.000160 max mem: 18817 Epoch: [127/300] [ 400/1251] eta: 0:13:40 lr: 0.001242 loss: 3.299205 (3.264979) time: 0.937840 data: 0.000187 max mem: 18817 Epoch: [127/300] [ 450/1251] eta: 0:12:53 lr: 0.001242 loss: 3.315481 (3.258173) time: 0.995806 data: 0.000193 max mem: 18817 Epoch: [127/300] [ 500/1251] eta: 0:12:06 lr: 0.001241 loss: 3.288805 (3.253233) time: 1.054115 data: 0.000159 max mem: 18817 Epoch: [127/300] [ 550/1251] eta: 0:11:17 lr: 0.001241 loss: 3.299976 (3.245984) time: 0.986561 data: 0.000174 max mem: 18817 Epoch: [127/300] [ 600/1251] eta: 0:10:28 lr: 0.001240 loss: 3.232962 (3.246040) time: 0.910815 data: 0.000172 max mem: 18817 Epoch: [127/300] [ 650/1251] eta: 0:09:40 lr: 0.001240 loss: 3.419856 (3.252127) time: 0.938424 data: 0.000177 max mem: 18817 Epoch: [127/300] [ 700/1251] eta: 0:08:52 lr: 0.001239 loss: 3.539405 (3.254524) time: 0.980144 data: 0.000171 max mem: 18817 Epoch: [127/300] [ 750/1251] eta: 0:08:04 lr: 0.001239 loss: 3.474839 (3.258358) time: 1.067551 data: 0.000176 max mem: 18817 Epoch: [127/300] [ 800/1251] eta: 0:07:15 lr: 0.001239 loss: 3.388606 (3.251618) time: 0.979135 data: 0.000176 max mem: 18817 Epoch: [127/300] [ 850/1251] eta: 0:06:26 lr: 0.001238 loss: 3.438961 (3.253037) time: 0.914294 data: 0.000171 max mem: 18817 Epoch: [127/300] [ 900/1251] eta: 0:05:38 lr: 0.001238 loss: 3.206628 (3.244941) time: 0.942722 data: 0.000161 max mem: 18817 Epoch: [127/300] [ 950/1251] eta: 0:04:50 lr: 0.001237 loss: 3.463688 (3.250840) time: 0.996070 data: 0.000185 max mem: 18817 Epoch: [127/300] [1000/1251] eta: 0:04:02 lr: 0.001237 loss: 3.377710 (3.252480) time: 1.048224 data: 0.000187 max mem: 18817 Epoch: [127/300] [1050/1251] eta: 0:03:13 lr: 0.001237 loss: 3.391041 (3.248293) time: 0.959888 data: 0.000184 max mem: 18817 Epoch: [127/300] [1100/1251] eta: 0:02:25 lr: 0.001236 loss: 3.299818 (3.246729) time: 0.925971 data: 0.000178 max mem: 18817 Epoch: [127/300] [1150/1251] eta: 0:01:37 lr: 0.001236 loss: 3.245585 (3.241690) time: 0.930208 data: 0.000177 max mem: 18817 Epoch: [127/300] [1200/1251] eta: 0:00:49 lr: 0.001235 loss: 3.412080 (3.243308) time: 0.996723 data: 0.000167 max mem: 18817 Epoch: [127/300] [1250/1251] eta: 0:00:00 lr: 0.001235 loss: 3.327917 (3.243573) time: 1.006371 data: 0.000759 max mem: 18817 Epoch: [127/300] Total time: 0:20:04 (0.963148 s / it) Averaged stats: lr: 0.001235 loss: 3.327917 (3.242441) Test: [ 0/49] eta: 0:01:30 loss: 0.648925 (0.648925) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.849492 data: 1.453475 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.696653 (0.766333) acc1: 81.250000 (81.250000) acc5: 96.875000 (95.738636) time: 0.517472 data: 0.132280 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.756357 (0.801416) acc1: 79.687500 (80.357143) acc5: 95.312500 (95.610119) time: 0.376201 data: 0.000138 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.817231 (0.808697) acc1: 78.125000 (80.141129) acc5: 95.312500 (95.766129) time: 0.366247 data: 0.000127 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.834342 (0.828377) acc1: 79.687500 (79.992378) acc5: 95.312500 (95.579268) time: 0.364966 data: 0.000141 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.862091 (0.832619) acc1: 79.687500 (79.616000) acc5: 95.312500 (95.648000) time: 0.359296 data: 0.000118 max mem: 18817 Test: Total time: 0:00:19 (0.400723 s / it) * Acc@1 79.744 Acc@5 95.286 loss 0.850 Max accuracy: 79.97% Epoch: [128/300] [ 0/1251] eta: 0:56:38 lr: 0.001235 loss: 2.869868 (2.869868) time: 2.716809 data: 1.201761 max mem: 18817 Epoch: [128/300] [ 50/1251] eta: 0:19:37 lr: 0.001235 loss: 3.440263 (3.163442) time: 0.923560 data: 0.000160 max mem: 18817 Epoch: [128/300] [ 100/1251] eta: 0:18:48 lr: 0.001234 loss: 3.420901 (3.264109) time: 0.951613 data: 0.000167 max mem: 18817 Epoch: [128/300] [ 150/1251] eta: 0:17:58 lr: 0.001234 loss: 3.397071 (3.265302) time: 1.003404 data: 0.000166 max mem: 18817 Epoch: [128/300] [ 200/1251] eta: 0:16:58 lr: 0.001233 loss: 3.481359 (3.290538) time: 0.970233 data: 0.000181 max mem: 18817 Epoch: [128/300] [ 250/1251] eta: 0:16:13 lr: 0.001233 loss: 3.354320 (3.268157) time: 0.987661 data: 0.000161 max mem: 18817 Epoch: [128/300] [ 300/1251] eta: 0:15:20 lr: 0.001233 loss: 3.301608 (3.252122) time: 0.947382 data: 0.000157 max mem: 18817 Epoch: [128/300] [ 350/1251] eta: 0:14:32 lr: 0.001232 loss: 3.212332 (3.240351) time: 0.936510 data: 0.000164 max mem: 18817 Epoch: [128/300] [ 400/1251] eta: 0:13:44 lr: 0.001232 loss: 3.489031 (3.258269) time: 0.995492 data: 0.000166 max mem: 18817 Epoch: [128/300] [ 450/1251] eta: 0:12:53 lr: 0.001231 loss: 3.306949 (3.246838) time: 0.970243 data: 0.000173 max mem: 18817 Epoch: [128/300] [ 500/1251] eta: 0:12:05 lr: 0.001231 loss: 3.310444 (3.250920) time: 0.998713 data: 0.000165 max mem: 18817 Epoch: [128/300] [ 550/1251] eta: 0:11:16 lr: 0.001231 loss: 3.473739 (3.254866) time: 0.932999 data: 0.000155 max mem: 18817 Epoch: [128/300] [ 600/1251] eta: 0:10:28 lr: 0.001230 loss: 3.156500 (3.251344) time: 0.928330 data: 0.000173 max mem: 18817 Epoch: [128/300] [ 650/1251] eta: 0:09:40 lr: 0.001230 loss: 3.080605 (3.246096) time: 0.989042 data: 0.000160 max mem: 18817 Epoch: [128/300] [ 700/1251] eta: 0:08:51 lr: 0.001229 loss: 3.406994 (3.244172) time: 0.965204 data: 0.000166 max mem: 18817 Epoch: [128/300] [ 750/1251] eta: 0:08:03 lr: 0.001229 loss: 3.216505 (3.242961) time: 1.000855 data: 0.000185 max mem: 18817 Epoch: [128/300] [ 800/1251] eta: 0:07:15 lr: 0.001229 loss: 3.466002 (3.240000) time: 0.936186 data: 0.000172 max mem: 18817 Epoch: [128/300] [ 850/1251] eta: 0:06:27 lr: 0.001228 loss: 3.213533 (3.243300) time: 0.924656 data: 0.000176 max mem: 18817 Epoch: [128/300] [ 900/1251] eta: 0:05:38 lr: 0.001228 loss: 2.941712 (3.240204) time: 0.990586 data: 0.000162 max mem: 18817 Epoch: [128/300] [ 950/1251] eta: 0:04:50 lr: 0.001227 loss: 3.233024 (3.240356) time: 1.061645 data: 0.000172 max mem: 18817 Epoch: [128/300] [1000/1251] eta: 0:04:02 lr: 0.001227 loss: 3.292707 (3.243409) time: 0.969069 data: 0.000153 max mem: 18817 Epoch: [128/300] [1050/1251] eta: 0:03:13 lr: 0.001227 loss: 3.466375 (3.246028) time: 0.920681 data: 0.000176 max mem: 18817 Epoch: [128/300] [1100/1251] eta: 0:02:25 lr: 0.001226 loss: 3.548944 (3.246064) time: 0.934114 data: 0.000197 max mem: 18817 Epoch: [128/300] [1150/1251] eta: 0:01:37 lr: 0.001226 loss: 3.264115 (3.246487) time: 0.976396 data: 0.000178 max mem: 18817 Epoch: [128/300] [1200/1251] eta: 0:00:49 lr: 0.001225 loss: 3.270432 (3.248194) time: 0.961836 data: 0.000178 max mem: 18817 Epoch: [128/300] [1250/1251] eta: 0:00:00 lr: 0.001225 loss: 2.933698 (3.246281) time: 0.946757 data: 0.000751 max mem: 18817 Epoch: [128/300] Total time: 0:20:04 (0.962782 s / it) Averaged stats: lr: 0.001225 loss: 2.933698 (3.244851) Test: [ 0/49] eta: 0:01:16 loss: 0.733410 (0.733410) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.312500) time: 1.565442 data: 1.118134 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.750893 (0.810598) acc1: 81.250000 (80.823864) acc5: 95.312500 (96.022727) time: 0.478487 data: 0.101825 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.851791 (0.842490) acc1: 79.687500 (79.464286) acc5: 95.312500 (95.461310) time: 0.366460 data: 0.000160 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.851791 (0.836934) acc1: 79.687500 (79.435484) acc5: 95.312500 (95.564516) time: 0.362570 data: 0.000128 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.848662 (0.849900) acc1: 78.125000 (79.115854) acc5: 95.312500 (95.579268) time: 0.360077 data: 0.000122 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.875094 (0.852785) acc1: 78.125000 (79.040000) acc5: 95.312500 (95.456000) time: 0.380484 data: 0.000103 max mem: 18817 Test: Total time: 0:00:19 (0.398710 s / it) * Acc@1 79.786 Acc@5 95.430 loss 0.844 Max accuracy: 79.97% Epoch: [129/300] [ 0/1251] eta: 0:40:29 lr: 0.001225 loss: 3.778840 (3.778840) time: 1.941943 data: 1.033092 max mem: 18817 Epoch: [129/300] [ 50/1251] eta: 0:19:28 lr: 0.001225 loss: 2.954240 (3.228101) time: 0.921447 data: 0.000144 max mem: 18817 Epoch: [129/300] [ 100/1251] eta: 0:18:39 lr: 0.001224 loss: 3.209154 (3.186510) time: 0.931681 data: 0.000191 max mem: 18817 Epoch: [129/300] [ 150/1251] eta: 0:17:49 lr: 0.001224 loss: 3.208874 (3.210242) time: 1.002234 data: 0.000173 max mem: 18817 Epoch: [129/300] [ 200/1251] eta: 0:17:01 lr: 0.001223 loss: 3.467417 (3.229406) time: 1.005984 data: 0.000176 max mem: 18817 Epoch: [129/300] [ 250/1251] eta: 0:16:06 lr: 0.001223 loss: 3.193300 (3.245448) time: 0.962138 data: 0.000173 max mem: 18817 Epoch: [129/300] [ 300/1251] eta: 0:15:14 lr: 0.001222 loss: 3.167824 (3.247427) time: 0.915315 data: 0.000150 max mem: 18817 Epoch: [129/300] [ 350/1251] eta: 0:14:27 lr: 0.001222 loss: 3.368102 (3.253406) time: 0.929030 data: 0.000172 max mem: 18817 Epoch: [129/300] [ 400/1251] eta: 0:13:39 lr: 0.001222 loss: 3.368606 (3.272396) time: 0.971026 data: 0.000172 max mem: 18817 Epoch: [129/300] [ 450/1251] eta: 0:12:52 lr: 0.001221 loss: 3.451082 (3.262268) time: 0.999688 data: 0.000168 max mem: 18817 Epoch: [129/300] [ 500/1251] eta: 0:12:03 lr: 0.001221 loss: 3.144477 (3.268759) time: 0.988277 data: 0.000191 max mem: 18817 Epoch: [129/300] [ 550/1251] eta: 0:11:13 lr: 0.001220 loss: 3.237868 (3.260262) time: 0.914447 data: 0.000167 max mem: 18817 Epoch: [129/300] [ 600/1251] eta: 0:10:26 lr: 0.001220 loss: 3.375618 (3.270536) time: 0.930977 data: 0.000181 max mem: 18817 Epoch: [129/300] [ 650/1251] eta: 0:09:39 lr: 0.001220 loss: 3.443323 (3.272503) time: 0.982427 data: 0.000159 max mem: 18817 Epoch: [129/300] [ 700/1251] eta: 0:08:51 lr: 0.001219 loss: 3.359380 (3.268032) time: 0.984616 data: 0.000162 max mem: 18817 Epoch: [129/300] [ 750/1251] eta: 0:08:02 lr: 0.001219 loss: 3.279391 (3.268057) time: 0.973146 data: 0.000176 max mem: 18817 Epoch: [129/300] [ 800/1251] eta: 0:07:13 lr: 0.001218 loss: 3.432184 (3.271562) time: 0.903293 data: 0.000169 max mem: 18817 Epoch: [129/300] [ 850/1251] eta: 0:06:25 lr: 0.001218 loss: 3.209480 (3.268893) time: 0.914499 data: 0.000160 max mem: 18817 Epoch: [129/300] [ 900/1251] eta: 0:05:37 lr: 0.001218 loss: 3.540883 (3.276423) time: 1.020052 data: 0.000205 max mem: 18817 Epoch: [129/300] [ 950/1251] eta: 0:04:49 lr: 0.001217 loss: 3.206323 (3.278421) time: 1.009925 data: 0.000168 max mem: 18817 Epoch: [129/300] [1000/1251] eta: 0:04:01 lr: 0.001217 loss: 3.264736 (3.280054) time: 0.966328 data: 0.000179 max mem: 18817 Epoch: [129/300] [1050/1251] eta: 0:03:13 lr: 0.001216 loss: 3.192310 (3.285093) time: 0.914837 data: 0.000163 max mem: 18817 Epoch: [129/300] [1100/1251] eta: 0:02:25 lr: 0.001216 loss: 3.265937 (3.287104) time: 0.937922 data: 0.000184 max mem: 18817 Epoch: [129/300] [1150/1251] eta: 0:01:37 lr: 0.001216 loss: 2.934217 (3.279946) time: 0.977153 data: 0.000187 max mem: 18817 Epoch: [129/300] [1200/1251] eta: 0:00:49 lr: 0.001215 loss: 3.500381 (3.282256) time: 1.029051 data: 0.000175 max mem: 18817 Epoch: [129/300] [1250/1251] eta: 0:00:00 lr: 0.001215 loss: 3.490963 (3.281376) time: 0.949385 data: 0.000752 max mem: 18817 Epoch: [129/300] Total time: 0:20:03 (0.961665 s / it) Averaged stats: lr: 0.001215 loss: 3.490963 (3.270369) Test: [ 0/49] eta: 0:01:22 loss: 0.622053 (0.622053) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.682166 data: 1.244210 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.769264 (0.831176) acc1: 81.250000 (81.107955) acc5: 95.312500 (95.596591) time: 0.488454 data: 0.113257 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.871941 (0.855916) acc1: 78.125000 (79.985119) acc5: 95.312500 (95.386905) time: 0.392884 data: 0.000149 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.871941 (0.844465) acc1: 78.125000 (80.090726) acc5: 95.312500 (95.463710) time: 0.416631 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.839939 (0.852111) acc1: 79.687500 (80.068598) acc5: 95.312500 (95.541159) time: 0.402834 data: 0.000123 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.839939 (0.850507) acc1: 79.687500 (80.064000) acc5: 96.875000 (95.680000) time: 0.372952 data: 0.000102 max mem: 18817 Test: Total time: 0:00:20 (0.419735 s / it) * Acc@1 79.962 Acc@5 95.398 loss 0.847 Max accuracy: 79.97% Epoch: [130/300] [ 0/1251] eta: 0:40:39 lr: 0.001215 loss: 3.711094 (3.711094) time: 1.949833 data: 1.054456 max mem: 18817 Epoch: [130/300] [ 50/1251] eta: 0:19:37 lr: 0.001214 loss: 3.386633 (3.355766) time: 0.918920 data: 0.000163 max mem: 18817 Epoch: [130/300] [ 100/1251] eta: 0:18:45 lr: 0.001214 loss: 3.018491 (3.257915) time: 0.983376 data: 0.000160 max mem: 18817 Epoch: [130/300] [ 150/1251] eta: 0:17:54 lr: 0.001214 loss: 3.267655 (3.279975) time: 1.018445 data: 0.000179 max mem: 18817 Epoch: [130/300] [ 200/1251] eta: 0:17:01 lr: 0.001213 loss: 3.322021 (3.284479) time: 0.981022 data: 0.000161 max mem: 18817 Epoch: [130/300] [ 250/1251] eta: 0:16:07 lr: 0.001213 loss: 2.811637 (3.259548) time: 0.938856 data: 0.000161 max mem: 18817 Epoch: [130/300] [ 300/1251] eta: 0:15:22 lr: 0.001212 loss: 3.680189 (3.286779) time: 0.939105 data: 0.000145 max mem: 18817 Epoch: [130/300] [ 350/1251] eta: 0:14:33 lr: 0.001212 loss: 3.118590 (3.273256) time: 0.998562 data: 0.000168 max mem: 18817 Epoch: [130/300] [ 400/1251] eta: 0:13:46 lr: 0.001212 loss: 3.285876 (3.273094) time: 1.040558 data: 0.000163 max mem: 18817 Epoch: [130/300] [ 450/1251] eta: 0:12:56 lr: 0.001211 loss: 3.562506 (3.273185) time: 0.982573 data: 0.000174 max mem: 18817 Epoch: [130/300] [ 500/1251] eta: 0:12:07 lr: 0.001211 loss: 3.355278 (3.274372) time: 0.933144 data: 0.000177 max mem: 18817 Epoch: [130/300] [ 550/1251] eta: 0:11:19 lr: 0.001210 loss: 3.524553 (3.281548) time: 0.936909 data: 0.000183 max mem: 18817 Epoch: [130/300] [ 600/1251] eta: 0:10:30 lr: 0.001210 loss: 3.299822 (3.281415) time: 0.970604 data: 0.000159 max mem: 18817 Epoch: [130/300] [ 650/1251] eta: 0:09:42 lr: 0.001210 loss: 3.148306 (3.275677) time: 1.069051 data: 0.000155 max mem: 18817 Epoch: [130/300] [ 700/1251] eta: 0:08:53 lr: 0.001209 loss: 3.246357 (3.269294) time: 0.987332 data: 0.000171 max mem: 18817 Epoch: [130/300] [ 750/1251] eta: 0:08:04 lr: 0.001209 loss: 3.473304 (3.274390) time: 0.924956 data: 0.000164 max mem: 18817 Epoch: [130/300] [ 800/1251] eta: 0:07:16 lr: 0.001208 loss: 3.139627 (3.271750) time: 0.927716 data: 0.000170 max mem: 18817 Epoch: [130/300] [ 850/1251] eta: 0:06:27 lr: 0.001208 loss: 3.387794 (3.266785) time: 0.980755 data: 0.000169 max mem: 18817 Epoch: [130/300] [ 900/1251] eta: 0:05:39 lr: 0.001207 loss: 3.409199 (3.271695) time: 1.033353 data: 0.000177 max mem: 18817 Epoch: [130/300] [ 950/1251] eta: 0:04:50 lr: 0.001207 loss: 3.405052 (3.277064) time: 0.968847 data: 0.000186 max mem: 18817 Epoch: [130/300] [1000/1251] eta: 0:04:02 lr: 0.001207 loss: 3.261373 (3.279627) time: 0.920650 data: 0.000168 max mem: 18817 Epoch: [130/300] [1050/1251] eta: 0:03:13 lr: 0.001206 loss: 3.375233 (3.279355) time: 0.925603 data: 0.000180 max mem: 18817 Epoch: [130/300] [1100/1251] eta: 0:02:25 lr: 0.001206 loss: 3.257312 (3.273632) time: 0.994341 data: 0.000194 max mem: 18817 Epoch: [130/300] [1150/1251] eta: 0:01:37 lr: 0.001205 loss: 3.249267 (3.275513) time: 1.045398 data: 0.000165 max mem: 18817 Epoch: [130/300] [1200/1251] eta: 0:00:49 lr: 0.001205 loss: 3.447488 (3.277992) time: 0.959713 data: 0.000161 max mem: 18817 Epoch: [130/300] [1250/1251] eta: 0:00:00 lr: 0.001205 loss: 3.363595 (3.279229) time: 0.911870 data: 0.000737 max mem: 18817 Epoch: [130/300] Total time: 0:20:06 (0.964140 s / it) Averaged stats: lr: 0.001205 loss: 3.363595 (3.282669) Test: [ 0/49] eta: 0:01:28 loss: 0.566073 (0.566073) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.802629 data: 1.407437 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.812668 (0.802402) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.738636) time: 0.532536 data: 0.128084 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 0.841072 (0.835001) acc1: 78.125000 (79.910714) acc5: 95.312500 (95.758929) time: 0.477127 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 0.838689 (0.834701) acc1: 79.687500 (80.393145) acc5: 96.875000 (95.866935) time: 0.455422 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.838859 (0.841619) acc1: 79.687500 (80.411585) acc5: 95.312500 (95.807927) time: 0.359609 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.872104 (0.841149) acc1: 79.687500 (80.608000) acc5: 95.312500 (95.904000) time: 0.355166 data: 0.000099 max mem: 18817 Test: Total time: 0:00:21 (0.438537 s / it) * Acc@1 80.014 Acc@5 95.358 loss 0.858 Max accuracy: 80.01% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0130.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0130.pth Epoch: [131/300] [ 0/1251] eta: 0:55:30 lr: 0.001205 loss: 3.267378 (3.267378) time: 2.662034 data: 1.178809 max mem: 18817 Epoch: [131/300] [ 50/1251] eta: 0:19:53 lr: 0.001204 loss: 3.374954 (3.209078) time: 0.941100 data: 0.000168 max mem: 18817 Epoch: [131/300] [ 100/1251] eta: 0:18:52 lr: 0.001204 loss: 3.325633 (3.196162) time: 0.925396 data: 0.000177 max mem: 18817 Epoch: [131/300] [ 150/1251] eta: 0:17:56 lr: 0.001203 loss: 3.255919 (3.187419) time: 0.984216 data: 0.000177 max mem: 18817 Epoch: [131/300] [ 200/1251] eta: 0:16:58 lr: 0.001203 loss: 3.185625 (3.194024) time: 0.967035 data: 0.000180 max mem: 18817 Epoch: [131/300] [ 250/1251] eta: 0:16:12 lr: 0.001203 loss: 3.355170 (3.189077) time: 0.979913 data: 0.000167 max mem: 18817 Epoch: [131/300] [ 300/1251] eta: 0:15:20 lr: 0.001202 loss: 3.346084 (3.218397) time: 0.934298 data: 0.000154 max mem: 18817 Epoch: [131/300] [ 350/1251] eta: 0:14:34 lr: 0.001202 loss: 3.145684 (3.217327) time: 0.931789 data: 0.000170 max mem: 18817 Epoch: [131/300] [ 400/1251] eta: 0:13:46 lr: 0.001201 loss: 3.218646 (3.213859) time: 0.978758 data: 0.000174 max mem: 18817 Epoch: [131/300] [ 450/1251] eta: 0:12:56 lr: 0.001201 loss: 3.174474 (3.203597) time: 1.026465 data: 0.000172 max mem: 18817 Epoch: [131/300] [ 500/1251] eta: 0:12:07 lr: 0.001201 loss: 3.015296 (3.190598) time: 0.981259 data: 0.000166 max mem: 18817 Epoch: [131/300] [ 550/1251] eta: 0:11:17 lr: 0.001200 loss: 3.434389 (3.199580) time: 0.929175 data: 0.000182 max mem: 18817 Epoch: [131/300] [ 600/1251] eta: 0:10:29 lr: 0.001200 loss: 3.372807 (3.203770) time: 0.923830 data: 0.000178 max mem: 18817 Epoch: [131/300] [ 650/1251] eta: 0:09:40 lr: 0.001199 loss: 3.101322 (3.203440) time: 0.975756 data: 0.000178 max mem: 18817 Epoch: [131/300] [ 700/1251] eta: 0:08:51 lr: 0.001199 loss: 3.340564 (3.208689) time: 0.975385 data: 0.000173 max mem: 18817 Epoch: [131/300] [ 750/1251] eta: 0:08:04 lr: 0.001199 loss: 3.348846 (3.213182) time: 1.003731 data: 0.000184 max mem: 18817 Epoch: [131/300] [ 800/1251] eta: 0:07:15 lr: 0.001198 loss: 3.154057 (3.211936) time: 0.925824 data: 0.000174 max mem: 18817 Epoch: [131/300] [ 850/1251] eta: 0:06:27 lr: 0.001198 loss: 3.435399 (3.219419) time: 0.927923 data: 0.000182 max mem: 18817 Epoch: [131/300] [ 900/1251] eta: 0:05:38 lr: 0.001197 loss: 3.009935 (3.219037) time: 0.966674 data: 0.000187 max mem: 18817 Epoch: [131/300] [ 950/1251] eta: 0:04:49 lr: 0.001197 loss: 3.128829 (3.219182) time: 0.958029 data: 0.000183 max mem: 18817 Epoch: [131/300] [1000/1251] eta: 0:04:01 lr: 0.001196 loss: 3.046476 (3.218279) time: 0.976657 data: 0.000172 max mem: 18817 Epoch: [131/300] [1050/1251] eta: 0:03:13 lr: 0.001196 loss: 3.435251 (3.223429) time: 0.933504 data: 0.000179 max mem: 18817 Epoch: [131/300] [1100/1251] eta: 0:02:25 lr: 0.001196 loss: 3.423547 (3.229976) time: 0.945612 data: 0.000170 max mem: 18817 Epoch: [131/300] [1150/1251] eta: 0:01:37 lr: 0.001195 loss: 3.229568 (3.231064) time: 0.984386 data: 0.000173 max mem: 18817 Epoch: [131/300] [1200/1251] eta: 0:00:49 lr: 0.001195 loss: 3.189579 (3.237508) time: 0.968904 data: 0.000177 max mem: 18817 Epoch: [131/300] [1250/1251] eta: 0:00:00 lr: 0.001194 loss: 3.061707 (3.237067) time: 0.979507 data: 0.000756 max mem: 18817 Epoch: [131/300] Total time: 0:20:04 (0.963171 s / it) Averaged stats: lr: 0.001194 loss: 3.061707 (3.238223) Test: [ 0/49] eta: 0:01:24 loss: 0.629923 (0.629923) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.719426 data: 1.336007 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.734838 (0.814415) acc1: 81.250000 (80.539773) acc5: 96.875000 (95.880682) time: 0.491663 data: 0.121583 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.863677 (0.846699) acc1: 79.687500 (79.761905) acc5: 96.875000 (95.684524) time: 0.365373 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.863677 (0.849091) acc1: 79.687500 (79.586694) acc5: 96.875000 (95.917339) time: 0.362413 data: 0.000136 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.842290 (0.858917) acc1: 79.687500 (80.144817) acc5: 95.312500 (95.807927) time: 0.388384 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.894819 (0.859217) acc1: 81.250000 (80.416000) acc5: 95.312500 (95.808000) time: 0.383717 data: 0.000103 max mem: 18817 Test: Total time: 0:00:19 (0.402905 s / it) * Acc@1 80.064 Acc@5 95.410 loss 0.872 Max accuracy: 80.06% Epoch: [132/300] [ 0/1251] eta: 0:46:45 lr: 0.001194 loss: 3.406767 (3.406767) time: 2.242892 data: 1.259511 max mem: 18817 Epoch: [132/300] [ 50/1251] eta: 0:20:08 lr: 0.001194 loss: 3.084605 (3.131697) time: 0.923659 data: 0.000170 max mem: 18817 Epoch: [132/300] [ 100/1251] eta: 0:18:58 lr: 0.001194 loss: 2.863229 (3.150645) time: 0.929510 data: 0.000156 max mem: 18817 Epoch: [132/300] [ 150/1251] eta: 0:18:04 lr: 0.001193 loss: 3.141745 (3.171221) time: 0.983133 data: 0.000161 max mem: 18817 Epoch: [132/300] [ 200/1251] eta: 0:17:03 lr: 0.001193 loss: 3.395225 (3.200433) time: 0.962255 data: 0.000187 max mem: 18817 Epoch: [132/300] [ 250/1251] eta: 0:16:10 lr: 0.001192 loss: 3.324805 (3.198118) time: 0.921754 data: 0.000164 max mem: 18817 Epoch: [132/300] [ 300/1251] eta: 0:15:24 lr: 0.001192 loss: 2.976741 (3.194769) time: 0.938255 data: 0.000171 max mem: 18817 Epoch: [132/300] [ 350/1251] eta: 0:14:35 lr: 0.001192 loss: 3.389789 (3.209418) time: 0.943069 data: 0.000188 max mem: 18817 Epoch: [132/300] [ 400/1251] eta: 0:13:46 lr: 0.001191 loss: 3.511737 (3.221244) time: 0.984179 data: 0.000177 max mem: 18817 Epoch: [132/300] [ 450/1251] eta: 0:12:55 lr: 0.001191 loss: 3.335550 (3.215584) time: 0.971210 data: 0.000171 max mem: 18817 Epoch: [132/300] [ 500/1251] eta: 0:12:08 lr: 0.001190 loss: 2.991804 (3.215040) time: 0.979460 data: 0.000167 max mem: 18817 Epoch: [132/300] [ 550/1251] eta: 0:11:18 lr: 0.001190 loss: 3.214604 (3.227771) time: 0.914880 data: 0.000171 max mem: 18817 Epoch: [132/300] [ 600/1251] eta: 0:10:30 lr: 0.001190 loss: 3.185102 (3.232737) time: 0.948955 data: 0.000165 max mem: 18817 Epoch: [132/300] [ 650/1251] eta: 0:09:41 lr: 0.001189 loss: 3.305947 (3.233269) time: 0.988537 data: 0.000160 max mem: 18817 Epoch: [132/300] [ 700/1251] eta: 0:08:52 lr: 0.001189 loss: 3.240167 (3.226612) time: 0.953487 data: 0.000173 max mem: 18817 Epoch: [132/300] [ 750/1251] eta: 0:08:04 lr: 0.001188 loss: 3.344495 (3.224335) time: 0.991246 data: 0.000155 max mem: 18817 Epoch: [132/300] [ 800/1251] eta: 0:07:15 lr: 0.001188 loss: 3.184265 (3.220732) time: 0.919803 data: 0.000162 max mem: 18817 Epoch: [132/300] [ 850/1251] eta: 0:06:27 lr: 0.001188 loss: 3.236656 (3.223013) time: 0.925406 data: 0.000170 max mem: 18817 Epoch: [132/300] [ 900/1251] eta: 0:05:39 lr: 0.001187 loss: 3.313884 (3.222635) time: 0.999285 data: 0.000183 max mem: 18817 Epoch: [132/300] [ 950/1251] eta: 0:04:50 lr: 0.001187 loss: 3.381454 (3.229472) time: 1.020604 data: 0.000173 max mem: 18817 Epoch: [132/300] [1000/1251] eta: 0:04:02 lr: 0.001186 loss: 3.302008 (3.229470) time: 0.991060 data: 0.000168 max mem: 18817 Epoch: [132/300] [1050/1251] eta: 0:03:14 lr: 0.001186 loss: 3.292797 (3.231856) time: 0.927689 data: 0.000165 max mem: 18817 Epoch: [132/300] [1100/1251] eta: 0:02:25 lr: 0.001185 loss: 3.060018 (3.230755) time: 0.927278 data: 0.000173 max mem: 18817 Epoch: [132/300] [1150/1251] eta: 0:01:37 lr: 0.001185 loss: 3.452139 (3.232461) time: 0.980147 data: 0.000180 max mem: 18817 Epoch: [132/300] [1200/1251] eta: 0:00:49 lr: 0.001185 loss: 3.270337 (3.238250) time: 0.994133 data: 0.000177 max mem: 18817 Epoch: [132/300] [1250/1251] eta: 0:00:00 lr: 0.001184 loss: 3.378515 (3.237878) time: 0.969617 data: 0.000728 max mem: 18817 Epoch: [132/300] Total time: 0:20:07 (0.964901 s / it) Averaged stats: lr: 0.001184 loss: 3.378515 (3.242409) Test: [ 0/49] eta: 0:01:28 loss: 0.525918 (0.525918) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.809175 data: 1.407518 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.762111 (0.777100) acc1: 79.687500 (80.823864) acc5: 95.312500 (95.028409) time: 0.500812 data: 0.128087 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.826516 (0.805013) acc1: 78.125000 (80.357143) acc5: 95.312500 (95.461310) time: 0.366674 data: 0.000138 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.843225 (0.811088) acc1: 81.250000 (80.090726) acc5: 96.875000 (95.715726) time: 0.363291 data: 0.000125 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.801657 (0.820615) acc1: 81.250000 (80.335366) acc5: 95.312500 (95.617378) time: 0.373139 data: 0.000117 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.869315 (0.825340) acc1: 79.687500 (80.352000) acc5: 95.312500 (95.552000) time: 0.373325 data: 0.000100 max mem: 18817 Test: Total time: 0:00:19 (0.402174 s / it) * Acc@1 80.094 Acc@5 95.494 loss 0.826 Max accuracy: 80.09% Epoch: [133/300] [ 0/1251] eta: 0:42:33 lr: 0.001184 loss: 3.624552 (3.624552) time: 2.040787 data: 1.162465 max mem: 18817 Epoch: [133/300] [ 50/1251] eta: 0:19:49 lr: 0.001184 loss: 3.368762 (3.137219) time: 0.934497 data: 0.000152 max mem: 18817 Epoch: [133/300] [ 100/1251] eta: 0:18:50 lr: 0.001183 loss: 3.452356 (3.166670) time: 0.999303 data: 0.000171 max mem: 18817 Epoch: [133/300] [ 150/1251] eta: 0:17:54 lr: 0.001183 loss: 3.353836 (3.206632) time: 1.015859 data: 0.000171 max mem: 18817 Epoch: [133/300] [ 200/1251] eta: 0:17:03 lr: 0.001183 loss: 3.301603 (3.217320) time: 0.981092 data: 0.000171 max mem: 18817 Epoch: [133/300] [ 250/1251] eta: 0:16:11 lr: 0.001182 loss: 3.349839 (3.228593) time: 0.932920 data: 0.000165 max mem: 18817 Epoch: [133/300] [ 300/1251] eta: 0:15:24 lr: 0.001182 loss: 3.199962 (3.204885) time: 0.955077 data: 0.000171 max mem: 18817 Epoch: [133/300] [ 350/1251] eta: 0:14:35 lr: 0.001181 loss: 3.222802 (3.218529) time: 0.992692 data: 0.000165 max mem: 18817 Epoch: [133/300] [ 400/1251] eta: 0:13:46 lr: 0.001181 loss: 3.430177 (3.213659) time: 1.026234 data: 0.000164 max mem: 18817 Epoch: [133/300] [ 450/1251] eta: 0:12:56 lr: 0.001181 loss: 3.445910 (3.213779) time: 0.968855 data: 0.000176 max mem: 18817 Epoch: [133/300] [ 500/1251] eta: 0:12:06 lr: 0.001180 loss: 3.431397 (3.219705) time: 0.929691 data: 0.000173 max mem: 18817 Epoch: [133/300] [ 550/1251] eta: 0:11:18 lr: 0.001180 loss: 3.224328 (3.227190) time: 0.935646 data: 0.000157 max mem: 18817 Epoch: [133/300] [ 600/1251] eta: 0:10:30 lr: 0.001179 loss: 3.152120 (3.228825) time: 0.987353 data: 0.000187 max mem: 18817 Epoch: [133/300] [ 650/1251] eta: 0:09:42 lr: 0.001179 loss: 3.466166 (3.220797) time: 1.061806 data: 0.000156 max mem: 18817 Epoch: [133/300] [ 700/1251] eta: 0:08:52 lr: 0.001179 loss: 3.238662 (3.211546) time: 0.983610 data: 0.000166 max mem: 18817 Epoch: [133/300] [ 750/1251] eta: 0:08:04 lr: 0.001178 loss: 3.347728 (3.215973) time: 0.926517 data: 0.000168 max mem: 18817 Epoch: [133/300] [ 800/1251] eta: 0:07:16 lr: 0.001178 loss: 3.292358 (3.215089) time: 0.928513 data: 0.000180 max mem: 18817 Epoch: [133/300] [ 850/1251] eta: 0:06:27 lr: 0.001177 loss: 3.189254 (3.220544) time: 0.971414 data: 0.000197 max mem: 18817 Epoch: [133/300] [ 900/1251] eta: 0:05:39 lr: 0.001177 loss: 3.088229 (3.217990) time: 1.027588 data: 0.000184 max mem: 18817 Epoch: [133/300] [ 950/1251] eta: 0:04:50 lr: 0.001176 loss: 3.187769 (3.213843) time: 0.982281 data: 0.000177 max mem: 18817 Epoch: [133/300] [1000/1251] eta: 0:04:02 lr: 0.001176 loss: 3.359215 (3.215925) time: 0.932154 data: 0.000163 max mem: 18817 Epoch: [133/300] [1050/1251] eta: 0:03:14 lr: 0.001176 loss: 3.360882 (3.214728) time: 0.930183 data: 0.000171 max mem: 18817 Epoch: [133/300] [1100/1251] eta: 0:02:25 lr: 0.001175 loss: 3.066937 (3.215553) time: 1.012259 data: 0.000181 max mem: 18817 Epoch: [133/300] [1150/1251] eta: 0:01:37 lr: 0.001175 loss: 3.377924 (3.218821) time: 1.007194 data: 0.000178 max mem: 18817 Epoch: [133/300] [1200/1251] eta: 0:00:49 lr: 0.001174 loss: 3.192866 (3.218860) time: 0.974743 data: 0.000171 max mem: 18817 Epoch: [133/300] [1250/1251] eta: 0:00:00 lr: 0.001174 loss: 3.228710 (3.219335) time: 0.915485 data: 0.000731 max mem: 18817 Epoch: [133/300] Total time: 0:20:07 (0.965553 s / it) Averaged stats: lr: 0.001174 loss: 3.228710 (3.230355) Test: [ 0/49] eta: 0:01:28 loss: 0.613285 (0.613285) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.796352 data: 1.379070 max mem: 18817 Test: [10/49] eta: 0:00:26 loss: 0.688330 (0.780370) acc1: 82.812500 (82.102273) acc5: 95.312500 (95.312500) time: 0.689863 data: 0.125499 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 0.819521 (0.814002) acc1: 78.125000 (80.282738) acc5: 95.312500 (95.461310) time: 0.471103 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 0.860075 (0.810282) acc1: 78.125000 (80.040323) acc5: 95.312500 (95.816532) time: 0.364272 data: 0.000141 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.841945 (0.819149) acc1: 79.687500 (80.259146) acc5: 95.312500 (95.655488) time: 0.361966 data: 0.000143 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.841945 (0.814416) acc1: 81.250000 (80.480000) acc5: 95.312500 (95.744000) time: 0.355930 data: 0.000115 max mem: 18817 Test: Total time: 0:00:21 (0.437937 s / it) * Acc@1 80.180 Acc@5 95.448 loss 0.826 Max accuracy: 80.18% Epoch: [134/300] [ 0/1251] eta: 0:41:34 lr: 0.001174 loss: 3.229012 (3.229012) time: 1.994181 data: 1.096721 max mem: 18817 Epoch: [134/300] [ 50/1251] eta: 0:20:04 lr: 0.001174 loss: 3.079494 (3.198126) time: 1.013036 data: 0.000167 max mem: 18817 Epoch: [134/300] [ 100/1251] eta: 0:19:02 lr: 0.001173 loss: 3.278965 (3.174333) time: 1.057230 data: 0.000177 max mem: 18817 Epoch: [134/300] [ 150/1251] eta: 0:17:57 lr: 0.001173 loss: 2.830444 (3.184700) time: 0.971344 data: 0.000170 max mem: 18817 Epoch: [134/300] [ 200/1251] eta: 0:17:02 lr: 0.001172 loss: 3.433950 (3.168104) time: 0.921862 data: 0.000163 max mem: 18817 Epoch: [134/300] [ 250/1251] eta: 0:16:12 lr: 0.001172 loss: 3.218506 (3.183119) time: 0.935346 data: 0.000173 max mem: 18817 Epoch: [134/300] [ 300/1251] eta: 0:15:23 lr: 0.001172 loss: 3.347210 (3.217538) time: 1.002405 data: 0.000164 max mem: 18817 Epoch: [134/300] [ 350/1251] eta: 0:14:34 lr: 0.001171 loss: 3.413450 (3.197279) time: 1.041536 data: 0.000162 max mem: 18817 Epoch: [134/300] [ 400/1251] eta: 0:13:45 lr: 0.001171 loss: 3.618146 (3.226267) time: 0.989334 data: 0.000173 max mem: 18817 Epoch: [134/300] [ 450/1251] eta: 0:12:55 lr: 0.001170 loss: 2.657934 (3.202516) time: 0.923980 data: 0.000166 max mem: 18817 Epoch: [134/300] [ 500/1251] eta: 0:12:07 lr: 0.001170 loss: 3.337975 (3.207075) time: 0.915540 data: 0.000172 max mem: 18817 Epoch: [134/300] [ 550/1251] eta: 0:11:18 lr: 0.001170 loss: 3.365219 (3.214636) time: 0.981496 data: 0.000175 max mem: 18817 Epoch: [134/300] [ 600/1251] eta: 0:10:31 lr: 0.001169 loss: 3.196036 (3.216498) time: 1.034564 data: 0.000175 max mem: 18817 Epoch: [134/300] [ 650/1251] eta: 0:09:41 lr: 0.001169 loss: 3.552578 (3.233521) time: 0.974878 data: 0.000181 max mem: 18817 Epoch: [134/300] [ 700/1251] eta: 0:08:52 lr: 0.001168 loss: 2.810678 (3.225215) time: 0.924234 data: 0.000151 max mem: 18817 Epoch: [134/300] [ 750/1251] eta: 0:08:04 lr: 0.001168 loss: 3.448004 (3.229315) time: 0.918607 data: 0.000172 max mem: 18817 Epoch: [134/300] [ 800/1251] eta: 0:07:16 lr: 0.001167 loss: 3.097515 (3.231819) time: 0.970764 data: 0.000192 max mem: 18817 Epoch: [134/300] [ 850/1251] eta: 0:06:27 lr: 0.001167 loss: 3.103741 (3.235116) time: 0.990190 data: 0.000186 max mem: 18817 Epoch: [134/300] [ 900/1251] eta: 0:05:38 lr: 0.001167 loss: 3.304223 (3.236338) time: 0.968962 data: 0.000169 max mem: 18817 Epoch: [134/300] [ 950/1251] eta: 0:04:50 lr: 0.001166 loss: 3.161644 (3.231588) time: 0.915477 data: 0.000183 max mem: 18817 Epoch: [134/300] [1000/1251] eta: 0:04:02 lr: 0.001166 loss: 3.061642 (3.228420) time: 0.932242 data: 0.000153 max mem: 18817 Epoch: [134/300] [1050/1251] eta: 0:03:14 lr: 0.001165 loss: 3.366360 (3.231319) time: 1.007890 data: 0.000185 max mem: 18817 Epoch: [134/300] [1100/1251] eta: 0:02:25 lr: 0.001165 loss: 3.225020 (3.231563) time: 0.968521 data: 0.000177 max mem: 18817 Epoch: [134/300] [1150/1251] eta: 0:01:37 lr: 0.001165 loss: 3.517395 (3.234766) time: 0.963680 data: 0.000161 max mem: 18817 Epoch: [134/300] [1200/1251] eta: 0:00:49 lr: 0.001164 loss: 3.014521 (3.231743) time: 0.923685 data: 0.000170 max mem: 18817 Epoch: [134/300] [1250/1251] eta: 0:00:00 lr: 0.001164 loss: 3.027763 (3.232779) time: 0.935939 data: 0.000778 max mem: 18817 Epoch: [134/300] Total time: 0:20:07 (0.965196 s / it) Averaged stats: lr: 0.001164 loss: 3.027763 (3.230390) Test: [ 0/49] eta: 0:01:17 loss: 0.571472 (0.571472) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.579439 data: 1.159605 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.827692 (0.799234) acc1: 78.125000 (80.113636) acc5: 95.312500 (95.170455) time: 0.482466 data: 0.105577 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.846741 (0.824717) acc1: 78.125000 (79.836310) acc5: 95.312500 (95.461310) time: 0.367693 data: 0.000164 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.822387 (0.826164) acc1: 79.687500 (79.687500) acc5: 95.312500 (95.614919) time: 0.373684 data: 0.000143 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.808783 (0.833289) acc1: 79.687500 (79.649390) acc5: 95.312500 (95.464939) time: 0.472256 data: 0.000136 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.881954 (0.837170) acc1: 79.687500 (79.584000) acc5: 95.312500 (95.392000) time: 0.466609 data: 0.000112 max mem: 18817 Test: Total time: 0:00:21 (0.435443 s / it) * Acc@1 80.050 Acc@5 95.412 loss 0.843 Max accuracy: 80.18% Epoch: [135/300] [ 0/1251] eta: 0:41:03 lr: 0.001164 loss: 3.857196 (3.857196) time: 1.969596 data: 1.085483 max mem: 18817 Epoch: [135/300] [ 50/1251] eta: 0:20:03 lr: 0.001163 loss: 3.257273 (3.219001) time: 0.996450 data: 0.000183 max mem: 18817 Epoch: [135/300] [ 100/1251] eta: 0:18:38 lr: 0.001163 loss: 3.424189 (3.253371) time: 0.960284 data: 0.000174 max mem: 18817 Epoch: [135/300] [ 150/1251] eta: 0:17:44 lr: 0.001163 loss: 3.389412 (3.269964) time: 0.916788 data: 0.000168 max mem: 18817 Epoch: [135/300] [ 200/1251] eta: 0:16:55 lr: 0.001162 loss: 3.453632 (3.296236) time: 0.908158 data: 0.000164 max mem: 18817 Epoch: [135/300] [ 250/1251] eta: 0:16:07 lr: 0.001162 loss: 3.335300 (3.271786) time: 0.932102 data: 0.000164 max mem: 18817 Epoch: [135/300] [ 300/1251] eta: 0:15:19 lr: 0.001161 loss: 3.230903 (3.260208) time: 0.996793 data: 0.000158 max mem: 18817 Epoch: [135/300] [ 350/1251] eta: 0:14:28 lr: 0.001161 loss: 2.981870 (3.258428) time: 0.962737 data: 0.000152 max mem: 18817 Epoch: [135/300] [ 400/1251] eta: 0:13:37 lr: 0.001160 loss: 3.332006 (3.257845) time: 0.914904 data: 0.000457 max mem: 18817 Epoch: [135/300] [ 450/1251] eta: 0:12:50 lr: 0.001160 loss: 3.413317 (3.259872) time: 0.924604 data: 0.000175 max mem: 18817 Epoch: [135/300] [ 500/1251] eta: 0:12:03 lr: 0.001160 loss: 3.312097 (3.262323) time: 0.957562 data: 0.000168 max mem: 18817 Epoch: [135/300] [ 550/1251] eta: 0:11:15 lr: 0.001159 loss: 3.484325 (3.267450) time: 0.999711 data: 0.000168 max mem: 18817 Epoch: [135/300] [ 600/1251] eta: 0:10:27 lr: 0.001159 loss: 3.363096 (3.265927) time: 0.973976 data: 0.000159 max mem: 18817 Epoch: [135/300] [ 650/1251] eta: 0:09:38 lr: 0.001158 loss: 3.203969 (3.260222) time: 0.917875 data: 0.000161 max mem: 18817 Epoch: [135/300] [ 700/1251] eta: 0:08:50 lr: 0.001158 loss: 3.310678 (3.259315) time: 0.929098 data: 0.000166 max mem: 18817 Epoch: [135/300] [ 750/1251] eta: 0:08:02 lr: 0.001158 loss: 3.258954 (3.261127) time: 0.932241 data: 0.000174 max mem: 18817 Epoch: [135/300] [ 800/1251] eta: 0:07:14 lr: 0.001157 loss: 3.281914 (3.260504) time: 0.999146 data: 0.000171 max mem: 18817 Epoch: [135/300] [ 850/1251] eta: 0:06:26 lr: 0.001157 loss: 3.322052 (3.260335) time: 0.986975 data: 0.000176 max mem: 18817 Epoch: [135/300] [ 900/1251] eta: 0:05:37 lr: 0.001156 loss: 3.180316 (3.257506) time: 0.905796 data: 0.000167 max mem: 18817 Epoch: [135/300] [ 950/1251] eta: 0:04:49 lr: 0.001156 loss: 3.252238 (3.253199) time: 0.947177 data: 0.000173 max mem: 18817 Epoch: [135/300] [1000/1251] eta: 0:04:01 lr: 0.001156 loss: 3.304112 (3.251689) time: 0.936410 data: 0.000164 max mem: 18817 Epoch: [135/300] [1050/1251] eta: 0:03:13 lr: 0.001155 loss: 3.359461 (3.250213) time: 0.985757 data: 0.000174 max mem: 18817 Epoch: [135/300] [1100/1251] eta: 0:02:25 lr: 0.001155 loss: 3.383188 (3.250028) time: 0.991377 data: 0.000178 max mem: 18817 Epoch: [135/300] [1150/1251] eta: 0:01:37 lr: 0.001154 loss: 3.268722 (3.245350) time: 0.948534 data: 0.000172 max mem: 18817 Epoch: [135/300] [1200/1251] eta: 0:00:49 lr: 0.001154 loss: 3.541457 (3.246137) time: 0.919353 data: 0.000171 max mem: 18817 Epoch: [135/300] [1250/1251] eta: 0:00:00 lr: 0.001154 loss: 3.435551 (3.247189) time: 0.927823 data: 0.000749 max mem: 18817 Epoch: [135/300] Total time: 0:20:04 (0.963051 s / it) Averaged stats: lr: 0.001154 loss: 3.435551 (3.247942) Test: [ 0/49] eta: 0:01:17 loss: 0.622311 (0.622311) acc1: 81.250000 (81.250000) acc5: 98.437500 (98.437500) time: 1.582921 data: 1.139004 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.727883 (0.792104) acc1: 81.250000 (80.539773) acc5: 95.312500 (95.880682) time: 0.487351 data: 0.103687 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.858444 (0.827709) acc1: 81.250000 (79.613095) acc5: 95.312500 (95.833333) time: 0.370159 data: 0.000149 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.827198 (0.827211) acc1: 78.125000 (79.435484) acc5: 96.875000 (95.866935) time: 0.363378 data: 0.000142 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.819657 (0.838712) acc1: 81.250000 (79.420732) acc5: 93.750000 (95.426829) time: 0.361394 data: 0.000140 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.836470 (0.835852) acc1: 79.687500 (79.616000) acc5: 93.750000 (95.488000) time: 0.355583 data: 0.000118 max mem: 18817 Test: Total time: 0:00:19 (0.391429 s / it) * Acc@1 80.266 Acc@5 95.596 loss 0.840 Max accuracy: 80.27% Epoch: [136/300] [ 0/1251] eta: 0:41:54 lr: 0.001154 loss: 2.940479 (2.940479) time: 2.010112 data: 1.073621 max mem: 18817 Epoch: [136/300] [ 50/1251] eta: 0:19:15 lr: 0.001153 loss: 3.100361 (3.191045) time: 0.970753 data: 0.000164 max mem: 18817 Epoch: [136/300] [ 100/1251] eta: 0:18:25 lr: 0.001153 loss: 3.311942 (3.116785) time: 0.936967 data: 0.000166 max mem: 18817 Epoch: [136/300] [ 150/1251] eta: 0:17:44 lr: 0.001152 loss: 3.375087 (3.206047) time: 0.930613 data: 0.000162 max mem: 18817 Epoch: [136/300] [ 200/1251] eta: 0:16:52 lr: 0.001152 loss: 3.354221 (3.247125) time: 0.961859 data: 0.000161 max mem: 18817 Epoch: [136/300] [ 250/1251] eta: 0:16:06 lr: 0.001151 loss: 3.272488 (3.240089) time: 1.000792 data: 0.000174 max mem: 18817 Epoch: [136/300] [ 300/1251] eta: 0:15:14 lr: 0.001151 loss: 3.320441 (3.244062) time: 0.965889 data: 0.000172 max mem: 18817 Epoch: [136/300] [ 350/1251] eta: 0:14:24 lr: 0.001151 loss: 3.329445 (3.244896) time: 0.909579 data: 0.000161 max mem: 18817 Epoch: [136/300] [ 400/1251] eta: 0:13:38 lr: 0.001150 loss: 3.088775 (3.238273) time: 0.935872 data: 0.000181 max mem: 18817 Epoch: [136/300] [ 450/1251] eta: 0:12:50 lr: 0.001150 loss: 3.187564 (3.233533) time: 0.986730 data: 0.000157 max mem: 18817 Epoch: [136/300] [ 500/1251] eta: 0:12:02 lr: 0.001149 loss: 3.379761 (3.231788) time: 0.999550 data: 0.000166 max mem: 18817 Epoch: [136/300] [ 550/1251] eta: 0:11:13 lr: 0.001149 loss: 3.557262 (3.241952) time: 0.962565 data: 0.000169 max mem: 18817 Epoch: [136/300] [ 600/1251] eta: 0:10:25 lr: 0.001149 loss: 3.257912 (3.243754) time: 0.920389 data: 0.000174 max mem: 18817 Epoch: [136/300] [ 650/1251] eta: 0:09:37 lr: 0.001148 loss: 3.362235 (3.247073) time: 0.927831 data: 0.000169 max mem: 18817 Epoch: [136/300] [ 700/1251] eta: 0:08:49 lr: 0.001148 loss: 3.351065 (3.249076) time: 0.979277 data: 0.000172 max mem: 18817 Epoch: [136/300] [ 750/1251] eta: 0:08:02 lr: 0.001147 loss: 3.330585 (3.252009) time: 1.025837 data: 0.000165 max mem: 18817 Epoch: [136/300] [ 800/1251] eta: 0:07:14 lr: 0.001147 loss: 3.130663 (3.249100) time: 0.983962 data: 0.000168 max mem: 18817 Epoch: [136/300] [ 850/1251] eta: 0:06:25 lr: 0.001147 loss: 3.331156 (3.248659) time: 0.905451 data: 0.000168 max mem: 18817 Epoch: [136/300] [ 900/1251] eta: 0:05:37 lr: 0.001146 loss: 3.388449 (3.247156) time: 0.940085 data: 0.000178 max mem: 18817 Epoch: [136/300] [ 950/1251] eta: 0:04:49 lr: 0.001146 loss: 2.991687 (3.239700) time: 0.967742 data: 0.000176 max mem: 18817 Epoch: [136/300] [1000/1251] eta: 0:04:01 lr: 0.001145 loss: 3.391730 (3.239539) time: 0.991163 data: 0.000154 max mem: 18817 Epoch: [136/300] [1050/1251] eta: 0:03:13 lr: 0.001145 loss: 3.258490 (3.237634) time: 0.982735 data: 0.000175 max mem: 18817 Epoch: [136/300] [1100/1251] eta: 0:02:25 lr: 0.001144 loss: 3.032988 (3.237606) time: 0.905144 data: 0.000172 max mem: 18817 Epoch: [136/300] [1150/1251] eta: 0:01:37 lr: 0.001144 loss: 3.371300 (3.241918) time: 0.934432 data: 0.000178 max mem: 18817 Epoch: [136/300] [1200/1251] eta: 0:00:49 lr: 0.001144 loss: 3.279838 (3.239622) time: 0.966927 data: 0.000163 max mem: 18817 Epoch: [136/300] [1250/1251] eta: 0:00:00 lr: 0.001143 loss: 3.403650 (3.240814) time: 0.969835 data: 0.000727 max mem: 18817 Epoch: [136/300] Total time: 0:20:03 (0.962116 s / it) Averaged stats: lr: 0.001143 loss: 3.403650 (3.233888) Test: [ 0/49] eta: 0:01:30 loss: 0.627631 (0.627631) acc1: 81.250000 (81.250000) acc5: 98.437500 (98.437500) time: 1.836797 data: 1.434988 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.681256 (0.793969) acc1: 82.812500 (81.392045) acc5: 95.312500 (95.312500) time: 0.508054 data: 0.130622 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.855746 (0.826765) acc1: 79.687500 (80.654762) acc5: 95.312500 (95.312500) time: 0.368537 data: 0.000150 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.861948 (0.833646) acc1: 78.125000 (80.241935) acc5: 95.312500 (95.413306) time: 0.373264 data: 0.000130 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.861948 (0.840755) acc1: 78.125000 (80.068598) acc5: 95.312500 (95.388720) time: 0.371307 data: 0.000130 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.848372 (0.835228) acc1: 78.125000 (80.320000) acc5: 96.875000 (95.584000) time: 0.360412 data: 0.000100 max mem: 18817 Test: Total time: 0:00:19 (0.400147 s / it) * Acc@1 80.172 Acc@5 95.504 loss 0.833 Max accuracy: 80.27% Epoch: [137/300] [ 0/1251] eta: 0:41:20 lr: 0.001143 loss: 3.167957 (3.167957) time: 1.983126 data: 1.091886 max mem: 18817 Epoch: [137/300] [ 50/1251] eta: 0:19:29 lr: 0.001143 loss: 3.104190 (3.191802) time: 0.923697 data: 0.000151 max mem: 18817 Epoch: [137/300] [ 100/1251] eta: 0:18:40 lr: 0.001142 loss: 3.220573 (3.189907) time: 0.931887 data: 0.000182 max mem: 18817 Epoch: [137/300] [ 150/1251] eta: 0:17:48 lr: 0.001142 loss: 3.162058 (3.194708) time: 0.996114 data: 0.000169 max mem: 18817 Epoch: [137/300] [ 200/1251] eta: 0:17:00 lr: 0.001142 loss: 3.609340 (3.214878) time: 0.968036 data: 0.000170 max mem: 18817 Epoch: [137/300] [ 250/1251] eta: 0:16:09 lr: 0.001141 loss: 2.892504 (3.196131) time: 0.981869 data: 0.000175 max mem: 18817 Epoch: [137/300] [ 300/1251] eta: 0:15:16 lr: 0.001141 loss: 3.242797 (3.199088) time: 0.914419 data: 0.000180 max mem: 18817 Epoch: [137/300] [ 350/1251] eta: 0:14:27 lr: 0.001140 loss: 3.272457 (3.184561) time: 0.928608 data: 0.000171 max mem: 18817 Epoch: [137/300] [ 400/1251] eta: 0:13:39 lr: 0.001140 loss: 3.441447 (3.192596) time: 0.992488 data: 0.000165 max mem: 18817 Epoch: [137/300] [ 450/1251] eta: 0:12:52 lr: 0.001140 loss: 3.107246 (3.183670) time: 1.023438 data: 0.000175 max mem: 18817 Epoch: [137/300] [ 500/1251] eta: 0:12:03 lr: 0.001139 loss: 3.495663 (3.191126) time: 0.982533 data: 0.000181 max mem: 18817 Epoch: [137/300] [ 550/1251] eta: 0:11:14 lr: 0.001139 loss: 2.956072 (3.183205) time: 0.919401 data: 0.000183 max mem: 18817 Epoch: [137/300] [ 600/1251] eta: 0:10:26 lr: 0.001138 loss: 3.071760 (3.183975) time: 0.911117 data: 0.000177 max mem: 18817 Epoch: [137/300] [ 650/1251] eta: 0:09:38 lr: 0.001138 loss: 3.241491 (3.191972) time: 0.991798 data: 0.000173 max mem: 18817 Epoch: [137/300] [ 700/1251] eta: 0:08:50 lr: 0.001137 loss: 3.176539 (3.191330) time: 0.984619 data: 0.000157 max mem: 18817 Epoch: [137/300] [ 750/1251] eta: 0:08:01 lr: 0.001137 loss: 3.064036 (3.184807) time: 0.998234 data: 0.000164 max mem: 18817 Epoch: [137/300] [ 800/1251] eta: 0:07:13 lr: 0.001137 loss: 3.563475 (3.186531) time: 0.911214 data: 0.000168 max mem: 18817 Epoch: [137/300] [ 850/1251] eta: 0:06:25 lr: 0.001136 loss: 3.273953 (3.184911) time: 0.913401 data: 0.000164 max mem: 18817 Epoch: [137/300] [ 900/1251] eta: 0:05:37 lr: 0.001136 loss: 3.351047 (3.192209) time: 0.988263 data: 0.000164 max mem: 18817 Epoch: [137/300] [ 950/1251] eta: 0:04:49 lr: 0.001135 loss: 3.253524 (3.198822) time: 0.986191 data: 0.000164 max mem: 18817 Epoch: [137/300] [1000/1251] eta: 0:04:00 lr: 0.001135 loss: 3.277325 (3.209415) time: 0.948082 data: 0.000164 max mem: 18817 Epoch: [137/300] [1050/1251] eta: 0:03:12 lr: 0.001135 loss: 3.185971 (3.211636) time: 0.913339 data: 0.000188 max mem: 18817 Epoch: [137/300] [1100/1251] eta: 0:02:24 lr: 0.001134 loss: 3.525572 (3.216129) time: 0.928207 data: 0.000175 max mem: 18817 Epoch: [137/300] [1150/1251] eta: 0:01:36 lr: 0.001134 loss: 3.157834 (3.216505) time: 0.984553 data: 0.000169 max mem: 18817 Epoch: [137/300] [1200/1251] eta: 0:00:48 lr: 0.001133 loss: 3.134629 (3.214213) time: 1.038373 data: 0.000169 max mem: 18817 Epoch: [137/300] [1250/1251] eta: 0:00:00 lr: 0.001133 loss: 2.958872 (3.213008) time: 0.997544 data: 0.000762 max mem: 18817 Epoch: [137/300] Total time: 0:20:01 (0.960827 s / it) Averaged stats: lr: 0.001133 loss: 2.958872 (3.209201) Test: [ 0/49] eta: 0:01:30 loss: 0.622527 (0.622527) acc1: 82.812500 (82.812500) acc5: 100.000000 (100.000000) time: 1.855287 data: 1.470490 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.718109 (0.769851) acc1: 82.812500 (82.244318) acc5: 95.312500 (95.738636) time: 0.501894 data: 0.133823 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.785103 (0.805558) acc1: 79.687500 (80.654762) acc5: 95.312500 (95.758929) time: 0.367395 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.813246 (0.804742) acc1: 79.687500 (80.393145) acc5: 95.312500 (95.816532) time: 0.394941 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.812723 (0.817734) acc1: 79.687500 (80.487805) acc5: 95.312500 (95.807927) time: 0.411012 data: 0.000140 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.833184 (0.819987) acc1: 78.125000 (80.192000) acc5: 95.312500 (95.840000) time: 0.382408 data: 0.000120 max mem: 18817 Test: Total time: 0:00:20 (0.416376 s / it) * Acc@1 80.384 Acc@5 95.512 loss 0.829 Max accuracy: 80.38% Epoch: [138/300] [ 0/1251] eta: 0:42:53 lr: 0.001133 loss: 3.632037 (3.632037) time: 2.056907 data: 1.152026 max mem: 18817 Epoch: [138/300] [ 50/1251] eta: 0:19:46 lr: 0.001133 loss: 3.333245 (3.099536) time: 0.934430 data: 0.000173 max mem: 18817 Epoch: [138/300] [ 100/1251] eta: 0:18:41 lr: 0.001132 loss: 3.337139 (3.196885) time: 0.997224 data: 0.000179 max mem: 18817 Epoch: [138/300] [ 150/1251] eta: 0:17:50 lr: 0.001132 loss: 3.292559 (3.208500) time: 1.049255 data: 0.000186 max mem: 18817 Epoch: [138/300] [ 200/1251] eta: 0:16:58 lr: 0.001131 loss: 3.259018 (3.217170) time: 0.977698 data: 0.000174 max mem: 18817 Epoch: [138/300] [ 250/1251] eta: 0:16:05 lr: 0.001131 loss: 2.855576 (3.199226) time: 0.921960 data: 0.000172 max mem: 18817 Epoch: [138/300] [ 300/1251] eta: 0:15:18 lr: 0.001130 loss: 3.158550 (3.209220) time: 0.926051 data: 0.000164 max mem: 18817 Epoch: [138/300] [ 350/1251] eta: 0:14:31 lr: 0.001130 loss: 3.390712 (3.189804) time: 0.999693 data: 0.000163 max mem: 18817 Epoch: [138/300] [ 400/1251] eta: 0:13:43 lr: 0.001130 loss: 3.072107 (3.180207) time: 1.040210 data: 0.000183 max mem: 18817 Epoch: [138/300] [ 450/1251] eta: 0:12:53 lr: 0.001129 loss: 3.297789 (3.181715) time: 0.971921 data: 0.000197 max mem: 18817 Epoch: [138/300] [ 500/1251] eta: 0:12:04 lr: 0.001129 loss: 3.164091 (3.182320) time: 0.921099 data: 0.000170 max mem: 18817 Epoch: [138/300] [ 550/1251] eta: 0:11:17 lr: 0.001128 loss: 3.247782 (3.186236) time: 0.938035 data: 0.000194 max mem: 18817 Epoch: [138/300] [ 600/1251] eta: 0:10:28 lr: 0.001128 loss: 3.113423 (3.171764) time: 0.971414 data: 0.000158 max mem: 18817 Epoch: [138/300] [ 650/1251] eta: 0:09:40 lr: 0.001128 loss: 3.306488 (3.173349) time: 0.989330 data: 0.000289 max mem: 18817 Epoch: [138/300] [ 700/1251] eta: 0:08:51 lr: 0.001127 loss: 3.394116 (3.182553) time: 0.971510 data: 0.000169 max mem: 18817 Epoch: [138/300] [ 750/1251] eta: 0:08:02 lr: 0.001127 loss: 2.984451 (3.180738) time: 0.909574 data: 0.000174 max mem: 18817 Epoch: [138/300] [ 800/1251] eta: 0:07:14 lr: 0.001126 loss: 3.243091 (3.185405) time: 0.918721 data: 0.000175 max mem: 18817 Epoch: [138/300] [ 850/1251] eta: 0:06:26 lr: 0.001126 loss: 3.415416 (3.191194) time: 0.969397 data: 0.000169 max mem: 18817 Epoch: [138/300] [ 900/1251] eta: 0:05:38 lr: 0.001126 loss: 3.500298 (3.197894) time: 1.016752 data: 0.000172 max mem: 18817 Epoch: [138/300] [ 950/1251] eta: 0:04:49 lr: 0.001125 loss: 3.476365 (3.201261) time: 0.975724 data: 0.000181 max mem: 18817 Epoch: [138/300] [1000/1251] eta: 0:04:01 lr: 0.001125 loss: 3.406059 (3.204747) time: 0.929553 data: 0.000167 max mem: 18817 Epoch: [138/300] [1050/1251] eta: 0:03:13 lr: 0.001124 loss: 3.091113 (3.206712) time: 0.940775 data: 0.000176 max mem: 18817 Epoch: [138/300] [1100/1251] eta: 0:02:25 lr: 0.001124 loss: 3.294230 (3.208728) time: 0.988400 data: 0.000162 max mem: 18817 Epoch: [138/300] [1150/1251] eta: 0:01:37 lr: 0.001123 loss: 3.194406 (3.212154) time: 0.968163 data: 0.000161 max mem: 18817 Epoch: [138/300] [1200/1251] eta: 0:00:49 lr: 0.001123 loss: 3.183517 (3.208873) time: 0.957023 data: 0.000170 max mem: 18817 Epoch: [138/300] [1250/1251] eta: 0:00:00 lr: 0.001123 loss: 3.387837 (3.213895) time: 0.918690 data: 0.000749 max mem: 18817 Epoch: [138/300] Total time: 0:20:03 (0.961922 s / it) Averaged stats: lr: 0.001123 loss: 3.387837 (3.218809) Test: [ 0/49] eta: 0:01:29 loss: 0.636981 (0.636981) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.832194 data: 1.396579 max mem: 18817 Test: [10/49] eta: 0:00:26 loss: 0.649445 (0.781262) acc1: 82.812500 (82.386364) acc5: 95.312500 (95.454545) time: 0.690941 data: 0.127123 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 0.854551 (0.809248) acc1: 81.250000 (81.473214) acc5: 95.312500 (95.312500) time: 0.469238 data: 0.000156 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 0.875022 (0.819812) acc1: 79.687500 (81.098790) acc5: 96.875000 (95.665323) time: 0.362111 data: 0.000139 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.839876 (0.837560) acc1: 79.687500 (80.716463) acc5: 95.312500 (95.388720) time: 0.360126 data: 0.000147 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.839876 (0.831961) acc1: 79.687500 (80.768000) acc5: 95.312500 (95.488000) time: 0.364797 data: 0.000121 max mem: 18817 Test: Total time: 0:00:21 (0.439473 s / it) * Acc@1 80.242 Acc@5 95.520 loss 0.842 Max accuracy: 80.38% Epoch: [139/300] [ 0/1251] eta: 0:44:09 lr: 0.001123 loss: 2.321048 (2.321048) time: 2.118288 data: 1.220366 max mem: 18817 Epoch: [139/300] [ 50/1251] eta: 0:19:58 lr: 0.001122 loss: 3.030179 (3.127740) time: 0.970183 data: 0.000180 max mem: 18817 Epoch: [139/300] [ 100/1251] eta: 0:18:53 lr: 0.001122 loss: 3.099867 (3.141907) time: 0.977964 data: 0.000175 max mem: 18817 Epoch: [139/300] [ 150/1251] eta: 0:17:48 lr: 0.001121 loss: 3.254011 (3.190809) time: 0.979847 data: 0.000177 max mem: 18817 Epoch: [139/300] [ 200/1251] eta: 0:16:55 lr: 0.001121 loss: 3.188715 (3.187294) time: 0.918465 data: 0.000173 max mem: 18817 Epoch: [139/300] [ 250/1251] eta: 0:16:07 lr: 0.001121 loss: 3.200828 (3.196348) time: 0.918424 data: 0.000170 max mem: 18817 Epoch: [139/300] [ 300/1251] eta: 0:15:21 lr: 0.001120 loss: 3.244221 (3.185668) time: 0.947924 data: 0.000161 max mem: 18817 Epoch: [139/300] [ 350/1251] eta: 0:14:32 lr: 0.001120 loss: 2.938371 (3.180017) time: 0.973350 data: 0.000162 max mem: 18817 Epoch: [139/300] [ 400/1251] eta: 0:13:40 lr: 0.001119 loss: 3.232738 (3.200498) time: 0.964399 data: 0.000187 max mem: 18817 Epoch: [139/300] [ 450/1251] eta: 0:12:51 lr: 0.001119 loss: 3.193897 (3.202612) time: 0.917972 data: 0.000175 max mem: 18817 Epoch: [139/300] [ 500/1251] eta: 0:12:02 lr: 0.001119 loss: 3.234167 (3.202650) time: 0.931844 data: 0.000163 max mem: 18817 Epoch: [139/300] [ 550/1251] eta: 0:11:15 lr: 0.001118 loss: 3.336249 (3.204752) time: 0.943290 data: 0.000178 max mem: 18817 Epoch: [139/300] [ 600/1251] eta: 0:10:28 lr: 0.001118 loss: 3.300362 (3.206505) time: 0.983594 data: 0.000180 max mem: 18817 Epoch: [139/300] [ 650/1251] eta: 0:09:39 lr: 0.001117 loss: 3.499344 (3.209105) time: 0.973395 data: 0.000179 max mem: 18817 Epoch: [139/300] [ 700/1251] eta: 0:08:50 lr: 0.001117 loss: 3.196962 (3.212718) time: 0.927079 data: 0.000167 max mem: 18817 Epoch: [139/300] [ 750/1251] eta: 0:08:02 lr: 0.001116 loss: 3.092026 (3.206157) time: 0.924309 data: 0.000165 max mem: 18817 Epoch: [139/300] [ 800/1251] eta: 0:07:14 lr: 0.001116 loss: 3.224238 (3.203291) time: 0.928991 data: 0.000162 max mem: 18817 Epoch: [139/300] [ 850/1251] eta: 0:06:27 lr: 0.001116 loss: 2.991906 (3.197033) time: 1.000175 data: 0.000180 max mem: 18817 Epoch: [139/300] [ 900/1251] eta: 0:05:38 lr: 0.001115 loss: 3.424675 (3.199789) time: 0.980612 data: 0.000191 max mem: 18817 Epoch: [139/300] [ 950/1251] eta: 0:04:50 lr: 0.001115 loss: 3.337560 (3.196975) time: 0.980087 data: 0.000190 max mem: 18817 Epoch: [139/300] [1000/1251] eta: 0:04:02 lr: 0.001114 loss: 3.112794 (3.194584) time: 0.925352 data: 0.000169 max mem: 18817 Epoch: [139/300] [1050/1251] eta: 0:03:13 lr: 0.001114 loss: 3.033951 (3.190706) time: 0.937008 data: 0.000172 max mem: 18817 Epoch: [139/300] [1100/1251] eta: 0:02:25 lr: 0.001114 loss: 3.239895 (3.194343) time: 0.991889 data: 0.000167 max mem: 18817 Epoch: [139/300] [1150/1251] eta: 0:01:37 lr: 0.001113 loss: 3.087770 (3.193635) time: 1.048625 data: 0.000180 max mem: 18817 Epoch: [139/300] [1200/1251] eta: 0:00:49 lr: 0.001113 loss: 3.282058 (3.195300) time: 0.984055 data: 0.000174 max mem: 18817 Epoch: [139/300] [1250/1251] eta: 0:00:00 lr: 0.001112 loss: 3.422746 (3.200138) time: 0.936185 data: 0.000754 max mem: 18817 Epoch: [139/300] Total time: 0:20:07 (0.965159 s / it) Averaged stats: lr: 0.001112 loss: 3.422746 (3.200252) Test: [ 0/49] eta: 0:01:16 loss: 0.528950 (0.528950) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.569271 data: 1.136283 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.679685 (0.758022) acc1: 79.687500 (81.107955) acc5: 95.312500 (96.022727) time: 0.476400 data: 0.103437 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.827836 (0.786981) acc1: 79.687500 (80.357143) acc5: 95.312500 (96.056548) time: 0.461228 data: 0.000136 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.806036 (0.790243) acc1: 78.125000 (79.989919) acc5: 95.312500 (95.766129) time: 0.458988 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.804323 (0.798911) acc1: 79.687500 (80.259146) acc5: 95.312500 (95.769817) time: 0.360152 data: 0.000131 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.847177 (0.804310) acc1: 78.125000 (80.192000) acc5: 95.312500 (95.872000) time: 0.354653 data: 0.000099 max mem: 18817 Test: Total time: 0:00:20 (0.427497 s / it) * Acc@1 80.308 Acc@5 95.614 loss 0.810 Max accuracy: 80.38% Epoch: [140/300] [ 0/1251] eta: 0:40:46 lr: 0.001112 loss: 3.677796 (3.677796) time: 1.955878 data: 1.059681 max mem: 18817 Epoch: [140/300] [ 50/1251] eta: 0:19:32 lr: 0.001112 loss: 3.166695 (3.228451) time: 0.973738 data: 0.000151 max mem: 18817 Epoch: [140/300] [ 100/1251] eta: 0:18:34 lr: 0.001112 loss: 3.439070 (3.260188) time: 0.991023 data: 0.000187 max mem: 18817 Epoch: [140/300] [ 150/1251] eta: 0:17:37 lr: 0.001111 loss: 3.164162 (3.202513) time: 0.915132 data: 0.000172 max mem: 18817 Epoch: [140/300] [ 200/1251] eta: 0:16:53 lr: 0.001111 loss: 3.190293 (3.223134) time: 0.931418 data: 0.000158 max mem: 18817 Epoch: [140/300] [ 250/1251] eta: 0:16:04 lr: 0.001110 loss: 3.104619 (3.210005) time: 0.985620 data: 0.000174 max mem: 18817 Epoch: [140/300] [ 300/1251] eta: 0:15:18 lr: 0.001110 loss: 3.200690 (3.205524) time: 0.990500 data: 0.000182 max mem: 18817 Epoch: [140/300] [ 350/1251] eta: 0:14:27 lr: 0.001109 loss: 3.195906 (3.201280) time: 0.977896 data: 0.000180 max mem: 18817 Epoch: [140/300] [ 400/1251] eta: 0:13:39 lr: 0.001109 loss: 3.298724 (3.212173) time: 0.922084 data: 0.000189 max mem: 18817 Epoch: [140/300] [ 450/1251] eta: 0:12:52 lr: 0.001109 loss: 3.154862 (3.212486) time: 0.936371 data: 0.000169 max mem: 18817 Epoch: [140/300] [ 500/1251] eta: 0:12:05 lr: 0.001108 loss: 3.451192 (3.218815) time: 0.939889 data: 0.000184 max mem: 18817 Epoch: [140/300] [ 550/1251] eta: 0:11:17 lr: 0.001108 loss: 3.127379 (3.219957) time: 0.992550 data: 0.000162 max mem: 18817 Epoch: [140/300] [ 600/1251] eta: 0:10:28 lr: 0.001107 loss: 3.240909 (3.216731) time: 0.972814 data: 0.000188 max mem: 18817 Epoch: [140/300] [ 650/1251] eta: 0:09:39 lr: 0.001107 loss: 3.422141 (3.226159) time: 0.933941 data: 0.000173 max mem: 18817 Epoch: [140/300] [ 700/1251] eta: 0:08:51 lr: 0.001107 loss: 3.067102 (3.225861) time: 0.927690 data: 0.000166 max mem: 18817 Epoch: [140/300] [ 750/1251] eta: 0:08:03 lr: 0.001106 loss: 3.128406 (3.222316) time: 0.933998 data: 0.000177 max mem: 18817 Epoch: [140/300] [ 800/1251] eta: 0:07:15 lr: 0.001106 loss: 3.098876 (3.215925) time: 1.003299 data: 0.000170 max mem: 18817 Epoch: [140/300] [ 850/1251] eta: 0:06:27 lr: 0.001105 loss: 3.015394 (3.221831) time: 0.982637 data: 0.000170 max mem: 18817 Epoch: [140/300] [ 900/1251] eta: 0:05:38 lr: 0.001105 loss: 3.296089 (3.223291) time: 0.962539 data: 0.000189 max mem: 18817 Epoch: [140/300] [ 950/1251] eta: 0:04:50 lr: 0.001105 loss: 3.202571 (3.218355) time: 0.933360 data: 0.000186 max mem: 18817 Epoch: [140/300] [1000/1251] eta: 0:04:02 lr: 0.001104 loss: 3.174549 (3.217365) time: 0.915211 data: 0.000153 max mem: 18817 Epoch: [140/300] [1050/1251] eta: 0:03:13 lr: 0.001104 loss: 3.191062 (3.215850) time: 0.988865 data: 0.000171 max mem: 18817 Epoch: [140/300] [1100/1251] eta: 0:02:25 lr: 0.001103 loss: 3.388386 (3.218728) time: 0.979155 data: 0.000188 max mem: 18817 Epoch: [140/300] [1150/1251] eta: 0:01:37 lr: 0.001103 loss: 3.540670 (3.222191) time: 0.954984 data: 0.000171 max mem: 18817 Epoch: [140/300] [1200/1251] eta: 0:00:49 lr: 0.001102 loss: 3.520574 (3.224534) time: 0.931334 data: 0.000168 max mem: 18817 Epoch: [140/300] [1250/1251] eta: 0:00:00 lr: 0.001102 loss: 3.172298 (3.229267) time: 0.921411 data: 0.000740 max mem: 18817 Epoch: [140/300] Total time: 0:20:06 (0.964210 s / it) Averaged stats: lr: 0.001102 loss: 3.172298 (3.222034) Test: [ 0/49] eta: 0:01:18 loss: 0.655815 (0.655815) acc1: 81.250000 (81.250000) acc5: 98.437500 (98.437500) time: 1.596453 data: 1.145814 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.736088 (0.763286) acc1: 81.250000 (80.965909) acc5: 95.312500 (95.596591) time: 0.485470 data: 0.104315 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.756144 (0.781305) acc1: 81.250000 (80.654762) acc5: 95.312500 (95.907738) time: 0.367880 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.770974 (0.782720) acc1: 79.687500 (80.796371) acc5: 96.875000 (96.219758) time: 0.361469 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.791152 (0.798387) acc1: 79.687500 (80.487805) acc5: 96.875000 (95.884146) time: 0.359854 data: 0.000131 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.837574 (0.794388) acc1: 78.125000 (80.576000) acc5: 95.312500 (95.872000) time: 0.355430 data: 0.000101 max mem: 18817 Test: Total time: 0:00:19 (0.390197 s / it) * Acc@1 80.272 Acc@5 95.710 loss 0.810 Max accuracy: 80.38% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0140.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0140.pth Epoch: [141/300] [ 0/1251] eta: 0:42:59 lr: 0.001102 loss: 2.239941 (2.239941) time: 2.061739 data: 1.167203 max mem: 18817 Epoch: [141/300] [ 50/1251] eta: 0:19:52 lr: 0.001102 loss: 3.389981 (3.189131) time: 0.998199 data: 0.000168 max mem: 18817 Epoch: [141/300] [ 100/1251] eta: 0:18:52 lr: 0.001101 loss: 3.147069 (3.139026) time: 1.006551 data: 0.000172 max mem: 18817 Epoch: [141/300] [ 150/1251] eta: 0:17:49 lr: 0.001101 loss: 3.024248 (3.090862) time: 0.975744 data: 0.000200 max mem: 18817 Epoch: [141/300] [ 200/1251] eta: 0:16:54 lr: 0.001100 loss: 3.201990 (3.086257) time: 0.926296 data: 0.000169 max mem: 18817 Epoch: [141/300] [ 250/1251] eta: 0:16:07 lr: 0.001100 loss: 3.221960 (3.098733) time: 0.925372 data: 0.000161 max mem: 18817 Epoch: [141/300] [ 300/1251] eta: 0:15:19 lr: 0.001100 loss: 3.182047 (3.110303) time: 0.976497 data: 0.000165 max mem: 18817 Epoch: [141/300] [ 350/1251] eta: 0:14:29 lr: 0.001099 loss: 3.260444 (3.101571) time: 0.972113 data: 0.000177 max mem: 18817 Epoch: [141/300] [ 400/1251] eta: 0:13:39 lr: 0.001099 loss: 3.250093 (3.112446) time: 0.986089 data: 0.000182 max mem: 18817 Epoch: [141/300] [ 450/1251] eta: 0:12:49 lr: 0.001098 loss: 3.110118 (3.112561) time: 0.903665 data: 0.000172 max mem: 18817 Epoch: [141/300] [ 500/1251] eta: 0:12:02 lr: 0.001098 loss: 3.039637 (3.118454) time: 0.926701 data: 0.000170 max mem: 18817 Epoch: [141/300] [ 550/1251] eta: 0:11:14 lr: 0.001097 loss: 3.394402 (3.131001) time: 0.987757 data: 0.000179 max mem: 18817 Epoch: [141/300] [ 600/1251] eta: 0:10:26 lr: 0.001097 loss: 3.169837 (3.136944) time: 0.984859 data: 0.000162 max mem: 18817 Epoch: [141/300] [ 650/1251] eta: 0:09:38 lr: 0.001097 loss: 3.288705 (3.146309) time: 0.983811 data: 0.000183 max mem: 18817 Epoch: [141/300] [ 700/1251] eta: 0:08:50 lr: 0.001096 loss: 3.283016 (3.150118) time: 0.919987 data: 0.000173 max mem: 18817 Epoch: [141/300] [ 750/1251] eta: 0:08:02 lr: 0.001096 loss: 3.489033 (3.154876) time: 0.944041 data: 0.000172 max mem: 18817 Epoch: [141/300] [ 800/1251] eta: 0:07:14 lr: 0.001095 loss: 3.118999 (3.150889) time: 0.949815 data: 0.000166 max mem: 18817 Epoch: [141/300] [ 850/1251] eta: 0:06:26 lr: 0.001095 loss: 3.542028 (3.162678) time: 0.987657 data: 0.000173 max mem: 18817 Epoch: [141/300] [ 900/1251] eta: 0:05:37 lr: 0.001095 loss: 3.402562 (3.166158) time: 0.963122 data: 0.000175 max mem: 18817 Epoch: [141/300] [ 950/1251] eta: 0:04:49 lr: 0.001094 loss: 3.462000 (3.170124) time: 0.921479 data: 0.000163 max mem: 18817 Epoch: [141/300] [1000/1251] eta: 0:04:01 lr: 0.001094 loss: 3.475931 (3.179560) time: 0.934037 data: 0.000174 max mem: 18817 Epoch: [141/300] [1050/1251] eta: 0:03:13 lr: 0.001093 loss: 3.515044 (3.189072) time: 0.939644 data: 0.000173 max mem: 18817 Epoch: [141/300] [1100/1251] eta: 0:02:25 lr: 0.001093 loss: 3.191352 (3.192603) time: 0.993182 data: 0.000175 max mem: 18817 Epoch: [141/300] [1150/1251] eta: 0:01:37 lr: 0.001093 loss: 3.614597 (3.202239) time: 0.959985 data: 0.000168 max mem: 18817 Epoch: [141/300] [1200/1251] eta: 0:00:49 lr: 0.001092 loss: 3.128236 (3.201147) time: 0.912024 data: 0.000171 max mem: 18817 Epoch: [141/300] [1250/1251] eta: 0:00:00 lr: 0.001092 loss: 3.407841 (3.203840) time: 0.929071 data: 0.000767 max mem: 18817 Epoch: [141/300] Total time: 0:20:05 (0.963944 s / it) Averaged stats: lr: 0.001092 loss: 3.407841 (3.202773) Test: [ 0/49] eta: 0:01:28 loss: 0.613348 (0.613348) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.799869 data: 1.389764 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.671836 (0.750937) acc1: 82.812500 (82.528409) acc5: 96.875000 (95.738636) time: 0.496612 data: 0.126484 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.777434 (0.782142) acc1: 81.250000 (81.324405) acc5: 96.875000 (95.907738) time: 0.364000 data: 0.000135 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.793986 (0.782980) acc1: 79.687500 (80.897177) acc5: 96.875000 (96.118952) time: 0.457933 data: 0.000122 max mem: 18817 Test: [40/49] eta: 0:00:04 loss: 0.820939 (0.799236) acc1: 79.687500 (80.830793) acc5: 96.875000 (95.960366) time: 0.458678 data: 0.000130 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.872452 (0.802357) acc1: 79.687500 (80.672000) acc5: 95.312500 (95.936000) time: 0.360425 data: 0.000114 max mem: 18817 Test: Total time: 0:00:21 (0.432979 s / it) * Acc@1 80.220 Acc@5 95.636 loss 0.815 Max accuracy: 80.38% Epoch: [142/300] [ 0/1251] eta: 0:43:20 lr: 0.001092 loss: 2.358150 (2.358150) time: 2.078905 data: 1.175321 max mem: 18817 Epoch: [142/300] [ 50/1251] eta: 0:19:49 lr: 0.001091 loss: 3.397853 (3.189276) time: 0.987151 data: 0.000172 max mem: 18817 Epoch: [142/300] [ 100/1251] eta: 0:18:38 lr: 0.001091 loss: 3.261863 (3.165003) time: 0.992240 data: 0.000185 max mem: 18817 Epoch: [142/300] [ 150/1251] eta: 0:17:44 lr: 0.001090 loss: 3.453937 (3.157876) time: 0.917247 data: 0.000169 max mem: 18817 Epoch: [142/300] [ 200/1251] eta: 0:16:57 lr: 0.001090 loss: 3.281532 (3.150643) time: 0.930148 data: 0.000173 max mem: 18817 Epoch: [142/300] [ 250/1251] eta: 0:16:09 lr: 0.001090 loss: 3.232680 (3.157728) time: 0.947893 data: 0.000178 max mem: 18817 Epoch: [142/300] [ 300/1251] eta: 0:15:19 lr: 0.001089 loss: 3.243800 (3.169445) time: 0.970757 data: 0.000151 max mem: 18817 Epoch: [142/300] [ 350/1251] eta: 0:14:26 lr: 0.001089 loss: 3.044842 (3.162698) time: 0.942136 data: 0.000165 max mem: 18817 Epoch: [142/300] [ 400/1251] eta: 0:13:37 lr: 0.001088 loss: 3.278612 (3.158228) time: 0.914525 data: 0.000178 max mem: 18817 Epoch: [142/300] [ 450/1251] eta: 0:12:51 lr: 0.001088 loss: 3.046725 (3.155732) time: 0.926862 data: 0.000191 max mem: 18817 Epoch: [142/300] [ 500/1251] eta: 0:12:03 lr: 0.001088 loss: 3.441588 (3.168225) time: 0.932109 data: 0.000158 max mem: 18817 Epoch: [142/300] [ 550/1251] eta: 0:11:15 lr: 0.001087 loss: 3.223928 (3.161508) time: 0.978408 data: 0.000168 max mem: 18817 Epoch: [142/300] [ 600/1251] eta: 0:10:26 lr: 0.001087 loss: 3.165370 (3.168631) time: 0.978624 data: 0.000166 max mem: 18817 Epoch: [142/300] [ 650/1251] eta: 0:09:37 lr: 0.001086 loss: 3.045761 (3.167843) time: 0.913158 data: 0.000159 max mem: 18817 Epoch: [142/300] [ 700/1251] eta: 0:08:50 lr: 0.001086 loss: 3.133379 (3.166134) time: 0.939026 data: 0.000167 max mem: 18817 Epoch: [142/300] [ 750/1251] eta: 0:08:02 lr: 0.001085 loss: 3.206624 (3.166907) time: 0.925069 data: 0.000184 max mem: 18817 Epoch: [142/300] [ 800/1251] eta: 0:07:14 lr: 0.001085 loss: 3.272944 (3.168998) time: 0.987728 data: 0.000189 max mem: 18817 Epoch: [142/300] [ 850/1251] eta: 0:06:25 lr: 0.001085 loss: 3.336427 (3.171514) time: 0.978426 data: 0.000168 max mem: 18817 Epoch: [142/300] [ 900/1251] eta: 0:05:38 lr: 0.001084 loss: 3.290147 (3.179518) time: 0.963176 data: 0.000167 max mem: 18817 Epoch: [142/300] [ 950/1251] eta: 0:04:49 lr: 0.001084 loss: 3.085560 (3.181939) time: 0.933562 data: 0.000168 max mem: 18817 Epoch: [142/300] [1000/1251] eta: 0:04:01 lr: 0.001083 loss: 3.166564 (3.185587) time: 0.935800 data: 0.000181 max mem: 18817 Epoch: [142/300] [1050/1251] eta: 0:03:13 lr: 0.001083 loss: 3.162205 (3.187471) time: 0.986500 data: 0.000175 max mem: 18817 Epoch: [142/300] [1100/1251] eta: 0:02:25 lr: 0.001083 loss: 3.241210 (3.187313) time: 0.951167 data: 0.000170 max mem: 18817 Epoch: [142/300] [1150/1251] eta: 0:01:37 lr: 0.001082 loss: 3.298372 (3.183903) time: 0.953363 data: 0.000157 max mem: 18817 Epoch: [142/300] [1200/1251] eta: 0:00:49 lr: 0.001082 loss: 3.304846 (3.183921) time: 0.943235 data: 0.000163 max mem: 18817 Epoch: [142/300] [1250/1251] eta: 0:00:00 lr: 0.001081 loss: 3.295043 (3.185723) time: 0.935516 data: 0.000756 max mem: 18817 Epoch: [142/300] Total time: 0:20:05 (0.963315 s / it) Averaged stats: lr: 0.001081 loss: 3.295043 (3.187749) Test: [ 0/49] eta: 0:01:19 loss: 0.599629 (0.599629) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.631160 data: 1.189780 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.703269 (0.754085) acc1: 82.812500 (82.102273) acc5: 95.312500 (95.738636) time: 0.486476 data: 0.108318 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.798604 (0.790896) acc1: 79.687500 (80.877976) acc5: 95.312500 (95.461310) time: 0.367193 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.794537 (0.785691) acc1: 79.687500 (80.544355) acc5: 96.875000 (95.715726) time: 0.362762 data: 0.000129 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.794537 (0.806592) acc1: 81.250000 (80.830793) acc5: 95.312500 (95.426829) time: 0.360718 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.819810 (0.799280) acc1: 81.250000 (80.800000) acc5: 95.312500 (95.552000) time: 0.355152 data: 0.000103 max mem: 18817 Test: Total time: 0:00:19 (0.391423 s / it) * Acc@1 80.474 Acc@5 95.592 loss 0.814 Max accuracy: 80.47% Epoch: [143/300] [ 0/1251] eta: 0:43:27 lr: 0.001081 loss: 2.726024 (2.726024) time: 2.084222 data: 1.050101 max mem: 18817 Epoch: [143/300] [ 50/1251] eta: 0:19:34 lr: 0.001081 loss: 3.189810 (3.253839) time: 0.969228 data: 0.000177 max mem: 18817 Epoch: [143/300] [ 100/1251] eta: 0:18:24 lr: 0.001081 loss: 3.371232 (3.246174) time: 0.916610 data: 0.000164 max mem: 18817 Epoch: [143/300] [ 150/1251] eta: 0:17:40 lr: 0.001080 loss: 3.147635 (3.197188) time: 0.919158 data: 0.000178 max mem: 18817 Epoch: [143/300] [ 200/1251] eta: 0:16:49 lr: 0.001080 loss: 3.343686 (3.221612) time: 0.976914 data: 0.000181 max mem: 18817 Epoch: [143/300] [ 250/1251] eta: 0:16:03 lr: 0.001079 loss: 3.401802 (3.227263) time: 1.016279 data: 0.000179 max mem: 18817 Epoch: [143/300] [ 300/1251] eta: 0:15:13 lr: 0.001079 loss: 3.337527 (3.201645) time: 0.983058 data: 0.000182 max mem: 18817 Epoch: [143/300] [ 350/1251] eta: 0:14:24 lr: 0.001078 loss: 3.166169 (3.201993) time: 0.915039 data: 0.000170 max mem: 18817 Epoch: [143/300] [ 400/1251] eta: 0:13:36 lr: 0.001078 loss: 3.382875 (3.198500) time: 0.912063 data: 0.000175 max mem: 18817 Epoch: [143/300] [ 450/1251] eta: 0:12:49 lr: 0.001078 loss: 3.139049 (3.181208) time: 0.969050 data: 0.000162 max mem: 18817 Epoch: [143/300] [ 500/1251] eta: 0:12:01 lr: 0.001077 loss: 3.002769 (3.170857) time: 0.976910 data: 0.000165 max mem: 18817 Epoch: [143/300] [ 550/1251] eta: 0:11:13 lr: 0.001077 loss: 3.427242 (3.175554) time: 0.981642 data: 0.000168 max mem: 18817 Epoch: [143/300] [ 600/1251] eta: 0:10:24 lr: 0.001076 loss: 3.213295 (3.181240) time: 0.922699 data: 0.000200 max mem: 18817 Epoch: [143/300] [ 650/1251] eta: 0:09:37 lr: 0.001076 loss: 3.097297 (3.174237) time: 0.933841 data: 0.000175 max mem: 18817 Epoch: [143/300] [ 700/1251] eta: 0:08:50 lr: 0.001076 loss: 3.060796 (3.176299) time: 0.955997 data: 0.000168 max mem: 18817 Epoch: [143/300] [ 750/1251] eta: 0:08:02 lr: 0.001075 loss: 3.037567 (3.167903) time: 0.986891 data: 0.000163 max mem: 18817 Epoch: [143/300] [ 800/1251] eta: 0:07:13 lr: 0.001075 loss: 3.227571 (3.167461) time: 0.971038 data: 0.000174 max mem: 18817 Epoch: [143/300] [ 850/1251] eta: 0:06:25 lr: 0.001074 loss: 3.333458 (3.172628) time: 0.909749 data: 0.000164 max mem: 18817 Epoch: [143/300] [ 900/1251] eta: 0:05:37 lr: 0.001074 loss: 3.154717 (3.170350) time: 0.923888 data: 0.000164 max mem: 18817 Epoch: [143/300] [ 950/1251] eta: 0:04:49 lr: 0.001073 loss: 3.345498 (3.173181) time: 0.966562 data: 0.000175 max mem: 18817 Epoch: [143/300] [1000/1251] eta: 0:04:01 lr: 0.001073 loss: 3.114440 (3.168859) time: 0.975685 data: 0.000162 max mem: 18817 Epoch: [143/300] [1050/1251] eta: 0:03:13 lr: 0.001073 loss: 2.916229 (3.163303) time: 0.964148 data: 0.000182 max mem: 18817 Epoch: [143/300] [1100/1251] eta: 0:02:25 lr: 0.001072 loss: 3.262307 (3.163890) time: 0.922515 data: 0.000185 max mem: 18817 Epoch: [143/300] [1150/1251] eta: 0:01:37 lr: 0.001072 loss: 3.162917 (3.166734) time: 0.924698 data: 0.000160 max mem: 18817 Epoch: [143/300] [1200/1251] eta: 0:00:49 lr: 0.001071 loss: 3.425498 (3.172295) time: 0.972389 data: 0.000173 max mem: 18817 Epoch: [143/300] [1250/1251] eta: 0:00:00 lr: 0.001071 loss: 3.295358 (3.175426) time: 0.988069 data: 0.000767 max mem: 18817 Epoch: [143/300] Total time: 0:20:03 (0.962381 s / it) Averaged stats: lr: 0.001071 loss: 3.295358 (3.169579) Test: [ 0/49] eta: 0:01:33 loss: 0.546567 (0.546567) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.903363 data: 1.481390 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.662194 (0.748484) acc1: 81.250000 (82.528409) acc5: 96.875000 (95.880682) time: 0.508776 data: 0.134821 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.793153 (0.781883) acc1: 79.687500 (81.101190) acc5: 95.312500 (95.833333) time: 0.365385 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.777808 (0.786556) acc1: 79.687500 (80.594758) acc5: 96.875000 (95.917339) time: 0.374257 data: 0.000129 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.824958 (0.798435) acc1: 78.125000 (80.373476) acc5: 96.875000 (95.846037) time: 0.372728 data: 0.000139 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.870443 (0.804843) acc1: 78.125000 (80.160000) acc5: 95.312500 (95.776000) time: 0.355693 data: 0.000118 max mem: 18817 Test: Total time: 0:00:19 (0.399851 s / it) * Acc@1 80.498 Acc@5 95.600 loss 0.808 Max accuracy: 80.50% Epoch: [144/300] [ 0/1251] eta: 0:42:50 lr: 0.001071 loss: 3.130401 (3.130401) time: 2.055121 data: 1.159120 max mem: 18817 Epoch: [144/300] [ 50/1251] eta: 0:19:31 lr: 0.001071 loss: 3.063810 (3.007701) time: 0.941034 data: 0.000153 max mem: 18817 Epoch: [144/300] [ 100/1251] eta: 0:18:36 lr: 0.001070 loss: 3.457070 (3.135672) time: 0.923954 data: 0.000167 max mem: 18817 Epoch: [144/300] [ 150/1251] eta: 0:17:46 lr: 0.001070 loss: 2.922633 (3.119932) time: 0.995368 data: 0.000171 max mem: 18817 Epoch: [144/300] [ 200/1251] eta: 0:16:56 lr: 0.001069 loss: 3.349400 (3.112359) time: 1.017034 data: 0.000180 max mem: 18817 Epoch: [144/300] [ 250/1251] eta: 0:16:05 lr: 0.001069 loss: 3.186455 (3.119553) time: 0.979624 data: 0.000171 max mem: 18817 Epoch: [144/300] [ 300/1251] eta: 0:15:14 lr: 0.001069 loss: 3.170053 (3.128067) time: 0.920624 data: 0.000169 max mem: 18817 Epoch: [144/300] [ 350/1251] eta: 0:14:27 lr: 0.001068 loss: 3.122269 (3.137647) time: 0.938591 data: 0.000170 max mem: 18817 Epoch: [144/300] [ 400/1251] eta: 0:13:41 lr: 0.001068 loss: 3.347747 (3.142660) time: 1.007231 data: 0.000172 max mem: 18817 Epoch: [144/300] [ 450/1251] eta: 0:12:54 lr: 0.001067 loss: 3.233767 (3.153272) time: 1.051367 data: 0.000180 max mem: 18817 Epoch: [144/300] [ 500/1251] eta: 0:12:04 lr: 0.001067 loss: 3.302367 (3.145772) time: 0.980482 data: 0.000194 max mem: 18817 Epoch: [144/300] [ 550/1251] eta: 0:11:15 lr: 0.001066 loss: 3.119152 (3.158246) time: 0.918462 data: 0.000174 max mem: 18817 Epoch: [144/300] [ 600/1251] eta: 0:10:27 lr: 0.001066 loss: 3.234049 (3.165571) time: 0.933890 data: 0.000191 max mem: 18817 Epoch: [144/300] [ 650/1251] eta: 0:09:39 lr: 0.001066 loss: 3.209610 (3.160933) time: 0.989623 data: 0.000181 max mem: 18817 Epoch: [144/300] [ 700/1251] eta: 0:08:51 lr: 0.001065 loss: 3.138136 (3.158545) time: 1.040258 data: 0.000164 max mem: 18817 Epoch: [144/300] [ 750/1251] eta: 0:08:02 lr: 0.001065 loss: 3.053996 (3.157859) time: 0.980162 data: 0.000163 max mem: 18817 Epoch: [144/300] [ 800/1251] eta: 0:07:14 lr: 0.001064 loss: 3.383301 (3.165774) time: 0.924915 data: 0.000182 max mem: 18817 Epoch: [144/300] [ 850/1251] eta: 0:06:26 lr: 0.001064 loss: 3.231311 (3.166323) time: 0.927579 data: 0.000185 max mem: 18817 Epoch: [144/300] [ 900/1251] eta: 0:05:38 lr: 0.001064 loss: 3.345613 (3.166356) time: 1.004738 data: 0.000166 max mem: 18817 Epoch: [144/300] [ 950/1251] eta: 0:04:50 lr: 0.001063 loss: 3.162974 (3.172387) time: 1.022123 data: 0.000169 max mem: 18817 Epoch: [144/300] [1000/1251] eta: 0:04:01 lr: 0.001063 loss: 3.010128 (3.165762) time: 0.957323 data: 0.000179 max mem: 18817 Epoch: [144/300] [1050/1251] eta: 0:03:13 lr: 0.001062 loss: 3.358662 (3.166684) time: 0.935346 data: 0.000169 max mem: 18817 Epoch: [144/300] [1100/1251] eta: 0:02:25 lr: 0.001062 loss: 2.984257 (3.170015) time: 0.936051 data: 0.000195 max mem: 18817 Epoch: [144/300] [1150/1251] eta: 0:01:37 lr: 0.001061 loss: 3.245584 (3.167925) time: 1.009894 data: 0.000174 max mem: 18817 Epoch: [144/300] [1200/1251] eta: 0:00:49 lr: 0.001061 loss: 3.322794 (3.171342) time: 0.985250 data: 0.000179 max mem: 18817 Epoch: [144/300] [1250/1251] eta: 0:00:00 lr: 0.001061 loss: 3.170617 (3.171336) time: 0.957764 data: 0.000790 max mem: 18817 Epoch: [144/300] Total time: 0:20:07 (0.964994 s / it) Averaged stats: lr: 0.001061 loss: 3.170617 (3.171620) Test: [ 0/49] eta: 0:01:31 loss: 0.555200 (0.555200) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.862243 data: 1.442863 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.743801 (0.766941) acc1: 81.250000 (82.102273) acc5: 95.312500 (95.312500) time: 0.505165 data: 0.131310 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.812032 (0.799291) acc1: 78.125000 (79.910714) acc5: 95.312500 (95.461310) time: 0.375566 data: 0.000152 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.811963 (0.795513) acc1: 78.125000 (80.040323) acc5: 95.312500 (95.715726) time: 0.396918 data: 0.000146 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.820914 (0.810475) acc1: 79.687500 (79.916159) acc5: 95.312500 (95.464939) time: 0.401665 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.831961 (0.810495) acc1: 81.250000 (80.160000) acc5: 95.312500 (95.488000) time: 0.375316 data: 0.000109 max mem: 18817 Test: Total time: 0:00:20 (0.415691 s / it) * Acc@1 80.398 Acc@5 95.556 loss 0.812 Max accuracy: 80.50% Epoch: [145/300] [ 0/1251] eta: 0:42:22 lr: 0.001061 loss: 2.223340 (2.223340) time: 2.032456 data: 1.136539 max mem: 18817 Epoch: [145/300] [ 50/1251] eta: 0:19:40 lr: 0.001060 loss: 3.179596 (3.078923) time: 0.926538 data: 0.000147 max mem: 18817 Epoch: [145/300] [ 100/1251] eta: 0:18:45 lr: 0.001060 loss: 3.197113 (3.134713) time: 0.994879 data: 0.000170 max mem: 18817 Epoch: [145/300] [ 150/1251] eta: 0:17:54 lr: 0.001059 loss: 3.533198 (3.180112) time: 1.037921 data: 0.000181 max mem: 18817 Epoch: [145/300] [ 200/1251] eta: 0:16:56 lr: 0.001059 loss: 3.119219 (3.174602) time: 0.965228 data: 0.000182 max mem: 18817 Epoch: [145/300] [ 250/1251] eta: 0:16:04 lr: 0.001059 loss: 3.221089 (3.177741) time: 0.912075 data: 0.000182 max mem: 18817 Epoch: [145/300] [ 300/1251] eta: 0:15:17 lr: 0.001058 loss: 3.295880 (3.189519) time: 0.912142 data: 0.000157 max mem: 18817 Epoch: [145/300] [ 350/1251] eta: 0:14:29 lr: 0.001058 loss: 3.295421 (3.180852) time: 0.978496 data: 0.000160 max mem: 18817 Epoch: [145/300] [ 400/1251] eta: 0:13:43 lr: 0.001057 loss: 3.375429 (3.186270) time: 1.072683 data: 0.000189 max mem: 18817 Epoch: [145/300] [ 450/1251] eta: 0:12:52 lr: 0.001057 loss: 3.006617 (3.176181) time: 0.967153 data: 0.000175 max mem: 18817 Epoch: [145/300] [ 500/1251] eta: 0:12:03 lr: 0.001056 loss: 2.975519 (3.167576) time: 0.913736 data: 0.000171 max mem: 18817 Epoch: [145/300] [ 550/1251] eta: 0:11:16 lr: 0.001056 loss: 3.166475 (3.176759) time: 0.934740 data: 0.000175 max mem: 18817 Epoch: [145/300] [ 600/1251] eta: 0:10:27 lr: 0.001056 loss: 3.106337 (3.172526) time: 0.986895 data: 0.000182 max mem: 18817 Epoch: [145/300] [ 650/1251] eta: 0:09:40 lr: 0.001055 loss: 3.291270 (3.174108) time: 1.004743 data: 0.000177 max mem: 18817 Epoch: [145/300] [ 700/1251] eta: 0:08:50 lr: 0.001055 loss: 3.341556 (3.171319) time: 0.964240 data: 0.000163 max mem: 18817 Epoch: [145/300] [ 750/1251] eta: 0:08:02 lr: 0.001054 loss: 3.240347 (3.171403) time: 0.919018 data: 0.000182 max mem: 18817 Epoch: [145/300] [ 800/1251] eta: 0:07:14 lr: 0.001054 loss: 3.121972 (3.172529) time: 0.925289 data: 0.000169 max mem: 18817 Epoch: [145/300] [ 850/1251] eta: 0:06:26 lr: 0.001054 loss: 3.282525 (3.170271) time: 0.985554 data: 0.000162 max mem: 18817 Epoch: [145/300] [ 900/1251] eta: 0:05:37 lr: 0.001053 loss: 3.139634 (3.167655) time: 1.035712 data: 0.000169 max mem: 18817 Epoch: [145/300] [ 950/1251] eta: 0:04:49 lr: 0.001053 loss: 3.117930 (3.167396) time: 0.971698 data: 0.000174 max mem: 18817 Epoch: [145/300] [1000/1251] eta: 0:04:01 lr: 0.001052 loss: 3.284221 (3.166307) time: 0.917856 data: 0.000160 max mem: 18817 Epoch: [145/300] [1050/1251] eta: 0:03:13 lr: 0.001052 loss: 3.057825 (3.162466) time: 0.928466 data: 0.000180 max mem: 18817 Epoch: [145/300] [1100/1251] eta: 0:02:25 lr: 0.001052 loss: 3.398674 (3.166986) time: 1.017648 data: 0.000176 max mem: 18817 Epoch: [145/300] [1150/1251] eta: 0:01:37 lr: 0.001051 loss: 2.905800 (3.165560) time: 1.025604 data: 0.000182 max mem: 18817 Epoch: [145/300] [1200/1251] eta: 0:00:49 lr: 0.001051 loss: 3.458234 (3.171767) time: 0.955124 data: 0.000164 max mem: 18817 Epoch: [145/300] [1250/1251] eta: 0:00:00 lr: 0.001050 loss: 3.210759 (3.173099) time: 0.921018 data: 0.000760 max mem: 18817 Epoch: [145/300] Total time: 0:20:02 (0.961023 s / it) Averaged stats: lr: 0.001050 loss: 3.210759 (3.180536) Test: [ 0/49] eta: 0:01:27 loss: 0.585044 (0.585044) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.780453 data: 1.377197 max mem: 18817 Test: [10/49] eta: 0:00:24 loss: 0.733796 (0.758132) acc1: 79.687500 (81.250000) acc5: 96.875000 (96.022727) time: 0.622110 data: 0.125347 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.824070 (0.795709) acc1: 78.125000 (80.133929) acc5: 95.312500 (95.833333) time: 0.447457 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.771012 (0.788406) acc1: 78.125000 (80.594758) acc5: 95.312500 (95.816532) time: 0.376904 data: 0.000130 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.771012 (0.799505) acc1: 79.687500 (80.335366) acc5: 96.875000 (95.922256) time: 0.361634 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.825371 (0.795680) acc1: 79.687500 (80.544000) acc5: 96.875000 (96.064000) time: 0.359255 data: 0.000101 max mem: 18817 Test: Total time: 0:00:21 (0.428597 s / it) * Acc@1 80.682 Acc@5 95.648 loss 0.802 Max accuracy: 80.68% Epoch: [146/300] [ 0/1251] eta: 0:44:08 lr: 0.001050 loss: 3.823044 (3.823044) time: 2.117393 data: 1.211669 max mem: 18817 Epoch: [146/300] [ 50/1251] eta: 0:19:47 lr: 0.001050 loss: 3.099834 (3.117175) time: 0.984342 data: 0.000164 max mem: 18817 Epoch: [146/300] [ 100/1251] eta: 0:18:49 lr: 0.001049 loss: 3.250589 (3.101991) time: 1.024039 data: 0.000174 max mem: 18817 Epoch: [146/300] [ 150/1251] eta: 0:17:50 lr: 0.001049 loss: 3.464000 (3.107844) time: 0.981802 data: 0.000177 max mem: 18817 Epoch: [146/300] [ 200/1251] eta: 0:16:54 lr: 0.001049 loss: 3.199899 (3.125112) time: 0.923206 data: 0.000169 max mem: 18817 Epoch: [146/300] [ 250/1251] eta: 0:16:06 lr: 0.001048 loss: 3.116240 (3.146670) time: 0.926597 data: 0.000169 max mem: 18817 Epoch: [146/300] [ 300/1251] eta: 0:15:17 lr: 0.001048 loss: 3.292592 (3.145989) time: 0.974833 data: 0.000167 max mem: 18817 Epoch: [146/300] [ 350/1251] eta: 0:14:28 lr: 0.001047 loss: 3.349901 (3.152155) time: 1.030427 data: 0.000158 max mem: 18817 Epoch: [146/300] [ 400/1251] eta: 0:13:40 lr: 0.001047 loss: 3.171758 (3.149802) time: 0.990474 data: 0.000221 max mem: 18817 Epoch: [146/300] [ 450/1251] eta: 0:12:51 lr: 0.001047 loss: 3.177173 (3.149592) time: 0.929782 data: 0.000203 max mem: 18817 Epoch: [146/300] [ 500/1251] eta: 0:12:04 lr: 0.001046 loss: 3.378441 (3.160262) time: 0.942104 data: 0.000188 max mem: 18817 Epoch: [146/300] [ 550/1251] eta: 0:11:16 lr: 0.001046 loss: 3.220358 (3.157232) time: 1.001739 data: 0.000148 max mem: 18817 Epoch: [146/300] [ 600/1251] eta: 0:10:28 lr: 0.001045 loss: 3.259115 (3.158471) time: 1.036901 data: 0.000168 max mem: 18817 Epoch: [146/300] [ 650/1251] eta: 0:09:40 lr: 0.001045 loss: 3.351176 (3.160677) time: 0.987714 data: 0.000176 max mem: 18817 Epoch: [146/300] [ 700/1251] eta: 0:08:51 lr: 0.001044 loss: 3.305578 (3.159697) time: 0.929095 data: 0.000174 max mem: 18817 Epoch: [146/300] [ 750/1251] eta: 0:08:03 lr: 0.001044 loss: 3.242144 (3.158365) time: 0.921248 data: 0.000170 max mem: 18817 Epoch: [146/300] [ 800/1251] eta: 0:07:14 lr: 0.001044 loss: 3.121099 (3.159093) time: 0.982420 data: 0.000169 max mem: 18817 Epoch: [146/300] [ 850/1251] eta: 0:06:26 lr: 0.001043 loss: 3.093487 (3.156696) time: 1.037817 data: 0.000164 max mem: 18817 Epoch: [146/300] [ 900/1251] eta: 0:05:38 lr: 0.001043 loss: 3.237158 (3.160345) time: 0.971412 data: 0.000195 max mem: 18817 Epoch: [146/300] [ 950/1251] eta: 0:04:49 lr: 0.001042 loss: 3.039682 (3.161211) time: 0.926178 data: 0.000200 max mem: 18817 Epoch: [146/300] [1000/1251] eta: 0:04:01 lr: 0.001042 loss: 3.304727 (3.164845) time: 0.931354 data: 0.000173 max mem: 18817 Epoch: [146/300] [1050/1251] eta: 0:03:13 lr: 0.001042 loss: 3.214288 (3.163406) time: 0.980339 data: 0.000177 max mem: 18817 Epoch: [146/300] [1100/1251] eta: 0:02:25 lr: 0.001041 loss: 3.399211 (3.163021) time: 1.055831 data: 0.000184 max mem: 18817 Epoch: [146/300] [1150/1251] eta: 0:01:37 lr: 0.001041 loss: 3.233203 (3.163948) time: 0.968090 data: 0.000171 max mem: 18817 Epoch: [146/300] [1200/1251] eta: 0:00:49 lr: 0.001040 loss: 2.988423 (3.163530) time: 0.914108 data: 0.000175 max mem: 18817 Epoch: [146/300] [1250/1251] eta: 0:00:00 lr: 0.001040 loss: 3.222867 (3.162449) time: 0.926919 data: 0.000761 max mem: 18817 Epoch: [146/300] Total time: 0:20:04 (0.963028 s / it) Averaged stats: lr: 0.001040 loss: 3.222867 (3.163893) Test: [ 0/49] eta: 0:01:23 loss: 0.615744 (0.615744) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.705537 data: 1.280882 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.728432 (0.773070) acc1: 81.250000 (81.250000) acc5: 95.312500 (95.454545) time: 0.491700 data: 0.116604 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.801581 (0.793286) acc1: 81.250000 (81.101190) acc5: 95.312500 (95.535714) time: 0.365950 data: 0.000151 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.748103 (0.782858) acc1: 79.687500 (80.997984) acc5: 96.875000 (95.866935) time: 0.362477 data: 0.000124 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.748103 (0.792704) acc1: 81.250000 (81.097561) acc5: 96.875000 (95.884146) time: 0.391813 data: 0.000122 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.823175 (0.791163) acc1: 81.250000 (81.056000) acc5: 96.875000 (96.000000) time: 0.442668 data: 0.000103 max mem: 18817 Test: Total time: 0:00:20 (0.426951 s / it) * Acc@1 80.732 Acc@5 95.746 loss 0.805 Max accuracy: 80.73% Epoch: [147/300] [ 0/1251] eta: 0:42:27 lr: 0.001040 loss: 3.300163 (3.300163) time: 2.036131 data: 1.150069 max mem: 18817 Epoch: [147/300] [ 50/1251] eta: 0:19:34 lr: 0.001039 loss: 3.185591 (3.081447) time: 0.995654 data: 0.000181 max mem: 18817 Epoch: [147/300] [ 100/1251] eta: 0:18:43 lr: 0.001039 loss: 3.331308 (3.148348) time: 0.982705 data: 0.000167 max mem: 18817 Epoch: [147/300] [ 150/1251] eta: 0:17:48 lr: 0.001039 loss: 3.025031 (3.096064) time: 0.936315 data: 0.000169 max mem: 18817 Epoch: [147/300] [ 200/1251] eta: 0:17:02 lr: 0.001038 loss: 3.264264 (3.130040) time: 0.930796 data: 0.000158 max mem: 18817 Epoch: [147/300] [ 250/1251] eta: 0:16:13 lr: 0.001038 loss: 3.221988 (3.135134) time: 0.999152 data: 0.000173 max mem: 18817 Epoch: [147/300] [ 300/1251] eta: 0:15:23 lr: 0.001037 loss: 3.401233 (3.120628) time: 1.022202 data: 0.000168 max mem: 18817 Epoch: [147/300] [ 350/1251] eta: 0:14:31 lr: 0.001037 loss: 3.120970 (3.116265) time: 0.960272 data: 0.000167 max mem: 18817 Epoch: [147/300] [ 400/1251] eta: 0:13:40 lr: 0.001037 loss: 3.102974 (3.114156) time: 0.929239 data: 0.000178 max mem: 18817 Epoch: [147/300] [ 450/1251] eta: 0:12:53 lr: 0.001036 loss: 3.410576 (3.141620) time: 0.925697 data: 0.000178 max mem: 18817 Epoch: [147/300] [ 500/1251] eta: 0:12:04 lr: 0.001036 loss: 3.250386 (3.141579) time: 0.961205 data: 0.000170 max mem: 18817 Epoch: [147/300] [ 550/1251] eta: 0:11:15 lr: 0.001035 loss: 3.234498 (3.146555) time: 1.001230 data: 0.000177 max mem: 18817 Epoch: [147/300] [ 600/1251] eta: 0:10:26 lr: 0.001035 loss: 3.385921 (3.149461) time: 0.980018 data: 0.000184 max mem: 18817 Epoch: [147/300] [ 650/1251] eta: 0:09:37 lr: 0.001034 loss: 3.092569 (3.149501) time: 0.926491 data: 0.000177 max mem: 18817 Epoch: [147/300] [ 700/1251] eta: 0:08:50 lr: 0.001034 loss: 2.906525 (3.140850) time: 0.938840 data: 0.000161 max mem: 18817 Epoch: [147/300] [ 750/1251] eta: 0:08:02 lr: 0.001034 loss: 3.441402 (3.155708) time: 1.004031 data: 0.000176 max mem: 18817 Epoch: [147/300] [ 800/1251] eta: 0:07:14 lr: 0.001033 loss: 3.176396 (3.156513) time: 0.992366 data: 0.000157 max mem: 18817 Epoch: [147/300] [ 850/1251] eta: 0:06:25 lr: 0.001033 loss: 3.363738 (3.164020) time: 0.960895 data: 0.000189 max mem: 18817 Epoch: [147/300] [ 900/1251] eta: 0:05:37 lr: 0.001032 loss: 2.975940 (3.165024) time: 0.932071 data: 0.000162 max mem: 18817 Epoch: [147/300] [ 950/1251] eta: 0:04:49 lr: 0.001032 loss: 3.237958 (3.163909) time: 0.942500 data: 0.000171 max mem: 18817 Epoch: [147/300] [1000/1251] eta: 0:04:01 lr: 0.001032 loss: 3.370710 (3.158675) time: 0.992418 data: 0.000165 max mem: 18817 Epoch: [147/300] [1050/1251] eta: 0:03:13 lr: 0.001031 loss: 3.030212 (3.159662) time: 1.002761 data: 0.000170 max mem: 18817 Epoch: [147/300] [1100/1251] eta: 0:02:25 lr: 0.001031 loss: 3.237923 (3.157521) time: 0.970063 data: 0.000179 max mem: 18817 Epoch: [147/300] [1150/1251] eta: 0:01:37 lr: 0.001030 loss: 3.204962 (3.156512) time: 0.922708 data: 0.000177 max mem: 18817 Epoch: [147/300] [1200/1251] eta: 0:00:49 lr: 0.001030 loss: 3.276798 (3.159519) time: 0.939494 data: 0.000186 max mem: 18817 Epoch: [147/300] [1250/1251] eta: 0:00:00 lr: 0.001030 loss: 3.246389 (3.159255) time: 0.995068 data: 0.000725 max mem: 18817 Epoch: [147/300] Total time: 0:20:05 (0.963506 s / it) Averaged stats: lr: 0.001030 loss: 3.246389 (3.161910) Test: [ 0/49] eta: 0:01:32 loss: 0.660369 (0.660369) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.895297 data: 1.475824 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.787473 (0.812215) acc1: 81.250000 (82.812500) acc5: 95.312500 (95.312500) time: 0.528208 data: 0.134383 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.851663 (0.833695) acc1: 79.687500 (81.026786) acc5: 95.312500 (95.833333) time: 0.378078 data: 0.000186 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.851663 (0.830289) acc1: 79.687500 (80.645161) acc5: 96.875000 (95.967742) time: 0.364487 data: 0.000142 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.842148 (0.839493) acc1: 79.687500 (80.449695) acc5: 95.312500 (95.884146) time: 0.361554 data: 0.000152 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.857030 (0.836692) acc1: 79.687500 (80.352000) acc5: 95.312500 (95.904000) time: 0.355752 data: 0.000122 max mem: 18817 Test: Total time: 0:00:19 (0.401505 s / it) * Acc@1 80.668 Acc@5 95.718 loss 0.840 Max accuracy: 80.73% Epoch: [148/300] [ 0/1251] eta: 0:41:47 lr: 0.001030 loss: 3.103118 (3.103118) time: 2.004436 data: 1.108416 max mem: 18817 Epoch: [148/300] [ 50/1251] eta: 0:19:19 lr: 0.001029 loss: 3.194152 (3.261491) time: 0.958437 data: 0.000170 max mem: 18817 Epoch: [148/300] [ 100/1251] eta: 0:18:26 lr: 0.001029 loss: 3.287199 (3.205477) time: 0.936280 data: 0.000166 max mem: 18817 Epoch: [148/300] [ 150/1251] eta: 0:17:39 lr: 0.001028 loss: 3.117106 (3.174467) time: 0.920908 data: 0.000158 max mem: 18817 Epoch: [148/300] [ 200/1251] eta: 0:16:50 lr: 0.001028 loss: 3.284620 (3.155719) time: 0.981763 data: 0.000164 max mem: 18817 Epoch: [148/300] [ 250/1251] eta: 0:16:02 lr: 0.001027 loss: 3.192531 (3.152755) time: 0.994834 data: 0.000181 max mem: 18817 Epoch: [148/300] [ 300/1251] eta: 0:15:14 lr: 0.001027 loss: 3.184257 (3.147357) time: 0.977948 data: 0.000171 max mem: 18817 Epoch: [148/300] [ 350/1251] eta: 0:14:25 lr: 0.001027 loss: 3.352238 (3.165564) time: 0.942121 data: 0.000169 max mem: 18817 Epoch: [148/300] [ 400/1251] eta: 0:13:37 lr: 0.001026 loss: 3.287033 (3.162094) time: 0.923288 data: 0.000187 max mem: 18817 Epoch: [148/300] [ 450/1251] eta: 0:12:50 lr: 0.001026 loss: 3.124698 (3.151236) time: 0.994734 data: 0.000187 max mem: 18817 Epoch: [148/300] [ 500/1251] eta: 0:12:01 lr: 0.001025 loss: 3.066662 (3.160489) time: 0.984240 data: 0.000165 max mem: 18817 Epoch: [148/300] [ 550/1251] eta: 0:11:13 lr: 0.001025 loss: 3.160309 (3.163865) time: 0.974151 data: 0.000168 max mem: 18817 Epoch: [148/300] [ 600/1251] eta: 0:10:24 lr: 0.001025 loss: 3.201905 (3.175769) time: 0.928847 data: 0.000159 max mem: 18817 Epoch: [148/300] [ 650/1251] eta: 0:09:37 lr: 0.001024 loss: 3.531173 (3.177894) time: 0.929044 data: 0.000155 max mem: 18817 Epoch: [148/300] [ 700/1251] eta: 0:08:50 lr: 0.001024 loss: 3.444086 (3.178037) time: 1.010769 data: 0.000171 max mem: 18817 Epoch: [148/300] [ 750/1251] eta: 0:08:02 lr: 0.001023 loss: 3.153860 (3.177084) time: 1.054886 data: 0.000173 max mem: 18817 Epoch: [148/300] [ 800/1251] eta: 0:07:14 lr: 0.001023 loss: 3.245843 (3.169480) time: 0.963424 data: 0.000165 max mem: 18817 Epoch: [148/300] [ 850/1251] eta: 0:06:25 lr: 0.001022 loss: 3.386979 (3.177911) time: 0.915450 data: 0.000168 max mem: 18817 Epoch: [148/300] [ 900/1251] eta: 0:05:37 lr: 0.001022 loss: 3.159065 (3.177228) time: 0.932042 data: 0.000174 max mem: 18817 Epoch: [148/300] [ 950/1251] eta: 0:04:50 lr: 0.001022 loss: 3.196352 (3.175478) time: 0.983020 data: 0.000168 max mem: 18817 Epoch: [148/300] [1000/1251] eta: 0:04:02 lr: 0.001021 loss: 3.106655 (3.180053) time: 1.052230 data: 0.000189 max mem: 18817 Epoch: [148/300] [1050/1251] eta: 0:03:13 lr: 0.001021 loss: 3.197317 (3.182997) time: 0.984460 data: 0.000174 max mem: 18817 Epoch: [148/300] [1100/1251] eta: 0:02:25 lr: 0.001020 loss: 3.215551 (3.184005) time: 0.931644 data: 0.000163 max mem: 18817 Epoch: [148/300] [1150/1251] eta: 0:01:37 lr: 0.001020 loss: 2.940823 (3.181402) time: 0.936461 data: 0.000182 max mem: 18817 Epoch: [148/300] [1200/1251] eta: 0:00:49 lr: 0.001020 loss: 3.335124 (3.184427) time: 0.987648 data: 0.000184 max mem: 18817 Epoch: [148/300] [1250/1251] eta: 0:00:00 lr: 0.001019 loss: 3.161200 (3.183852) time: 1.037637 data: 0.000748 max mem: 18817 Epoch: [148/300] Total time: 0:20:05 (0.963440 s / it) Averaged stats: lr: 0.001019 loss: 3.161200 (3.175818) Test: [ 0/49] eta: 0:01:17 loss: 0.582546 (0.582546) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.583546 data: 1.130889 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.737660 (0.804788) acc1: 79.687500 (80.681818) acc5: 96.875000 (96.306818) time: 0.492407 data: 0.102941 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.846615 (0.818241) acc1: 79.687500 (80.803571) acc5: 96.875000 (95.982143) time: 0.373053 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.789814 (0.815605) acc1: 79.687500 (80.745968) acc5: 96.875000 (96.068548) time: 0.362937 data: 0.000146 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.782755 (0.824104) acc1: 81.250000 (80.678354) acc5: 95.312500 (95.960366) time: 0.375579 data: 0.000149 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.785829 (0.821092) acc1: 81.250000 (80.736000) acc5: 95.312500 (96.032000) time: 0.370397 data: 0.000120 max mem: 18817 Test: Total time: 0:00:19 (0.398275 s / it) * Acc@1 80.516 Acc@5 95.704 loss 0.831 Max accuracy: 80.73% Epoch: [149/300] [ 0/1251] eta: 0:43:23 lr: 0.001019 loss: 3.570437 (3.570437) time: 2.081351 data: 1.181979 max mem: 18817 Epoch: [149/300] [ 50/1251] eta: 0:19:07 lr: 0.001019 loss: 3.212545 (3.071814) time: 0.904454 data: 0.000166 max mem: 18817 Epoch: [149/300] [ 100/1251] eta: 0:18:29 lr: 0.001018 loss: 3.328425 (3.153875) time: 0.924179 data: 0.000165 max mem: 18817 Epoch: [149/300] [ 150/1251] eta: 0:17:39 lr: 0.001018 loss: 3.242544 (3.157607) time: 0.976444 data: 0.000198 max mem: 18817 Epoch: [149/300] [ 200/1251] eta: 0:16:55 lr: 0.001017 loss: 3.199884 (3.161585) time: 1.046093 data: 0.000175 max mem: 18817 Epoch: [149/300] [ 250/1251] eta: 0:16:04 lr: 0.001017 loss: 3.555239 (3.178890) time: 0.965497 data: 0.000175 max mem: 18817 Epoch: [149/300] [ 300/1251] eta: 0:15:15 lr: 0.001017 loss: 3.355349 (3.168695) time: 0.951372 data: 0.000168 max mem: 18817 Epoch: [149/300] [ 350/1251] eta: 0:14:28 lr: 0.001016 loss: 3.231016 (3.164050) time: 0.926749 data: 0.000166 max mem: 18817 Epoch: [149/300] [ 400/1251] eta: 0:13:40 lr: 0.001016 loss: 3.215959 (3.174591) time: 0.989543 data: 0.000185 max mem: 18817 Epoch: [149/300] [ 450/1251] eta: 0:12:52 lr: 0.001015 loss: 3.270406 (3.172708) time: 1.024757 data: 0.000181 max mem: 18817 Epoch: [149/300] [ 500/1251] eta: 0:12:02 lr: 0.001015 loss: 3.350543 (3.175110) time: 0.986708 data: 0.000168 max mem: 18817 Epoch: [149/300] [ 550/1251] eta: 0:11:14 lr: 0.001015 loss: 3.329846 (3.167009) time: 0.931491 data: 0.000173 max mem: 18817 Epoch: [149/300] [ 600/1251] eta: 0:10:26 lr: 0.001014 loss: 3.164659 (3.165210) time: 0.929249 data: 0.000210 max mem: 18817 Epoch: [149/300] [ 650/1251] eta: 0:09:38 lr: 0.001014 loss: 3.320293 (3.163087) time: 0.977265 data: 0.000179 max mem: 18817 Epoch: [149/300] [ 700/1251] eta: 0:08:50 lr: 0.001013 loss: 3.018668 (3.159953) time: 1.040369 data: 0.000164 max mem: 18817 Epoch: [149/300] [ 750/1251] eta: 0:08:01 lr: 0.001013 loss: 2.952769 (3.154411) time: 0.981720 data: 0.000185 max mem: 18817 Epoch: [149/300] [ 800/1251] eta: 0:07:13 lr: 0.001013 loss: 3.023404 (3.145559) time: 0.920592 data: 0.000178 max mem: 18817 Epoch: [149/300] [ 850/1251] eta: 0:06:25 lr: 0.001012 loss: 3.295442 (3.144376) time: 0.914567 data: 0.000174 max mem: 18817 Epoch: [149/300] [ 900/1251] eta: 0:05:37 lr: 0.001012 loss: 3.421752 (3.144675) time: 0.978990 data: 0.000172 max mem: 18817 Epoch: [149/300] [ 950/1251] eta: 0:04:49 lr: 0.001011 loss: 3.319781 (3.149072) time: 1.037438 data: 0.000164 max mem: 18817 Epoch: [149/300] [1000/1251] eta: 0:04:01 lr: 0.001011 loss: 3.120543 (3.152773) time: 0.983993 data: 0.000156 max mem: 18817 Epoch: [149/300] [1050/1251] eta: 0:03:13 lr: 0.001010 loss: 2.866843 (3.152891) time: 0.918325 data: 0.000174 max mem: 18817 Epoch: [149/300] [1100/1251] eta: 0:02:25 lr: 0.001010 loss: 3.413802 (3.155144) time: 0.925169 data: 0.000186 max mem: 18817 Epoch: [149/300] [1150/1251] eta: 0:01:37 lr: 0.001010 loss: 3.347930 (3.157713) time: 0.987214 data: 0.000183 max mem: 18817 Epoch: [149/300] [1200/1251] eta: 0:00:49 lr: 0.001009 loss: 3.164047 (3.155294) time: 1.042560 data: 0.000170 max mem: 18817 Epoch: [149/300] [1250/1251] eta: 0:00:00 lr: 0.001009 loss: 2.947450 (3.153412) time: 0.979230 data: 0.000739 max mem: 18817 Epoch: [149/300] Total time: 0:20:02 (0.961201 s / it) Averaged stats: lr: 0.001009 loss: 2.947450 (3.151874) Test: [ 0/49] eta: 0:01:29 loss: 0.620793 (0.620793) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.819619 data: 1.404180 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.702898 (0.772462) acc1: 82.812500 (82.954545) acc5: 96.875000 (96.022727) time: 0.502110 data: 0.127792 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.824098 (0.799456) acc1: 79.687500 (81.398810) acc5: 95.312500 (95.461310) time: 0.366631 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.822851 (0.785861) acc1: 79.687500 (80.997984) acc5: 95.312500 (95.917339) time: 0.368493 data: 0.000136 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.822851 (0.805582) acc1: 79.687500 (80.716463) acc5: 95.312500 (95.846037) time: 0.385683 data: 0.000133 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.843840 (0.801106) acc1: 79.687500 (80.928000) acc5: 95.312500 (95.840000) time: 0.395156 data: 0.000104 max mem: 18817 Test: Total time: 0:00:20 (0.410094 s / it) * Acc@1 80.764 Acc@5 95.738 loss 0.817 Max accuracy: 80.76% Epoch: [150/300] [ 0/1251] eta: 0:41:16 lr: 0.001009 loss: 3.381042 (3.381042) time: 1.979884 data: 1.082731 max mem: 18817 Epoch: [150/300] [ 50/1251] eta: 0:19:54 lr: 0.001008 loss: 3.441262 (3.248062) time: 0.928848 data: 0.000175 max mem: 18817 Epoch: [150/300] [ 100/1251] eta: 0:18:51 lr: 0.001008 loss: 3.071509 (3.196752) time: 0.995879 data: 0.000179 max mem: 18817 Epoch: [150/300] [ 150/1251] eta: 0:17:59 lr: 0.001008 loss: 3.236659 (3.150628) time: 1.050277 data: 0.000171 max mem: 18817 Epoch: [150/300] [ 200/1251] eta: 0:17:02 lr: 0.001007 loss: 3.153261 (3.159213) time: 0.979073 data: 0.000170 max mem: 18817 Epoch: [150/300] [ 250/1251] eta: 0:16:06 lr: 0.001007 loss: 2.932734 (3.145697) time: 0.921787 data: 0.000164 max mem: 18817 Epoch: [150/300] [ 300/1251] eta: 0:15:16 lr: 0.001006 loss: 3.060813 (3.138756) time: 0.921578 data: 0.000176 max mem: 18817 Epoch: [150/300] [ 350/1251] eta: 0:14:29 lr: 0.001006 loss: 3.261493 (3.159570) time: 0.984615 data: 0.000159 max mem: 18817 Epoch: [150/300] [ 400/1251] eta: 0:13:40 lr: 0.001005 loss: 3.185393 (3.138404) time: 1.020378 data: 0.000180 max mem: 18817 Epoch: [150/300] [ 450/1251] eta: 0:12:51 lr: 0.001005 loss: 3.101579 (3.136143) time: 0.971807 data: 0.000175 max mem: 18817 Epoch: [150/300] [ 500/1251] eta: 0:12:01 lr: 0.001005 loss: 3.042698 (3.117665) time: 0.913923 data: 0.000166 max mem: 18817 Epoch: [150/300] [ 550/1251] eta: 0:11:15 lr: 0.001004 loss: 3.295166 (3.125733) time: 0.945984 data: 0.000181 max mem: 18817 Epoch: [150/300] [ 600/1251] eta: 0:10:27 lr: 0.001004 loss: 3.026866 (3.122023) time: 0.980632 data: 0.000178 max mem: 18817 Epoch: [150/300] [ 650/1251] eta: 0:09:39 lr: 0.001003 loss: 3.375611 (3.127357) time: 1.037383 data: 0.000159 max mem: 18817 Epoch: [150/300] [ 700/1251] eta: 0:08:50 lr: 0.001003 loss: 3.021868 (3.129785) time: 0.962018 data: 0.000159 max mem: 18817 Epoch: [150/300] [ 750/1251] eta: 0:08:01 lr: 0.001003 loss: 3.269245 (3.135009) time: 0.930489 data: 0.000172 max mem: 18817 Epoch: [150/300] [ 800/1251] eta: 0:07:14 lr: 0.001002 loss: 3.203923 (3.134252) time: 0.936859 data: 0.000185 max mem: 18817 Epoch: [150/300] [ 850/1251] eta: 0:06:26 lr: 0.001002 loss: 3.134947 (3.133573) time: 0.980479 data: 0.000166 max mem: 18817 Epoch: [150/300] [ 900/1251] eta: 0:05:38 lr: 0.001001 loss: 3.328023 (3.135027) time: 1.028613 data: 0.000182 max mem: 18817 Epoch: [150/300] [ 950/1251] eta: 0:04:49 lr: 0.001001 loss: 3.249530 (3.135431) time: 0.970152 data: 0.000195 max mem: 18817 Epoch: [150/300] [1000/1251] eta: 0:04:01 lr: 0.001000 loss: 3.106237 (3.132114) time: 0.928462 data: 0.000189 max mem: 18817 Epoch: [150/300] [1050/1251] eta: 0:03:13 lr: 0.001000 loss: 3.111754 (3.133914) time: 0.919371 data: 0.000182 max mem: 18817 Epoch: [150/300] [1100/1251] eta: 0:02:25 lr: 0.001000 loss: 3.423419 (3.142549) time: 0.976888 data: 0.000173 max mem: 18817 Epoch: [150/300] [1150/1251] eta: 0:01:37 lr: 0.000999 loss: 3.135164 (3.133673) time: 1.008628 data: 0.000169 max mem: 18817 Epoch: [150/300] [1200/1251] eta: 0:00:49 lr: 0.000999 loss: 3.276929 (3.137883) time: 0.970429 data: 0.000179 max mem: 18817 Epoch: [150/300] [1250/1251] eta: 0:00:00 lr: 0.000998 loss: 3.218656 (3.137489) time: 0.913645 data: 0.000737 max mem: 18817 Epoch: [150/300] Total time: 0:20:02 (0.961140 s / it) Averaged stats: lr: 0.000998 loss: 3.218656 (3.136977) Test: [ 0/49] eta: 0:01:28 loss: 0.542967 (0.542967) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.798398 data: 1.397796 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.693643 (0.741043) acc1: 82.812500 (82.528409) acc5: 96.875000 (95.880682) time: 0.500347 data: 0.127221 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.809941 (0.772024) acc1: 79.687500 (81.398810) acc5: 96.875000 (95.907738) time: 0.402708 data: 0.000147 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.790536 (0.779982) acc1: 79.687500 (81.149194) acc5: 96.875000 (95.866935) time: 0.454887 data: 0.000140 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.793457 (0.796684) acc1: 79.687500 (80.792683) acc5: 95.312500 (95.617378) time: 0.416676 data: 0.000148 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.855916 (0.796886) acc1: 79.687500 (80.704000) acc5: 95.312500 (95.712000) time: 0.355621 data: 0.000120 max mem: 18817 Test: Total time: 0:00:21 (0.431249 s / it) * Acc@1 80.856 Acc@5 95.680 loss 0.796 Max accuracy: 80.86% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0150.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0150.pth Epoch: [151/300] [ 0/1251] eta: 0:41:53 lr: 0.000998 loss: 3.372192 (3.372192) time: 2.009167 data: 1.120242 max mem: 18817 Epoch: [151/300] [ 50/1251] eta: 0:20:01 lr: 0.000998 loss: 3.031681 (3.131314) time: 0.930380 data: 0.000170 max mem: 18817 Epoch: [151/300] [ 100/1251] eta: 0:18:52 lr: 0.000998 loss: 3.263180 (3.188746) time: 0.962054 data: 0.000189 max mem: 18817 Epoch: [151/300] [ 150/1251] eta: 0:17:54 lr: 0.000997 loss: 3.238258 (3.200507) time: 0.980807 data: 0.000172 max mem: 18817 Epoch: [151/300] [ 200/1251] eta: 0:17:01 lr: 0.000997 loss: 3.111459 (3.183673) time: 0.991508 data: 0.000199 max mem: 18817 Epoch: [151/300] [ 250/1251] eta: 0:16:08 lr: 0.000996 loss: 3.420077 (3.195498) time: 0.916221 data: 0.000194 max mem: 18817 Epoch: [151/300] [ 300/1251] eta: 0:15:19 lr: 0.000996 loss: 3.234289 (3.173874) time: 0.926484 data: 0.000176 max mem: 18817 Epoch: [151/300] [ 350/1251] eta: 0:14:32 lr: 0.000995 loss: 3.243594 (3.173591) time: 0.947138 data: 0.000184 max mem: 18817 Epoch: [151/300] [ 400/1251] eta: 0:13:45 lr: 0.000995 loss: 3.228753 (3.180268) time: 0.994998 data: 0.000192 max mem: 18817 Epoch: [151/300] [ 450/1251] eta: 0:12:53 lr: 0.000995 loss: 3.338953 (3.175562) time: 0.956005 data: 0.000173 max mem: 18817 Epoch: [151/300] [ 500/1251] eta: 0:12:04 lr: 0.000994 loss: 3.312455 (3.171968) time: 0.921373 data: 0.000178 max mem: 18817 Epoch: [151/300] [ 550/1251] eta: 0:11:16 lr: 0.000994 loss: 3.384565 (3.182715) time: 0.934131 data: 0.000191 max mem: 18817 Epoch: [151/300] [ 600/1251] eta: 0:10:28 lr: 0.000993 loss: 3.171851 (3.177227) time: 0.934830 data: 0.000187 max mem: 18817 Epoch: [151/300] [ 650/1251] eta: 0:09:41 lr: 0.000993 loss: 3.177955 (3.175142) time: 0.989292 data: 0.000166 max mem: 18817 Epoch: [151/300] [ 700/1251] eta: 0:08:51 lr: 0.000993 loss: 3.004532 (3.166798) time: 0.978787 data: 0.000180 max mem: 18817 Epoch: [151/300] [ 750/1251] eta: 0:08:02 lr: 0.000992 loss: 3.282017 (3.168691) time: 0.919291 data: 0.000165 max mem: 18817 Epoch: [151/300] [ 800/1251] eta: 0:07:15 lr: 0.000992 loss: 3.323835 (3.171507) time: 0.936961 data: 0.000175 max mem: 18817 Epoch: [151/300] [ 850/1251] eta: 0:06:26 lr: 0.000991 loss: 3.010607 (3.172602) time: 0.930485 data: 0.000177 max mem: 18817 Epoch: [151/300] [ 900/1251] eta: 0:05:38 lr: 0.000991 loss: 3.137722 (3.175505) time: 0.997120 data: 0.000162 max mem: 18817 Epoch: [151/300] [ 950/1251] eta: 0:04:50 lr: 0.000991 loss: 3.215829 (3.174902) time: 0.970518 data: 0.000182 max mem: 18817 Epoch: [151/300] [1000/1251] eta: 0:04:01 lr: 0.000990 loss: 3.464267 (3.173917) time: 0.920863 data: 0.000175 max mem: 18817 Epoch: [151/300] [1050/1251] eta: 0:03:13 lr: 0.000990 loss: 3.334625 (3.181013) time: 0.930830 data: 0.000169 max mem: 18817 Epoch: [151/300] [1100/1251] eta: 0:02:25 lr: 0.000989 loss: 3.313531 (3.182150) time: 0.934374 data: 0.000187 max mem: 18817 Epoch: [151/300] [1150/1251] eta: 0:01:37 lr: 0.000989 loss: 3.078141 (3.178601) time: 0.969849 data: 0.000179 max mem: 18817 Epoch: [151/300] [1200/1251] eta: 0:00:49 lr: 0.000988 loss: 3.275281 (3.180821) time: 0.977923 data: 0.000169 max mem: 18817 Epoch: [151/300] [1250/1251] eta: 0:00:00 lr: 0.000988 loss: 3.423804 (3.182172) time: 0.910001 data: 0.000794 max mem: 18817 Epoch: [151/300] Total time: 0:20:05 (0.963439 s / it) Averaged stats: lr: 0.000988 loss: 3.423804 (3.191079) Test: [ 0/49] eta: 0:01:33 loss: 0.593082 (0.593082) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.911974 data: 1.126821 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.673217 (0.758272) acc1: 82.812500 (83.238636) acc5: 96.875000 (95.880682) time: 0.508624 data: 0.102580 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.821548 (0.814173) acc1: 79.687500 (81.101190) acc5: 95.312500 (95.833333) time: 0.365054 data: 0.000154 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.839294 (0.809246) acc1: 79.687500 (80.846774) acc5: 96.875000 (95.967742) time: 0.362679 data: 0.000151 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.820054 (0.822234) acc1: 79.687500 (80.716463) acc5: 95.312500 (95.922256) time: 0.360260 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.840690 (0.823845) acc1: 79.687500 (80.704000) acc5: 95.312500 (95.936000) time: 0.382447 data: 0.000107 max mem: 18817 Test: Total time: 0:00:19 (0.406170 s / it) * Acc@1 80.640 Acc@5 95.688 loss 0.825 Max accuracy: 80.86% Epoch: [152/300] [ 0/1251] eta: 0:40:36 lr: 0.000988 loss: 3.626401 (3.626401) time: 1.947676 data: 1.051858 max mem: 18817 Epoch: [152/300] [ 50/1251] eta: 0:19:42 lr: 0.000988 loss: 3.289181 (3.177262) time: 0.990235 data: 0.000164 max mem: 18817 Epoch: [152/300] [ 100/1251] eta: 0:18:42 lr: 0.000987 loss: 3.241583 (3.160937) time: 1.025853 data: 0.000168 max mem: 18817 Epoch: [152/300] [ 150/1251] eta: 0:17:44 lr: 0.000987 loss: 3.134706 (3.168079) time: 0.965817 data: 0.000158 max mem: 18817 Epoch: [152/300] [ 200/1251] eta: 0:16:55 lr: 0.000986 loss: 3.278295 (3.158572) time: 0.931417 data: 0.000175 max mem: 18817 Epoch: [152/300] [ 250/1251] eta: 0:16:10 lr: 0.000986 loss: 3.282962 (3.147594) time: 0.930365 data: 0.000187 max mem: 18817 Epoch: [152/300] [ 300/1251] eta: 0:15:22 lr: 0.000986 loss: 3.353662 (3.160317) time: 1.002802 data: 0.000157 max mem: 18817 Epoch: [152/300] [ 350/1251] eta: 0:14:34 lr: 0.000985 loss: 3.183893 (3.163348) time: 1.016744 data: 0.000162 max mem: 18817 Epoch: [152/300] [ 400/1251] eta: 0:13:43 lr: 0.000985 loss: 3.260583 (3.162006) time: 0.977867 data: 0.000175 max mem: 18817 Epoch: [152/300] [ 450/1251] eta: 0:12:52 lr: 0.000984 loss: 3.023305 (3.150501) time: 0.912790 data: 0.000175 max mem: 18817 Epoch: [152/300] [ 500/1251] eta: 0:12:04 lr: 0.000984 loss: 3.319902 (3.152208) time: 0.931351 data: 0.000191 max mem: 18817 Epoch: [152/300] [ 550/1251] eta: 0:11:16 lr: 0.000983 loss: 3.270127 (3.145585) time: 0.997214 data: 0.000179 max mem: 18817 Epoch: [152/300] [ 600/1251] eta: 0:10:29 lr: 0.000983 loss: 3.078825 (3.132262) time: 1.030434 data: 0.000171 max mem: 18817 Epoch: [152/300] [ 650/1251] eta: 0:09:40 lr: 0.000983 loss: 3.273132 (3.134189) time: 0.967298 data: 0.000172 max mem: 18817 Epoch: [152/300] [ 700/1251] eta: 0:08:50 lr: 0.000982 loss: 3.146109 (3.138269) time: 0.908247 data: 0.000173 max mem: 18817 Epoch: [152/300] [ 750/1251] eta: 0:08:03 lr: 0.000982 loss: 3.007393 (3.137444) time: 0.931395 data: 0.000161 max mem: 18817 Epoch: [152/300] [ 800/1251] eta: 0:07:15 lr: 0.000981 loss: 2.807055 (3.138344) time: 0.978089 data: 0.000161 max mem: 18817 Epoch: [152/300] [ 850/1251] eta: 0:06:26 lr: 0.000981 loss: 3.359053 (3.144994) time: 0.966474 data: 0.000183 max mem: 18817 Epoch: [152/300] [ 900/1251] eta: 0:05:38 lr: 0.000981 loss: 3.109110 (3.152801) time: 0.962209 data: 0.000171 max mem: 18817 Epoch: [152/300] [ 950/1251] eta: 0:04:49 lr: 0.000980 loss: 2.969210 (3.155486) time: 0.912593 data: 0.000166 max mem: 18817 Epoch: [152/300] [1000/1251] eta: 0:04:01 lr: 0.000980 loss: 3.108422 (3.155440) time: 0.927385 data: 0.000179 max mem: 18817 Epoch: [152/300] [1050/1251] eta: 0:03:13 lr: 0.000979 loss: 2.902518 (3.144567) time: 0.981357 data: 0.000179 max mem: 18817 Epoch: [152/300] [1100/1251] eta: 0:02:25 lr: 0.000979 loss: 3.241464 (3.150103) time: 0.968421 data: 0.000190 max mem: 18817 Epoch: [152/300] [1150/1251] eta: 0:01:37 lr: 0.000978 loss: 3.189102 (3.150804) time: 0.991282 data: 0.000170 max mem: 18817 Epoch: [152/300] [1200/1251] eta: 0:00:49 lr: 0.000978 loss: 3.134039 (3.154358) time: 0.924715 data: 0.000184 max mem: 18817 Epoch: [152/300] [1250/1251] eta: 0:00:00 lr: 0.000978 loss: 2.964336 (3.152975) time: 0.931055 data: 0.000750 max mem: 18817 Epoch: [152/300] Total time: 0:20:04 (0.962518 s / it) Averaged stats: lr: 0.000978 loss: 2.964336 (3.156582) Test: [ 0/49] eta: 0:01:28 loss: 0.651610 (0.651610) acc1: 84.375000 (84.375000) acc5: 95.312500 (95.312500) time: 1.814368 data: 1.412032 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.746141 (0.780180) acc1: 82.812500 (82.670455) acc5: 95.312500 (95.596591) time: 0.496674 data: 0.128621 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.792358 (0.815051) acc1: 79.687500 (81.770833) acc5: 95.312500 (95.535714) time: 0.364125 data: 0.000202 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.792358 (0.811118) acc1: 78.125000 (80.897177) acc5: 95.312500 (95.766129) time: 0.363118 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.798133 (0.819898) acc1: 79.687500 (80.907012) acc5: 95.312500 (95.617378) time: 0.449431 data: 0.000130 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.840548 (0.816932) acc1: 79.687500 (81.024000) acc5: 95.312500 (95.776000) time: 0.444555 data: 0.000105 max mem: 18817 Test: Total time: 0:00:21 (0.429723 s / it) * Acc@1 80.842 Acc@5 95.870 loss 0.811 Max accuracy: 80.86% Epoch: [153/300] [ 0/1251] eta: 0:44:58 lr: 0.000978 loss: 2.818629 (2.818629) time: 2.156718 data: 1.253107 max mem: 18817 Epoch: [153/300] [ 50/1251] eta: 0:19:56 lr: 0.000977 loss: 3.155750 (2.989654) time: 1.047240 data: 0.000189 max mem: 18817 Epoch: [153/300] [ 100/1251] eta: 0:18:37 lr: 0.000977 loss: 2.960485 (3.065625) time: 0.975613 data: 0.000163 max mem: 18817 Epoch: [153/300] [ 150/1251] eta: 0:17:37 lr: 0.000976 loss: 3.266628 (3.092317) time: 0.920135 data: 0.000171 max mem: 18817 Epoch: [153/300] [ 200/1251] eta: 0:16:54 lr: 0.000976 loss: 2.900267 (3.083214) time: 0.930374 data: 0.000181 max mem: 18817 Epoch: [153/300] [ 250/1251] eta: 0:16:05 lr: 0.000976 loss: 3.374432 (3.108015) time: 0.976435 data: 0.000162 max mem: 18817 Epoch: [153/300] [ 300/1251] eta: 0:15:18 lr: 0.000975 loss: 3.112286 (3.112559) time: 0.994627 data: 0.000172 max mem: 18817 Epoch: [153/300] [ 350/1251] eta: 0:14:27 lr: 0.000975 loss: 3.056307 (3.114860) time: 0.973690 data: 0.000156 max mem: 18817 Epoch: [153/300] [ 400/1251] eta: 0:13:38 lr: 0.000974 loss: 3.217697 (3.116237) time: 0.915018 data: 0.000173 max mem: 18817 Epoch: [153/300] [ 450/1251] eta: 0:12:51 lr: 0.000974 loss: 3.262801 (3.125014) time: 0.918913 data: 0.000171 max mem: 18817 Epoch: [153/300] [ 500/1251] eta: 0:12:03 lr: 0.000973 loss: 3.029057 (3.122585) time: 0.988810 data: 0.000186 max mem: 18817 Epoch: [153/300] [ 550/1251] eta: 0:11:16 lr: 0.000973 loss: 3.286139 (3.125343) time: 1.006630 data: 0.000184 max mem: 18817 Epoch: [153/300] [ 600/1251] eta: 0:10:27 lr: 0.000973 loss: 3.264061 (3.129438) time: 0.986591 data: 0.000178 max mem: 18817 Epoch: [153/300] [ 650/1251] eta: 0:09:38 lr: 0.000972 loss: 3.094591 (3.132828) time: 0.920639 data: 0.000153 max mem: 18817 Epoch: [153/300] [ 700/1251] eta: 0:08:50 lr: 0.000972 loss: 3.065968 (3.127798) time: 0.920568 data: 0.000159 max mem: 18817 Epoch: [153/300] [ 750/1251] eta: 0:08:02 lr: 0.000971 loss: 3.454698 (3.137880) time: 0.982456 data: 0.000158 max mem: 18817 Epoch: [153/300] [ 800/1251] eta: 0:07:14 lr: 0.000971 loss: 3.077771 (3.135215) time: 1.009386 data: 0.000183 max mem: 18817 Epoch: [153/300] [ 850/1251] eta: 0:06:25 lr: 0.000971 loss: 3.261787 (3.137462) time: 0.972119 data: 0.000172 max mem: 18817 Epoch: [153/300] [ 900/1251] eta: 0:05:37 lr: 0.000970 loss: 3.193629 (3.132679) time: 0.914989 data: 0.000171 max mem: 18817 Epoch: [153/300] [ 950/1251] eta: 0:04:49 lr: 0.000970 loss: 3.294814 (3.140369) time: 0.928509 data: 0.000182 max mem: 18817 Epoch: [153/300] [1000/1251] eta: 0:04:01 lr: 0.000969 loss: 3.010185 (3.140373) time: 0.970446 data: 0.000163 max mem: 18817 Epoch: [153/300] [1050/1251] eta: 0:03:13 lr: 0.000969 loss: 3.212211 (3.144986) time: 1.000368 data: 0.000199 max mem: 18817 Epoch: [153/300] [1100/1251] eta: 0:02:25 lr: 0.000969 loss: 3.299592 (3.144612) time: 0.958422 data: 0.000177 max mem: 18817 Epoch: [153/300] [1150/1251] eta: 0:01:36 lr: 0.000968 loss: 3.202178 (3.143107) time: 0.916035 data: 0.000161 max mem: 18817 Epoch: [153/300] [1200/1251] eta: 0:00:48 lr: 0.000968 loss: 3.064859 (3.137076) time: 0.927880 data: 0.000166 max mem: 18817 Epoch: [153/300] [1250/1251] eta: 0:00:00 lr: 0.000967 loss: 3.261651 (3.141043) time: 1.002654 data: 0.000753 max mem: 18817 Epoch: [153/300] Total time: 0:20:02 (0.961240 s / it) Averaged stats: lr: 0.000967 loss: 3.261651 (3.128379) Test: [ 0/49] eta: 0:01:22 loss: 0.654909 (0.654909) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.875000) time: 1.682119 data: 1.229804 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.679168 (0.762392) acc1: 82.812500 (82.670455) acc5: 95.312500 (95.880682) time: 0.495562 data: 0.111930 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.790757 (0.801367) acc1: 79.687500 (81.026786) acc5: 95.312500 (95.907738) time: 0.378100 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.835928 (0.803056) acc1: 78.125000 (80.594758) acc5: 95.312500 (95.967742) time: 0.370202 data: 0.000132 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.835928 (0.811293) acc1: 79.687500 (80.373476) acc5: 95.312500 (95.922256) time: 0.359087 data: 0.000126 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.837698 (0.809881) acc1: 79.687500 (80.480000) acc5: 95.312500 (96.096000) time: 0.354263 data: 0.000110 max mem: 18817 Test: Total time: 0:00:19 (0.395205 s / it) * Acc@1 80.944 Acc@5 95.776 loss 0.806 Max accuracy: 80.94% Epoch: [154/300] [ 0/1251] eta: 0:43:32 lr: 0.000967 loss: 2.925797 (2.925797) time: 2.088469 data: 1.186504 max mem: 18817 Epoch: [154/300] [ 50/1251] eta: 0:19:27 lr: 0.000967 loss: 2.919750 (2.984168) time: 0.973308 data: 0.000172 max mem: 18817 Epoch: [154/300] [ 100/1251] eta: 0:18:19 lr: 0.000966 loss: 3.393802 (3.073227) time: 0.915098 data: 0.000170 max mem: 18817 Epoch: [154/300] [ 150/1251] eta: 0:17:34 lr: 0.000966 loss: 3.273413 (3.112265) time: 0.921988 data: 0.000174 max mem: 18817 Epoch: [154/300] [ 200/1251] eta: 0:16:50 lr: 0.000966 loss: 3.192726 (3.115059) time: 0.981900 data: 0.000157 max mem: 18817 Epoch: [154/300] [ 250/1251] eta: 0:15:59 lr: 0.000965 loss: 3.258504 (3.113499) time: 0.994331 data: 0.000173 max mem: 18817 Epoch: [154/300] [ 300/1251] eta: 0:15:11 lr: 0.000965 loss: 3.077728 (3.103491) time: 0.968670 data: 0.000168 max mem: 18817 Epoch: [154/300] [ 350/1251] eta: 0:14:21 lr: 0.000964 loss: 3.125744 (3.102404) time: 0.928126 data: 0.000161 max mem: 18817 Epoch: [154/300] [ 400/1251] eta: 0:13:35 lr: 0.000964 loss: 3.242436 (3.113931) time: 0.943439 data: 0.000172 max mem: 18817 Epoch: [154/300] [ 450/1251] eta: 0:12:49 lr: 0.000964 loss: 3.390996 (3.118351) time: 1.004542 data: 0.000162 max mem: 18817 Epoch: [154/300] [ 500/1251] eta: 0:11:59 lr: 0.000963 loss: 3.142238 (3.111759) time: 0.964014 data: 0.000176 max mem: 18817 Epoch: [154/300] [ 550/1251] eta: 0:11:12 lr: 0.000963 loss: 3.349907 (3.114392) time: 0.959149 data: 0.000165 max mem: 18817 Epoch: [154/300] [ 600/1251] eta: 0:10:24 lr: 0.000962 loss: 3.211153 (3.115283) time: 0.950886 data: 0.000180 max mem: 18817 Epoch: [154/300] [ 650/1251] eta: 0:09:37 lr: 0.000962 loss: 3.103866 (3.122819) time: 0.940882 data: 0.000167 max mem: 18817 Epoch: [154/300] [ 700/1251] eta: 0:08:49 lr: 0.000961 loss: 3.447807 (3.125516) time: 0.975705 data: 0.000152 max mem: 18817 Epoch: [154/300] [ 750/1251] eta: 0:08:01 lr: 0.000961 loss: 2.890440 (3.128321) time: 1.015316 data: 0.000161 max mem: 18817 Epoch: [154/300] [ 800/1251] eta: 0:07:13 lr: 0.000961 loss: 3.448822 (3.125777) time: 0.989473 data: 0.000177 max mem: 18817 Epoch: [154/300] [ 850/1251] eta: 0:06:25 lr: 0.000960 loss: 3.186113 (3.125722) time: 0.925771 data: 0.000162 max mem: 18817 Epoch: [154/300] [ 900/1251] eta: 0:05:37 lr: 0.000960 loss: 3.030126 (3.119541) time: 0.953823 data: 0.000171 max mem: 18817 Epoch: [154/300] [ 950/1251] eta: 0:04:49 lr: 0.000959 loss: 3.446957 (3.119665) time: 1.004854 data: 0.000168 max mem: 18817 Epoch: [154/300] [1000/1251] eta: 0:04:01 lr: 0.000959 loss: 2.910707 (3.118900) time: 1.049058 data: 0.000162 max mem: 18817 Epoch: [154/300] [1050/1251] eta: 0:03:13 lr: 0.000959 loss: 3.303165 (3.122105) time: 0.983482 data: 0.000182 max mem: 18817 Epoch: [154/300] [1100/1251] eta: 0:02:25 lr: 0.000958 loss: 3.170617 (3.124441) time: 0.922238 data: 0.000164 max mem: 18817 Epoch: [154/300] [1150/1251] eta: 0:01:37 lr: 0.000958 loss: 3.354090 (3.126859) time: 0.930290 data: 0.000161 max mem: 18817 Epoch: [154/300] [1200/1251] eta: 0:00:49 lr: 0.000957 loss: 2.983113 (3.129193) time: 0.986533 data: 0.000162 max mem: 18817 Epoch: [154/300] [1250/1251] eta: 0:00:00 lr: 0.000957 loss: 3.353828 (3.131757) time: 1.018449 data: 0.000785 max mem: 18817 Epoch: [154/300] Total time: 0:20:04 (0.962774 s / it) Averaged stats: lr: 0.000957 loss: 3.353828 (3.124071) Test: [ 0/49] eta: 0:01:26 loss: 0.552796 (0.552796) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.765817 data: 1.178350 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.709334 (0.762566) acc1: 84.375000 (82.670455) acc5: 96.875000 (96.022727) time: 0.513818 data: 0.107278 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.837963 (0.812690) acc1: 81.250000 (81.696429) acc5: 95.312500 (95.758929) time: 0.376456 data: 0.000168 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.804076 (0.797881) acc1: 81.250000 (81.602823) acc5: 95.312500 (95.866935) time: 0.364491 data: 0.000156 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.797101 (0.817574) acc1: 79.687500 (81.402439) acc5: 95.312500 (95.731707) time: 0.368966 data: 0.000149 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.882056 (0.815506) acc1: 79.687500 (81.216000) acc5: 95.312500 (95.776000) time: 0.362907 data: 0.000120 max mem: 18817 Test: Total time: 0:00:19 (0.400408 s / it) * Acc@1 80.978 Acc@5 95.772 loss 0.822 Max accuracy: 80.98% Epoch: [155/300] [ 0/1251] eta: 0:42:37 lr: 0.000957 loss: 3.357645 (3.357645) time: 2.044285 data: 1.130988 max mem: 18817 Epoch: [155/300] [ 50/1251] eta: 0:19:37 lr: 0.000956 loss: 3.301807 (3.178737) time: 0.986642 data: 0.000198 max mem: 18817 Epoch: [155/300] [ 100/1251] eta: 0:18:30 lr: 0.000956 loss: 3.146219 (3.108153) time: 0.936331 data: 0.000175 max mem: 18817 Epoch: [155/300] [ 150/1251] eta: 0:17:44 lr: 0.000956 loss: 3.128855 (3.120892) time: 0.941090 data: 0.000177 max mem: 18817 Epoch: [155/300] [ 200/1251] eta: 0:16:56 lr: 0.000955 loss: 3.208046 (3.122527) time: 0.991177 data: 0.000169 max mem: 18817 Epoch: [155/300] [ 250/1251] eta: 0:16:08 lr: 0.000955 loss: 3.256269 (3.100825) time: 1.025506 data: 0.000159 max mem: 18817 Epoch: [155/300] [ 300/1251] eta: 0:15:19 lr: 0.000954 loss: 3.353765 (3.124070) time: 0.977635 data: 0.000167 max mem: 18817 Epoch: [155/300] [ 350/1251] eta: 0:14:28 lr: 0.000954 loss: 3.150907 (3.127090) time: 0.925207 data: 0.000170 max mem: 18817 Epoch: [155/300] [ 400/1251] eta: 0:13:41 lr: 0.000954 loss: 3.016063 (3.117926) time: 0.927702 data: 0.000172 max mem: 18817 Epoch: [155/300] [ 450/1251] eta: 0:12:53 lr: 0.000953 loss: 3.136859 (3.111806) time: 0.982283 data: 0.000180 max mem: 18817 Epoch: [155/300] [ 500/1251] eta: 0:12:04 lr: 0.000953 loss: 3.466221 (3.116514) time: 1.031744 data: 0.000169 max mem: 18817 Epoch: [155/300] [ 550/1251] eta: 0:11:15 lr: 0.000952 loss: 3.228240 (3.107417) time: 0.960043 data: 0.000172 max mem: 18817 Epoch: [155/300] [ 600/1251] eta: 0:10:26 lr: 0.000952 loss: 3.251214 (3.113027) time: 0.920129 data: 0.000169 max mem: 18817 Epoch: [155/300] [ 650/1251] eta: 0:09:38 lr: 0.000952 loss: 3.199324 (3.112816) time: 0.942239 data: 0.000158 max mem: 18817 Epoch: [155/300] [ 700/1251] eta: 0:08:51 lr: 0.000951 loss: 3.246614 (3.121710) time: 0.992816 data: 0.000168 max mem: 18817 Epoch: [155/300] [ 750/1251] eta: 0:08:03 lr: 0.000951 loss: 3.292844 (3.131608) time: 1.033338 data: 0.000172 max mem: 18817 Epoch: [155/300] [ 800/1251] eta: 0:07:14 lr: 0.000950 loss: 3.285764 (3.130950) time: 0.954767 data: 0.000164 max mem: 18817 Epoch: [155/300] [ 850/1251] eta: 0:06:25 lr: 0.000950 loss: 3.367971 (3.136090) time: 0.927134 data: 0.000168 max mem: 18817 Epoch: [155/300] [ 900/1251] eta: 0:05:37 lr: 0.000949 loss: 3.274179 (3.139970) time: 0.931338 data: 0.000177 max mem: 18817 Epoch: [155/300] [ 950/1251] eta: 0:04:49 lr: 0.000949 loss: 3.070995 (3.134282) time: 0.987117 data: 0.000159 max mem: 18817 Epoch: [155/300] [1000/1251] eta: 0:04:01 lr: 0.000949 loss: 3.341295 (3.138904) time: 1.015806 data: 0.000165 max mem: 18817 Epoch: [155/300] [1050/1251] eta: 0:03:13 lr: 0.000948 loss: 3.256688 (3.142962) time: 0.978485 data: 0.000185 max mem: 18817 Epoch: [155/300] [1100/1251] eta: 0:02:25 lr: 0.000948 loss: 2.998307 (3.139405) time: 0.925705 data: 0.000170 max mem: 18817 Epoch: [155/300] [1150/1251] eta: 0:01:37 lr: 0.000947 loss: 3.142565 (3.138256) time: 0.939575 data: 0.000179 max mem: 18817 Epoch: [155/300] [1200/1251] eta: 0:00:49 lr: 0.000947 loss: 3.346180 (3.138768) time: 1.005395 data: 0.000164 max mem: 18817 Epoch: [155/300] [1250/1251] eta: 0:00:00 lr: 0.000947 loss: 3.168421 (3.133185) time: 1.021096 data: 0.000806 max mem: 18817 Epoch: [155/300] Total time: 0:20:04 (0.963195 s / it) Averaged stats: lr: 0.000947 loss: 3.168421 (3.131374) Test: [ 0/49] eta: 0:01:23 loss: 0.640789 (0.640789) acc1: 79.687500 (79.687500) acc5: 98.437500 (98.437500) time: 1.705497 data: 1.177057 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.697099 (0.774786) acc1: 79.687500 (81.534091) acc5: 95.312500 (95.312500) time: 0.502451 data: 0.107163 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.817880 (0.792134) acc1: 79.687500 (80.877976) acc5: 95.312500 (95.610119) time: 0.376672 data: 0.000166 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.817880 (0.792374) acc1: 79.687500 (80.695565) acc5: 96.875000 (95.816532) time: 0.367267 data: 0.000153 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.813903 (0.804491) acc1: 79.687500 (80.487805) acc5: 95.312500 (95.693598) time: 0.368294 data: 0.000139 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.868501 (0.803701) acc1: 78.125000 (80.448000) acc5: 95.312500 (95.744000) time: 0.362958 data: 0.000113 max mem: 18817 Test: Total time: 0:00:19 (0.400340 s / it) * Acc@1 80.836 Acc@5 95.872 loss 0.806 Max accuracy: 80.98% Epoch: [156/300] [ 0/1251] eta: 1:52:02 lr: 0.000947 loss: 3.371571 (3.371571) time: 5.373366 data: 1.959567 max mem: 18510 Epoch: [156/300] [ 50/1251] eta: 0:20:45 lr: 0.000946 loss: 3.305335 (3.342968) time: 0.931198 data: 0.000160 max mem: 18814 Epoch: [156/300] [ 100/1251] eta: 0:19:15 lr: 0.000946 loss: 3.353502 (3.242011) time: 0.948150 data: 0.000166 max mem: 18814 Epoch: [156/300] [ 150/1251] eta: 0:18:08 lr: 0.000945 loss: 3.084138 (3.213005) time: 0.991545 data: 0.000152 max mem: 18814 Epoch: [156/300] [ 200/1251] eta: 0:17:09 lr: 0.000945 loss: 3.068011 (3.176480) time: 0.981910 data: 0.000161 max mem: 18814 Epoch: [156/300] [ 250/1251] eta: 0:16:14 lr: 0.000944 loss: 2.902584 (3.144363) time: 0.936417 data: 0.000152 max mem: 18814 Epoch: [156/300] [ 300/1251] eta: 0:15:26 lr: 0.000944 loss: 2.977988 (3.120858) time: 0.933907 data: 0.000152 max mem: 18814 Epoch: [156/300] [ 350/1251] eta: 0:14:38 lr: 0.000944 loss: 3.326516 (3.133487) time: 1.001185 data: 0.000149 max mem: 18814 Epoch: [156/300] [ 400/1251] eta: 0:13:48 lr: 0.000943 loss: 3.109276 (3.133324) time: 0.978905 data: 0.000158 max mem: 18814 Epoch: [156/300] [ 450/1251] eta: 0:12:58 lr: 0.000943 loss: 3.143888 (3.143741) time: 0.992552 data: 0.000161 max mem: 18814 Epoch: [156/300] [ 500/1251] eta: 0:12:08 lr: 0.000942 loss: 3.201432 (3.129256) time: 0.929554 data: 0.000155 max mem: 18814 Epoch: [156/300] [ 550/1251] eta: 0:11:19 lr: 0.000942 loss: 3.242352 (3.132855) time: 0.923539 data: 0.000155 max mem: 18814 Epoch: [156/300] [ 600/1251] eta: 0:10:30 lr: 0.000942 loss: 3.345581 (3.143959) time: 0.982212 data: 0.000158 max mem: 18814 Epoch: [156/300] [ 650/1251] eta: 0:09:42 lr: 0.000941 loss: 3.253122 (3.141463) time: 0.977979 data: 0.000139 max mem: 18814 Epoch: [156/300] [ 700/1251] eta: 0:08:53 lr: 0.000941 loss: 2.903759 (3.134415) time: 0.998056 data: 0.000148 max mem: 18814 Epoch: [156/300] [ 750/1251] eta: 0:08:04 lr: 0.000940 loss: 3.375378 (3.137552) time: 0.934761 data: 0.000158 max mem: 18814 Epoch: [156/300] [ 800/1251] eta: 0:07:16 lr: 0.000940 loss: 3.197771 (3.142314) time: 0.932884 data: 0.000156 max mem: 18814 Epoch: [156/300] [ 850/1251] eta: 0:06:28 lr: 0.000940 loss: 3.072269 (3.132740) time: 1.005284 data: 0.000169 max mem: 18814 Epoch: [156/300] [ 900/1251] eta: 0:05:39 lr: 0.000939 loss: 3.231726 (3.131105) time: 0.970805 data: 0.000158 max mem: 18814 Epoch: [156/300] [ 950/1251] eta: 0:04:51 lr: 0.000939 loss: 2.680843 (3.131092) time: 0.983300 data: 0.000152 max mem: 18814 Epoch: [156/300] [1000/1251] eta: 0:04:02 lr: 0.000938 loss: 3.224945 (3.132313) time: 0.924651 data: 0.000150 max mem: 18814 Epoch: [156/300] [1050/1251] eta: 0:03:14 lr: 0.000938 loss: 3.266685 (3.132240) time: 0.937201 data: 0.000154 max mem: 18814 Epoch: [156/300] [1100/1251] eta: 0:02:26 lr: 0.000937 loss: 3.171500 (3.132010) time: 0.974317 data: 0.000151 max mem: 18814 Epoch: [156/300] [1150/1251] eta: 0:01:37 lr: 0.000937 loss: 2.999721 (3.133167) time: 0.987744 data: 0.000153 max mem: 18814 Epoch: [156/300] [1200/1251] eta: 0:00:49 lr: 0.000937 loss: 3.359755 (3.138094) time: 0.977465 data: 0.000165 max mem: 18814 Epoch: [156/300] [1250/1251] eta: 0:00:00 lr: 0.000936 loss: 3.203650 (3.134061) time: 0.991102 data: 0.000755 max mem: 18814 Epoch: [156/300] Total time: 0:20:11 (0.968133 s / it) Averaged stats: lr: 0.000936 loss: 3.203650 (3.132024) Test: [ 0/49] eta: 0:01:29 loss: 0.667119 (0.667119) acc1: 79.687500 (79.687500) acc5: 96.875000 (96.875000) time: 1.822263 data: 1.401376 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.667119 (0.731158) acc1: 79.687500 (81.107955) acc5: 95.312500 (95.596591) time: 0.506358 data: 0.127535 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.798803 (0.772708) acc1: 79.687500 (80.877976) acc5: 95.312500 (95.758929) time: 0.369254 data: 0.000153 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.810455 (0.774549) acc1: 79.687500 (80.443548) acc5: 95.312500 (95.866935) time: 0.364331 data: 0.000143 max mem: 18814 Test: [40/49] eta: 0:00:04 loss: 0.795623 (0.791806) acc1: 81.250000 (80.716463) acc5: 96.875000 (95.769817) time: 0.462901 data: 0.000130 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.795623 (0.787836) acc1: 79.687500 (80.544000) acc5: 96.875000 (95.872000) time: 0.462583 data: 0.000113 max mem: 18814 Test: Total time: 0:00:21 (0.439410 s / it) * Acc@1 81.020 Acc@5 95.920 loss 0.794 Max accuracy: 81.02% Epoch: [157/300] [ 0/1251] eta: 0:41:39 lr: 0.000936 loss: 3.303178 (3.303178) time: 1.998006 data: 1.088759 max mem: 18814 Epoch: [157/300] [ 50/1251] eta: 0:19:41 lr: 0.000936 loss: 3.376705 (3.244893) time: 0.967487 data: 0.000166 max mem: 18814 Epoch: [157/300] [ 100/1251] eta: 0:18:35 lr: 0.000935 loss: 3.229698 (3.181253) time: 0.962317 data: 0.000157 max mem: 18814 Epoch: [157/300] [ 150/1251] eta: 0:17:39 lr: 0.000935 loss: 3.399442 (3.192516) time: 0.968817 data: 0.000147 max mem: 18814 Epoch: [157/300] [ 200/1251] eta: 0:16:47 lr: 0.000935 loss: 3.371287 (3.191388) time: 0.933079 data: 0.000165 max mem: 18814 Epoch: [157/300] [ 250/1251] eta: 0:16:04 lr: 0.000934 loss: 2.934135 (3.140339) time: 0.923418 data: 0.000146 max mem: 18814 Epoch: [157/300] [ 300/1251] eta: 0:15:17 lr: 0.000934 loss: 2.956274 (3.136979) time: 0.988014 data: 0.000158 max mem: 18814 Epoch: [157/300] [ 350/1251] eta: 0:14:31 lr: 0.000933 loss: 3.239595 (3.141770) time: 0.990925 data: 0.000161 max mem: 18814 Epoch: [157/300] [ 400/1251] eta: 0:13:40 lr: 0.000933 loss: 2.769861 (3.129988) time: 0.974886 data: 0.000149 max mem: 18814 Epoch: [157/300] [ 450/1251] eta: 0:12:51 lr: 0.000932 loss: 3.354762 (3.134384) time: 0.932526 data: 0.000149 max mem: 18814 Epoch: [157/300] [ 500/1251] eta: 0:12:03 lr: 0.000932 loss: 2.898792 (3.124670) time: 0.932646 data: 0.000158 max mem: 18814 Epoch: [157/300] [ 550/1251] eta: 0:11:17 lr: 0.000932 loss: 3.199959 (3.129644) time: 0.982904 data: 0.000163 max mem: 18814 Epoch: [157/300] [ 600/1251] eta: 0:10:27 lr: 0.000931 loss: 2.854208 (3.124586) time: 0.964248 data: 0.000157 max mem: 18814 Epoch: [157/300] [ 650/1251] eta: 0:09:38 lr: 0.000931 loss: 2.921279 (3.122102) time: 0.928671 data: 0.000155 max mem: 18814 Epoch: [157/300] [ 700/1251] eta: 0:08:50 lr: 0.000930 loss: 3.161786 (3.111379) time: 0.949589 data: 0.000155 max mem: 18814 Epoch: [157/300] [ 750/1251] eta: 0:08:03 lr: 0.000930 loss: 3.067981 (3.102395) time: 0.995622 data: 0.000156 max mem: 18814 Epoch: [157/300] [ 800/1251] eta: 0:07:15 lr: 0.000930 loss: 3.100339 (3.109874) time: 1.036897 data: 0.000153 max mem: 18814 Epoch: [157/300] [ 850/1251] eta: 0:06:26 lr: 0.000929 loss: 3.432105 (3.110057) time: 0.974680 data: 0.000158 max mem: 18814 Epoch: [157/300] [ 900/1251] eta: 0:05:38 lr: 0.000929 loss: 3.428272 (3.113099) time: 0.925277 data: 0.000154 max mem: 18814 Epoch: [157/300] [ 950/1251] eta: 0:04:50 lr: 0.000928 loss: 3.272137 (3.116904) time: 0.949120 data: 0.000159 max mem: 18814 Epoch: [157/300] [1000/1251] eta: 0:04:02 lr: 0.000928 loss: 2.994166 (3.111353) time: 0.926298 data: 0.000149 max mem: 18814 Epoch: [157/300] [1050/1251] eta: 0:03:13 lr: 0.000928 loss: 2.877853 (3.112152) time: 1.027183 data: 0.000162 max mem: 18814 Epoch: [157/300] [1100/1251] eta: 0:02:25 lr: 0.000927 loss: 3.149564 (3.115027) time: 0.997305 data: 0.000163 max mem: 18814 Epoch: [157/300] [1150/1251] eta: 0:01:37 lr: 0.000927 loss: 2.973298 (3.110332) time: 0.923687 data: 0.000170 max mem: 18814 Epoch: [157/300] [1200/1251] eta: 0:00:49 lr: 0.000926 loss: 3.390528 (3.113570) time: 0.946262 data: 0.000160 max mem: 18814 Epoch: [157/300] [1250/1251] eta: 0:00:00 lr: 0.000926 loss: 3.178593 (3.112064) time: 0.920443 data: 0.000736 max mem: 18814 Epoch: [157/300] Total time: 0:20:06 (0.964762 s / it) Averaged stats: lr: 0.000926 loss: 3.178593 (3.118795) Test: [ 0/49] eta: 0:01:25 loss: 0.687585 (0.687585) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.875000) time: 1.746142 data: 1.352893 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.699993 (0.745808) acc1: 82.812500 (83.096591) acc5: 96.875000 (95.596591) time: 0.494724 data: 0.123112 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.758265 (0.796576) acc1: 81.250000 (82.068452) acc5: 95.312500 (95.610119) time: 0.368043 data: 0.000124 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.864678 (0.793895) acc1: 81.250000 (81.401210) acc5: 96.875000 (95.917339) time: 0.368326 data: 0.000129 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.822950 (0.806665) acc1: 79.687500 (81.097561) acc5: 96.875000 (95.960366) time: 0.363051 data: 0.000133 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.853415 (0.800977) acc1: 79.687500 (81.024000) acc5: 96.875000 (96.064000) time: 0.353211 data: 0.000105 max mem: 18814 Test: Total time: 0:00:19 (0.393297 s / it) * Acc@1 81.088 Acc@5 95.886 loss 0.812 Max accuracy: 81.09% Epoch: [158/300] [ 0/1251] eta: 0:48:13 lr: 0.000926 loss: 2.794977 (2.794977) time: 2.312863 data: 1.416540 max mem: 18814 Epoch: [158/300] [ 50/1251] eta: 0:19:50 lr: 0.000925 loss: 3.203193 (3.128793) time: 0.983937 data: 0.000155 max mem: 18814 Epoch: [158/300] [ 100/1251] eta: 0:18:36 lr: 0.000925 loss: 3.033377 (3.159328) time: 0.986805 data: 0.000150 max mem: 18814 Epoch: [158/300] [ 150/1251] eta: 0:17:39 lr: 0.000925 loss: 3.211123 (3.136051) time: 0.923693 data: 0.000159 max mem: 18814 Epoch: [158/300] [ 200/1251] eta: 0:16:54 lr: 0.000924 loss: 3.290374 (3.169506) time: 0.948196 data: 0.000167 max mem: 18814 Epoch: [158/300] [ 250/1251] eta: 0:16:07 lr: 0.000924 loss: 2.879484 (3.168320) time: 0.945724 data: 0.000156 max mem: 18814 Epoch: [158/300] [ 300/1251] eta: 0:15:18 lr: 0.000923 loss: 2.924447 (3.158953) time: 0.949720 data: 0.000164 max mem: 18814 Epoch: [158/300] [ 350/1251] eta: 0:14:28 lr: 0.000923 loss: 3.019684 (3.151807) time: 0.986273 data: 0.000155 max mem: 18814 Epoch: [158/300] [ 400/1251] eta: 0:13:39 lr: 0.000923 loss: 3.234404 (3.127205) time: 0.929073 data: 0.000162 max mem: 18814 Epoch: [158/300] [ 450/1251] eta: 0:12:51 lr: 0.000922 loss: 3.253160 (3.128796) time: 0.940180 data: 0.000168 max mem: 18814 Epoch: [158/300] [ 500/1251] eta: 0:12:04 lr: 0.000922 loss: 3.314122 (3.123700) time: 0.925875 data: 0.000156 max mem: 18814 Epoch: [158/300] [ 550/1251] eta: 0:11:16 lr: 0.000921 loss: 3.281056 (3.123779) time: 0.965235 data: 0.000183 max mem: 18814 Epoch: [158/300] [ 600/1251] eta: 0:10:27 lr: 0.000921 loss: 3.219665 (3.114716) time: 0.971606 data: 0.000167 max mem: 18814 Epoch: [158/300] [ 650/1251] eta: 0:09:38 lr: 0.000920 loss: 3.060941 (3.115037) time: 0.922317 data: 0.000160 max mem: 18814 Epoch: [158/300] [ 700/1251] eta: 0:08:50 lr: 0.000920 loss: 3.073280 (3.116103) time: 0.932148 data: 0.000161 max mem: 18814 Epoch: [158/300] [ 750/1251] eta: 0:08:02 lr: 0.000920 loss: 3.211696 (3.123665) time: 0.943678 data: 0.000167 max mem: 18814 Epoch: [158/300] [ 800/1251] eta: 0:07:14 lr: 0.000919 loss: 3.094821 (3.126389) time: 0.982143 data: 0.000161 max mem: 18814 Epoch: [158/300] [ 850/1251] eta: 0:06:25 lr: 0.000919 loss: 3.192276 (3.126040) time: 0.980439 data: 0.000158 max mem: 18814 Epoch: [158/300] [ 900/1251] eta: 0:05:37 lr: 0.000918 loss: 3.183160 (3.125651) time: 0.927417 data: 0.000166 max mem: 18814 Epoch: [158/300] [ 950/1251] eta: 0:04:49 lr: 0.000918 loss: 2.938338 (3.122119) time: 0.930619 data: 0.000166 max mem: 18814 Epoch: [158/300] [1000/1251] eta: 0:04:01 lr: 0.000918 loss: 3.222574 (3.119757) time: 0.915819 data: 0.000145 max mem: 18814 Epoch: [158/300] [1050/1251] eta: 0:03:13 lr: 0.000917 loss: 2.985647 (3.115290) time: 0.967163 data: 0.000156 max mem: 18814 Epoch: [158/300] [1100/1251] eta: 0:02:25 lr: 0.000917 loss: 3.245001 (3.119905) time: 0.995936 data: 0.000154 max mem: 18814 Epoch: [158/300] [1150/1251] eta: 0:01:37 lr: 0.000916 loss: 3.231512 (3.126483) time: 0.932604 data: 0.000163 max mem: 18814 Epoch: [158/300] [1200/1251] eta: 0:00:49 lr: 0.000916 loss: 3.185542 (3.127190) time: 0.944042 data: 0.000171 max mem: 18814 Epoch: [158/300] [1250/1251] eta: 0:00:00 lr: 0.000916 loss: 3.211888 (3.128030) time: 0.925107 data: 0.000752 max mem: 18814 Epoch: [158/300] Total time: 0:20:05 (0.963983 s / it) Averaged stats: lr: 0.000916 loss: 3.211888 (3.132034) Test: [ 0/49] eta: 0:01:26 loss: 0.644413 (0.644413) acc1: 87.500000 (87.500000) acc5: 96.875000 (96.875000) time: 1.759499 data: 1.350888 max mem: 18814 Test: [10/49] eta: 0:00:20 loss: 0.756667 (0.767438) acc1: 82.812500 (81.960227) acc5: 96.875000 (95.880682) time: 0.526033 data: 0.122937 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.788999 (0.802207) acc1: 79.687500 (80.877976) acc5: 96.875000 (95.758929) time: 0.381208 data: 0.000129 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.801371 (0.805708) acc1: 79.687500 (80.393145) acc5: 96.875000 (96.018145) time: 0.362041 data: 0.000132 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.803338 (0.819925) acc1: 79.687500 (80.259146) acc5: 95.312500 (95.731707) time: 0.359687 data: 0.000133 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.833155 (0.818734) acc1: 79.687500 (80.096000) acc5: 95.312500 (95.904000) time: 0.352958 data: 0.000109 max mem: 18814 Test: Total time: 0:00:19 (0.398282 s / it) * Acc@1 80.998 Acc@5 95.802 loss 0.816 Max accuracy: 81.09% Epoch: [159/300] [ 0/1251] eta: 0:40:03 lr: 0.000916 loss: 3.155198 (3.155198) time: 1.921294 data: 1.009667 max mem: 18814 Epoch: [159/300] [ 50/1251] eta: 0:19:31 lr: 0.000915 loss: 3.283644 (3.209227) time: 0.985990 data: 0.000169 max mem: 18814 Epoch: [159/300] [ 100/1251] eta: 0:18:29 lr: 0.000915 loss: 3.325262 (3.154652) time: 0.949808 data: 0.000161 max mem: 18814 Epoch: [159/300] [ 150/1251] eta: 0:17:41 lr: 0.000914 loss: 3.069378 (3.088575) time: 0.929196 data: 0.000154 max mem: 18814 Epoch: [159/300] [ 200/1251] eta: 0:16:54 lr: 0.000914 loss: 2.892930 (3.082089) time: 0.982011 data: 0.000151 max mem: 18814 Epoch: [159/300] [ 250/1251] eta: 0:16:07 lr: 0.000913 loss: 3.137593 (3.091648) time: 1.036673 data: 0.000251 max mem: 18814 Epoch: [159/300] [ 300/1251] eta: 0:15:17 lr: 0.000913 loss: 3.362850 (3.110327) time: 0.987864 data: 0.000152 max mem: 18814 Epoch: [159/300] [ 350/1251] eta: 0:14:27 lr: 0.000913 loss: 2.949200 (3.089889) time: 0.927685 data: 0.000156 max mem: 18814 Epoch: [159/300] [ 400/1251] eta: 0:13:39 lr: 0.000912 loss: 3.043736 (3.086500) time: 0.928034 data: 0.000162 max mem: 18814 Epoch: [159/300] [ 450/1251] eta: 0:12:52 lr: 0.000912 loss: 3.007712 (3.089492) time: 0.984597 data: 0.000174 max mem: 18814 Epoch: [159/300] [ 500/1251] eta: 0:12:04 lr: 0.000911 loss: 3.223897 (3.094660) time: 1.020909 data: 0.000170 max mem: 18814 Epoch: [159/300] [ 550/1251] eta: 0:11:14 lr: 0.000911 loss: 3.089400 (3.092433) time: 0.971667 data: 0.000171 max mem: 18814 Epoch: [159/300] [ 600/1251] eta: 0:10:25 lr: 0.000911 loss: 3.057737 (3.092836) time: 0.933578 data: 0.000144 max mem: 18814 Epoch: [159/300] [ 650/1251] eta: 0:09:38 lr: 0.000910 loss: 3.214466 (3.098465) time: 0.933646 data: 0.000149 max mem: 18814 Epoch: [159/300] [ 700/1251] eta: 0:08:50 lr: 0.000910 loss: 3.217434 (3.095843) time: 0.964066 data: 0.000144 max mem: 18814 Epoch: [159/300] [ 750/1251] eta: 0:08:02 lr: 0.000909 loss: 3.001004 (3.096844) time: 1.016298 data: 0.000175 max mem: 18814 Epoch: [159/300] [ 800/1251] eta: 0:07:13 lr: 0.000909 loss: 3.143217 (3.098391) time: 0.973162 data: 0.000152 max mem: 18814 Epoch: [159/300] [ 850/1251] eta: 0:06:25 lr: 0.000909 loss: 3.256866 (3.104141) time: 0.934234 data: 0.000159 max mem: 18814 Epoch: [159/300] [ 900/1251] eta: 0:05:37 lr: 0.000908 loss: 3.270694 (3.109628) time: 0.937024 data: 0.000171 max mem: 18814 Epoch: [159/300] [ 950/1251] eta: 0:04:49 lr: 0.000908 loss: 3.379591 (3.119917) time: 0.983333 data: 0.000184 max mem: 18814 Epoch: [159/300] [1000/1251] eta: 0:04:01 lr: 0.000907 loss: 3.340434 (3.118374) time: 1.037348 data: 0.000159 max mem: 18814 Epoch: [159/300] [1050/1251] eta: 0:03:13 lr: 0.000907 loss: 3.257275 (3.116618) time: 0.969083 data: 0.000164 max mem: 18814 Epoch: [159/300] [1100/1251] eta: 0:02:25 lr: 0.000906 loss: 3.219875 (3.117134) time: 0.949792 data: 0.000168 max mem: 18814 Epoch: [159/300] [1150/1251] eta: 0:01:37 lr: 0.000906 loss: 2.999625 (3.112828) time: 0.924602 data: 0.000163 max mem: 18814 Epoch: [159/300] [1200/1251] eta: 0:00:49 lr: 0.000906 loss: 3.323242 (3.116142) time: 0.987666 data: 0.000155 max mem: 18814 Epoch: [159/300] [1250/1251] eta: 0:00:00 lr: 0.000905 loss: 3.434093 (3.119463) time: 1.018240 data: 0.000770 max mem: 18814 Epoch: [159/300] Total time: 0:20:03 (0.961998 s / it) Averaged stats: lr: 0.000905 loss: 3.434093 (3.116843) Test: [ 0/49] eta: 0:01:25 loss: 0.597689 (0.597689) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.751388 data: 1.354396 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.713844 (0.738265) acc1: 84.375000 (83.380682) acc5: 96.875000 (95.880682) time: 0.502649 data: 0.123276 max mem: 18814 Test: [20/49] eta: 0:00:13 loss: 0.782057 (0.782605) acc1: 79.687500 (81.994048) acc5: 95.312500 (95.684524) time: 0.390487 data: 0.000149 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.782057 (0.783439) acc1: 79.687500 (81.754032) acc5: 95.312500 (95.766129) time: 0.390452 data: 0.000133 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.830343 (0.803132) acc1: 81.250000 (81.516768) acc5: 95.312500 (95.579268) time: 0.366526 data: 0.000124 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.865231 (0.799930) acc1: 81.250000 (81.600000) acc5: 95.312500 (95.776000) time: 0.355496 data: 0.000103 max mem: 18814 Test: Total time: 0:00:19 (0.404734 s / it) * Acc@1 81.094 Acc@5 95.836 loss 0.804 Max accuracy: 81.09% Epoch: [160/300] [ 0/1251] eta: 0:48:54 lr: 0.000905 loss: 3.500518 (3.500518) time: 2.345970 data: 1.047578 max mem: 18814 Epoch: [160/300] [ 50/1251] eta: 0:19:28 lr: 0.000905 loss: 3.158409 (3.067736) time: 0.933666 data: 0.000158 max mem: 18814 Epoch: [160/300] [ 100/1251] eta: 0:18:44 lr: 0.000904 loss: 3.244302 (3.067910) time: 0.931583 data: 0.000155 max mem: 18814 Epoch: [160/300] [ 150/1251] eta: 0:17:51 lr: 0.000904 loss: 3.085998 (3.084764) time: 0.961780 data: 0.000151 max mem: 18814 Epoch: [160/300] [ 200/1251] eta: 0:17:03 lr: 0.000904 loss: 3.081838 (3.081125) time: 1.034055 data: 0.000158 max mem: 18814 Epoch: [160/300] [ 250/1251] eta: 0:16:09 lr: 0.000903 loss: 3.299074 (3.111495) time: 0.982894 data: 0.000159 max mem: 18814 Epoch: [160/300] [ 300/1251] eta: 0:15:17 lr: 0.000903 loss: 3.329543 (3.124310) time: 0.925197 data: 0.000154 max mem: 18814 Epoch: [160/300] [ 350/1251] eta: 0:14:29 lr: 0.000902 loss: 3.305804 (3.118757) time: 0.928478 data: 0.000154 max mem: 18814 Epoch: [160/300] [ 400/1251] eta: 0:13:39 lr: 0.000902 loss: 3.150466 (3.108789) time: 0.912536 data: 0.000163 max mem: 18814 Epoch: [160/300] [ 450/1251] eta: 0:12:52 lr: 0.000902 loss: 3.082712 (3.104532) time: 0.999502 data: 0.000155 max mem: 18814 Epoch: [160/300] [ 500/1251] eta: 0:12:04 lr: 0.000901 loss: 3.243866 (3.103702) time: 0.989680 data: 0.000174 max mem: 18814 Epoch: [160/300] [ 550/1251] eta: 0:11:15 lr: 0.000901 loss: 3.172602 (3.094650) time: 0.983062 data: 0.000170 max mem: 18814 Epoch: [160/300] [ 600/1251] eta: 0:10:26 lr: 0.000900 loss: 3.094657 (3.093752) time: 0.921785 data: 0.000161 max mem: 18814 Epoch: [160/300] [ 650/1251] eta: 0:09:38 lr: 0.000900 loss: 3.254964 (3.100716) time: 0.930753 data: 0.000155 max mem: 18814 Epoch: [160/300] [ 700/1251] eta: 0:08:50 lr: 0.000899 loss: 3.227125 (3.103773) time: 1.002293 data: 0.000162 max mem: 18814 Epoch: [160/300] [ 750/1251] eta: 0:08:03 lr: 0.000899 loss: 3.158918 (3.099812) time: 1.011993 data: 0.000148 max mem: 18814 Epoch: [160/300] [ 800/1251] eta: 0:07:14 lr: 0.000899 loss: 3.121545 (3.099928) time: 0.968975 data: 0.000160 max mem: 18814 Epoch: [160/300] [ 850/1251] eta: 0:06:25 lr: 0.000898 loss: 2.956840 (3.102086) time: 0.923510 data: 0.000159 max mem: 18814 Epoch: [160/300] [ 900/1251] eta: 0:05:37 lr: 0.000898 loss: 2.768224 (3.106050) time: 0.928732 data: 0.000162 max mem: 18814 Epoch: [160/300] [ 950/1251] eta: 0:04:49 lr: 0.000897 loss: 3.166471 (3.109678) time: 0.985333 data: 0.000155 max mem: 18814 Epoch: [160/300] [1000/1251] eta: 0:04:01 lr: 0.000897 loss: 3.073039 (3.111922) time: 0.973834 data: 0.000157 max mem: 18814 Epoch: [160/300] [1050/1251] eta: 0:03:13 lr: 0.000897 loss: 3.336200 (3.112482) time: 0.981528 data: 0.000159 max mem: 18814 Epoch: [160/300] [1100/1251] eta: 0:02:25 lr: 0.000896 loss: 3.217724 (3.116104) time: 0.932531 data: 0.000162 max mem: 18814 Epoch: [160/300] [1150/1251] eta: 0:01:37 lr: 0.000896 loss: 3.014538 (3.113837) time: 0.929863 data: 0.000151 max mem: 18814 Epoch: [160/300] [1200/1251] eta: 0:00:49 lr: 0.000895 loss: 3.241807 (3.116907) time: 0.981394 data: 0.000168 max mem: 18814 Epoch: [160/300] [1250/1251] eta: 0:00:00 lr: 0.000895 loss: 3.252177 (3.116695) time: 0.973977 data: 0.000754 max mem: 18814 Epoch: [160/300] Total time: 0:20:05 (0.963234 s / it) Averaged stats: lr: 0.000895 loss: 3.252177 (3.126299) Test: [ 0/49] eta: 0:01:30 loss: 0.638722 (0.638722) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.842895 data: 1.430415 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.722130 (0.743129) acc1: 79.687500 (81.960227) acc5: 95.312500 (96.164773) time: 0.498800 data: 0.130168 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.817748 (0.773753) acc1: 79.687500 (81.175595) acc5: 95.312500 (96.354167) time: 0.361718 data: 0.000130 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.754013 (0.774135) acc1: 79.687500 (81.451613) acc5: 96.875000 (96.421371) time: 0.359471 data: 0.000122 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.832343 (0.797249) acc1: 81.250000 (81.364329) acc5: 96.875000 (96.150915) time: 0.357080 data: 0.000120 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.863559 (0.800946) acc1: 81.250000 (81.088000) acc5: 96.875000 (96.288000) time: 0.351806 data: 0.000098 max mem: 18814 Test: Total time: 0:00:19 (0.390582 s / it) * Acc@1 81.246 Acc@5 95.940 loss 0.806 Max accuracy: 81.25% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0160.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0160.pth Epoch: [161/300] [ 0/1251] eta: 0:40:58 lr: 0.000895 loss: 3.224262 (3.224262) time: 1.965338 data: 1.069983 max mem: 18814 Epoch: [161/300] [ 50/1251] eta: 0:19:47 lr: 0.000894 loss: 3.309710 (3.279427) time: 0.981303 data: 0.000158 max mem: 18814 Epoch: [161/300] [ 100/1251] eta: 0:18:46 lr: 0.000894 loss: 3.177077 (3.187387) time: 1.032182 data: 0.000181 max mem: 18814 Epoch: [161/300] [ 150/1251] eta: 0:17:46 lr: 0.000894 loss: 3.219428 (3.200802) time: 0.978185 data: 0.000171 max mem: 18814 Epoch: [161/300] [ 200/1251] eta: 0:16:52 lr: 0.000893 loss: 3.111217 (3.163137) time: 0.936910 data: 0.000170 max mem: 18814 Epoch: [161/300] [ 250/1251] eta: 0:16:07 lr: 0.000893 loss: 3.256044 (3.135072) time: 0.935536 data: 0.000159 max mem: 18814 Epoch: [161/300] [ 300/1251] eta: 0:15:17 lr: 0.000892 loss: 3.100602 (3.137862) time: 0.974567 data: 0.000161 max mem: 18814 Epoch: [161/300] [ 350/1251] eta: 0:14:28 lr: 0.000892 loss: 3.090693 (3.131802) time: 1.014255 data: 0.000166 max mem: 18814 Epoch: [161/300] [ 400/1251] eta: 0:13:40 lr: 0.000892 loss: 3.199965 (3.128658) time: 1.008752 data: 0.000158 max mem: 18814 Epoch: [161/300] [ 450/1251] eta: 0:12:50 lr: 0.000891 loss: 3.188217 (3.122056) time: 0.929847 data: 0.000167 max mem: 18814 Epoch: [161/300] [ 500/1251] eta: 0:12:03 lr: 0.000891 loss: 2.746076 (3.118509) time: 0.929306 data: 0.000151 max mem: 18814 Epoch: [161/300] [ 550/1251] eta: 0:11:16 lr: 0.000890 loss: 3.165864 (3.115913) time: 0.985193 data: 0.000167 max mem: 18814 Epoch: [161/300] [ 600/1251] eta: 0:10:28 lr: 0.000890 loss: 3.077289 (3.110956) time: 1.002626 data: 0.000170 max mem: 18814 Epoch: [161/300] [ 650/1251] eta: 0:09:39 lr: 0.000890 loss: 3.050197 (3.105447) time: 0.998074 data: 0.000149 max mem: 18814 Epoch: [161/300] [ 700/1251] eta: 0:08:50 lr: 0.000889 loss: 2.870717 (3.100929) time: 0.928877 data: 0.000154 max mem: 18814 Epoch: [161/300] [ 750/1251] eta: 0:08:02 lr: 0.000889 loss: 2.693154 (3.094199) time: 0.928428 data: 0.000156 max mem: 18814 Epoch: [161/300] [ 800/1251] eta: 0:07:14 lr: 0.000888 loss: 3.141127 (3.093176) time: 0.951235 data: 0.000150 max mem: 18814 Epoch: [161/300] [ 850/1251] eta: 0:06:26 lr: 0.000888 loss: 3.117831 (3.093259) time: 0.978881 data: 0.000157 max mem: 18814 Epoch: [161/300] [ 900/1251] eta: 0:05:38 lr: 0.000887 loss: 3.459455 (3.096995) time: 0.995159 data: 0.000163 max mem: 18814 Epoch: [161/300] [ 950/1251] eta: 0:04:49 lr: 0.000887 loss: 3.160492 (3.097681) time: 0.931059 data: 0.000164 max mem: 18814 Epoch: [161/300] [1000/1251] eta: 0:04:01 lr: 0.000887 loss: 3.222097 (3.099930) time: 0.928716 data: 0.000157 max mem: 18814 Epoch: [161/300] [1050/1251] eta: 0:03:13 lr: 0.000886 loss: 3.319806 (3.103866) time: 0.951117 data: 0.000159 max mem: 18814 Epoch: [161/300] [1100/1251] eta: 0:02:25 lr: 0.000886 loss: 3.245976 (3.107418) time: 1.018573 data: 0.000184 max mem: 18814 Epoch: [161/300] [1150/1251] eta: 0:01:37 lr: 0.000885 loss: 3.053533 (3.105216) time: 0.981937 data: 0.000164 max mem: 18814 Epoch: [161/300] [1200/1251] eta: 0:00:49 lr: 0.000885 loss: 3.126505 (3.104415) time: 0.927402 data: 0.000156 max mem: 18814 Epoch: [161/300] [1250/1251] eta: 0:00:00 lr: 0.000885 loss: 3.215216 (3.106804) time: 0.928435 data: 0.000760 max mem: 18814 Epoch: [161/300] Total time: 0:20:05 (0.963894 s / it) Averaged stats: lr: 0.000885 loss: 3.215216 (3.104081) Test: [ 0/49] eta: 0:01:14 loss: 0.685058 (0.685058) acc1: 84.375000 (84.375000) acc5: 95.312500 (95.312500) time: 1.530538 data: 1.121379 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.685058 (0.752676) acc1: 82.812500 (83.380682) acc5: 95.312500 (95.596591) time: 0.477287 data: 0.102081 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.767919 (0.774338) acc1: 81.250000 (81.696429) acc5: 95.312500 (95.535714) time: 0.366495 data: 0.000140 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.773898 (0.776197) acc1: 79.687500 (81.451613) acc5: 96.875000 (95.665323) time: 0.363872 data: 0.000137 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.820056 (0.789962) acc1: 79.687500 (81.402439) acc5: 96.875000 (95.617378) time: 0.452156 data: 0.000141 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.821979 (0.789243) acc1: 81.250000 (81.472000) acc5: 96.875000 (95.648000) time: 0.447341 data: 0.000115 max mem: 18814 Test: Total time: 0:00:20 (0.425601 s / it) * Acc@1 81.224 Acc@5 95.850 loss 0.790 Max accuracy: 81.25% Epoch: [162/300] [ 0/1251] eta: 0:41:12 lr: 0.000885 loss: 2.491780 (2.491780) time: 1.976762 data: 1.071892 max mem: 18814 Epoch: [162/300] [ 50/1251] eta: 0:20:08 lr: 0.000884 loss: 3.080765 (3.015007) time: 1.002929 data: 0.000164 max mem: 18814 Epoch: [162/300] [ 100/1251] eta: 0:18:46 lr: 0.000884 loss: 2.981536 (3.052358) time: 0.966547 data: 0.000182 max mem: 18814 Epoch: [162/300] [ 150/1251] eta: 0:17:47 lr: 0.000883 loss: 3.185516 (3.067448) time: 0.921911 data: 0.000167 max mem: 18814 Epoch: [162/300] [ 200/1251] eta: 0:17:02 lr: 0.000883 loss: 3.290624 (3.088745) time: 0.938431 data: 0.000169 max mem: 18814 Epoch: [162/300] [ 250/1251] eta: 0:16:12 lr: 0.000883 loss: 3.016617 (3.067030) time: 0.926487 data: 0.000182 max mem: 18814 Epoch: [162/300] [ 300/1251] eta: 0:15:21 lr: 0.000882 loss: 3.028548 (3.071324) time: 0.963585 data: 0.000197 max mem: 18814 Epoch: [162/300] [ 350/1251] eta: 0:14:30 lr: 0.000882 loss: 3.176709 (3.075840) time: 0.968805 data: 0.000151 max mem: 18814 Epoch: [162/300] [ 400/1251] eta: 0:13:41 lr: 0.000881 loss: 3.386961 (3.095803) time: 0.937355 data: 0.000182 max mem: 18814 Epoch: [162/300] [ 450/1251] eta: 0:12:52 lr: 0.000881 loss: 3.115166 (3.093418) time: 0.922960 data: 0.000163 max mem: 18814 Epoch: [162/300] [ 500/1251] eta: 0:12:03 lr: 0.000880 loss: 3.063192 (3.085598) time: 0.926471 data: 0.000170 max mem: 18814 Epoch: [162/300] [ 550/1251] eta: 0:11:15 lr: 0.000880 loss: 2.751517 (3.074535) time: 0.979500 data: 0.000182 max mem: 18814 Epoch: [162/300] [ 600/1251] eta: 0:10:27 lr: 0.000880 loss: 3.202499 (3.084926) time: 0.982828 data: 0.000191 max mem: 18814 Epoch: [162/300] [ 650/1251] eta: 0:09:39 lr: 0.000879 loss: 3.185588 (3.083979) time: 0.978195 data: 0.000164 max mem: 18814 Epoch: [162/300] [ 700/1251] eta: 0:08:50 lr: 0.000879 loss: 3.113010 (3.083948) time: 0.931278 data: 0.000162 max mem: 18814 Epoch: [162/300] [ 750/1251] eta: 0:08:03 lr: 0.000878 loss: 3.252893 (3.081119) time: 0.930586 data: 0.000178 max mem: 18814 Epoch: [162/300] [ 800/1251] eta: 0:07:15 lr: 0.000878 loss: 3.046073 (3.086784) time: 0.983650 data: 0.000156 max mem: 18814 Epoch: [162/300] [ 850/1251] eta: 0:06:26 lr: 0.000878 loss: 2.964582 (3.090596) time: 1.013876 data: 0.000162 max mem: 18814 Epoch: [162/300] [ 900/1251] eta: 0:05:38 lr: 0.000877 loss: 3.333048 (3.097812) time: 0.992312 data: 0.000163 max mem: 18814 Epoch: [162/300] [ 950/1251] eta: 0:04:50 lr: 0.000877 loss: 3.325535 (3.099454) time: 0.939223 data: 0.000163 max mem: 18814 Epoch: [162/300] [1000/1251] eta: 0:04:02 lr: 0.000876 loss: 3.423984 (3.102683) time: 0.928806 data: 0.000152 max mem: 18814 Epoch: [162/300] [1050/1251] eta: 0:03:14 lr: 0.000876 loss: 3.472129 (3.108404) time: 0.994246 data: 0.000166 max mem: 18814 Epoch: [162/300] [1100/1251] eta: 0:02:25 lr: 0.000876 loss: 3.330184 (3.108199) time: 1.012982 data: 0.000159 max mem: 18814 Epoch: [162/300] [1150/1251] eta: 0:01:37 lr: 0.000875 loss: 2.995223 (3.107567) time: 0.982842 data: 0.000153 max mem: 18814 Epoch: [162/300] [1200/1251] eta: 0:00:49 lr: 0.000875 loss: 3.042053 (3.106212) time: 0.925993 data: 0.000168 max mem: 18814 Epoch: [162/300] [1250/1251] eta: 0:00:00 lr: 0.000874 loss: 3.213856 (3.112598) time: 0.929425 data: 0.000749 max mem: 18814 Epoch: [162/300] Total time: 0:20:07 (0.964904 s / it) Averaged stats: lr: 0.000874 loss: 3.213856 (3.106836) Test: [ 0/49] eta: 0:01:32 loss: 0.566562 (0.566562) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.890168 data: 1.445880 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.695506 (0.735647) acc1: 84.375000 (83.096591) acc5: 96.875000 (96.164773) time: 0.502259 data: 0.131585 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.836826 (0.782091) acc1: 79.687500 (81.547619) acc5: 95.312500 (96.130952) time: 0.369641 data: 0.000140 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.817138 (0.781103) acc1: 79.687500 (81.804435) acc5: 95.312500 (96.118952) time: 0.367899 data: 0.000140 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.792312 (0.791081) acc1: 81.250000 (81.707317) acc5: 95.312500 (95.960366) time: 0.358064 data: 0.000152 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.792312 (0.789653) acc1: 81.250000 (81.472000) acc5: 96.875000 (96.096000) time: 0.459940 data: 0.000119 max mem: 18814 Test: Total time: 0:00:21 (0.438389 s / it) * Acc@1 81.320 Acc@5 95.882 loss 0.786 Max accuracy: 81.32% Epoch: [163/300] [ 0/1251] eta: 0:43:45 lr: 0.000874 loss: 3.096908 (3.096908) time: 2.098453 data: 1.125691 max mem: 18814 Epoch: [163/300] [ 50/1251] eta: 0:19:43 lr: 0.000874 loss: 3.041929 (3.104452) time: 1.010572 data: 0.000158 max mem: 18814 Epoch: [163/300] [ 100/1251] eta: 0:18:45 lr: 0.000873 loss: 3.085255 (3.122708) time: 0.993189 data: 0.000174 max mem: 18814 Epoch: [163/300] [ 150/1251] eta: 0:17:46 lr: 0.000873 loss: 2.967579 (3.101048) time: 0.927809 data: 0.000157 max mem: 18814 Epoch: [163/300] [ 200/1251] eta: 0:16:59 lr: 0.000873 loss: 2.954243 (3.107947) time: 0.923112 data: 0.000173 max mem: 18814 Epoch: [163/300] [ 250/1251] eta: 0:16:05 lr: 0.000872 loss: 2.920606 (3.097618) time: 0.919184 data: 0.000161 max mem: 18814 Epoch: [163/300] [ 300/1251] eta: 0:15:19 lr: 0.000872 loss: 2.847753 (3.092463) time: 0.997750 data: 0.000156 max mem: 18814 Epoch: [163/300] [ 350/1251] eta: 0:14:30 lr: 0.000871 loss: 3.090138 (3.090073) time: 1.015388 data: 0.000174 max mem: 18814 Epoch: [163/300] [ 400/1251] eta: 0:13:41 lr: 0.000871 loss: 3.178768 (3.082023) time: 0.982245 data: 0.000175 max mem: 18814 Epoch: [163/300] [ 450/1251] eta: 0:12:50 lr: 0.000871 loss: 3.201353 (3.074306) time: 0.923778 data: 0.000261 max mem: 18814 Epoch: [163/300] [ 500/1251] eta: 0:12:03 lr: 0.000870 loss: 2.983659 (3.077721) time: 0.943131 data: 0.000168 max mem: 18814 Epoch: [163/300] [ 550/1251] eta: 0:11:15 lr: 0.000870 loss: 3.209727 (3.084956) time: 0.983471 data: 0.000159 max mem: 18814 Epoch: [163/300] [ 600/1251] eta: 0:10:27 lr: 0.000869 loss: 3.110373 (3.075724) time: 0.984225 data: 0.000160 max mem: 18814 Epoch: [163/300] [ 650/1251] eta: 0:09:38 lr: 0.000869 loss: 3.269868 (3.082176) time: 0.959328 data: 0.000177 max mem: 18814 Epoch: [163/300] [ 700/1251] eta: 0:08:50 lr: 0.000869 loss: 2.880946 (3.089175) time: 0.926085 data: 0.000162 max mem: 18814 Epoch: [163/300] [ 750/1251] eta: 0:08:02 lr: 0.000868 loss: 2.756577 (3.089421) time: 0.928995 data: 0.000163 max mem: 18814 Epoch: [163/300] [ 800/1251] eta: 0:07:14 lr: 0.000868 loss: 2.986961 (3.089548) time: 1.005111 data: 0.000170 max mem: 18814 Epoch: [163/300] [ 850/1251] eta: 0:06:26 lr: 0.000867 loss: 2.825006 (3.086519) time: 0.971053 data: 0.000168 max mem: 18814 Epoch: [163/300] [ 900/1251] eta: 0:05:38 lr: 0.000867 loss: 3.058002 (3.085600) time: 0.990532 data: 0.000153 max mem: 18814 Epoch: [163/300] [ 950/1251] eta: 0:04:50 lr: 0.000867 loss: 3.066060 (3.093645) time: 0.940876 data: 0.000168 max mem: 18814 Epoch: [163/300] [1000/1251] eta: 0:04:01 lr: 0.000866 loss: 3.107874 (3.092282) time: 0.917977 data: 0.000159 max mem: 18814 Epoch: [163/300] [1050/1251] eta: 0:03:13 lr: 0.000866 loss: 3.186735 (3.092152) time: 0.990831 data: 0.000174 max mem: 18814 Epoch: [163/300] [1100/1251] eta: 0:02:25 lr: 0.000865 loss: 2.990506 (3.088971) time: 0.968797 data: 0.000166 max mem: 18814 Epoch: [163/300] [1150/1251] eta: 0:01:37 lr: 0.000865 loss: 3.000354 (3.088263) time: 0.976543 data: 0.000162 max mem: 18814 Epoch: [163/300] [1200/1251] eta: 0:00:49 lr: 0.000864 loss: 3.052579 (3.090584) time: 0.937112 data: 0.000278 max mem: 18814 Epoch: [163/300] [1250/1251] eta: 0:00:00 lr: 0.000864 loss: 3.122585 (3.091056) time: 0.935080 data: 0.000759 max mem: 18814 Epoch: [163/300] Total time: 0:20:05 (0.963749 s / it) Averaged stats: lr: 0.000864 loss: 3.122585 (3.096611) Test: [ 0/49] eta: 0:01:30 loss: 0.637274 (0.637274) acc1: 82.812500 (82.812500) acc5: 93.750000 (93.750000) time: 1.844042 data: 1.439008 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.658760 (0.730834) acc1: 82.812500 (82.812500) acc5: 95.312500 (96.022727) time: 0.501332 data: 0.130968 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.769664 (0.763385) acc1: 79.687500 (81.547619) acc5: 96.875000 (96.056548) time: 0.365704 data: 0.000154 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.784187 (0.766980) acc1: 78.125000 (81.098790) acc5: 96.875000 (96.169355) time: 0.368335 data: 0.000141 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.793998 (0.783101) acc1: 79.687500 (80.830793) acc5: 95.312500 (95.922256) time: 0.366565 data: 0.000135 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.824370 (0.790914) acc1: 79.687500 (80.800000) acc5: 95.312500 (96.000000) time: 0.355430 data: 0.000115 max mem: 18814 Test: Total time: 0:00:19 (0.396228 s / it) * Acc@1 81.168 Acc@5 95.944 loss 0.793 Max accuracy: 81.32% Epoch: [164/300] [ 0/1251] eta: 0:44:37 lr: 0.000864 loss: 2.738081 (2.738081) time: 2.140488 data: 1.215770 max mem: 18814 Epoch: [164/300] [ 50/1251] eta: 0:19:35 lr: 0.000864 loss: 3.171491 (3.170916) time: 0.994902 data: 0.000175 max mem: 18814 Epoch: [164/300] [ 100/1251] eta: 0:18:34 lr: 0.000863 loss: 3.222271 (3.170514) time: 0.991618 data: 0.000165 max mem: 18814 Epoch: [164/300] [ 150/1251] eta: 0:17:39 lr: 0.000863 loss: 2.951039 (3.141680) time: 0.936509 data: 0.000162 max mem: 18814 Epoch: [164/300] [ 200/1251] eta: 0:16:54 lr: 0.000862 loss: 3.194448 (3.156124) time: 0.930479 data: 0.000172 max mem: 18814 Epoch: [164/300] [ 250/1251] eta: 0:16:09 lr: 0.000862 loss: 3.429878 (3.154154) time: 0.992400 data: 0.000172 max mem: 18814 Epoch: [164/300] [ 300/1251] eta: 0:15:20 lr: 0.000862 loss: 3.080952 (3.146165) time: 0.961237 data: 0.000149 max mem: 18814 Epoch: [164/300] [ 350/1251] eta: 0:14:29 lr: 0.000861 loss: 3.259647 (3.140169) time: 0.984765 data: 0.000171 max mem: 18814 Epoch: [164/300] [ 400/1251] eta: 0:13:39 lr: 0.000861 loss: 3.248984 (3.126094) time: 0.928761 data: 0.000170 max mem: 18814 Epoch: [164/300] [ 450/1251] eta: 0:12:52 lr: 0.000860 loss: 3.073388 (3.117246) time: 0.923717 data: 0.000155 max mem: 18814 Epoch: [164/300] [ 500/1251] eta: 0:12:04 lr: 0.000860 loss: 3.373477 (3.118394) time: 0.968727 data: 0.000154 max mem: 18814 Epoch: [164/300] [ 550/1251] eta: 0:11:16 lr: 0.000860 loss: 3.167015 (3.109988) time: 0.976306 data: 0.000152 max mem: 18814 Epoch: [164/300] [ 600/1251] eta: 0:10:27 lr: 0.000859 loss: 3.042099 (3.102706) time: 0.982011 data: 0.000155 max mem: 18814 Epoch: [164/300] [ 650/1251] eta: 0:09:39 lr: 0.000859 loss: 3.079503 (3.105090) time: 0.919432 data: 0.000172 max mem: 18814 Epoch: [164/300] [ 700/1251] eta: 0:08:51 lr: 0.000858 loss: 3.217562 (3.108715) time: 0.917700 data: 0.000172 max mem: 18814 Epoch: [164/300] [ 750/1251] eta: 0:08:03 lr: 0.000858 loss: 3.168046 (3.109745) time: 1.008024 data: 0.000167 max mem: 18814 Epoch: [164/300] [ 800/1251] eta: 0:07:15 lr: 0.000857 loss: 3.137491 (3.106003) time: 0.969896 data: 0.000165 max mem: 18814 Epoch: [164/300] [ 850/1251] eta: 0:06:27 lr: 0.000857 loss: 3.231988 (3.107093) time: 0.976457 data: 0.000175 max mem: 18814 Epoch: [164/300] [ 900/1251] eta: 0:05:38 lr: 0.000857 loss: 3.041903 (3.105445) time: 0.940166 data: 0.000155 max mem: 18814 Epoch: [164/300] [ 950/1251] eta: 0:04:50 lr: 0.000856 loss: 3.088566 (3.099150) time: 0.921852 data: 0.000170 max mem: 18814 Epoch: [164/300] [1000/1251] eta: 0:04:02 lr: 0.000856 loss: 3.246455 (3.105307) time: 1.002388 data: 0.000168 max mem: 18814 Epoch: [164/300] [1050/1251] eta: 0:03:14 lr: 0.000855 loss: 3.006981 (3.105330) time: 0.977857 data: 0.000157 max mem: 18814 Epoch: [164/300] [1100/1251] eta: 0:02:25 lr: 0.000855 loss: 3.219577 (3.105283) time: 0.976755 data: 0.000163 max mem: 18814 Epoch: [164/300] [1150/1251] eta: 0:01:37 lr: 0.000855 loss: 3.171971 (3.107056) time: 1.010010 data: 0.000170 max mem: 18814 Epoch: [164/300] [1200/1251] eta: 0:00:49 lr: 0.000854 loss: 3.245280 (3.102042) time: 0.924721 data: 0.000161 max mem: 18814 Epoch: [164/300] [1250/1251] eta: 0:00:00 lr: 0.000854 loss: 3.233498 (3.103112) time: 0.927915 data: 0.000751 max mem: 18814 Epoch: [164/300] Total time: 0:20:08 (0.965909 s / it) Averaged stats: lr: 0.000854 loss: 3.233498 (3.103260) Test: [ 0/49] eta: 0:01:17 loss: 0.566173 (0.566173) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.589568 data: 1.171369 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.622150 (0.729842) acc1: 82.812500 (82.528409) acc5: 96.875000 (96.875000) time: 0.477465 data: 0.106619 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.793041 (0.782962) acc1: 81.250000 (81.473214) acc5: 96.875000 (96.428571) time: 0.378420 data: 0.000144 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.812563 (0.780065) acc1: 81.250000 (81.451613) acc5: 96.875000 (96.622984) time: 0.386436 data: 0.000138 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.816006 (0.795689) acc1: 79.687500 (81.021341) acc5: 96.875000 (96.455793) time: 0.368285 data: 0.000123 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.863434 (0.801382) acc1: 79.687500 (80.800000) acc5: 96.875000 (96.288000) time: 0.355796 data: 0.000098 max mem: 18814 Test: Total time: 0:00:19 (0.398171 s / it) * Acc@1 81.100 Acc@5 96.004 loss 0.813 Max accuracy: 81.32% Epoch: [165/300] [ 0/1251] eta: 0:39:36 lr: 0.000854 loss: 2.567613 (2.567613) time: 1.900071 data: 1.009840 max mem: 18814 Epoch: [165/300] [ 50/1251] eta: 0:19:44 lr: 0.000853 loss: 3.168697 (3.087608) time: 0.972422 data: 0.000154 max mem: 18814 Epoch: [165/300] [ 100/1251] eta: 0:18:46 lr: 0.000853 loss: 3.233310 (3.123328) time: 1.044508 data: 0.000154 max mem: 18814 Epoch: [165/300] [ 150/1251] eta: 0:17:49 lr: 0.000853 loss: 3.081595 (3.094330) time: 0.994424 data: 0.000158 max mem: 18814 Epoch: [165/300] [ 200/1251] eta: 0:16:54 lr: 0.000852 loss: 3.220947 (3.117155) time: 0.918610 data: 0.000160 max mem: 18814 Epoch: [165/300] [ 250/1251] eta: 0:16:09 lr: 0.000852 loss: 3.371596 (3.143097) time: 0.946995 data: 0.000169 max mem: 18814 Epoch: [165/300] [ 300/1251] eta: 0:15:22 lr: 0.000851 loss: 3.001687 (3.129988) time: 0.989867 data: 0.000156 max mem: 18814 Epoch: [165/300] [ 350/1251] eta: 0:14:34 lr: 0.000851 loss: 3.202773 (3.107036) time: 0.976508 data: 0.000148 max mem: 18814 Epoch: [165/300] [ 400/1251] eta: 0:13:43 lr: 0.000851 loss: 3.144558 (3.099530) time: 0.974544 data: 0.000160 max mem: 18814 Epoch: [165/300] [ 450/1251] eta: 0:12:53 lr: 0.000850 loss: 3.281695 (3.109299) time: 0.936648 data: 0.000152 max mem: 18814 Epoch: [165/300] [ 500/1251] eta: 0:12:05 lr: 0.000850 loss: 3.070013 (3.095019) time: 0.932686 data: 0.000161 max mem: 18814 Epoch: [165/300] [ 550/1251] eta: 0:11:16 lr: 0.000849 loss: 3.174099 (3.100185) time: 0.976089 data: 0.000165 max mem: 18814 Epoch: [165/300] [ 600/1251] eta: 0:10:27 lr: 0.000849 loss: 3.175625 (3.099478) time: 0.915654 data: 0.000155 max mem: 18814 Epoch: [165/300] [ 650/1251] eta: 0:09:39 lr: 0.000848 loss: 3.313632 (3.101297) time: 0.986929 data: 0.000155 max mem: 18814 Epoch: [165/300] [ 700/1251] eta: 0:08:50 lr: 0.000848 loss: 3.227824 (3.099047) time: 0.976617 data: 0.000147 max mem: 18814 Epoch: [165/300] [ 750/1251] eta: 0:08:02 lr: 0.000848 loss: 3.235531 (3.095103) time: 0.922075 data: 0.000149 max mem: 18814 Epoch: [165/300] [ 800/1251] eta: 0:07:13 lr: 0.000847 loss: 3.275002 (3.094562) time: 0.932504 data: 0.000165 max mem: 18814 Epoch: [165/300] [ 850/1251] eta: 0:06:26 lr: 0.000847 loss: 2.947896 (3.091167) time: 0.922381 data: 0.000165 max mem: 18814 Epoch: [165/300] [ 900/1251] eta: 0:05:38 lr: 0.000846 loss: 3.271607 (3.086308) time: 0.969340 data: 0.000156 max mem: 18814 Epoch: [165/300] [ 950/1251] eta: 0:04:49 lr: 0.000846 loss: 2.871355 (3.077862) time: 0.963096 data: 0.000164 max mem: 18814 Epoch: [165/300] [1000/1251] eta: 0:04:01 lr: 0.000846 loss: 3.204503 (3.081456) time: 0.928025 data: 0.000155 max mem: 18814 Epoch: [165/300] [1050/1251] eta: 0:03:13 lr: 0.000845 loss: 3.097237 (3.081796) time: 0.926491 data: 0.000164 max mem: 18814 Epoch: [165/300] [1100/1251] eta: 0:02:25 lr: 0.000845 loss: 3.006315 (3.081497) time: 0.920571 data: 0.000150 max mem: 18814 Epoch: [165/300] [1150/1251] eta: 0:01:37 lr: 0.000844 loss: 3.006001 (3.078198) time: 0.979758 data: 0.000158 max mem: 18814 Epoch: [165/300] [1200/1251] eta: 0:00:49 lr: 0.000844 loss: 3.059094 (3.072412) time: 0.992578 data: 0.000156 max mem: 18814 Epoch: [165/300] [1250/1251] eta: 0:00:00 lr: 0.000844 loss: 3.322786 (3.076395) time: 0.938106 data: 0.000747 max mem: 18814 Epoch: [165/300] Total time: 0:20:03 (0.961853 s / it) Averaged stats: lr: 0.000844 loss: 3.322786 (3.076387) Test: [ 0/49] eta: 0:01:26 loss: 0.615576 (0.615576) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.759177 data: 1.362610 max mem: 18814 Test: [10/49] eta: 0:00:26 loss: 0.624244 (0.709838) acc1: 84.375000 (83.806818) acc5: 96.875000 (96.306818) time: 0.686178 data: 0.124004 max mem: 18814 Test: [20/49] eta: 0:00:15 loss: 0.816749 (0.744853) acc1: 81.250000 (81.994048) acc5: 96.875000 (96.428571) time: 0.469186 data: 0.000132 max mem: 18814 Test: [30/49] eta: 0:00:09 loss: 0.797703 (0.742134) acc1: 81.250000 (81.552419) acc5: 96.875000 (96.471774) time: 0.360184 data: 0.000129 max mem: 18814 Test: [40/49] eta: 0:00:04 loss: 0.761217 (0.758067) acc1: 81.250000 (81.288110) acc5: 95.312500 (96.303354) time: 0.358503 data: 0.000136 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.817968 (0.757949) acc1: 79.687500 (81.120000) acc5: 95.312500 (96.320000) time: 0.353047 data: 0.000112 max mem: 18814 Test: Total time: 0:00:21 (0.433395 s / it) * Acc@1 81.446 Acc@5 96.038 loss 0.762 Max accuracy: 81.45% Epoch: [166/300] [ 0/1251] eta: 0:43:47 lr: 0.000844 loss: 3.540082 (3.540082) time: 2.100624 data: 1.155911 max mem: 18814 Epoch: [166/300] [ 50/1251] eta: 0:19:20 lr: 0.000843 loss: 3.111994 (3.155125) time: 0.920399 data: 0.000148 max mem: 18814 Epoch: [166/300] [ 100/1251] eta: 0:18:27 lr: 0.000843 loss: 3.057986 (3.131355) time: 0.928497 data: 0.000172 max mem: 18814 Epoch: [166/300] [ 150/1251] eta: 0:17:42 lr: 0.000842 loss: 3.243329 (3.110751) time: 0.968647 data: 0.000154 max mem: 18814 Epoch: [166/300] [ 200/1251] eta: 0:16:50 lr: 0.000842 loss: 3.214867 (3.118244) time: 0.980116 data: 0.000157 max mem: 18814 Epoch: [166/300] [ 250/1251] eta: 0:16:03 lr: 0.000842 loss: 3.000273 (3.113151) time: 0.984792 data: 0.000162 max mem: 18814 Epoch: [166/300] [ 300/1251] eta: 0:15:13 lr: 0.000841 loss: 3.067838 (3.104016) time: 0.922599 data: 0.000150 max mem: 18814 Epoch: [166/300] [ 350/1251] eta: 0:14:27 lr: 0.000841 loss: 3.110658 (3.102583) time: 0.932888 data: 0.000166 max mem: 18814 Epoch: [166/300] [ 400/1251] eta: 0:13:40 lr: 0.000840 loss: 3.393691 (3.115647) time: 0.998477 data: 0.000174 max mem: 18814 Epoch: [166/300] [ 450/1251] eta: 0:12:52 lr: 0.000840 loss: 3.246475 (3.120969) time: 1.005227 data: 0.000162 max mem: 18814 Epoch: [166/300] [ 500/1251] eta: 0:12:04 lr: 0.000839 loss: 3.288458 (3.106969) time: 0.981691 data: 0.000181 max mem: 18814 Epoch: [166/300] [ 550/1251] eta: 0:11:14 lr: 0.000839 loss: 3.090596 (3.118220) time: 0.914738 data: 0.000175 max mem: 18814 Epoch: [166/300] [ 600/1251] eta: 0:10:26 lr: 0.000839 loss: 3.182806 (3.110378) time: 0.920646 data: 0.000159 max mem: 18814 Epoch: [166/300] [ 650/1251] eta: 0:09:38 lr: 0.000838 loss: 3.144827 (3.108693) time: 0.975648 data: 0.000159 max mem: 18814 Epoch: [166/300] [ 700/1251] eta: 0:08:49 lr: 0.000838 loss: 3.296820 (3.108204) time: 0.980784 data: 0.000155 max mem: 18814 Epoch: [166/300] [ 750/1251] eta: 0:08:01 lr: 0.000837 loss: 3.044808 (3.104031) time: 0.978533 data: 0.000161 max mem: 18814 Epoch: [166/300] [ 800/1251] eta: 0:07:12 lr: 0.000837 loss: 2.893397 (3.095845) time: 0.927552 data: 0.000143 max mem: 18814 Epoch: [166/300] [ 850/1251] eta: 0:06:25 lr: 0.000837 loss: 3.142496 (3.094814) time: 0.939889 data: 0.000161 max mem: 18814 Epoch: [166/300] [ 900/1251] eta: 0:05:37 lr: 0.000836 loss: 2.469580 (3.088391) time: 0.988744 data: 0.000170 max mem: 18814 Epoch: [166/300] [ 950/1251] eta: 0:04:49 lr: 0.000836 loss: 2.927666 (3.090865) time: 1.019499 data: 0.000165 max mem: 18814 Epoch: [166/300] [1000/1251] eta: 0:04:01 lr: 0.000835 loss: 3.059977 (3.091114) time: 0.968120 data: 0.000165 max mem: 18814 Epoch: [166/300] [1050/1251] eta: 0:03:13 lr: 0.000835 loss: 3.116622 (3.083886) time: 0.929629 data: 0.000163 max mem: 18814 Epoch: [166/300] [1100/1251] eta: 0:02:25 lr: 0.000835 loss: 3.125168 (3.080003) time: 0.922553 data: 0.000156 max mem: 18814 Epoch: [166/300] [1150/1251] eta: 0:01:37 lr: 0.000834 loss: 2.615071 (3.073946) time: 0.994780 data: 0.000162 max mem: 18814 Epoch: [166/300] [1200/1251] eta: 0:00:49 lr: 0.000834 loss: 3.215626 (3.075968) time: 1.004762 data: 0.000167 max mem: 18814 Epoch: [166/300] [1250/1251] eta: 0:00:00 lr: 0.000833 loss: 3.286870 (3.078884) time: 0.978829 data: 0.000749 max mem: 18814 Epoch: [166/300] Total time: 0:20:04 (0.962909 s / it) Averaged stats: lr: 0.000833 loss: 3.286870 (3.083635) Test: [ 0/49] eta: 0:01:28 loss: 0.564063 (0.564063) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.812268 data: 1.388843 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.698572 (0.736203) acc1: 82.812500 (83.664773) acc5: 95.312500 (95.880682) time: 0.500473 data: 0.126393 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.816317 (0.776567) acc1: 81.250000 (81.622024) acc5: 96.875000 (95.982143) time: 0.369394 data: 0.000146 max mem: 18814 Test: [30/49] eta: 0:00:09 loss: 0.816317 (0.780842) acc1: 79.687500 (81.350806) acc5: 96.875000 (96.118952) time: 0.465757 data: 0.000134 max mem: 18814 Test: [40/49] eta: 0:00:04 loss: 0.822834 (0.801954) acc1: 81.250000 (81.059451) acc5: 95.312500 (95.922256) time: 0.458905 data: 0.000117 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.842772 (0.797939) acc1: 81.250000 (81.216000) acc5: 95.312500 (96.032000) time: 0.352990 data: 0.000098 max mem: 18814 Test: Total time: 0:00:21 (0.434650 s / it) * Acc@1 81.414 Acc@5 96.004 loss 0.802 Max accuracy: 81.45% Epoch: [167/300] [ 0/1251] eta: 0:42:19 lr: 0.000833 loss: 3.658435 (3.658435) time: 2.030303 data: 1.115170 max mem: 18814 Epoch: [167/300] [ 50/1251] eta: 0:19:49 lr: 0.000833 loss: 3.058932 (3.110829) time: 0.935076 data: 0.000165 max mem: 18814 Epoch: [167/300] [ 100/1251] eta: 0:18:52 lr: 0.000833 loss: 3.211551 (3.100595) time: 0.985353 data: 0.000169 max mem: 18814 Epoch: [167/300] [ 150/1251] eta: 0:17:48 lr: 0.000832 loss: 3.241199 (3.095743) time: 0.980951 data: 0.000146 max mem: 18814 Epoch: [167/300] [ 200/1251] eta: 0:17:02 lr: 0.000832 loss: 2.973159 (3.061382) time: 0.973292 data: 0.000158 max mem: 18814 Epoch: [167/300] [ 250/1251] eta: 0:16:09 lr: 0.000831 loss: 3.332181 (3.085361) time: 0.931127 data: 0.000153 max mem: 18814 Epoch: [167/300] [ 300/1251] eta: 0:15:20 lr: 0.000831 loss: 3.040997 (3.077016) time: 0.935413 data: 0.000147 max mem: 18814 Epoch: [167/300] [ 350/1251] eta: 0:14:32 lr: 0.000830 loss: 2.883846 (3.084031) time: 0.971103 data: 0.000155 max mem: 18814 Epoch: [167/300] [ 400/1251] eta: 0:13:40 lr: 0.000830 loss: 3.264368 (3.090646) time: 0.962738 data: 0.000156 max mem: 18814 Epoch: [167/300] [ 450/1251] eta: 0:12:53 lr: 0.000830 loss: 2.908371 (3.080363) time: 0.972413 data: 0.000156 max mem: 18814 Epoch: [167/300] [ 500/1251] eta: 0:12:03 lr: 0.000829 loss: 3.122620 (3.078277) time: 0.921395 data: 0.000152 max mem: 18814 Epoch: [167/300] [ 550/1251] eta: 0:11:15 lr: 0.000829 loss: 2.793333 (3.071311) time: 0.926226 data: 0.000155 max mem: 18814 Epoch: [167/300] [ 600/1251] eta: 0:10:27 lr: 0.000828 loss: 3.132097 (3.074549) time: 0.996325 data: 0.000160 max mem: 18814 Epoch: [167/300] [ 650/1251] eta: 0:09:38 lr: 0.000828 loss: 3.381829 (3.085066) time: 0.972628 data: 0.000147 max mem: 18814 Epoch: [167/300] [ 700/1251] eta: 0:08:51 lr: 0.000828 loss: 3.266517 (3.090073) time: 1.001750 data: 0.000165 max mem: 18814 Epoch: [167/300] [ 750/1251] eta: 0:08:02 lr: 0.000827 loss: 3.070657 (3.097041) time: 0.931495 data: 0.000158 max mem: 18814 Epoch: [167/300] [ 800/1251] eta: 0:07:14 lr: 0.000827 loss: 3.124789 (3.099695) time: 0.925862 data: 0.000161 max mem: 18814 Epoch: [167/300] [ 850/1251] eta: 0:06:26 lr: 0.000826 loss: 3.456785 (3.106440) time: 0.982226 data: 0.000162 max mem: 18814 Epoch: [167/300] [ 900/1251] eta: 0:05:38 lr: 0.000826 loss: 3.176858 (3.104221) time: 1.017460 data: 0.000178 max mem: 18814 Epoch: [167/300] [ 950/1251] eta: 0:04:50 lr: 0.000826 loss: 2.696871 (3.094742) time: 0.974552 data: 0.000168 max mem: 18814 Epoch: [167/300] [1000/1251] eta: 0:04:01 lr: 0.000825 loss: 3.137968 (3.093085) time: 0.923140 data: 0.000147 max mem: 18814 Epoch: [167/300] [1050/1251] eta: 0:03:13 lr: 0.000825 loss: 3.177330 (3.094129) time: 0.919108 data: 0.000159 max mem: 18814 Epoch: [167/300] [1100/1251] eta: 0:02:25 lr: 0.000824 loss: 3.280345 (3.093995) time: 0.980649 data: 0.000157 max mem: 18814 Epoch: [167/300] [1150/1251] eta: 0:01:37 lr: 0.000824 loss: 3.105955 (3.092994) time: 1.002505 data: 0.000151 max mem: 18814 Epoch: [167/300] [1200/1251] eta: 0:00:49 lr: 0.000824 loss: 3.176360 (3.093986) time: 0.981643 data: 0.000160 max mem: 18814 Epoch: [167/300] [1250/1251] eta: 0:00:00 lr: 0.000823 loss: 3.315110 (3.094351) time: 0.934945 data: 0.000756 max mem: 18814 Epoch: [167/300] Total time: 0:20:05 (0.963572 s / it) Averaged stats: lr: 0.000823 loss: 3.315110 (3.093339) Test: [ 0/49] eta: 0:01:29 loss: 0.731073 (0.731073) acc1: 81.250000 (81.250000) acc5: 96.875000 (96.875000) time: 1.823813 data: 1.366258 max mem: 18814 Test: [10/49] eta: 0:00:20 loss: 0.731073 (0.739625) acc1: 82.812500 (83.096591) acc5: 96.875000 (95.738636) time: 0.516407 data: 0.124328 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.766203 (0.761481) acc1: 82.812500 (81.622024) acc5: 96.875000 (95.907738) time: 0.374200 data: 0.000135 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.749196 (0.770407) acc1: 78.125000 (81.149194) acc5: 95.312500 (95.766129) time: 0.361096 data: 0.000131 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.769706 (0.778410) acc1: 81.250000 (81.288110) acc5: 95.312500 (95.807927) time: 0.357951 data: 0.000128 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.769706 (0.772880) acc1: 81.250000 (81.344000) acc5: 95.312500 (95.904000) time: 0.353178 data: 0.000107 max mem: 18814 Test: Total time: 0:00:19 (0.395171 s / it) * Acc@1 81.278 Acc@5 95.894 loss 0.782 Max accuracy: 81.45% Epoch: [168/300] [ 0/1251] eta: 0:40:44 lr: 0.000823 loss: 3.643166 (3.643166) time: 1.953906 data: 1.060266 max mem: 18814 Epoch: [168/300] [ 50/1251] eta: 0:20:11 lr: 0.000823 loss: 3.037025 (2.969233) time: 1.035315 data: 0.000158 max mem: 18814 Epoch: [168/300] [ 100/1251] eta: 0:19:02 lr: 0.000822 loss: 3.232443 (3.001643) time: 0.981050 data: 0.000171 max mem: 18814 Epoch: [168/300] [ 150/1251] eta: 0:17:58 lr: 0.000822 loss: 3.236816 (3.036328) time: 0.983264 data: 0.000148 max mem: 18814 Epoch: [168/300] [ 200/1251] eta: 0:16:59 lr: 0.000822 loss: 3.169418 (3.067082) time: 0.920594 data: 0.000156 max mem: 18814 Epoch: [168/300] [ 250/1251] eta: 0:16:13 lr: 0.000821 loss: 3.027195 (3.065563) time: 0.929698 data: 0.000170 max mem: 18814 Epoch: [168/300] [ 300/1251] eta: 0:15:24 lr: 0.000821 loss: 3.062700 (3.074622) time: 0.992122 data: 0.000162 max mem: 18814 Epoch: [168/300] [ 350/1251] eta: 0:14:36 lr: 0.000820 loss: 3.212780 (3.072754) time: 0.983409 data: 0.000151 max mem: 18814 Epoch: [168/300] [ 400/1251] eta: 0:13:44 lr: 0.000820 loss: 2.997207 (3.066465) time: 0.960832 data: 0.000168 max mem: 18814 Epoch: [168/300] [ 450/1251] eta: 0:12:54 lr: 0.000819 loss: 3.074185 (3.060856) time: 0.925289 data: 0.000155 max mem: 18814 Epoch: [168/300] [ 500/1251] eta: 0:12:06 lr: 0.000819 loss: 3.324697 (3.065268) time: 0.925998 data: 0.000149 max mem: 18814 Epoch: [168/300] [ 550/1251] eta: 0:11:18 lr: 0.000819 loss: 3.184124 (3.070593) time: 0.975932 data: 0.000152 max mem: 18814 Epoch: [168/300] [ 600/1251] eta: 0:10:30 lr: 0.000818 loss: 3.241694 (3.071138) time: 0.984360 data: 0.000157 max mem: 18814 Epoch: [168/300] [ 650/1251] eta: 0:09:41 lr: 0.000818 loss: 3.013309 (3.073904) time: 0.966041 data: 0.000144 max mem: 18814 Epoch: [168/300] [ 700/1251] eta: 0:08:52 lr: 0.000817 loss: 3.027663 (3.073040) time: 0.928839 data: 0.000157 max mem: 18814 Epoch: [168/300] [ 750/1251] eta: 0:08:04 lr: 0.000817 loss: 3.207214 (3.072769) time: 0.932858 data: 0.000156 max mem: 18814 Epoch: [168/300] [ 800/1251] eta: 0:07:15 lr: 0.000817 loss: 3.089271 (3.069794) time: 0.987686 data: 0.000168 max mem: 18814 Epoch: [168/300] [ 850/1251] eta: 0:06:27 lr: 0.000816 loss: 3.365452 (3.077747) time: 0.992723 data: 0.000187 max mem: 18814 Epoch: [168/300] [ 900/1251] eta: 0:05:39 lr: 0.000816 loss: 3.018147 (3.082088) time: 0.982834 data: 0.000163 max mem: 18814 Epoch: [168/300] [ 950/1251] eta: 0:04:50 lr: 0.000815 loss: 2.992341 (3.073936) time: 0.918150 data: 0.000158 max mem: 18814 Epoch: [168/300] [1000/1251] eta: 0:04:02 lr: 0.000815 loss: 3.334769 (3.072770) time: 0.939790 data: 0.000158 max mem: 18814 Epoch: [168/300] [1050/1251] eta: 0:03:14 lr: 0.000815 loss: 3.081486 (3.070694) time: 0.971920 data: 0.000165 max mem: 18814 Epoch: [168/300] [1100/1251] eta: 0:02:25 lr: 0.000814 loss: 3.050185 (3.071376) time: 0.977533 data: 0.000156 max mem: 18814 Epoch: [168/300] [1150/1251] eta: 0:01:37 lr: 0.000814 loss: 3.035305 (3.071198) time: 0.992978 data: 0.000167 max mem: 18814 Epoch: [168/300] [1200/1251] eta: 0:00:49 lr: 0.000813 loss: 3.424263 (3.071971) time: 0.935028 data: 0.000162 max mem: 18814 Epoch: [168/300] [1250/1251] eta: 0:00:00 lr: 0.000813 loss: 2.918584 (3.071639) time: 0.983379 data: 0.000764 max mem: 18814 Epoch: [168/300] Total time: 0:20:07 (0.965300 s / it) Averaged stats: lr: 0.000813 loss: 2.918584 (3.073416) Test: [ 0/49] eta: 0:01:25 loss: 0.547140 (0.547140) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.750088 data: 1.339366 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.640861 (0.719268) acc1: 82.812500 (83.522727) acc5: 96.875000 (95.880682) time: 0.489615 data: 0.121917 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.777190 (0.753619) acc1: 79.687500 (81.994048) acc5: 95.312500 (95.758929) time: 0.365387 data: 0.000147 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.777190 (0.762529) acc1: 79.687500 (81.502016) acc5: 95.312500 (95.866935) time: 0.368261 data: 0.000128 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.740768 (0.766009) acc1: 81.250000 (81.592988) acc5: 96.875000 (95.960366) time: 0.366368 data: 0.000124 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.740768 (0.765628) acc1: 81.250000 (81.600000) acc5: 96.875000 (96.032000) time: 0.356649 data: 0.000100 max mem: 18814 Test: Total time: 0:00:19 (0.393257 s / it) * Acc@1 81.686 Acc@5 96.036 loss 0.772 Max accuracy: 81.69% Epoch: [169/300] [ 0/1251] eta: 0:41:29 lr: 0.000813 loss: 2.007767 (2.007767) time: 1.989785 data: 1.020591 max mem: 18814 Epoch: [169/300] [ 50/1251] eta: 0:19:55 lr: 0.000813 loss: 3.019141 (2.981849) time: 1.026430 data: 0.000174 max mem: 18814 Epoch: [169/300] [ 100/1251] eta: 0:18:31 lr: 0.000812 loss: 2.947640 (2.989285) time: 0.951518 data: 0.000158 max mem: 18814 Epoch: [169/300] [ 150/1251] eta: 0:17:41 lr: 0.000812 loss: 3.163044 (3.022001) time: 0.921067 data: 0.000161 max mem: 18814 Epoch: [169/300] [ 200/1251] eta: 0:16:57 lr: 0.000811 loss: 2.890973 (3.031275) time: 0.940773 data: 0.000166 max mem: 18814 Epoch: [169/300] [ 250/1251] eta: 0:16:11 lr: 0.000811 loss: 3.335399 (3.059335) time: 0.991252 data: 0.000152 max mem: 18814 Epoch: [169/300] [ 300/1251] eta: 0:15:23 lr: 0.000811 loss: 2.981292 (3.037640) time: 1.045848 data: 0.000152 max mem: 18814 Epoch: [169/300] [ 350/1251] eta: 0:14:32 lr: 0.000810 loss: 2.883287 (3.040162) time: 0.975788 data: 0.000164 max mem: 18814 Epoch: [169/300] [ 400/1251] eta: 0:13:40 lr: 0.000810 loss: 3.210347 (3.044072) time: 0.927056 data: 0.000168 max mem: 18814 Epoch: [169/300] [ 450/1251] eta: 0:12:52 lr: 0.000809 loss: 3.321505 (3.050201) time: 0.929275 data: 0.000199 max mem: 18814 Epoch: [169/300] [ 500/1251] eta: 0:12:05 lr: 0.000809 loss: 3.247234 (3.052106) time: 1.000257 data: 0.000174 max mem: 18814 Epoch: [169/300] [ 550/1251] eta: 0:11:17 lr: 0.000808 loss: 2.861922 (3.046488) time: 1.034619 data: 0.000158 max mem: 18814 Epoch: [169/300] [ 600/1251] eta: 0:10:28 lr: 0.000808 loss: 3.107976 (3.050915) time: 0.970845 data: 0.000162 max mem: 18814 Epoch: [169/300] [ 650/1251] eta: 0:09:39 lr: 0.000808 loss: 2.811437 (3.047321) time: 0.924540 data: 0.000156 max mem: 18814 Epoch: [169/300] [ 700/1251] eta: 0:08:51 lr: 0.000807 loss: 3.273615 (3.054980) time: 0.921449 data: 0.000145 max mem: 18814 Epoch: [169/300] [ 750/1251] eta: 0:08:02 lr: 0.000807 loss: 3.253601 (3.051792) time: 0.984340 data: 0.000155 max mem: 18814 Epoch: [169/300] [ 800/1251] eta: 0:07:15 lr: 0.000806 loss: 2.818688 (3.047345) time: 1.037215 data: 0.000176 max mem: 18814 Epoch: [169/300] [ 850/1251] eta: 0:06:26 lr: 0.000806 loss: 3.084602 (3.050693) time: 1.002296 data: 0.000161 max mem: 18814 Epoch: [169/300] [ 900/1251] eta: 0:05:38 lr: 0.000806 loss: 3.287220 (3.056309) time: 0.931369 data: 0.000169 max mem: 18814 Epoch: [169/300] [ 950/1251] eta: 0:04:50 lr: 0.000805 loss: 3.187867 (3.054705) time: 0.940053 data: 0.000158 max mem: 18814 Epoch: [169/300] [1000/1251] eta: 0:04:02 lr: 0.000805 loss: 3.081548 (3.057750) time: 0.996232 data: 0.000157 max mem: 18814 Epoch: [169/300] [1050/1251] eta: 0:03:13 lr: 0.000804 loss: 3.071323 (3.060024) time: 1.039932 data: 0.000169 max mem: 18814 Epoch: [169/300] [1100/1251] eta: 0:02:25 lr: 0.000804 loss: 3.178887 (3.059213) time: 0.987577 data: 0.000162 max mem: 18814 Epoch: [169/300] [1150/1251] eta: 0:01:37 lr: 0.000804 loss: 3.122437 (3.057352) time: 0.927998 data: 0.000159 max mem: 18814 Epoch: [169/300] [1200/1251] eta: 0:00:49 lr: 0.000803 loss: 2.817395 (3.056691) time: 0.935589 data: 0.000165 max mem: 18814 Epoch: [169/300] [1250/1251] eta: 0:00:00 lr: 0.000803 loss: 2.994788 (3.056532) time: 0.981271 data: 0.000737 max mem: 18814 Epoch: [169/300] Total time: 0:20:06 (0.964600 s / it) Averaged stats: lr: 0.000803 loss: 2.994788 (3.054218) Test: [ 0/49] eta: 0:01:17 loss: 0.582963 (0.582963) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.578631 data: 1.100842 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.717749 (0.755455) acc1: 84.375000 (82.528409) acc5: 96.875000 (96.164773) time: 0.476777 data: 0.100255 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.801445 (0.793505) acc1: 81.250000 (81.398810) acc5: 96.875000 (96.056548) time: 0.364759 data: 0.000172 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.798459 (0.783056) acc1: 81.250000 (81.653226) acc5: 96.875000 (96.169355) time: 0.369771 data: 0.000153 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.798286 (0.790029) acc1: 81.250000 (81.402439) acc5: 96.875000 (96.112805) time: 0.369246 data: 0.000157 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.811516 (0.792144) acc1: 79.687500 (81.344000) acc5: 96.875000 (96.128000) time: 0.359059 data: 0.000127 max mem: 18814 Test: Total time: 0:00:19 (0.392621 s / it) * Acc@1 81.454 Acc@5 96.120 loss 0.789 Max accuracy: 81.69% Epoch: [170/300] [ 0/1251] eta: 0:43:04 lr: 0.000803 loss: 3.536688 (3.536688) time: 2.066119 data: 1.182713 max mem: 18814 Epoch: [170/300] [ 50/1251] eta: 0:19:32 lr: 0.000802 loss: 3.029357 (3.031884) time: 1.000979 data: 0.000157 max mem: 18814 Epoch: [170/300] [ 100/1251] eta: 0:18:35 lr: 0.000802 loss: 3.071004 (3.060701) time: 0.941298 data: 0.000145 max mem: 18814 Epoch: [170/300] [ 150/1251] eta: 0:17:47 lr: 0.000802 loss: 3.061584 (3.052423) time: 0.935486 data: 0.000163 max mem: 18814 Epoch: [170/300] [ 200/1251] eta: 0:17:00 lr: 0.000801 loss: 3.148003 (3.042407) time: 0.949917 data: 0.000156 max mem: 18814 Epoch: [170/300] [ 250/1251] eta: 0:16:10 lr: 0.000801 loss: 3.293924 (3.045265) time: 0.982606 data: 0.000148 max mem: 18814 Epoch: [170/300] [ 300/1251] eta: 0:15:19 lr: 0.000800 loss: 3.074171 (3.037166) time: 0.980462 data: 0.000158 max mem: 18814 Epoch: [170/300] [ 350/1251] eta: 0:14:28 lr: 0.000800 loss: 2.971188 (3.029671) time: 0.921056 data: 0.000152 max mem: 18814 Epoch: [170/300] [ 400/1251] eta: 0:13:41 lr: 0.000800 loss: 3.188595 (3.041733) time: 0.932243 data: 0.000166 max mem: 18814 Epoch: [170/300] [ 450/1251] eta: 0:12:53 lr: 0.000799 loss: 2.833323 (3.039015) time: 0.920494 data: 0.000175 max mem: 18814 Epoch: [170/300] [ 500/1251] eta: 0:12:06 lr: 0.000799 loss: 3.187972 (3.036904) time: 0.982618 data: 0.000164 max mem: 18814 Epoch: [170/300] [ 550/1251] eta: 0:11:17 lr: 0.000798 loss: 3.153207 (3.033513) time: 0.978081 data: 0.000160 max mem: 18814 Epoch: [170/300] [ 600/1251] eta: 0:10:28 lr: 0.000798 loss: 3.289024 (3.050229) time: 0.928979 data: 0.000154 max mem: 18814 Epoch: [170/300] [ 650/1251] eta: 0:09:40 lr: 0.000798 loss: 2.902383 (3.047802) time: 0.928070 data: 0.000165 max mem: 18814 Epoch: [170/300] [ 700/1251] eta: 0:08:52 lr: 0.000797 loss: 3.074739 (3.055286) time: 0.921659 data: 0.000155 max mem: 18814 Epoch: [170/300] [ 750/1251] eta: 0:08:03 lr: 0.000797 loss: 3.326094 (3.065664) time: 0.961760 data: 0.000161 max mem: 18814 Epoch: [170/300] [ 800/1251] eta: 0:07:15 lr: 0.000796 loss: 3.055167 (3.066721) time: 0.982602 data: 0.000159 max mem: 18814 Epoch: [170/300] [ 850/1251] eta: 0:06:26 lr: 0.000796 loss: 2.978026 (3.063429) time: 0.923949 data: 0.000155 max mem: 18814 Epoch: [170/300] [ 900/1251] eta: 0:05:38 lr: 0.000796 loss: 3.219406 (3.068616) time: 0.921599 data: 0.000162 max mem: 18814 Epoch: [170/300] [ 950/1251] eta: 0:04:50 lr: 0.000795 loss: 3.069173 (3.064752) time: 0.927315 data: 0.000157 max mem: 18814 Epoch: [170/300] [1000/1251] eta: 0:04:02 lr: 0.000795 loss: 3.278939 (3.070536) time: 0.982315 data: 0.000154 max mem: 18814 Epoch: [170/300] [1050/1251] eta: 0:03:13 lr: 0.000794 loss: 3.129001 (3.066926) time: 0.983538 data: 0.000169 max mem: 18814 Epoch: [170/300] [1100/1251] eta: 0:02:25 lr: 0.000794 loss: 3.214354 (3.066707) time: 0.923685 data: 0.000164 max mem: 18814 Epoch: [170/300] [1150/1251] eta: 0:01:37 lr: 0.000793 loss: 3.053636 (3.062534) time: 0.930109 data: 0.000165 max mem: 18814 Epoch: [170/300] [1200/1251] eta: 0:00:49 lr: 0.000793 loss: 2.883915 (3.063149) time: 0.920377 data: 0.000153 max mem: 18814 Epoch: [170/300] [1250/1251] eta: 0:00:00 lr: 0.000793 loss: 3.043430 (3.057518) time: 0.972552 data: 0.000747 max mem: 18814 Epoch: [170/300] Total time: 0:20:06 (0.964196 s / it) Averaged stats: lr: 0.000793 loss: 3.043430 (3.060458) Test: [ 0/49] eta: 0:01:27 loss: 0.589825 (0.589825) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.781742 data: 1.386535 max mem: 18814 Test: [10/49] eta: 0:00:20 loss: 0.683563 (0.732440) acc1: 81.250000 (82.528409) acc5: 96.875000 (95.596591) time: 0.515245 data: 0.126188 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.792843 (0.769807) acc1: 79.687500 (81.398810) acc5: 95.312500 (95.758929) time: 0.374640 data: 0.000136 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.770578 (0.766450) acc1: 81.250000 (81.401210) acc5: 96.875000 (96.018145) time: 0.361221 data: 0.000125 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.770578 (0.779460) acc1: 81.250000 (81.288110) acc5: 96.875000 (95.998476) time: 0.374810 data: 0.000122 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.808764 (0.774803) acc1: 81.250000 (81.248000) acc5: 96.875000 (95.968000) time: 0.368267 data: 0.000098 max mem: 18814 Test: Total time: 0:00:19 (0.402477 s / it) * Acc@1 81.748 Acc@5 96.046 loss 0.764 Max accuracy: 81.75% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0170.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0170.pth Epoch: [171/300] [ 0/1251] eta: 0:41:09 lr: 0.000793 loss: 3.586599 (3.586599) time: 1.974280 data: 1.083679 max mem: 18814 Epoch: [171/300] [ 50/1251] eta: 0:19:56 lr: 0.000792 loss: 3.182079 (3.187518) time: 0.935329 data: 0.000157 max mem: 18814 Epoch: [171/300] [ 100/1251] eta: 0:18:50 lr: 0.000792 loss: 3.193370 (3.081883) time: 0.985658 data: 0.000152 max mem: 18814 Epoch: [171/300] [ 150/1251] eta: 0:17:58 lr: 0.000791 loss: 2.960499 (3.081118) time: 1.032733 data: 0.000161 max mem: 18814 Epoch: [171/300] [ 200/1251] eta: 0:17:00 lr: 0.000791 loss: 3.021437 (3.056112) time: 0.953930 data: 0.000155 max mem: 18814 Epoch: [171/300] [ 250/1251] eta: 0:16:09 lr: 0.000791 loss: 3.187372 (3.050049) time: 0.932181 data: 0.000161 max mem: 18814 Epoch: [171/300] [ 300/1251] eta: 0:15:21 lr: 0.000790 loss: 3.106614 (3.048924) time: 0.933902 data: 0.000147 max mem: 18814 Epoch: [171/300] [ 350/1251] eta: 0:14:33 lr: 0.000790 loss: 3.219011 (3.064939) time: 0.966923 data: 0.000162 max mem: 18814 Epoch: [171/300] [ 400/1251] eta: 0:13:45 lr: 0.000789 loss: 3.017374 (3.048386) time: 1.041479 data: 0.000162 max mem: 18814 Epoch: [171/300] [ 450/1251] eta: 0:12:55 lr: 0.000789 loss: 3.266419 (3.067396) time: 1.001695 data: 0.000162 max mem: 18814 Epoch: [171/300] [ 500/1251] eta: 0:12:05 lr: 0.000789 loss: 2.983088 (3.066808) time: 0.926664 data: 0.000153 max mem: 18814 Epoch: [171/300] [ 550/1251] eta: 0:11:17 lr: 0.000788 loss: 2.926287 (3.060200) time: 0.922503 data: 0.000160 max mem: 18814 Epoch: [171/300] [ 600/1251] eta: 0:10:29 lr: 0.000788 loss: 3.162251 (3.061540) time: 1.000664 data: 0.000162 max mem: 18814 Epoch: [171/300] [ 650/1251] eta: 0:09:41 lr: 0.000787 loss: 3.205830 (3.052867) time: 1.053697 data: 0.000164 max mem: 18814 Epoch: [171/300] [ 700/1251] eta: 0:08:52 lr: 0.000787 loss: 3.262924 (3.064161) time: 0.976013 data: 0.000154 max mem: 18814 Epoch: [171/300] [ 750/1251] eta: 0:08:03 lr: 0.000787 loss: 3.030837 (3.065185) time: 0.942587 data: 0.000164 max mem: 18814 Epoch: [171/300] [ 800/1251] eta: 0:07:15 lr: 0.000786 loss: 3.025426 (3.067334) time: 0.937342 data: 0.000153 max mem: 18814 Epoch: [171/300] [ 850/1251] eta: 0:06:27 lr: 0.000786 loss: 3.147963 (3.072088) time: 0.988407 data: 0.000161 max mem: 18814 Epoch: [171/300] [ 900/1251] eta: 0:05:39 lr: 0.000785 loss: 2.989099 (3.063700) time: 1.060965 data: 0.000169 max mem: 18814 Epoch: [171/300] [ 950/1251] eta: 0:04:50 lr: 0.000785 loss: 2.997854 (3.062604) time: 0.982788 data: 0.000191 max mem: 18814 Epoch: [171/300] [1000/1251] eta: 0:04:02 lr: 0.000785 loss: 3.120257 (3.067425) time: 0.966862 data: 0.000168 max mem: 18814 Epoch: [171/300] [1050/1251] eta: 0:03:13 lr: 0.000784 loss: 2.895126 (3.060558) time: 0.922638 data: 0.000173 max mem: 18814 Epoch: [171/300] [1100/1251] eta: 0:02:25 lr: 0.000784 loss: 3.090063 (3.060487) time: 0.930537 data: 0.000155 max mem: 18814 Epoch: [171/300] [1150/1251] eta: 0:01:37 lr: 0.000783 loss: 3.307363 (3.063162) time: 0.979149 data: 0.000164 max mem: 18814 Epoch: [171/300] [1200/1251] eta: 0:00:49 lr: 0.000783 loss: 3.302150 (3.065931) time: 0.968842 data: 0.000155 max mem: 18814 Epoch: [171/300] [1250/1251] eta: 0:00:00 lr: 0.000783 loss: 3.027479 (3.062862) time: 0.976964 data: 0.000752 max mem: 18814 Epoch: [171/300] Total time: 0:20:07 (0.965342 s / it) Averaged stats: lr: 0.000783 loss: 3.027479 (3.062584) Test: [ 0/49] eta: 0:01:16 loss: 0.588686 (0.588686) acc1: 82.812500 (82.812500) acc5: 100.000000 (100.000000) time: 1.571097 data: 1.129099 max mem: 18814 Test: [10/49] eta: 0:00:22 loss: 0.656777 (0.750225) acc1: 81.250000 (82.386364) acc5: 96.875000 (96.732955) time: 0.579289 data: 0.102783 max mem: 18814 Test: [20/49] eta: 0:00:14 loss: 0.780844 (0.766630) acc1: 81.250000 (82.366071) acc5: 96.875000 (96.056548) time: 0.461322 data: 0.000133 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.780844 (0.770053) acc1: 79.687500 (81.905242) acc5: 95.312500 (96.169355) time: 0.400841 data: 0.000123 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.791795 (0.778337) acc1: 79.687500 (81.745427) acc5: 95.312500 (96.227134) time: 0.357363 data: 0.000123 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.820644 (0.777073) acc1: 79.687500 (81.600000) acc5: 96.875000 (96.352000) time: 0.352551 data: 0.000099 max mem: 18814 Test: Total time: 0:00:20 (0.425040 s / it) * Acc@1 81.656 Acc@5 96.124 loss 0.779 Max accuracy: 81.75% Epoch: [172/300] [ 0/1251] eta: 0:39:43 lr: 0.000783 loss: 3.358889 (3.358889) time: 1.905470 data: 1.004558 max mem: 18814 Epoch: [172/300] [ 50/1251] eta: 0:19:28 lr: 0.000782 loss: 3.301319 (3.130221) time: 0.927432 data: 0.000150 max mem: 18814 Epoch: [172/300] [ 100/1251] eta: 0:18:39 lr: 0.000782 loss: 2.937897 (3.110795) time: 0.927616 data: 0.000176 max mem: 18814 Epoch: [172/300] [ 150/1251] eta: 0:17:54 lr: 0.000781 loss: 3.103263 (3.058710) time: 0.987675 data: 0.000177 max mem: 18814 Epoch: [172/300] [ 200/1251] eta: 0:17:03 lr: 0.000781 loss: 2.917115 (3.025530) time: 1.023162 data: 0.000154 max mem: 18814 Epoch: [172/300] [ 250/1251] eta: 0:16:10 lr: 0.000781 loss: 2.888485 (3.030333) time: 0.978908 data: 0.000159 max mem: 18814 Epoch: [172/300] [ 300/1251] eta: 0:15:19 lr: 0.000780 loss: 3.031637 (3.041556) time: 0.930512 data: 0.000157 max mem: 18814 Epoch: [172/300] [ 350/1251] eta: 0:14:33 lr: 0.000780 loss: 3.023326 (3.034045) time: 0.929749 data: 0.000155 max mem: 18814 Epoch: [172/300] [ 400/1251] eta: 0:13:46 lr: 0.000779 loss: 2.996049 (3.040075) time: 0.993868 data: 0.000164 max mem: 18814 Epoch: [172/300] [ 450/1251] eta: 0:12:58 lr: 0.000779 loss: 3.116791 (3.042846) time: 0.971339 data: 0.000160 max mem: 18814 Epoch: [172/300] [ 500/1251] eta: 0:12:08 lr: 0.000779 loss: 3.209871 (3.048418) time: 0.987046 data: 0.000177 max mem: 18814 Epoch: [172/300] [ 550/1251] eta: 0:11:18 lr: 0.000778 loss: 2.989927 (3.050594) time: 0.921558 data: 0.000169 max mem: 18814 Epoch: [172/300] [ 600/1251] eta: 0:10:30 lr: 0.000778 loss: 3.147127 (3.062258) time: 0.925917 data: 0.000173 max mem: 18814 Epoch: [172/300] [ 650/1251] eta: 0:09:42 lr: 0.000777 loss: 3.133632 (3.060602) time: 0.993916 data: 0.000171 max mem: 18814 Epoch: [172/300] [ 700/1251] eta: 0:08:54 lr: 0.000777 loss: 3.107657 (3.062331) time: 0.961265 data: 0.000169 max mem: 18814 Epoch: [172/300] [ 750/1251] eta: 0:08:04 lr: 0.000777 loss: 3.057818 (3.061789) time: 0.985377 data: 0.000171 max mem: 18814 Epoch: [172/300] [ 800/1251] eta: 0:07:16 lr: 0.000776 loss: 3.218933 (3.062068) time: 0.926138 data: 0.000163 max mem: 18814 Epoch: [172/300] [ 850/1251] eta: 0:06:28 lr: 0.000776 loss: 2.999432 (3.057449) time: 0.940561 data: 0.000170 max mem: 18814 Epoch: [172/300] [ 900/1251] eta: 0:05:39 lr: 0.000775 loss: 3.015820 (3.057140) time: 0.992772 data: 0.000169 max mem: 18814 Epoch: [172/300] [ 950/1251] eta: 0:04:51 lr: 0.000775 loss: 3.269326 (3.054035) time: 0.985573 data: 0.000171 max mem: 18814 Epoch: [172/300] [1000/1251] eta: 0:04:02 lr: 0.000774 loss: 2.962661 (3.050485) time: 0.954453 data: 0.000181 max mem: 18814 Epoch: [172/300] [1050/1251] eta: 0:03:14 lr: 0.000774 loss: 3.031592 (3.050548) time: 0.941239 data: 0.000151 max mem: 18814 Epoch: [172/300] [1100/1251] eta: 0:02:25 lr: 0.000774 loss: 2.825320 (3.046511) time: 0.927378 data: 0.000171 max mem: 18814 Epoch: [172/300] [1150/1251] eta: 0:01:37 lr: 0.000773 loss: 3.179736 (3.044541) time: 0.989377 data: 0.000163 max mem: 18814 Epoch: [172/300] [1200/1251] eta: 0:00:49 lr: 0.000773 loss: 3.223684 (3.049741) time: 0.972337 data: 0.000165 max mem: 18814 Epoch: [172/300] [1250/1251] eta: 0:00:00 lr: 0.000772 loss: 3.113625 (3.050570) time: 0.958413 data: 0.000775 max mem: 18814 Epoch: [172/300] Total time: 0:20:08 (0.966033 s / it) Averaged stats: lr: 0.000772 loss: 3.113625 (3.056315) Test: [ 0/49] eta: 0:01:29 loss: 0.516289 (0.516289) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.823625 data: 1.409025 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.659191 (0.701694) acc1: 82.812500 (82.954545) acc5: 96.875000 (96.022727) time: 0.506273 data: 0.128232 max mem: 18814 Test: [20/49] eta: 0:00:15 loss: 0.757426 (0.740086) acc1: 81.250000 (81.696429) acc5: 95.312500 (96.056548) time: 0.477326 data: 0.000136 max mem: 18814 Test: [30/49] eta: 0:00:09 loss: 0.757426 (0.741312) acc1: 81.250000 (81.401210) acc5: 95.312500 (96.270161) time: 0.469734 data: 0.000123 max mem: 18814 Test: [40/49] eta: 0:00:04 loss: 0.784240 (0.752902) acc1: 81.250000 (81.516768) acc5: 96.875000 (96.303354) time: 0.357064 data: 0.000120 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.805815 (0.754498) acc1: 81.250000 (81.568000) acc5: 96.875000 (96.256000) time: 0.352255 data: 0.000102 max mem: 18814 Test: Total time: 0:00:21 (0.437218 s / it) * Acc@1 81.606 Acc@5 96.100 loss 0.764 Max accuracy: 81.75% Epoch: [173/300] [ 0/1251] eta: 0:41:14 lr: 0.000772 loss: 2.826250 (2.826250) time: 1.977762 data: 1.084356 max mem: 18814 Epoch: [173/300] [ 50/1251] eta: 0:19:53 lr: 0.000772 loss: 2.579749 (2.894272) time: 0.931824 data: 0.000157 max mem: 18814 Epoch: [173/300] [ 100/1251] eta: 0:18:44 lr: 0.000772 loss: 3.271896 (3.007344) time: 0.962647 data: 0.000179 max mem: 18814 Epoch: [173/300] [ 150/1251] eta: 0:17:50 lr: 0.000771 loss: 2.913375 (3.007104) time: 1.003385 data: 0.000147 max mem: 18814 Epoch: [173/300] [ 200/1251] eta: 0:16:57 lr: 0.000771 loss: 2.943418 (2.992965) time: 0.972398 data: 0.000169 max mem: 18814 Epoch: [173/300] [ 250/1251] eta: 0:16:06 lr: 0.000770 loss: 2.790868 (2.978180) time: 0.930769 data: 0.000165 max mem: 18814 Epoch: [173/300] [ 300/1251] eta: 0:15:18 lr: 0.000770 loss: 3.078508 (2.981911) time: 0.937900 data: 0.000146 max mem: 18814 Epoch: [173/300] [ 350/1251] eta: 0:14:30 lr: 0.000770 loss: 2.628604 (2.966609) time: 0.971221 data: 0.000168 max mem: 18814 Epoch: [173/300] [ 400/1251] eta: 0:13:44 lr: 0.000769 loss: 2.817493 (2.979848) time: 1.070929 data: 0.000162 max mem: 18814 Epoch: [173/300] [ 450/1251] eta: 0:12:54 lr: 0.000769 loss: 3.174515 (2.993359) time: 0.972892 data: 0.000167 max mem: 18814 Epoch: [173/300] [ 500/1251] eta: 0:12:04 lr: 0.000768 loss: 3.183804 (3.004508) time: 0.924281 data: 0.000152 max mem: 18814 Epoch: [173/300] [ 550/1251] eta: 0:11:16 lr: 0.000768 loss: 3.392385 (3.010368) time: 0.935981 data: 0.000169 max mem: 18814 Epoch: [173/300] [ 600/1251] eta: 0:10:29 lr: 0.000768 loss: 3.046161 (3.013265) time: 1.015766 data: 0.000171 max mem: 18814 Epoch: [173/300] [ 650/1251] eta: 0:09:41 lr: 0.000767 loss: 3.034738 (3.019241) time: 0.986560 data: 0.000158 max mem: 18814 Epoch: [173/300] [ 700/1251] eta: 0:08:52 lr: 0.000767 loss: 3.068788 (3.020900) time: 0.975419 data: 0.000161 max mem: 18814 Epoch: [173/300] [ 750/1251] eta: 0:08:03 lr: 0.000766 loss: 3.261429 (3.027438) time: 0.919416 data: 0.000153 max mem: 18814 Epoch: [173/300] [ 800/1251] eta: 0:07:15 lr: 0.000766 loss: 3.132988 (3.026875) time: 0.926609 data: 0.000159 max mem: 18814 Epoch: [173/300] [ 850/1251] eta: 0:06:27 lr: 0.000766 loss: 3.067382 (3.023435) time: 0.984143 data: 0.000165 max mem: 18814 Epoch: [173/300] [ 900/1251] eta: 0:05:38 lr: 0.000765 loss: 3.033343 (3.023140) time: 0.981678 data: 0.000155 max mem: 18814 Epoch: [173/300] [ 950/1251] eta: 0:04:50 lr: 0.000765 loss: 3.112794 (3.025693) time: 0.966475 data: 0.000159 max mem: 18814 Epoch: [173/300] [1000/1251] eta: 0:04:02 lr: 0.000764 loss: 3.259372 (3.033661) time: 0.935816 data: 0.000146 max mem: 18814 Epoch: [173/300] [1050/1251] eta: 0:03:13 lr: 0.000764 loss: 3.074415 (3.036156) time: 0.930661 data: 0.000173 max mem: 18814 Epoch: [173/300] [1100/1251] eta: 0:02:25 lr: 0.000764 loss: 3.080073 (3.043152) time: 0.996565 data: 0.000155 max mem: 18814 Epoch: [173/300] [1150/1251] eta: 0:01:37 lr: 0.000763 loss: 3.216363 (3.046988) time: 0.967534 data: 0.000166 max mem: 18814 Epoch: [173/300] [1200/1251] eta: 0:00:49 lr: 0.000763 loss: 3.286846 (3.052207) time: 0.975144 data: 0.000177 max mem: 18814 Epoch: [173/300] [1250/1251] eta: 0:00:00 lr: 0.000762 loss: 3.384271 (3.056081) time: 0.932023 data: 0.000765 max mem: 18814 Epoch: [173/300] Total time: 0:20:05 (0.963830 s / it) Averaged stats: lr: 0.000762 loss: 3.384271 (3.055009) Test: [ 0/49] eta: 0:01:19 loss: 0.536372 (0.536372) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.626092 data: 1.172241 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.673581 (0.757781) acc1: 81.250000 (81.676136) acc5: 96.875000 (96.022727) time: 0.479486 data: 0.106715 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.812171 (0.774356) acc1: 78.125000 (81.026786) acc5: 96.875000 (95.982143) time: 0.373159 data: 0.000150 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.796177 (0.770168) acc1: 79.687500 (81.199597) acc5: 96.875000 (96.219758) time: 0.370936 data: 0.000153 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.773801 (0.775217) acc1: 82.812500 (81.631098) acc5: 96.875000 (96.265244) time: 0.357556 data: 0.000148 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.786404 (0.773211) acc1: 82.812500 (81.952000) acc5: 96.875000 (96.384000) time: 0.377657 data: 0.000123 max mem: 18814 Test: Total time: 0:00:19 (0.400625 s / it) * Acc@1 81.636 Acc@5 96.082 loss 0.789 Max accuracy: 81.75% Epoch: [174/300] [ 0/1251] eta: 0:42:49 lr: 0.000762 loss: 2.602779 (2.602779) time: 2.054292 data: 1.148980 max mem: 18814 Epoch: [174/300] [ 50/1251] eta: 0:19:56 lr: 0.000762 loss: 2.963226 (2.949171) time: 0.986251 data: 0.000158 max mem: 18814 Epoch: [174/300] [ 100/1251] eta: 0:18:52 lr: 0.000762 loss: 3.120323 (2.975198) time: 1.030968 data: 0.000167 max mem: 18814 Epoch: [174/300] [ 150/1251] eta: 0:17:51 lr: 0.000761 loss: 3.277171 (3.046911) time: 0.977460 data: 0.000151 max mem: 18814 Epoch: [174/300] [ 200/1251] eta: 0:16:59 lr: 0.000761 loss: 3.138184 (3.009066) time: 0.929556 data: 0.000164 max mem: 18814 Epoch: [174/300] [ 250/1251] eta: 0:16:11 lr: 0.000760 loss: 3.236860 (3.025857) time: 0.935557 data: 0.000162 max mem: 18814 Epoch: [174/300] [ 300/1251] eta: 0:15:21 lr: 0.000760 loss: 3.310639 (3.030684) time: 0.971076 data: 0.000161 max mem: 18814 Epoch: [174/300] [ 350/1251] eta: 0:14:33 lr: 0.000760 loss: 3.035849 (3.027822) time: 0.989718 data: 0.000145 max mem: 18814 Epoch: [174/300] [ 400/1251] eta: 0:13:43 lr: 0.000759 loss: 3.230758 (3.024161) time: 1.000524 data: 0.000169 max mem: 18814 Epoch: [174/300] [ 450/1251] eta: 0:12:53 lr: 0.000759 loss: 3.402921 (3.029014) time: 0.922907 data: 0.000181 max mem: 18814 Epoch: [174/300] [ 500/1251] eta: 0:12:06 lr: 0.000758 loss: 3.349323 (3.031423) time: 0.932644 data: 0.000184 max mem: 18814 Epoch: [174/300] [ 550/1251] eta: 0:11:18 lr: 0.000758 loss: 3.117263 (3.023899) time: 0.987435 data: 0.000177 max mem: 18814 Epoch: [174/300] [ 600/1251] eta: 0:10:30 lr: 0.000758 loss: 3.184824 (3.021962) time: 0.981172 data: 0.000170 max mem: 18814 Epoch: [174/300] [ 650/1251] eta: 0:09:41 lr: 0.000757 loss: 2.988027 (3.025858) time: 0.973679 data: 0.000157 max mem: 18814 Epoch: [174/300] [ 700/1251] eta: 0:08:52 lr: 0.000757 loss: 3.180346 (3.028833) time: 0.930802 data: 0.000153 max mem: 18814 Epoch: [174/300] [ 750/1251] eta: 0:08:04 lr: 0.000756 loss: 3.201665 (3.037542) time: 0.928498 data: 0.000154 max mem: 18814 Epoch: [174/300] [ 800/1251] eta: 0:07:16 lr: 0.000756 loss: 3.301351 (3.036359) time: 0.985151 data: 0.000158 max mem: 18814 Epoch: [174/300] [ 850/1251] eta: 0:06:28 lr: 0.000756 loss: 3.198911 (3.039302) time: 0.994710 data: 0.000166 max mem: 18814 Epoch: [174/300] [ 900/1251] eta: 0:05:39 lr: 0.000755 loss: 2.767643 (3.031816) time: 0.996543 data: 0.000158 max mem: 18814 Epoch: [174/300] [ 950/1251] eta: 0:04:50 lr: 0.000755 loss: 3.261536 (3.033664) time: 0.927940 data: 0.000167 max mem: 18814 Epoch: [174/300] [1000/1251] eta: 0:04:02 lr: 0.000754 loss: 3.102328 (3.033373) time: 0.922956 data: 0.000170 max mem: 18814 Epoch: [174/300] [1050/1251] eta: 0:03:14 lr: 0.000754 loss: 3.025283 (3.035361) time: 0.986402 data: 0.000179 max mem: 18814 Epoch: [174/300] [1100/1251] eta: 0:02:25 lr: 0.000754 loss: 3.214864 (3.042411) time: 0.974458 data: 0.000169 max mem: 18814 Epoch: [174/300] [1150/1251] eta: 0:01:37 lr: 0.000753 loss: 3.088048 (3.041526) time: 0.991888 data: 0.000172 max mem: 18814 Epoch: [174/300] [1200/1251] eta: 0:00:49 lr: 0.000753 loss: 3.205902 (3.043347) time: 0.943907 data: 0.000159 max mem: 18814 Epoch: [174/300] [1250/1251] eta: 0:00:00 lr: 0.000752 loss: 3.005115 (3.043109) time: 0.921698 data: 0.000733 max mem: 18814 Epoch: [174/300] Total time: 0:20:08 (0.965958 s / it) Averaged stats: lr: 0.000752 loss: 3.005115 (3.034304) Test: [ 0/49] eta: 0:01:26 loss: 0.576713 (0.576713) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.757558 data: 1.342044 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.681218 (0.743758) acc1: 82.812500 (83.096591) acc5: 96.875000 (96.448864) time: 0.493023 data: 0.122127 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.761860 (0.763847) acc1: 81.250000 (82.366071) acc5: 96.875000 (96.354167) time: 0.371160 data: 0.000126 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.763537 (0.768995) acc1: 81.250000 (82.258065) acc5: 96.875000 (96.320565) time: 0.367915 data: 0.000125 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.768761 (0.776149) acc1: 81.250000 (82.126524) acc5: 95.312500 (96.150915) time: 0.358118 data: 0.000127 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.798621 (0.779904) acc1: 81.250000 (81.728000) acc5: 95.312500 (96.224000) time: 0.353422 data: 0.000104 max mem: 18814 Test: Total time: 0:00:19 (0.393221 s / it) * Acc@1 81.738 Acc@5 96.196 loss 0.789 Max accuracy: 81.75% Epoch: [175/300] [ 0/1251] eta: 0:40:47 lr: 0.000752 loss: 2.928541 (2.928541) time: 1.956711 data: 1.061370 max mem: 18814 Epoch: [175/300] [ 50/1251] eta: 0:19:44 lr: 0.000752 loss: 3.037765 (2.880854) time: 1.035195 data: 0.000158 max mem: 18814 Epoch: [175/300] [ 100/1251] eta: 0:18:29 lr: 0.000752 loss: 3.266544 (2.992148) time: 0.965490 data: 0.000156 max mem: 18814 Epoch: [175/300] [ 150/1251] eta: 0:17:34 lr: 0.000751 loss: 3.102579 (2.995945) time: 0.928585 data: 0.000168 max mem: 18814 Epoch: [175/300] [ 200/1251] eta: 0:16:52 lr: 0.000751 loss: 3.085382 (3.016403) time: 0.933304 data: 0.000158 max mem: 18814 Epoch: [175/300] [ 250/1251] eta: 0:16:04 lr: 0.000750 loss: 3.194670 (3.036304) time: 0.983951 data: 0.000171 max mem: 18814 Epoch: [175/300] [ 300/1251] eta: 0:15:17 lr: 0.000750 loss: 2.987270 (3.027033) time: 1.032506 data: 0.000158 max mem: 18814 Epoch: [175/300] [ 350/1251] eta: 0:14:27 lr: 0.000750 loss: 3.104636 (3.034441) time: 0.972594 data: 0.000151 max mem: 18814 Epoch: [175/300] [ 400/1251] eta: 0:13:37 lr: 0.000749 loss: 3.286922 (3.046216) time: 0.928237 data: 0.000156 max mem: 18814 Epoch: [175/300] [ 450/1251] eta: 0:12:51 lr: 0.000749 loss: 3.212605 (3.044565) time: 0.932237 data: 0.000174 max mem: 18814 Epoch: [175/300] [ 500/1251] eta: 0:12:03 lr: 0.000748 loss: 3.059013 (3.038416) time: 0.997195 data: 0.000167 max mem: 18814 Epoch: [175/300] [ 550/1251] eta: 0:11:15 lr: 0.000748 loss: 3.175861 (3.041775) time: 0.986305 data: 0.000150 max mem: 18814 Epoch: [175/300] [ 600/1251] eta: 0:10:26 lr: 0.000748 loss: 2.896386 (3.039127) time: 0.989469 data: 0.000168 max mem: 18814 Epoch: [175/300] [ 650/1251] eta: 0:09:38 lr: 0.000747 loss: 3.341501 (3.043326) time: 0.923281 data: 0.000145 max mem: 18814 Epoch: [175/300] [ 700/1251] eta: 0:08:49 lr: 0.000747 loss: 3.115297 (3.033459) time: 0.922067 data: 0.000154 max mem: 18814 Epoch: [175/300] [ 750/1251] eta: 0:08:02 lr: 0.000746 loss: 3.170452 (3.038498) time: 0.991638 data: 0.000150 max mem: 18814 Epoch: [175/300] [ 800/1251] eta: 0:07:14 lr: 0.000746 loss: 2.873124 (3.041188) time: 1.041544 data: 0.000158 max mem: 18814 Epoch: [175/300] [ 850/1251] eta: 0:06:25 lr: 0.000746 loss: 2.826173 (3.040622) time: 0.985105 data: 0.000170 max mem: 18814 Epoch: [175/300] [ 900/1251] eta: 0:05:37 lr: 0.000745 loss: 3.438016 (3.047044) time: 0.933078 data: 0.000174 max mem: 18814 Epoch: [175/300] [ 950/1251] eta: 0:04:49 lr: 0.000745 loss: 3.156177 (3.048665) time: 0.920785 data: 0.000169 max mem: 18814 Epoch: [175/300] [1000/1251] eta: 0:04:01 lr: 0.000744 loss: 3.170105 (3.055807) time: 0.983200 data: 0.000149 max mem: 18814 Epoch: [175/300] [1050/1251] eta: 0:03:13 lr: 0.000744 loss: 3.055848 (3.050678) time: 0.980046 data: 0.000152 max mem: 18814 Epoch: [175/300] [1100/1251] eta: 0:02:25 lr: 0.000744 loss: 3.315568 (3.053119) time: 0.984314 data: 0.000170 max mem: 18814 Epoch: [175/300] [1150/1251] eta: 0:01:37 lr: 0.000743 loss: 2.967578 (3.047609) time: 0.933486 data: 0.000168 max mem: 18814 Epoch: [175/300] [1200/1251] eta: 0:00:49 lr: 0.000743 loss: 3.264941 (3.049977) time: 0.931964 data: 0.000157 max mem: 18814 Epoch: [175/300] [1250/1251] eta: 0:00:00 lr: 0.000742 loss: 3.299274 (3.050041) time: 0.979307 data: 0.000753 max mem: 18814 Epoch: [175/300] Total time: 0:20:05 (0.963841 s / it) Averaged stats: lr: 0.000742 loss: 3.299274 (3.054689) Test: [ 0/49] eta: 0:01:13 loss: 0.500765 (0.500765) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.493961 data: 1.078418 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.623391 (0.702292) acc1: 82.812500 (83.096591) acc5: 96.875000 (96.732955) time: 0.469912 data: 0.098168 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.780031 (0.732732) acc1: 81.250000 (81.845238) acc5: 96.875000 (96.354167) time: 0.375955 data: 0.000132 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.755465 (0.737519) acc1: 82.812500 (82.006048) acc5: 96.875000 (96.471774) time: 0.408880 data: 0.000124 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.728495 (0.745767) acc1: 82.812500 (81.897866) acc5: 96.875000 (96.570122) time: 0.394164 data: 0.000120 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.755465 (0.743070) acc1: 81.250000 (81.984000) acc5: 96.875000 (96.480000) time: 0.360867 data: 0.000103 max mem: 18814 Test: Total time: 0:00:19 (0.403945 s / it) * Acc@1 81.558 Acc@5 96.116 loss 0.756 Max accuracy: 81.75% Epoch: [176/300] [ 0/1251] eta: 0:41:43 lr: 0.000742 loss: 2.994103 (2.994103) time: 2.001109 data: 1.102998 max mem: 18814 Epoch: [176/300] [ 50/1251] eta: 0:19:26 lr: 0.000742 loss: 2.846069 (2.963232) time: 0.986193 data: 0.000144 max mem: 18814 Epoch: [176/300] [ 100/1251] eta: 0:18:22 lr: 0.000742 loss: 2.606377 (2.927022) time: 0.935277 data: 0.000183 max mem: 18814 Epoch: [176/300] [ 150/1251] eta: 0:17:36 lr: 0.000741 loss: 3.037235 (2.973483) time: 0.941278 data: 0.000154 max mem: 18814 Epoch: [176/300] [ 200/1251] eta: 0:16:52 lr: 0.000741 loss: 3.467953 (3.021361) time: 0.998412 data: 0.000169 max mem: 18814 Epoch: [176/300] [ 250/1251] eta: 0:16:07 lr: 0.000740 loss: 2.987381 (3.027833) time: 1.054094 data: 0.000166 max mem: 18814 Epoch: [176/300] [ 300/1251] eta: 0:15:16 lr: 0.000740 loss: 3.028682 (3.031558) time: 0.981176 data: 0.000161 max mem: 18814 Epoch: [176/300] [ 350/1251] eta: 0:14:25 lr: 0.000740 loss: 2.976645 (3.032718) time: 0.929352 data: 0.000152 max mem: 18814 Epoch: [176/300] [ 400/1251] eta: 0:13:39 lr: 0.000739 loss: 3.059712 (3.024497) time: 0.935103 data: 0.000155 max mem: 18814 Epoch: [176/300] [ 450/1251] eta: 0:12:52 lr: 0.000739 loss: 2.550084 (3.022345) time: 0.970539 data: 0.000157 max mem: 18814 Epoch: [176/300] [ 500/1251] eta: 0:12:04 lr: 0.000738 loss: 2.936432 (3.012466) time: 1.029071 data: 0.000200 max mem: 18814 Epoch: [176/300] [ 550/1251] eta: 0:11:15 lr: 0.000738 loss: 3.224058 (3.003140) time: 0.987674 data: 0.000171 max mem: 18814 Epoch: [176/300] [ 600/1251] eta: 0:10:26 lr: 0.000738 loss: 3.303621 (3.013617) time: 0.920013 data: 0.000160 max mem: 18814 Epoch: [176/300] [ 650/1251] eta: 0:09:39 lr: 0.000737 loss: 3.049226 (3.006690) time: 0.927850 data: 0.000163 max mem: 18814 Epoch: [176/300] [ 700/1251] eta: 0:08:51 lr: 0.000737 loss: 3.311850 (3.006194) time: 1.015939 data: 0.000161 max mem: 18814 Epoch: [176/300] [ 750/1251] eta: 0:08:03 lr: 0.000736 loss: 3.170822 (3.006206) time: 0.961050 data: 0.000166 max mem: 18814 Epoch: [176/300] [ 800/1251] eta: 0:07:14 lr: 0.000736 loss: 3.213111 (3.008262) time: 0.971722 data: 0.000157 max mem: 18814 Epoch: [176/300] [ 850/1251] eta: 0:06:26 lr: 0.000736 loss: 3.284123 (3.014999) time: 0.939542 data: 0.000147 max mem: 18814 Epoch: [176/300] [ 900/1251] eta: 0:05:38 lr: 0.000735 loss: 3.117444 (3.007710) time: 0.919937 data: 0.000152 max mem: 18814 Epoch: [176/300] [ 950/1251] eta: 0:04:50 lr: 0.000735 loss: 3.068902 (3.006129) time: 0.994061 data: 0.000169 max mem: 18814 Epoch: [176/300] [1000/1251] eta: 0:04:02 lr: 0.000734 loss: 3.036427 (3.003270) time: 0.979035 data: 0.000155 max mem: 18814 Epoch: [176/300] [1050/1251] eta: 0:03:13 lr: 0.000734 loss: 2.964320 (3.003278) time: 0.998392 data: 0.000191 max mem: 18814 Epoch: [176/300] [1100/1251] eta: 0:02:25 lr: 0.000734 loss: 3.049618 (3.002112) time: 0.918450 data: 0.000159 max mem: 18814 Epoch: [176/300] [1150/1251] eta: 0:01:37 lr: 0.000733 loss: 3.112215 (3.002133) time: 0.932702 data: 0.000165 max mem: 18814 Epoch: [176/300] [1200/1251] eta: 0:00:49 lr: 0.000733 loss: 3.029246 (3.005581) time: 0.976505 data: 0.000203 max mem: 18814 Epoch: [176/300] [1250/1251] eta: 0:00:00 lr: 0.000732 loss: 3.292511 (3.010331) time: 0.962621 data: 0.000756 max mem: 18814 Epoch: [176/300] Total time: 0:20:07 (0.964838 s / it) Averaged stats: lr: 0.000732 loss: 3.292511 (3.012610) Test: [ 0/49] eta: 0:01:23 loss: 0.635330 (0.635330) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.699116 data: 1.266321 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.671173 (0.748153) acc1: 82.812500 (82.812500) acc5: 96.875000 (95.596591) time: 0.487309 data: 0.115276 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.768949 (0.753715) acc1: 79.687500 (81.696429) acc5: 96.875000 (96.056548) time: 0.363705 data: 0.000156 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.738400 (0.754215) acc1: 79.687500 (81.804435) acc5: 96.875000 (96.118952) time: 0.361107 data: 0.000151 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.769030 (0.767143) acc1: 79.687500 (81.707317) acc5: 96.875000 (96.189024) time: 0.358473 data: 0.000154 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.777497 (0.767183) acc1: 79.687500 (81.600000) acc5: 96.875000 (96.224000) time: 0.374143 data: 0.000118 max mem: 18814 Test: Total time: 0:00:19 (0.396943 s / it) * Acc@1 81.702 Acc@5 96.086 loss 0.767 Max accuracy: 81.75% Epoch: [177/300] [ 0/1251] eta: 0:42:00 lr: 0.000732 loss: 2.515081 (2.515081) time: 2.014924 data: 1.117940 max mem: 18814 Epoch: [177/300] [ 50/1251] eta: 0:19:25 lr: 0.000732 loss: 3.103031 (3.047587) time: 0.926487 data: 0.000159 max mem: 18814 Epoch: [177/300] [ 100/1251] eta: 0:18:36 lr: 0.000732 loss: 3.200540 (3.049757) time: 0.927675 data: 0.000163 max mem: 18814 Epoch: [177/300] [ 150/1251] eta: 0:17:45 lr: 0.000731 loss: 3.177933 (3.061227) time: 0.959617 data: 0.000178 max mem: 18814 Epoch: [177/300] [ 200/1251] eta: 0:16:55 lr: 0.000731 loss: 3.125632 (3.054564) time: 0.959441 data: 0.000162 max mem: 18814 Epoch: [177/300] [ 250/1251] eta: 0:16:02 lr: 0.000730 loss: 3.103255 (3.038187) time: 0.974476 data: 0.000172 max mem: 18814 Epoch: [177/300] [ 300/1251] eta: 0:15:12 lr: 0.000730 loss: 3.281832 (3.054287) time: 0.935882 data: 0.000161 max mem: 18814 Epoch: [177/300] [ 350/1251] eta: 0:14:28 lr: 0.000730 loss: 2.925038 (3.042240) time: 0.927078 data: 0.000159 max mem: 18814 Epoch: [177/300] [ 400/1251] eta: 0:13:39 lr: 0.000729 loss: 3.280987 (3.044566) time: 0.975024 data: 0.000174 max mem: 18814 Epoch: [177/300] [ 450/1251] eta: 0:12:52 lr: 0.000729 loss: 3.060313 (3.052055) time: 0.968744 data: 0.000160 max mem: 18814 Epoch: [177/300] [ 500/1251] eta: 0:12:03 lr: 0.000728 loss: 3.054429 (3.047809) time: 0.973000 data: 0.000168 max mem: 18814 Epoch: [177/300] [ 550/1251] eta: 0:11:14 lr: 0.000728 loss: 3.104182 (3.055711) time: 0.933563 data: 0.000161 max mem: 18814 Epoch: [177/300] [ 600/1251] eta: 0:10:26 lr: 0.000728 loss: 3.297996 (3.058109) time: 0.931135 data: 0.000159 max mem: 18814 Epoch: [177/300] [ 650/1251] eta: 0:09:38 lr: 0.000727 loss: 3.167304 (3.060281) time: 0.962922 data: 0.000177 max mem: 18814 Epoch: [177/300] [ 700/1251] eta: 0:08:51 lr: 0.000727 loss: 3.141161 (3.059965) time: 0.988395 data: 0.000160 max mem: 18814 Epoch: [177/300] [ 750/1251] eta: 0:08:02 lr: 0.000726 loss: 2.887304 (3.054639) time: 0.980531 data: 0.000172 max mem: 18814 Epoch: [177/300] [ 800/1251] eta: 0:07:14 lr: 0.000726 loss: 2.871968 (3.050577) time: 0.974596 data: 0.000168 max mem: 18814 Epoch: [177/300] [ 850/1251] eta: 0:06:26 lr: 0.000726 loss: 3.224414 (3.058773) time: 0.920494 data: 0.000168 max mem: 18814 Epoch: [177/300] [ 900/1251] eta: 0:05:38 lr: 0.000725 loss: 3.172249 (3.061974) time: 0.974085 data: 0.000175 max mem: 18814 Epoch: [177/300] [ 950/1251] eta: 0:04:50 lr: 0.000725 loss: 2.663190 (3.057033) time: 0.978458 data: 0.000161 max mem: 18814 Epoch: [177/300] [1000/1251] eta: 0:04:02 lr: 0.000724 loss: 3.219345 (3.055382) time: 0.984198 data: 0.000159 max mem: 18814 Epoch: [177/300] [1050/1251] eta: 0:03:14 lr: 0.000724 loss: 3.076549 (3.052107) time: 0.992922 data: 0.000185 max mem: 18814 Epoch: [177/300] [1100/1251] eta: 0:02:25 lr: 0.000724 loss: 3.170254 (3.049372) time: 0.939545 data: 0.000161 max mem: 18814 Epoch: [177/300] [1150/1251] eta: 0:01:37 lr: 0.000723 loss: 3.068273 (3.049581) time: 0.964152 data: 0.000182 max mem: 18814 Epoch: [177/300] [1200/1251] eta: 0:00:49 lr: 0.000723 loss: 2.968112 (3.044497) time: 0.972083 data: 0.000160 max mem: 18814 Epoch: [177/300] [1250/1251] eta: 0:00:00 lr: 0.000722 loss: 2.764521 (3.043200) time: 0.976161 data: 0.000809 max mem: 18814 Epoch: [177/300] Total time: 0:20:07 (0.964871 s / it) Averaged stats: lr: 0.000722 loss: 2.764521 (3.046457) Test: [ 0/49] eta: 0:01:27 loss: 0.453326 (0.453326) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.776816 data: 1.383307 max mem: 18814 Test: [10/49] eta: 0:00:25 loss: 0.605323 (0.696643) acc1: 82.812500 (82.670455) acc5: 95.312500 (96.448864) time: 0.660248 data: 0.125881 max mem: 18814 Test: [20/49] eta: 0:00:15 loss: 0.760329 (0.723826) acc1: 81.250000 (82.068452) acc5: 95.312500 (96.354167) time: 0.454741 data: 0.000145 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.739282 (0.729621) acc1: 81.250000 (81.703629) acc5: 96.875000 (96.370968) time: 0.360821 data: 0.000155 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.739282 (0.746035) acc1: 81.250000 (81.631098) acc5: 96.875000 (96.379573) time: 0.358948 data: 0.000147 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.761543 (0.749070) acc1: 81.250000 (81.728000) acc5: 96.875000 (96.416000) time: 0.353721 data: 0.000124 max mem: 18814 Test: Total time: 0:00:20 (0.427887 s / it) * Acc@1 81.922 Acc@5 96.214 loss 0.750 Max accuracy: 81.92% Epoch: [178/300] [ 0/1251] eta: 0:41:44 lr: 0.000722 loss: 3.947288 (3.947288) time: 2.002376 data: 1.113054 max mem: 18814 Epoch: [178/300] [ 50/1251] eta: 0:19:24 lr: 0.000722 loss: 3.054443 (3.073673) time: 0.919694 data: 0.000168 max mem: 18814 Epoch: [178/300] [ 100/1251] eta: 0:18:41 lr: 0.000722 loss: 2.764656 (3.001165) time: 0.922757 data: 0.000171 max mem: 18814 Epoch: [178/300] [ 150/1251] eta: 0:17:51 lr: 0.000721 loss: 3.211941 (3.030285) time: 0.993880 data: 0.000161 max mem: 18814 Epoch: [178/300] [ 200/1251] eta: 0:17:01 lr: 0.000721 loss: 3.034402 (3.015595) time: 0.967205 data: 0.000152 max mem: 18814 Epoch: [178/300] [ 250/1251] eta: 0:16:08 lr: 0.000720 loss: 3.022575 (3.011187) time: 0.962041 data: 0.000169 max mem: 18814 Epoch: [178/300] [ 300/1251] eta: 0:15:16 lr: 0.000720 loss: 3.116517 (3.013271) time: 0.917456 data: 0.000144 max mem: 18814 Epoch: [178/300] [ 350/1251] eta: 0:14:29 lr: 0.000720 loss: 3.126484 (3.026774) time: 0.938964 data: 0.000157 max mem: 18814 Epoch: [178/300] [ 400/1251] eta: 0:13:43 lr: 0.000719 loss: 3.086859 (3.022415) time: 0.962850 data: 0.000174 max mem: 18814 Epoch: [178/300] [ 450/1251] eta: 0:12:55 lr: 0.000719 loss: 3.181126 (3.025468) time: 0.989840 data: 0.000165 max mem: 18814 Epoch: [178/300] [ 500/1251] eta: 0:12:06 lr: 0.000718 loss: 2.907470 (3.022182) time: 1.003860 data: 0.000158 max mem: 18814 Epoch: [178/300] [ 550/1251] eta: 0:11:18 lr: 0.000718 loss: 2.950449 (3.015795) time: 0.982120 data: 0.000156 max mem: 18814 Epoch: [178/300] [ 600/1251] eta: 0:10:29 lr: 0.000718 loss: 3.211736 (3.024662) time: 0.925304 data: 0.000174 max mem: 18814 Epoch: [178/300] [ 650/1251] eta: 0:09:40 lr: 0.000717 loss: 3.082339 (3.021085) time: 0.916896 data: 0.000154 max mem: 18814 Epoch: [178/300] [ 700/1251] eta: 0:08:52 lr: 0.000717 loss: 2.886927 (3.016635) time: 0.987711 data: 0.000149 max mem: 18814 Epoch: [178/300] [ 750/1251] eta: 0:08:03 lr: 0.000717 loss: 3.126265 (3.019700) time: 0.980733 data: 0.000157 max mem: 18814 Epoch: [178/300] [ 800/1251] eta: 0:07:15 lr: 0.000716 loss: 3.000043 (3.019107) time: 0.950292 data: 0.000165 max mem: 18814 Epoch: [178/300] [ 850/1251] eta: 0:06:26 lr: 0.000716 loss: 3.162180 (3.016666) time: 0.938376 data: 0.000168 max mem: 18814 Epoch: [178/300] [ 900/1251] eta: 0:05:38 lr: 0.000715 loss: 2.923969 (3.015075) time: 0.925550 data: 0.000168 max mem: 18814 Epoch: [178/300] [ 950/1251] eta: 0:04:50 lr: 0.000715 loss: 3.019913 (3.010211) time: 0.982746 data: 0.000157 max mem: 18814 Epoch: [178/300] [1000/1251] eta: 0:04:01 lr: 0.000715 loss: 3.108807 (3.012062) time: 1.011006 data: 0.000163 max mem: 18814 Epoch: [178/300] [1050/1251] eta: 0:03:13 lr: 0.000714 loss: 3.222821 (3.013041) time: 0.998006 data: 0.000160 max mem: 18814 Epoch: [178/300] [1100/1251] eta: 0:02:25 lr: 0.000714 loss: 3.217221 (3.018321) time: 0.928027 data: 0.000145 max mem: 18814 Epoch: [178/300] [1150/1251] eta: 0:01:37 lr: 0.000713 loss: 3.070478 (3.016483) time: 0.928441 data: 0.000157 max mem: 18814 Epoch: [178/300] [1200/1251] eta: 0:00:49 lr: 0.000713 loss: 3.269236 (3.020034) time: 0.997500 data: 0.000155 max mem: 18814 Epoch: [178/300] [1250/1251] eta: 0:00:00 lr: 0.000713 loss: 2.937851 (3.019028) time: 0.985179 data: 0.000719 max mem: 18814 Epoch: [178/300] Total time: 0:20:07 (0.965321 s / it) Averaged stats: lr: 0.000713 loss: 2.937851 (3.025079) Test: [ 0/49] eta: 0:01:42 loss: 0.588032 (0.588032) acc1: 81.250000 (81.250000) acc5: 98.437500 (98.437500) time: 2.091084 data: 1.423978 max mem: 18814 Test: [10/49] eta: 0:00:20 loss: 0.670330 (0.719916) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.164773) time: 0.518954 data: 0.129598 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.755830 (0.756074) acc1: 81.250000 (81.026786) acc5: 95.312500 (96.056548) time: 0.360636 data: 0.000145 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.744510 (0.748342) acc1: 79.687500 (81.250000) acc5: 96.875000 (96.219758) time: 0.359991 data: 0.000141 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.749373 (0.763818) acc1: 81.250000 (81.478659) acc5: 96.875000 (96.227134) time: 0.358085 data: 0.000153 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.792584 (0.760723) acc1: 81.250000 (81.632000) acc5: 96.875000 (96.288000) time: 0.352793 data: 0.000123 max mem: 18814 Test: Total time: 0:00:19 (0.396910 s / it) * Acc@1 81.894 Acc@5 96.076 loss 0.764 Max accuracy: 81.92% Epoch: [179/300] [ 0/1251] eta: 2:00:23 lr: 0.000713 loss: 3.835334 (3.835334) time: 5.774495 data: 1.689544 max mem: 18510 Epoch: [179/300] [ 50/1251] eta: 0:20:45 lr: 0.000712 loss: 3.161159 (3.208507) time: 0.935020 data: 0.000164 max mem: 18814 Epoch: [179/300] [ 100/1251] eta: 0:19:08 lr: 0.000712 loss: 3.191861 (3.122391) time: 0.955632 data: 0.000173 max mem: 18814 Epoch: [179/300] [ 150/1251] eta: 0:18:10 lr: 0.000711 loss: 3.079408 (3.083680) time: 1.044906 data: 0.000170 max mem: 18814 Epoch: [179/300] [ 200/1251] eta: 0:17:02 lr: 0.000711 loss: 3.137457 (3.054668) time: 0.915482 data: 0.000159 max mem: 18814 Epoch: [179/300] [ 250/1251] eta: 0:16:12 lr: 0.000711 loss: 2.965560 (3.030999) time: 0.928753 data: 0.000161 max mem: 18814 Epoch: [179/300] [ 300/1251] eta: 0:15:20 lr: 0.000710 loss: 2.726500 (3.017426) time: 0.926367 data: 0.000162 max mem: 18814 Epoch: [179/300] [ 350/1251] eta: 0:14:30 lr: 0.000710 loss: 3.129434 (3.022992) time: 0.981771 data: 0.000142 max mem: 18814 Epoch: [179/300] [ 400/1251] eta: 0:13:38 lr: 0.000709 loss: 3.032359 (3.022713) time: 0.942178 data: 0.000140 max mem: 18814 Epoch: [179/300] [ 450/1251] eta: 0:12:48 lr: 0.000709 loss: 3.287845 (3.037107) time: 0.914029 data: 0.000146 max mem: 18814 Epoch: [179/300] [ 500/1251] eta: 0:12:01 lr: 0.000709 loss: 3.032830 (3.019944) time: 0.990522 data: 0.000150 max mem: 18814 Epoch: [179/300] [ 550/1251] eta: 0:11:12 lr: 0.000708 loss: 3.094473 (3.022130) time: 0.940247 data: 0.000163 max mem: 18814 Epoch: [179/300] [ 600/1251] eta: 0:10:24 lr: 0.000708 loss: 3.219880 (3.032367) time: 0.959816 data: 0.000162 max mem: 18814 Epoch: [179/300] [ 650/1251] eta: 0:09:36 lr: 0.000707 loss: 3.012724 (3.035128) time: 1.004077 data: 0.000169 max mem: 18814 Epoch: [179/300] [ 700/1251] eta: 0:08:46 lr: 0.000707 loss: 2.869698 (3.031029) time: 0.916513 data: 0.000152 max mem: 18814 Epoch: [179/300] [ 750/1251] eta: 0:07:59 lr: 0.000707 loss: 3.227377 (3.028740) time: 0.916327 data: 0.000232 max mem: 18814 Epoch: [179/300] [ 800/1251] eta: 0:07:11 lr: 0.000706 loss: 3.034488 (3.031003) time: 0.938407 data: 0.000162 max mem: 18814 Epoch: [179/300] [ 850/1251] eta: 0:06:23 lr: 0.000706 loss: 3.085012 (3.020684) time: 0.954586 data: 0.000161 max mem: 18814 Epoch: [179/300] [ 900/1251] eta: 0:05:35 lr: 0.000705 loss: 3.149214 (3.020026) time: 0.919620 data: 0.000162 max mem: 18814 Epoch: [179/300] [ 950/1251] eta: 0:04:47 lr: 0.000705 loss: 2.457059 (3.018561) time: 0.908360 data: 0.000161 max mem: 18814 Epoch: [179/300] [1000/1251] eta: 0:03:59 lr: 0.000705 loss: 3.161336 (3.023109) time: 0.950082 data: 0.000146 max mem: 18814 Epoch: [179/300] [1050/1251] eta: 0:03:12 lr: 0.000704 loss: 3.201674 (3.023582) time: 0.927752 data: 0.000177 max mem: 18814 Epoch: [179/300] [1100/1251] eta: 0:02:24 lr: 0.000704 loss: 3.067068 (3.022681) time: 0.946518 data: 0.000166 max mem: 18814 Epoch: [179/300] [1150/1251] eta: 0:01:36 lr: 0.000703 loss: 3.038271 (3.026229) time: 0.914378 data: 0.000157 max mem: 18814 Epoch: [179/300] [1200/1251] eta: 0:00:48 lr: 0.000703 loss: 3.361967 (3.033150) time: 0.929958 data: 0.000162 max mem: 18814 Epoch: [179/300] [1250/1251] eta: 0:00:00 lr: 0.000703 loss: 3.074484 (3.031443) time: 0.934236 data: 0.000715 max mem: 18814 Epoch: [179/300] Total time: 0:19:54 (0.954995 s / it) Averaged stats: lr: 0.000703 loss: 3.074484 (3.020791) Test: [ 0/49] eta: 0:01:25 loss: 0.609147 (0.609147) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.751250 data: 1.371652 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.609147 (0.706303) acc1: 82.812500 (83.096591) acc5: 96.875000 (96.875000) time: 0.483522 data: 0.124825 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.767143 (0.737790) acc1: 81.250000 (81.770833) acc5: 96.875000 (96.502976) time: 0.359144 data: 0.000132 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.728898 (0.727229) acc1: 81.250000 (81.905242) acc5: 96.875000 (96.522177) time: 0.355981 data: 0.000130 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.728898 (0.739150) acc1: 81.250000 (81.707317) acc5: 96.875000 (96.455793) time: 0.348152 data: 0.000126 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.752009 (0.739503) acc1: 81.250000 (81.536000) acc5: 96.875000 (96.544000) time: 0.347073 data: 0.000100 max mem: 18814 Test: Total time: 0:00:18 (0.383429 s / it) * Acc@1 82.080 Acc@5 96.272 loss 0.751 Max accuracy: 82.08% Epoch: [180/300] [ 0/1251] eta: 0:41:18 lr: 0.000703 loss: 2.899716 (2.899716) time: 1.981614 data: 1.095758 max mem: 18814 Epoch: [180/300] [ 50/1251] eta: 0:19:04 lr: 0.000702 loss: 3.062949 (3.136754) time: 0.963304 data: 0.000169 max mem: 18814 Epoch: [180/300] [ 100/1251] eta: 0:18:13 lr: 0.000702 loss: 3.149795 (3.063112) time: 0.945351 data: 0.000155 max mem: 18814 Epoch: [180/300] [ 150/1251] eta: 0:17:24 lr: 0.000701 loss: 3.149890 (3.074975) time: 0.914830 data: 0.000157 max mem: 18814 Epoch: [180/300] [ 200/1251] eta: 0:16:41 lr: 0.000701 loss: 3.130359 (3.073468) time: 0.982418 data: 0.000166 max mem: 18814 Epoch: [180/300] [ 250/1251] eta: 0:15:51 lr: 0.000701 loss: 2.937628 (3.037764) time: 0.953195 data: 0.000163 max mem: 18814 Epoch: [180/300] [ 300/1251] eta: 0:15:01 lr: 0.000700 loss: 2.869805 (3.025410) time: 0.966343 data: 0.000159 max mem: 18814 Epoch: [180/300] [ 350/1251] eta: 0:14:12 lr: 0.000700 loss: 3.304701 (3.027689) time: 0.915310 data: 0.000170 max mem: 18814 Epoch: [180/300] [ 400/1251] eta: 0:13:26 lr: 0.000700 loss: 2.903898 (3.017966) time: 0.901266 data: 0.000150 max mem: 18814 Epoch: [180/300] [ 450/1251] eta: 0:12:40 lr: 0.000699 loss: 3.238258 (3.018142) time: 0.994321 data: 0.000179 max mem: 18814 Epoch: [180/300] [ 500/1251] eta: 0:11:52 lr: 0.000699 loss: 2.893166 (3.011016) time: 0.921898 data: 0.000159 max mem: 18814 Epoch: [180/300] [ 550/1251] eta: 0:11:05 lr: 0.000698 loss: 3.218407 (3.020370) time: 0.949922 data: 0.000171 max mem: 18814 Epoch: [180/300] [ 600/1251] eta: 0:10:17 lr: 0.000698 loss: 2.817844 (3.018921) time: 0.911344 data: 0.000158 max mem: 18814 Epoch: [180/300] [ 650/1251] eta: 0:09:30 lr: 0.000698 loss: 2.942901 (3.017152) time: 0.926788 data: 0.000162 max mem: 18814 Epoch: [180/300] [ 700/1251] eta: 0:08:44 lr: 0.000697 loss: 2.918778 (3.007762) time: 0.932002 data: 0.000147 max mem: 18814 Epoch: [180/300] [ 750/1251] eta: 0:07:57 lr: 0.000697 loss: 3.042863 (2.996524) time: 0.924820 data: 0.000153 max mem: 18814 Epoch: [180/300] [ 800/1251] eta: 0:07:09 lr: 0.000696 loss: 2.960605 (3.004454) time: 1.042947 data: 0.000164 max mem: 18814 Epoch: [180/300] [ 850/1251] eta: 0:06:21 lr: 0.000696 loss: 3.193043 (3.007197) time: 0.959623 data: 0.000168 max mem: 18814 Epoch: [180/300] [ 900/1251] eta: 0:05:33 lr: 0.000696 loss: 3.178023 (3.011650) time: 0.915171 data: 0.000159 max mem: 18814 Epoch: [180/300] [ 950/1251] eta: 0:04:46 lr: 0.000695 loss: 3.076026 (3.012351) time: 0.933569 data: 0.000173 max mem: 18814 Epoch: [180/300] [1000/1251] eta: 0:03:59 lr: 0.000695 loss: 2.837461 (3.004726) time: 0.944425 data: 0.000166 max mem: 18814 Epoch: [180/300] [1050/1251] eta: 0:03:11 lr: 0.000694 loss: 2.882313 (3.003506) time: 1.023288 data: 0.000168 max mem: 18814 Epoch: [180/300] [1100/1251] eta: 0:02:23 lr: 0.000694 loss: 3.041117 (3.007270) time: 0.972489 data: 0.000157 max mem: 18814 Epoch: [180/300] [1150/1251] eta: 0:01:36 lr: 0.000694 loss: 2.989229 (3.004341) time: 0.919302 data: 0.000157 max mem: 18814 Epoch: [180/300] [1200/1251] eta: 0:00:48 lr: 0.000693 loss: 3.174533 (3.008440) time: 0.919281 data: 0.000172 max mem: 18814 Epoch: [180/300] [1250/1251] eta: 0:00:00 lr: 0.000693 loss: 2.964129 (3.007436) time: 0.964619 data: 0.000742 max mem: 18814 Epoch: [180/300] Total time: 0:19:52 (0.953329 s / it) Averaged stats: lr: 0.000693 loss: 2.964129 (3.010261) Test: [ 0/49] eta: 0:01:26 loss: 0.563487 (0.563487) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.775486 data: 1.389153 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.617037 (0.721239) acc1: 84.375000 (83.806818) acc5: 96.875000 (96.448864) time: 0.512776 data: 0.126418 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.778234 (0.739031) acc1: 81.250000 (82.440476) acc5: 95.312500 (96.130952) time: 0.369257 data: 0.000132 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.739153 (0.736318) acc1: 79.687500 (82.157258) acc5: 96.875000 (96.471774) time: 0.355728 data: 0.000126 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.732176 (0.746819) acc1: 82.812500 (82.050305) acc5: 96.875000 (96.455793) time: 0.360849 data: 0.000126 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.745176 (0.747349) acc1: 79.687500 (81.952000) acc5: 96.875000 (96.416000) time: 0.352413 data: 0.000105 max mem: 18814 Test: Total time: 0:00:19 (0.393700 s / it) * Acc@1 82.004 Acc@5 96.220 loss 0.764 Max accuracy: 82.08% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0180.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0180.pth Epoch: [181/300] [ 0/1251] eta: 0:44:40 lr: 0.000693 loss: 2.868365 (2.868365) time: 2.142460 data: 1.284802 max mem: 18814 Epoch: [181/300] [ 50/1251] eta: 0:19:23 lr: 0.000692 loss: 3.111965 (3.079915) time: 0.955129 data: 0.000171 max mem: 18814 Epoch: [181/300] [ 100/1251] eta: 0:18:14 lr: 0.000692 loss: 3.046623 (3.081891) time: 0.958220 data: 0.000171 max mem: 18814 Epoch: [181/300] [ 150/1251] eta: 0:17:22 lr: 0.000692 loss: 3.190533 (3.067503) time: 0.907499 data: 0.000160 max mem: 18814 Epoch: [181/300] [ 200/1251] eta: 0:16:39 lr: 0.000691 loss: 3.230967 (3.080581) time: 0.936422 data: 0.000171 max mem: 18814 Epoch: [181/300] [ 250/1251] eta: 0:15:53 lr: 0.000691 loss: 3.013278 (3.068778) time: 0.922966 data: 0.000159 max mem: 18814 Epoch: [181/300] [ 300/1251] eta: 0:15:05 lr: 0.000690 loss: 2.784919 (3.056807) time: 1.018158 data: 0.000166 max mem: 18814 Epoch: [181/300] [ 350/1251] eta: 0:14:17 lr: 0.000690 loss: 2.936692 (3.055976) time: 0.980561 data: 0.000147 max mem: 18814 Epoch: [181/300] [ 400/1251] eta: 0:13:28 lr: 0.000690 loss: 3.000669 (3.028744) time: 0.913599 data: 0.000154 max mem: 18814 Epoch: [181/300] [ 450/1251] eta: 0:12:41 lr: 0.000689 loss: 2.979269 (3.029218) time: 0.933394 data: 0.000154 max mem: 18814 Epoch: [181/300] [ 500/1251] eta: 0:11:54 lr: 0.000689 loss: 3.150012 (3.025910) time: 0.931635 data: 0.000162 max mem: 18814 Epoch: [181/300] [ 550/1251] eta: 0:11:07 lr: 0.000688 loss: 3.090098 (3.022879) time: 1.003660 data: 0.000159 max mem: 18814 Epoch: [181/300] [ 600/1251] eta: 0:10:18 lr: 0.000688 loss: 3.135363 (3.016082) time: 0.973677 data: 0.000167 max mem: 18814 Epoch: [181/300] [ 650/1251] eta: 0:09:30 lr: 0.000688 loss: 2.790256 (3.013021) time: 0.909109 data: 0.000158 max mem: 18814 Epoch: [181/300] [ 700/1251] eta: 0:08:43 lr: 0.000687 loss: 3.157261 (3.013688) time: 0.933786 data: 0.000170 max mem: 18814 Epoch: [181/300] [ 750/1251] eta: 0:07:56 lr: 0.000687 loss: 3.211700 (3.021793) time: 0.977128 data: 0.000160 max mem: 18814 Epoch: [181/300] [ 800/1251] eta: 0:07:09 lr: 0.000687 loss: 3.141137 (3.025774) time: 1.009629 data: 0.000156 max mem: 18814 Epoch: [181/300] [ 850/1251] eta: 0:06:20 lr: 0.000686 loss: 3.117620 (3.022801) time: 0.912110 data: 0.000160 max mem: 18814 Epoch: [181/300] [ 900/1251] eta: 0:05:33 lr: 0.000686 loss: 2.887554 (3.020748) time: 0.973073 data: 0.000161 max mem: 18814 Epoch: [181/300] [ 950/1251] eta: 0:04:45 lr: 0.000685 loss: 2.963127 (3.020111) time: 0.959660 data: 0.000162 max mem: 18814 Epoch: [181/300] [1000/1251] eta: 0:03:58 lr: 0.000685 loss: 3.218539 (3.018594) time: 0.965748 data: 0.000151 max mem: 18814 Epoch: [181/300] [1050/1251] eta: 0:03:10 lr: 0.000685 loss: 2.893230 (3.015767) time: 0.943847 data: 0.000161 max mem: 18814 Epoch: [181/300] [1100/1251] eta: 0:02:23 lr: 0.000684 loss: 3.170749 (3.018904) time: 0.913497 data: 0.000152 max mem: 18814 Epoch: [181/300] [1150/1251] eta: 0:01:35 lr: 0.000684 loss: 3.190237 (3.024560) time: 0.953255 data: 0.000160 max mem: 18814 Epoch: [181/300] [1200/1251] eta: 0:00:48 lr: 0.000683 loss: 3.098542 (3.025984) time: 0.975364 data: 0.000162 max mem: 18814 Epoch: [181/300] [1250/1251] eta: 0:00:00 lr: 0.000683 loss: 3.191960 (3.028407) time: 0.968641 data: 0.000754 max mem: 18814 Epoch: [181/300] Total time: 0:19:47 (0.949426 s / it) Averaged stats: lr: 0.000683 loss: 3.191960 (3.023431) Test: [ 0/49] eta: 0:01:13 loss: 0.494545 (0.494545) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.491565 data: 1.088885 max mem: 18814 Test: [10/49] eta: 0:00:17 loss: 0.637190 (0.718163) acc1: 82.812500 (83.664773) acc5: 96.875000 (96.732955) time: 0.461367 data: 0.099140 max mem: 18814 Test: [20/49] eta: 0:00:11 loss: 0.782475 (0.742823) acc1: 79.687500 (82.738095) acc5: 96.875000 (96.205357) time: 0.354542 data: 0.000140 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.758295 (0.743640) acc1: 79.687500 (82.157258) acc5: 96.875000 (96.320565) time: 0.452632 data: 0.000130 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.744670 (0.755617) acc1: 82.812500 (82.431402) acc5: 95.312500 (96.189024) time: 0.450002 data: 0.000130 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.773550 (0.754431) acc1: 82.812500 (82.176000) acc5: 96.875000 (96.320000) time: 0.342889 data: 0.000105 max mem: 18814 Test: Total time: 0:00:20 (0.416496 s / it) * Acc@1 81.980 Acc@5 96.192 loss 0.776 Max accuracy: 82.08% Epoch: [182/300] [ 0/1251] eta: 0:41:19 lr: 0.000683 loss: 2.813349 (2.813349) time: 1.982181 data: 1.113847 max mem: 18814 Epoch: [182/300] [ 50/1251] eta: 0:19:14 lr: 0.000683 loss: 3.042187 (3.058532) time: 0.905776 data: 0.000160 max mem: 18814 Epoch: [182/300] [ 100/1251] eta: 0:18:31 lr: 0.000682 loss: 3.078056 (3.023090) time: 0.929975 data: 0.000167 max mem: 18814 Epoch: [182/300] [ 150/1251] eta: 0:17:41 lr: 0.000682 loss: 3.019875 (2.979559) time: 0.929978 data: 0.000161 max mem: 18814 Epoch: [182/300] [ 200/1251] eta: 0:16:53 lr: 0.000681 loss: 2.860121 (2.962190) time: 1.018538 data: 0.000150 max mem: 18814 Epoch: [182/300] [ 250/1251] eta: 0:16:00 lr: 0.000681 loss: 3.118192 (2.983964) time: 0.964854 data: 0.000147 max mem: 18814 Epoch: [182/300] [ 300/1251] eta: 0:15:11 lr: 0.000681 loss: 3.193384 (2.998650) time: 0.912899 data: 0.000153 max mem: 18814 Epoch: [182/300] [ 350/1251] eta: 0:14:23 lr: 0.000680 loss: 2.915492 (2.986333) time: 0.977916 data: 0.000161 max mem: 18814 Epoch: [182/300] [ 400/1251] eta: 0:13:32 lr: 0.000680 loss: 3.033882 (2.983959) time: 0.963669 data: 0.000189 max mem: 18814 Epoch: [182/300] [ 450/1251] eta: 0:12:45 lr: 0.000679 loss: 2.976697 (2.985206) time: 0.988585 data: 0.000172 max mem: 18814 Epoch: [182/300] [ 500/1251] eta: 0:11:57 lr: 0.000679 loss: 3.254036 (2.985994) time: 0.917361 data: 0.000173 max mem: 18814 Epoch: [182/300] [ 550/1251] eta: 0:11:08 lr: 0.000679 loss: 2.880549 (2.982214) time: 0.916639 data: 0.000155 max mem: 18814 Epoch: [182/300] [ 600/1251] eta: 0:10:20 lr: 0.000678 loss: 2.993030 (2.980688) time: 0.929509 data: 0.000173 max mem: 18814 Epoch: [182/300] [ 650/1251] eta: 0:09:33 lr: 0.000678 loss: 3.042833 (2.981728) time: 0.977079 data: 0.000167 max mem: 18814 Epoch: [182/300] [ 700/1251] eta: 0:08:46 lr: 0.000678 loss: 3.233344 (2.984992) time: 1.057018 data: 0.000163 max mem: 18814 Epoch: [182/300] [ 750/1251] eta: 0:07:57 lr: 0.000677 loss: 2.895211 (2.988256) time: 0.915604 data: 0.000160 max mem: 18814 Epoch: [182/300] [ 800/1251] eta: 0:07:10 lr: 0.000677 loss: 3.050279 (2.989091) time: 0.914885 data: 0.000157 max mem: 18814 Epoch: [182/300] [ 850/1251] eta: 0:06:22 lr: 0.000676 loss: 3.085041 (2.994091) time: 0.923680 data: 0.000179 max mem: 18814 Epoch: [182/300] [ 900/1251] eta: 0:05:34 lr: 0.000676 loss: 3.167719 (2.996123) time: 0.955435 data: 0.000194 max mem: 18814 Epoch: [182/300] [ 950/1251] eta: 0:04:47 lr: 0.000676 loss: 3.354283 (3.002879) time: 1.044948 data: 0.000175 max mem: 18814 Epoch: [182/300] [1000/1251] eta: 0:03:59 lr: 0.000675 loss: 3.058749 (3.000314) time: 0.907347 data: 0.000160 max mem: 18814 Epoch: [182/300] [1050/1251] eta: 0:03:11 lr: 0.000675 loss: 3.210830 (2.999168) time: 0.930570 data: 0.000170 max mem: 18814 Epoch: [182/300] [1100/1251] eta: 0:02:24 lr: 0.000674 loss: 3.023528 (2.997939) time: 0.971468 data: 0.000163 max mem: 18814 Epoch: [182/300] [1150/1251] eta: 0:01:36 lr: 0.000674 loss: 2.912465 (2.991508) time: 0.977143 data: 0.000163 max mem: 18814 Epoch: [182/300] [1200/1251] eta: 0:00:48 lr: 0.000674 loss: 3.226421 (2.995773) time: 0.908417 data: 0.000167 max mem: 18814 Epoch: [182/300] [1250/1251] eta: 0:00:00 lr: 0.000673 loss: 3.296007 (2.999923) time: 0.908715 data: 0.000730 max mem: 18814 Epoch: [182/300] Total time: 0:19:51 (0.952831 s / it) Averaged stats: lr: 0.000673 loss: 3.296007 (3.008196) Test: [ 0/49] eta: 0:01:15 loss: 0.543090 (0.543090) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.550512 data: 1.123823 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.633213 (0.709437) acc1: 82.812500 (81.818182) acc5: 98.437500 (96.732955) time: 0.478929 data: 0.102304 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.768347 (0.737738) acc1: 81.250000 (81.845238) acc5: 96.875000 (96.651786) time: 0.361366 data: 0.000145 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.732400 (0.739357) acc1: 81.250000 (81.703629) acc5: 96.875000 (96.673387) time: 0.350774 data: 0.000136 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.759460 (0.754773) acc1: 82.812500 (81.745427) acc5: 96.875000 (96.493902) time: 0.352641 data: 0.000127 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.783258 (0.756282) acc1: 81.250000 (81.728000) acc5: 96.875000 (96.512000) time: 0.351928 data: 0.000101 max mem: 18814 Test: Total time: 0:00:18 (0.383030 s / it) * Acc@1 82.126 Acc@5 96.386 loss 0.761 Max accuracy: 82.13% Epoch: [183/300] [ 0/1251] eta: 2:09:39 lr: 0.000673 loss: 3.748489 (3.748489) time: 6.218781 data: 2.045910 max mem: 18510 Epoch: [183/300] [ 50/1251] eta: 0:21:29 lr: 0.000673 loss: 3.041399 (3.109699) time: 0.947011 data: 0.000157 max mem: 18814 Epoch: [183/300] [ 100/1251] eta: 0:19:20 lr: 0.000672 loss: 3.120090 (3.062929) time: 0.927730 data: 0.000160 max mem: 18814 Epoch: [183/300] [ 150/1251] eta: 0:18:10 lr: 0.000672 loss: 2.900438 (3.038718) time: 0.965885 data: 0.000161 max mem: 18814 Epoch: [183/300] [ 200/1251] eta: 0:17:06 lr: 0.000672 loss: 2.930257 (3.024421) time: 0.965797 data: 0.000165 max mem: 18814 Epoch: [183/300] [ 250/1251] eta: 0:16:15 lr: 0.000671 loss: 2.921224 (2.994310) time: 0.946230 data: 0.000155 max mem: 18814 Epoch: [183/300] [ 300/1251] eta: 0:15:24 lr: 0.000671 loss: 2.708578 (2.977975) time: 0.959745 data: 0.000134 max mem: 18814 Epoch: [183/300] [ 350/1251] eta: 0:14:32 lr: 0.000671 loss: 3.046833 (2.990677) time: 0.935561 data: 0.000147 max mem: 18814 Epoch: [183/300] [ 400/1251] eta: 0:13:43 lr: 0.000670 loss: 3.085613 (2.991138) time: 1.046489 data: 0.000167 max mem: 18814 Epoch: [183/300] [ 450/1251] eta: 0:12:54 lr: 0.000670 loss: 2.957232 (2.999942) time: 0.990800 data: 0.000172 max mem: 18814 Epoch: [183/300] [ 500/1251] eta: 0:12:04 lr: 0.000669 loss: 3.019659 (2.988498) time: 0.934106 data: 0.000185 max mem: 18814 Epoch: [183/300] [ 550/1251] eta: 0:11:16 lr: 0.000669 loss: 3.170532 (2.992190) time: 0.941635 data: 0.000160 max mem: 18814 Epoch: [183/300] [ 600/1251] eta: 0:10:28 lr: 0.000669 loss: 3.084899 (2.997165) time: 0.994679 data: 0.000203 max mem: 18814 Epoch: [183/300] [ 650/1251] eta: 0:09:39 lr: 0.000668 loss: 2.865747 (2.997744) time: 0.981790 data: 0.000148 max mem: 18814 Epoch: [183/300] [ 700/1251] eta: 0:08:51 lr: 0.000668 loss: 2.893009 (2.997191) time: 0.965197 data: 0.000161 max mem: 18814 Epoch: [183/300] [ 750/1251] eta: 0:08:01 lr: 0.000667 loss: 3.257151 (2.998756) time: 0.908931 data: 0.000149 max mem: 18814 Epoch: [183/300] [ 800/1251] eta: 0:07:13 lr: 0.000667 loss: 3.100270 (3.004308) time: 0.930977 data: 0.000165 max mem: 18814 Epoch: [183/300] [ 850/1251] eta: 0:06:25 lr: 0.000667 loss: 3.217072 (2.995336) time: 0.985487 data: 0.000158 max mem: 18814 Epoch: [183/300] [ 900/1251] eta: 0:05:37 lr: 0.000666 loss: 3.033375 (2.990802) time: 0.998388 data: 0.000159 max mem: 18814 Epoch: [183/300] [ 950/1251] eta: 0:04:49 lr: 0.000666 loss: 2.415947 (2.985295) time: 0.979588 data: 0.000167 max mem: 18814 Epoch: [183/300] [1000/1251] eta: 0:04:00 lr: 0.000665 loss: 3.114599 (2.987186) time: 0.905588 data: 0.000140 max mem: 18814 Epoch: [183/300] [1050/1251] eta: 0:03:12 lr: 0.000665 loss: 3.300375 (2.988698) time: 0.926686 data: 0.000167 max mem: 18814 Epoch: [183/300] [1100/1251] eta: 0:02:24 lr: 0.000665 loss: 3.010535 (2.988571) time: 0.910937 data: 0.000168 max mem: 18814 Epoch: [183/300] [1150/1251] eta: 0:01:36 lr: 0.000664 loss: 2.964067 (2.992886) time: 0.971987 data: 0.000170 max mem: 18814 Epoch: [183/300] [1200/1251] eta: 0:00:48 lr: 0.000664 loss: 3.095214 (2.997524) time: 0.938166 data: 0.000165 max mem: 18814 Epoch: [183/300] [1250/1251] eta: 0:00:00 lr: 0.000663 loss: 2.992568 (2.995381) time: 0.920968 data: 0.000735 max mem: 18814 Epoch: [183/300] Total time: 0:19:59 (0.958962 s / it) Averaged stats: lr: 0.000663 loss: 2.992568 (3.004309) Test: [ 0/49] eta: 0:01:20 loss: 0.548418 (0.548418) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.650798 data: 1.248268 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.575167 (0.700834) acc1: 82.812500 (82.670455) acc5: 96.875000 (95.738636) time: 0.480321 data: 0.113661 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.788906 (0.729598) acc1: 81.250000 (81.919643) acc5: 96.875000 (95.907738) time: 0.358459 data: 0.000168 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.749740 (0.730401) acc1: 81.250000 (81.854839) acc5: 96.875000 (96.219758) time: 0.353274 data: 0.000153 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.738127 (0.741540) acc1: 82.812500 (81.935976) acc5: 96.875000 (96.150915) time: 0.349917 data: 0.000163 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.746746 (0.733419) acc1: 82.812500 (82.272000) acc5: 96.875000 (96.288000) time: 0.357661 data: 0.000134 max mem: 18814 Test: Total time: 0:00:18 (0.387331 s / it) * Acc@1 82.040 Acc@5 96.178 loss 0.748 Max accuracy: 82.13% Epoch: [184/300] [ 0/1251] eta: 0:44:25 lr: 0.000663 loss: 2.824208 (2.824208) time: 2.130975 data: 1.239630 max mem: 18814 Epoch: [184/300] [ 50/1251] eta: 0:19:41 lr: 0.000663 loss: 3.099287 (3.122700) time: 0.977338 data: 0.000183 max mem: 18814 Epoch: [184/300] [ 100/1251] eta: 0:18:50 lr: 0.000663 loss: 3.213211 (3.019719) time: 1.064573 data: 0.000170 max mem: 18814 Epoch: [184/300] [ 150/1251] eta: 0:17:47 lr: 0.000662 loss: 3.162670 (3.031562) time: 0.975204 data: 0.000190 max mem: 18814 Epoch: [184/300] [ 200/1251] eta: 0:16:53 lr: 0.000662 loss: 3.155770 (3.040986) time: 0.935650 data: 0.000162 max mem: 18814 Epoch: [184/300] [ 250/1251] eta: 0:16:03 lr: 0.000662 loss: 2.920762 (3.010356) time: 0.945458 data: 0.000171 max mem: 18814 Epoch: [184/300] [ 300/1251] eta: 0:15:13 lr: 0.000661 loss: 2.851892 (3.005314) time: 0.918098 data: 0.000164 max mem: 18814 Epoch: [184/300] [ 350/1251] eta: 0:14:24 lr: 0.000661 loss: 3.013487 (3.007361) time: 1.027535 data: 0.000150 max mem: 18814 Epoch: [184/300] [ 400/1251] eta: 0:13:34 lr: 0.000660 loss: 2.746255 (2.999257) time: 0.979683 data: 0.000193 max mem: 18814 Epoch: [184/300] [ 450/1251] eta: 0:12:45 lr: 0.000660 loss: 3.175627 (3.007804) time: 0.907302 data: 0.000155 max mem: 18814 Epoch: [184/300] [ 500/1251] eta: 0:11:56 lr: 0.000660 loss: 2.853756 (3.004058) time: 0.925519 data: 0.000172 max mem: 18814 Epoch: [184/300] [ 550/1251] eta: 0:11:09 lr: 0.000659 loss: 3.059954 (3.008055) time: 0.922719 data: 0.000163 max mem: 18814 Epoch: [184/300] [ 600/1251] eta: 0:10:22 lr: 0.000659 loss: 2.854264 (3.002912) time: 1.046622 data: 0.000172 max mem: 18814 Epoch: [184/300] [ 650/1251] eta: 0:09:33 lr: 0.000658 loss: 2.870353 (3.003278) time: 0.916167 data: 0.000177 max mem: 18814 Epoch: [184/300] [ 700/1251] eta: 0:08:46 lr: 0.000658 loss: 3.019596 (2.997350) time: 0.924946 data: 0.000159 max mem: 18814 Epoch: [184/300] [ 750/1251] eta: 0:07:59 lr: 0.000658 loss: 2.946107 (2.987462) time: 0.923696 data: 0.000166 max mem: 18814 Epoch: [184/300] [ 800/1251] eta: 0:07:11 lr: 0.000657 loss: 3.063320 (2.991773) time: 0.972854 data: 0.000163 max mem: 18814 Epoch: [184/300] [ 850/1251] eta: 0:06:23 lr: 0.000657 loss: 3.090876 (2.990883) time: 0.963531 data: 0.000174 max mem: 18814 Epoch: [184/300] [ 900/1251] eta: 0:05:35 lr: 0.000657 loss: 3.040060 (2.996416) time: 0.968110 data: 0.000191 max mem: 18814 Epoch: [184/300] [ 950/1251] eta: 0:04:47 lr: 0.000656 loss: 3.134954 (2.996405) time: 0.917867 data: 0.000173 max mem: 18814 Epoch: [184/300] [1000/1251] eta: 0:03:59 lr: 0.000656 loss: 2.972446 (2.992105) time: 0.931453 data: 0.000162 max mem: 18814 Epoch: [184/300] [1050/1251] eta: 0:03:12 lr: 0.000655 loss: 2.824612 (2.992118) time: 0.978470 data: 0.000158 max mem: 18814 Epoch: [184/300] [1100/1251] eta: 0:02:24 lr: 0.000655 loss: 3.010572 (2.996371) time: 1.024624 data: 0.000172 max mem: 18814 Epoch: [184/300] [1150/1251] eta: 0:01:36 lr: 0.000655 loss: 2.908520 (2.996327) time: 0.934243 data: 0.000165 max mem: 18814 Epoch: [184/300] [1200/1251] eta: 0:00:48 lr: 0.000654 loss: 3.146440 (2.999869) time: 0.925116 data: 0.000163 max mem: 18814 Epoch: [184/300] [1250/1251] eta: 0:00:00 lr: 0.000654 loss: 3.071856 (2.998325) time: 0.905572 data: 0.000773 max mem: 18814 Epoch: [184/300] Total time: 0:19:55 (0.955899 s / it) Averaged stats: lr: 0.000654 loss: 3.071856 (2.992667) Test: [ 0/49] eta: 0:01:15 loss: 0.599800 (0.599800) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.537022 data: 1.151268 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.691824 (0.725916) acc1: 81.250000 (82.386364) acc5: 95.312500 (96.164773) time: 0.500475 data: 0.104829 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.796396 (0.765308) acc1: 81.250000 (81.547619) acc5: 96.875000 (96.502976) time: 0.373981 data: 0.000161 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.794345 (0.748518) acc1: 81.250000 (81.804435) acc5: 96.875000 (96.572581) time: 0.350997 data: 0.000135 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.738310 (0.758618) acc1: 81.250000 (81.821646) acc5: 96.875000 (96.417683) time: 0.362215 data: 0.000124 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.769151 (0.758537) acc1: 81.250000 (81.792000) acc5: 96.875000 (96.544000) time: 0.357906 data: 0.000103 max mem: 18814 Test: Total time: 0:00:19 (0.389698 s / it) * Acc@1 82.172 Acc@5 96.264 loss 0.762 Max accuracy: 82.17% Epoch: [185/300] [ 0/1251] eta: 2:28:16 lr: 0.000654 loss: 3.686020 (3.686020) time: 7.111720 data: 1.813628 max mem: 18510 Epoch: [185/300] [ 50/1251] eta: 0:21:19 lr: 0.000653 loss: 3.071047 (3.191508) time: 0.912447 data: 0.000160 max mem: 18814 Epoch: [185/300] [ 100/1251] eta: 0:19:19 lr: 0.000653 loss: 3.073071 (3.062839) time: 0.933562 data: 0.000163 max mem: 18814 Epoch: [185/300] [ 150/1251] eta: 0:18:11 lr: 0.000653 loss: 2.780482 (3.021591) time: 0.925641 data: 0.000154 max mem: 18814 Epoch: [185/300] [ 200/1251] eta: 0:17:07 lr: 0.000652 loss: 2.898022 (2.997248) time: 0.961746 data: 0.000165 max mem: 18814 Epoch: [185/300] [ 250/1251] eta: 0:16:16 lr: 0.000652 loss: 2.930629 (2.977204) time: 0.982499 data: 0.000165 max mem: 18814 Epoch: [185/300] [ 300/1251] eta: 0:15:23 lr: 0.000651 loss: 2.765139 (2.962068) time: 0.911534 data: 0.000142 max mem: 18814 Epoch: [185/300] [ 350/1251] eta: 0:14:35 lr: 0.000651 loss: 3.085321 (2.972515) time: 0.988309 data: 0.000166 max mem: 18814 Epoch: [185/300] [ 400/1251] eta: 0:13:44 lr: 0.000651 loss: 3.028000 (2.970673) time: 0.914606 data: 0.000159 max mem: 18814 Epoch: [185/300] [ 450/1251] eta: 0:12:55 lr: 0.000650 loss: 3.056452 (2.985100) time: 1.018634 data: 0.000168 max mem: 18814 Epoch: [185/300] [ 500/1251] eta: 0:12:08 lr: 0.000650 loss: 3.083692 (2.980560) time: 1.046883 data: 0.000153 max mem: 18814 Epoch: [185/300] [ 550/1251] eta: 0:11:16 lr: 0.000650 loss: 3.048278 (2.980739) time: 0.916516 data: 0.000159 max mem: 18814 Epoch: [185/300] [ 600/1251] eta: 0:10:28 lr: 0.000649 loss: 3.064522 (2.990550) time: 0.984356 data: 0.000153 max mem: 18814 Epoch: [185/300] [ 650/1251] eta: 0:09:40 lr: 0.000649 loss: 2.898067 (2.991414) time: 0.975817 data: 0.000160 max mem: 18814 Epoch: [185/300] [ 700/1251] eta: 0:08:51 lr: 0.000648 loss: 2.993068 (2.989916) time: 0.969251 data: 0.000157 max mem: 18814 Epoch: [185/300] [ 750/1251] eta: 0:08:02 lr: 0.000648 loss: 3.329992 (2.993004) time: 0.972904 data: 0.000160 max mem: 18814 Epoch: [185/300] [ 800/1251] eta: 0:07:14 lr: 0.000648 loss: 3.034015 (2.992925) time: 0.923585 data: 0.000173 max mem: 18814 Epoch: [185/300] [ 850/1251] eta: 0:06:26 lr: 0.000647 loss: 3.020335 (2.985101) time: 0.984734 data: 0.000167 max mem: 18814 Epoch: [185/300] [ 900/1251] eta: 0:05:38 lr: 0.000647 loss: 2.928200 (2.980350) time: 0.977193 data: 0.000165 max mem: 18814 Epoch: [185/300] [ 950/1251] eta: 0:04:50 lr: 0.000646 loss: 2.657204 (2.979924) time: 0.981766 data: 0.000157 max mem: 18814 Epoch: [185/300] [1000/1251] eta: 0:04:01 lr: 0.000646 loss: 3.157090 (2.985848) time: 0.967050 data: 0.000155 max mem: 18814 Epoch: [185/300] [1050/1251] eta: 0:03:13 lr: 0.000646 loss: 3.049145 (2.986721) time: 0.911938 data: 0.000153 max mem: 18814 Epoch: [185/300] [1100/1251] eta: 0:02:25 lr: 0.000645 loss: 3.098477 (2.987527) time: 0.992407 data: 0.000190 max mem: 18814 Epoch: [185/300] [1150/1251] eta: 0:01:37 lr: 0.000645 loss: 2.945037 (2.990753) time: 1.008768 data: 0.000162 max mem: 18814 Epoch: [185/300] [1200/1251] eta: 0:00:49 lr: 0.000645 loss: 3.218580 (2.993666) time: 0.972422 data: 0.000160 max mem: 18814 Epoch: [185/300] [1250/1251] eta: 0:00:00 lr: 0.000644 loss: 3.040934 (2.991038) time: 0.998293 data: 0.000722 max mem: 18814 Epoch: [185/300] Total time: 0:20:06 (0.964157 s / it) Averaged stats: lr: 0.000644 loss: 3.040934 (2.994623) Test: [ 0/49] eta: 0:01:21 loss: 0.468556 (0.468556) acc1: 89.062500 (89.062500) acc5: 98.437500 (98.437500) time: 1.671870 data: 1.156592 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.602214 (0.690597) acc1: 82.812500 (81.960227) acc5: 98.437500 (97.159091) time: 0.495496 data: 0.105313 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.768497 (0.724907) acc1: 79.687500 (81.696429) acc5: 96.875000 (96.800595) time: 0.369734 data: 0.000176 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.754527 (0.718119) acc1: 81.250000 (81.854839) acc5: 96.875000 (96.723790) time: 0.360932 data: 0.000173 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.751531 (0.735557) acc1: 81.250000 (81.516768) acc5: 96.875000 (96.646341) time: 0.357660 data: 0.000163 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.759289 (0.734534) acc1: 79.687500 (81.472000) acc5: 96.875000 (96.608000) time: 0.371386 data: 0.000123 max mem: 18814 Test: Total time: 0:00:19 (0.397808 s / it) * Acc@1 82.020 Acc@5 96.174 loss 0.748 Max accuracy: 82.17% Epoch: [186/300] [ 0/1251] eta: 0:45:10 lr: 0.000644 loss: 2.980855 (2.980855) time: 2.166996 data: 1.164767 max mem: 18814 Epoch: [186/300] [ 50/1251] eta: 0:19:42 lr: 0.000644 loss: 3.015619 (3.110943) time: 0.964084 data: 0.000155 max mem: 18814 Epoch: [186/300] [ 100/1251] eta: 0:18:45 lr: 0.000643 loss: 3.201839 (3.035776) time: 0.976610 data: 0.000163 max mem: 18814 Epoch: [186/300] [ 150/1251] eta: 0:17:48 lr: 0.000643 loss: 3.038241 (3.055405) time: 0.991959 data: 0.000185 max mem: 18814 Epoch: [186/300] [ 200/1251] eta: 0:17:00 lr: 0.000643 loss: 3.047427 (3.042896) time: 0.989851 data: 0.000164 max mem: 18814 Epoch: [186/300] [ 250/1251] eta: 0:16:07 lr: 0.000642 loss: 2.743671 (3.009273) time: 0.926264 data: 0.000164 max mem: 18814 Epoch: [186/300] [ 300/1251] eta: 0:15:20 lr: 0.000642 loss: 2.895313 (3.005804) time: 0.979002 data: 0.000154 max mem: 18814 Epoch: [186/300] [ 350/1251] eta: 0:14:31 lr: 0.000641 loss: 3.112872 (3.009423) time: 0.967934 data: 0.000155 max mem: 18814 Epoch: [186/300] [ 400/1251] eta: 0:13:40 lr: 0.000641 loss: 2.711153 (2.993539) time: 0.971674 data: 0.000170 max mem: 18814 Epoch: [186/300] [ 450/1251] eta: 0:12:52 lr: 0.000641 loss: 3.272976 (2.995563) time: 0.990561 data: 0.000181 max mem: 18814 Epoch: [186/300] [ 500/1251] eta: 0:12:02 lr: 0.000640 loss: 2.775233 (2.990991) time: 0.909462 data: 0.000160 max mem: 18814 Epoch: [186/300] [ 550/1251] eta: 0:11:14 lr: 0.000640 loss: 3.081729 (2.996608) time: 1.020525 data: 0.000167 max mem: 18814 Epoch: [186/300] [ 600/1251] eta: 0:10:24 lr: 0.000640 loss: 2.976887 (2.995911) time: 0.950190 data: 0.000180 max mem: 18814 Epoch: [186/300] [ 650/1251] eta: 0:09:36 lr: 0.000639 loss: 3.035452 (2.992916) time: 0.929093 data: 0.000173 max mem: 18814 Epoch: [186/300] [ 700/1251] eta: 0:08:49 lr: 0.000639 loss: 3.062897 (2.984181) time: 0.929515 data: 0.000155 max mem: 18814 Epoch: [186/300] [ 750/1251] eta: 0:08:01 lr: 0.000638 loss: 3.065196 (2.974889) time: 0.920665 data: 0.000165 max mem: 18814 n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<14446> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<24328> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<30916> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43380> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36794> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<26910> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43398> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36820> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<26936> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43430> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36842> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<26960> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43450> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36862> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<26978> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43466> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36878> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<26996> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43478> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36892> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<27006> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43494> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36908> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<27022> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43506> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36926> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<27040> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43526> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36942> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<27056> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43538> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36956> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<27070> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<35070> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<38554> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<46984> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<50460> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<46996> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<50472> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<50486> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<47024> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<50508> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<47040> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<50522> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<47054> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<50540> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<47070> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<50550> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<47080> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<50560> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<47092> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<50574> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<47104> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<50594> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<47126> Epoch: [186/300] [ 800/1251] eta: 0:07:13 lr: 0.000638 loss: 2.956492 (2.978230) time: 1.028002 data: 0.000178 max mem: 18814 n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<36686> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<40094> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43586> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<26816> n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<43236> Epoch: [186/300] [ 850/1251] eta: 0:06:24 lr: 0.000638 loss: 3.148085 (2.981820) time: 0.950842 data: 0.000183 max mem: 18814 n130-017-149:1013:1439 [3] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<305> n130-017-149:1016:1443 [6] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<331> n130-017-149:1017:1442 [7] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<398> n130-017-149:1010:1438 [0] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<924> n130-017-149:1015:1444 [5] NCCL INFO [Rem Allocator] invalid request from 10.174.108.46<657> Epoch: [186/300] [ 900/1251] eta: 0:05:36 lr: 0.000637 loss: 3.250516 (2.989532) time: 0.935301 data: 0.000172 max mem: 18814 Epoch: [186/300] [ 950/1251] eta: 0:04:49 lr: 0.000637 loss: 3.057788 (2.991367) time: 0.940111 data: 0.000158 max mem: 18814 Epoch: [186/300] [1000/1251] eta: 0:04:00 lr: 0.000636 loss: 2.900942 (2.982892) time: 0.908372 data: 0.000280 max mem: 18814 Epoch: [186/300] [1050/1251] eta: 0:03:13 lr: 0.000636 loss: 3.000708 (2.981744) time: 1.025070 data: 0.000162 max mem: 18814 Epoch: [186/300] [1100/1251] eta: 0:02:24 lr: 0.000636 loss: 2.996430 (2.984279) time: 0.921623 data: 0.000175 max mem: 18814 Epoch: [186/300] [1150/1251] eta: 0:01:36 lr: 0.000635 loss: 2.799593 (2.980576) time: 0.929129 data: 0.000176 max mem: 18814 Epoch: [186/300] [1200/1251] eta: 0:00:48 lr: 0.000635 loss: 3.078014 (2.982816) time: 0.942401 data: 0.000188 max mem: 18814 Epoch: [186/300] [1250/1251] eta: 0:00:00 lr: 0.000635 loss: 3.092775 (2.983181) time: 0.989093 data: 0.000774 max mem: 18814 Epoch: [186/300] Total time: 0:20:01 (0.960748 s / it) Averaged stats: lr: 0.000635 loss: 3.092775 (2.983559) Test: [ 0/49] eta: 0:01:18 loss: 0.542726 (0.542726) acc1: 87.500000 (87.500000) acc5: 96.875000 (96.875000) time: 1.605341 data: 1.081912 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.606101 (0.707073) acc1: 84.375000 (82.954545) acc5: 96.875000 (96.590909) time: 0.477182 data: 0.098495 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.806694 (0.747305) acc1: 81.250000 (81.473214) acc5: 96.875000 (96.354167) time: 0.359227 data: 0.000140 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.796993 (0.738571) acc1: 79.687500 (81.804435) acc5: 96.875000 (96.471774) time: 0.356715 data: 0.000128 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.756493 (0.753606) acc1: 81.250000 (81.783537) acc5: 96.875000 (96.455793) time: 0.363624 data: 0.000124 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.754176 (0.750895) acc1: 81.250000 (81.856000) acc5: 96.875000 (96.480000) time: 0.358145 data: 0.000103 max mem: 18814 Test: Total time: 0:00:18 (0.386876 s / it) * Acc@1 81.910 Acc@5 96.202 loss 0.769 Max accuracy: 82.17% Epoch: [187/300] [ 0/1251] eta: 0:41:10 lr: 0.000635 loss: 2.579028 (2.579028) time: 1.975117 data: 1.123281 max mem: 18814 Epoch: [187/300] [ 50/1251] eta: 0:18:52 lr: 0.000634 loss: 3.024695 (3.061651) time: 0.930216 data: 0.000145 max mem: 18814 Epoch: [187/300] [ 100/1251] eta: 0:18:24 lr: 0.000634 loss: 2.968312 (3.056477) time: 0.939996 data: 0.000176 max mem: 18814 Epoch: [187/300] [ 150/1251] eta: 0:17:43 lr: 0.000633 loss: 3.098356 (3.027287) time: 0.945545 data: 0.000168 max mem: 18814 Epoch: [187/300] [ 200/1251] eta: 0:16:47 lr: 0.000633 loss: 3.212169 (3.055549) time: 0.967762 data: 0.000179 max mem: 18814 Epoch: [187/300] [ 250/1251] eta: 0:16:02 lr: 0.000633 loss: 2.763510 (3.044407) time: 0.996370 data: 0.000174 max mem: 18814 Epoch: [187/300] [ 300/1251] eta: 0:15:13 lr: 0.000632 loss: 2.825245 (3.038627) time: 0.927487 data: 0.000163 max mem: 18814 Epoch: [187/300] [ 350/1251] eta: 0:14:27 lr: 0.000632 loss: 2.967730 (3.026494) time: 0.981464 data: 0.000174 max mem: 18814 Epoch: [187/300] [ 400/1251] eta: 0:13:40 lr: 0.000631 loss: 3.048550 (3.005845) time: 0.979084 data: 0.000163 max mem: 18814 Epoch: [187/300] [ 450/1251] eta: 0:12:51 lr: 0.000631 loss: 3.039181 (3.002396) time: 0.989952 data: 0.000166 max mem: 18814 Epoch: [187/300] [ 500/1251] eta: 0:12:03 lr: 0.000631 loss: 3.300580 (2.999917) time: 0.970691 data: 0.000279 max mem: 18814 Epoch: [187/300] [ 550/1251] eta: 0:11:13 lr: 0.000630 loss: 3.193470 (3.000634) time: 0.914632 data: 0.000161 max mem: 18814 Epoch: [187/300] [ 600/1251] eta: 0:10:25 lr: 0.000630 loss: 3.026327 (2.992062) time: 0.972388 data: 0.000167 max mem: 18814 Epoch: [187/300] [ 650/1251] eta: 0:09:37 lr: 0.000630 loss: 2.980835 (2.988991) time: 0.974614 data: 0.000149 max mem: 18814 Epoch: [187/300] [ 700/1251] eta: 0:08:49 lr: 0.000629 loss: 3.043329 (2.984870) time: 0.981019 data: 0.000169 max mem: 18814 Epoch: [187/300] [ 750/1251] eta: 0:08:01 lr: 0.000629 loss: 3.105632 (2.988849) time: 0.977437 data: 0.000163 max mem: 18814 Epoch: [187/300] [ 800/1251] eta: 0:07:12 lr: 0.000628 loss: 2.983436 (2.991244) time: 0.917543 data: 0.000161 max mem: 18814 Epoch: [187/300] [ 850/1251] eta: 0:06:24 lr: 0.000628 loss: 3.069963 (2.990966) time: 0.994458 data: 0.000170 max mem: 18814 Epoch: [187/300] [ 900/1251] eta: 0:05:37 lr: 0.000628 loss: 2.974645 (2.990010) time: 1.009860 data: 0.000162 max mem: 18814 Epoch: [187/300] [ 950/1251] eta: 0:04:49 lr: 0.000627 loss: 2.901457 (2.988193) time: 0.988397 data: 0.000176 max mem: 18814 Epoch: [187/300] [1000/1251] eta: 0:04:01 lr: 0.000627 loss: 3.016602 (2.984948) time: 0.980413 data: 0.000169 max mem: 18814 Epoch: [187/300] [1050/1251] eta: 0:03:12 lr: 0.000626 loss: 2.989936 (2.982678) time: 0.915184 data: 0.000159 max mem: 18814 torch.distributed: socket accepted connection from 10.226.124.200:44030. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.226.124.200:60530. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.226.124.200:22804. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.226.124.200:31718. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.226.124.200:36824. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.226.124.200:47410. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.226.124.200:64536. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) Epoch: [187/300] [1100/1251] eta: 0:02:25 lr: 0.000626 loss: 3.051995 (2.987701) time: 0.973402 data: 0.000169 max mem: 18814 torch.distributed: socket accepted connection from 10.226.124.200:16618. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.226.124.200:21418. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.226.124.200:35996. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.226.124.200:50042. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.226.124.200:58222. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) Epoch: [187/300] [1150/1251] eta: 0:01:37 lr: 0.000626 loss: 3.229637 (2.994811) time: 0.976613 data: 0.000169 max mem: 18814 torch.distributed: socket accepted connection from 10.226.124.200:16. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) Epoch: [187/300] [1200/1251] eta: 0:00:48 lr: 0.000625 loss: 3.004583 (2.995198) time: 0.970442 data: 0.000154 max mem: 18814 Epoch: [187/300] [1250/1251] eta: 0:00:00 lr: 0.000625 loss: 3.145226 (2.996165) time: 0.987132 data: 0.000779 max mem: 18814 Epoch: [187/300] Total time: 0:20:01 (0.960429 s / it) Averaged stats: lr: 0.000625 loss: 3.145226 (2.993728) Test: [ 0/49] eta: 0:01:27 loss: 0.584066 (0.584066) acc1: 87.500000 (87.500000) acc5: 95.312500 (95.312500) time: 1.790682 data: 1.416725 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.674106 (0.722768) acc1: 82.812500 (83.380682) acc5: 96.875000 (96.022727) time: 0.493208 data: 0.128928 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.786849 (0.758648) acc1: 79.687500 (81.919643) acc5: 96.875000 (96.130952) time: 0.359255 data: 0.000148 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.776551 (0.748119) acc1: 79.687500 (81.905242) acc5: 96.875000 (96.270161) time: 0.354212 data: 0.000158 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.738689 (0.752905) acc1: 81.250000 (82.012195) acc5: 96.875000 (96.189024) time: 0.351177 data: 0.000158 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.738689 (0.749756) acc1: 81.250000 (82.080000) acc5: 96.875000 (96.288000) time: 0.346408 data: 0.000133 max mem: 18814 Test: Total time: 0:00:18 (0.384610 s / it) * Acc@1 82.222 Acc@5 96.240 loss 0.754 Max accuracy: 82.22% Epoch: [188/300] [ 0/1251] eta: 0:39:54 lr: 0.000625 loss: 3.066494 (3.066494) time: 1.914052 data: 1.060709 max mem: 18814 Epoch: [188/300] [ 50/1251] eta: 0:19:34 lr: 0.000625 loss: 3.088746 (3.033944) time: 0.963545 data: 0.000186 max mem: 18814 Epoch: [188/300] [ 100/1251] eta: 0:18:24 lr: 0.000624 loss: 3.090997 (2.982133) time: 0.906818 data: 0.000165 max mem: 18814 Epoch: [188/300] [ 150/1251] eta: 0:17:38 lr: 0.000624 loss: 2.960921 (2.926957) time: 0.974954 data: 0.000171 max mem: 18814 Epoch: [188/300] [ 200/1251] eta: 0:16:49 lr: 0.000623 loss: 2.858440 (2.919930) time: 0.960352 data: 0.000182 max mem: 18814 Epoch: [188/300] [ 250/1251] eta: 0:16:02 lr: 0.000623 loss: 2.996675 (2.940322) time: 0.932700 data: 0.000173 max mem: 18814 Epoch: [188/300] [ 300/1251] eta: 0:15:16 lr: 0.000623 loss: 3.213199 (2.956731) time: 0.968773 data: 0.000178 max mem: 18814 Epoch: [188/300] [ 350/1251] eta: 0:14:25 lr: 0.000622 loss: 3.024459 (2.943154) time: 0.918855 data: 0.000167 max mem: 18814 Epoch: [188/300] [ 400/1251] eta: 0:13:36 lr: 0.000622 loss: 3.044924 (2.943783) time: 0.960488 data: 0.000160 max mem: 18814 Epoch: [188/300] [ 450/1251] eta: 0:12:47 lr: 0.000622 loss: 2.922290 (2.949732) time: 0.932976 data: 0.000185 max mem: 18814 Epoch: [188/300] [ 500/1251] eta: 0:11:59 lr: 0.000621 loss: 3.044842 (2.953437) time: 0.909929 data: 0.000171 max mem: 18814 Epoch: [188/300] [ 550/1251] eta: 0:11:12 lr: 0.000621 loss: 3.025109 (2.949422) time: 0.981140 data: 0.000169 max mem: 18814 Epoch: [188/300] [ 600/1251] eta: 0:10:23 lr: 0.000620 loss: 3.054697 (2.950708) time: 0.914240 data: 0.000167 max mem: 18814 Epoch: [188/300] [ 650/1251] eta: 0:09:36 lr: 0.000620 loss: 3.024952 (2.954524) time: 0.969900 data: 0.000176 max mem: 18814 Epoch: [188/300] [ 700/1251] eta: 0:08:47 lr: 0.000620 loss: 3.180202 (2.958528) time: 0.921096 data: 0.000153 max mem: 18814 Epoch: [188/300] [ 750/1251] eta: 0:08:00 lr: 0.000619 loss: 2.875991 (2.959887) time: 0.926331 data: 0.000165 max mem: 18814 Epoch: [188/300] [ 800/1251] eta: 0:07:12 lr: 0.000619 loss: 3.040351 (2.961383) time: 0.976766 data: 0.000173 max mem: 18814 Epoch: [188/300] [ 850/1251] eta: 0:06:24 lr: 0.000618 loss: 3.171053 (2.967197) time: 0.914124 data: 0.000170 max mem: 18814 Epoch: [188/300] [ 900/1251] eta: 0:05:36 lr: 0.000618 loss: 3.042383 (2.968857) time: 0.979241 data: 0.000165 max mem: 18814 Epoch: [188/300] [ 950/1251] eta: 0:04:48 lr: 0.000618 loss: 3.276318 (2.976383) time: 0.927228 data: 0.000158 max mem: 18814 Epoch: [188/300] [1000/1251] eta: 0:04:00 lr: 0.000617 loss: 3.123896 (2.975302) time: 0.910814 data: 0.000157 max mem: 18814 Epoch: [188/300] [1050/1251] eta: 0:03:12 lr: 0.000617 loss: 3.178373 (2.976351) time: 0.971789 data: 0.000175 max mem: 18814 Epoch: [188/300] [1100/1251] eta: 0:02:24 lr: 0.000617 loss: 2.886245 (2.975201) time: 0.906202 data: 0.000166 max mem: 18814 Epoch: [188/300] [1150/1251] eta: 0:01:36 lr: 0.000616 loss: 2.814847 (2.968775) time: 0.980940 data: 0.000164 max mem: 18814 Epoch: [188/300] [1200/1251] eta: 0:00:48 lr: 0.000616 loss: 3.220174 (2.972841) time: 0.977484 data: 0.000164 max mem: 18814 Epoch: [188/300] [1250/1251] eta: 0:00:00 lr: 0.000615 loss: 3.102407 (2.976857) time: 0.921529 data: 0.000763 max mem: 18814 Epoch: [188/300] Total time: 0:19:58 (0.957855 s / it) Averaged stats: lr: 0.000615 loss: 3.102407 (2.977913) Test: [ 0/49] eta: 0:01:30 loss: 0.535915 (0.535915) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.842574 data: 1.413769 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.577975 (0.718638) acc1: 82.812500 (82.528409) acc5: 98.437500 (96.875000) time: 0.502850 data: 0.128682 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.775599 (0.738414) acc1: 81.250000 (81.473214) acc5: 96.875000 (96.875000) time: 0.365212 data: 0.000160 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.740608 (0.729285) acc1: 81.250000 (81.653226) acc5: 96.875000 (96.975806) time: 0.457251 data: 0.000143 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.732805 (0.743506) acc1: 81.250000 (81.821646) acc5: 96.875000 (96.798780) time: 0.451103 data: 0.000139 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.732805 (0.741859) acc1: 81.250000 (81.920000) acc5: 95.312500 (96.864000) time: 0.345904 data: 0.000114 max mem: 18814 Test: Total time: 0:00:20 (0.428554 s / it) * Acc@1 82.094 Acc@5 96.278 loss 0.755 Max accuracy: 82.22% Epoch: [189/300] [ 0/1251] eta: 0:40:39 lr: 0.000615 loss: 3.089742 (3.089742) time: 1.950357 data: 1.088163 max mem: 18814 Epoch: [189/300] [ 50/1251] eta: 0:19:39 lr: 0.000615 loss: 3.192181 (2.944088) time: 0.913089 data: 0.000166 max mem: 18814 Epoch: [189/300] [ 100/1251] eta: 0:18:43 lr: 0.000615 loss: 2.854096 (2.916506) time: 1.023353 data: 0.000189 max mem: 18814 Epoch: [189/300] [ 150/1251] eta: 0:17:40 lr: 0.000614 loss: 2.946250 (2.939745) time: 0.957229 data: 0.000429 max mem: 18814 Epoch: [189/300] [ 200/1251] eta: 0:16:50 lr: 0.000614 loss: 3.010993 (2.954869) time: 0.921521 data: 0.000158 max mem: 18814 Epoch: [189/300] [ 250/1251] eta: 0:16:02 lr: 0.000613 loss: 3.258029 (2.974948) time: 0.918703 data: 0.000165 max mem: 18814 Epoch: [189/300] [ 300/1251] eta: 0:15:13 lr: 0.000613 loss: 3.268465 (2.983827) time: 0.913720 data: 0.000157 max mem: 18814 Epoch: [189/300] [ 350/1251] eta: 0:14:27 lr: 0.000613 loss: 3.214829 (2.981460) time: 1.054124 data: 0.000166 max mem: 18814 Epoch: [189/300] [ 400/1251] eta: 0:13:34 lr: 0.000612 loss: 3.029849 (2.975824) time: 0.920073 data: 0.000160 max mem: 18814 Epoch: [189/300] [ 450/1251] eta: 0:12:45 lr: 0.000612 loss: 3.009278 (2.973276) time: 0.955774 data: 0.000160 max mem: 18814 Epoch: [189/300] [ 500/1251] eta: 0:11:57 lr: 0.000612 loss: 3.133856 (2.973150) time: 0.929369 data: 0.000160 max mem: 18814 Epoch: [189/300] [ 550/1251] eta: 0:11:09 lr: 0.000611 loss: 3.010303 (2.966061) time: 0.908978 data: 0.000156 max mem: 18814 Epoch: [189/300] [ 600/1251] eta: 0:10:23 lr: 0.000611 loss: 3.193569 (2.964539) time: 0.991611 data: 0.000165 max mem: 18814 Epoch: [189/300] [ 650/1251] eta: 0:09:34 lr: 0.000610 loss: 3.117223 (2.968630) time: 0.920237 data: 0.000165 max mem: 18814 Epoch: [189/300] [ 700/1251] eta: 0:08:46 lr: 0.000610 loss: 3.038781 (2.970750) time: 0.975312 data: 0.000162 max mem: 18814 Epoch: [189/300] [ 750/1251] eta: 0:07:58 lr: 0.000610 loss: 2.931749 (2.965123) time: 0.923476 data: 0.000157 max mem: 18814 Epoch: [189/300] [ 800/1251] eta: 0:07:11 lr: 0.000609 loss: 2.909533 (2.966186) time: 0.905535 data: 0.000163 max mem: 18814 Epoch: [189/300] [ 850/1251] eta: 0:06:23 lr: 0.000609 loss: 2.956573 (2.966951) time: 0.975137 data: 0.000159 max mem: 18814 Epoch: [189/300] [ 900/1251] eta: 0:05:35 lr: 0.000609 loss: 2.863514 (2.972851) time: 0.917526 data: 0.000167 max mem: 18814 Epoch: [189/300] [ 950/1251] eta: 0:04:47 lr: 0.000608 loss: 3.177113 (2.977167) time: 0.993717 data: 0.000172 max mem: 18814 Epoch: [189/300] [1000/1251] eta: 0:03:59 lr: 0.000608 loss: 3.009434 (2.979936) time: 0.925367 data: 0.000160 max mem: 18814 Epoch: [189/300] [1050/1251] eta: 0:03:12 lr: 0.000607 loss: 3.002206 (2.978597) time: 0.910706 data: 0.000163 max mem: 18814 Epoch: [189/300] [1100/1251] eta: 0:02:24 lr: 0.000607 loss: 3.005070 (2.980268) time: 0.962079 data: 0.000163 max mem: 18814 Epoch: [189/300] [1150/1251] eta: 0:01:36 lr: 0.000607 loss: 3.022627 (2.977908) time: 0.918046 data: 0.000167 max mem: 18814 Epoch: [189/300] [1200/1251] eta: 0:00:48 lr: 0.000606 loss: 3.213641 (2.981776) time: 0.995459 data: 0.000179 max mem: 18814 Epoch: [189/300] [1250/1251] eta: 0:00:00 lr: 0.000606 loss: 3.125602 (2.981308) time: 0.926368 data: 0.000776 max mem: 18814 Epoch: [189/300] Total time: 0:19:56 (0.956543 s / it) Averaged stats: lr: 0.000606 loss: 3.125602 (2.985056) Test: [ 0/49] eta: 0:01:22 loss: 0.557959 (0.557959) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.690647 data: 1.280981 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.622820 (0.740555) acc1: 82.812500 (83.096591) acc5: 96.875000 (96.590909) time: 0.483116 data: 0.116595 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.788015 (0.764687) acc1: 81.250000 (81.994048) acc5: 96.875000 (96.354167) time: 0.359414 data: 0.000164 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.783184 (0.752370) acc1: 81.250000 (82.358871) acc5: 96.875000 (96.572581) time: 0.353917 data: 0.000163 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.729776 (0.760050) acc1: 81.250000 (82.240854) acc5: 96.875000 (96.608232) time: 0.349991 data: 0.000151 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.766612 (0.758025) acc1: 81.250000 (82.272000) acc5: 96.875000 (96.736000) time: 0.345434 data: 0.000123 max mem: 18814 Test: Total time: 0:00:18 (0.381801 s / it) * Acc@1 82.204 Acc@5 96.320 loss 0.772 Max accuracy: 82.22% Epoch: [190/300] [ 0/1251] eta: 0:42:14 lr: 0.000606 loss: 2.965487 (2.965487) time: 2.026030 data: 1.166137 max mem: 18814 Epoch: [190/300] [ 50/1251] eta: 0:19:55 lr: 0.000606 loss: 3.250975 (3.148921) time: 0.977441 data: 0.000162 max mem: 18814 Epoch: [190/300] [ 100/1251] eta: 0:18:31 lr: 0.000605 loss: 3.064149 (3.065669) time: 0.909431 data: 0.000175 max mem: 18814 Epoch: [190/300] [ 150/1251] eta: 0:17:47 lr: 0.000605 loss: 2.975195 (3.065964) time: 0.992207 data: 0.000172 max mem: 18814 Epoch: [190/300] [ 200/1251] eta: 0:16:54 lr: 0.000604 loss: 3.073759 (3.022055) time: 0.924435 data: 0.000162 max mem: 18814 Epoch: [190/300] [ 250/1251] eta: 0:16:09 lr: 0.000604 loss: 3.074748 (2.988878) time: 0.922243 data: 0.000178 max mem: 18814 Epoch: [190/300] [ 300/1251] eta: 0:15:21 lr: 0.000604 loss: 3.150267 (2.991636) time: 0.987726 data: 0.000157 max mem: 18814 Epoch: [190/300] [ 350/1251] eta: 0:14:29 lr: 0.000603 loss: 3.040793 (2.988578) time: 0.940454 data: 0.000168 max mem: 18814 Epoch: [190/300] [ 400/1251] eta: 0:13:41 lr: 0.000603 loss: 3.082365 (2.987830) time: 0.974433 data: 0.000168 max mem: 18814 Epoch: [190/300] [ 450/1251] eta: 0:12:51 lr: 0.000603 loss: 3.213799 (2.980255) time: 0.925369 data: 0.000211 max mem: 18814 Epoch: [190/300] [ 500/1251] eta: 0:12:03 lr: 0.000602 loss: 2.726932 (2.977410) time: 0.912082 data: 0.000175 max mem: 18814 Epoch: [190/300] [ 550/1251] eta: 0:11:15 lr: 0.000602 loss: 3.176553 (2.974979) time: 0.985971 data: 0.000182 max mem: 18814 Epoch: [190/300] [ 600/1251] eta: 0:10:26 lr: 0.000601 loss: 2.837488 (2.969469) time: 0.927903 data: 0.000197 max mem: 18814 Epoch: [190/300] [ 650/1251] eta: 0:09:38 lr: 0.000601 loss: 2.944722 (2.965540) time: 0.974431 data: 0.000160 max mem: 18814 Epoch: [190/300] [ 700/1251] eta: 0:08:49 lr: 0.000601 loss: 3.007360 (2.963068) time: 0.918572 data: 0.000167 max mem: 18814 Epoch: [190/300] [ 750/1251] eta: 0:08:02 lr: 0.000600 loss: 2.609582 (2.958177) time: 0.917883 data: 0.000170 max mem: 18814 Epoch: [190/300] [ 800/1251] eta: 0:07:13 lr: 0.000600 loss: 2.840953 (2.957616) time: 0.963899 data: 0.000171 max mem: 18814 Epoch: [190/300] [ 850/1251] eta: 0:06:25 lr: 0.000599 loss: 2.980079 (2.955716) time: 0.914535 data: 0.000163 max mem: 18814 Epoch: [190/300] [ 900/1251] eta: 0:05:37 lr: 0.000599 loss: 3.197530 (2.956802) time: 0.973614 data: 0.000163 max mem: 18814 Epoch: [190/300] [ 950/1251] eta: 0:04:49 lr: 0.000599 loss: 3.104319 (2.957470) time: 0.923342 data: 0.000160 max mem: 18814 Epoch: [190/300] [1000/1251] eta: 0:04:01 lr: 0.000598 loss: 3.125707 (2.959986) time: 0.920596 data: 0.000149 max mem: 18814 Epoch: [190/300] [1050/1251] eta: 0:03:13 lr: 0.000598 loss: 3.169166 (2.964686) time: 0.994739 data: 0.000159 max mem: 18814 Epoch: [190/300] [1100/1251] eta: 0:02:25 lr: 0.000598 loss: 3.169566 (2.966409) time: 0.927830 data: 0.000174 max mem: 18814 Epoch: [190/300] [1150/1251] eta: 0:01:37 lr: 0.000597 loss: 2.988867 (2.965259) time: 0.999125 data: 0.000167 max mem: 18814 Epoch: [190/300] [1200/1251] eta: 0:00:49 lr: 0.000597 loss: 2.888430 (2.965149) time: 0.922238 data: 0.000161 max mem: 18814 Epoch: [190/300] [1250/1251] eta: 0:00:00 lr: 0.000596 loss: 2.926106 (2.967104) time: 0.915757 data: 0.000763 max mem: 18814 Epoch: [190/300] Total time: 0:20:03 (0.962012 s / it) Averaged stats: lr: 0.000596 loss: 2.926106 (2.964386) Test: [ 0/49] eta: 0:01:18 loss: 0.584411 (0.584411) acc1: 81.250000 (81.250000) acc5: 98.437500 (98.437500) time: 1.597353 data: 1.102196 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.584411 (0.703765) acc1: 81.250000 (82.670455) acc5: 96.875000 (95.880682) time: 0.492089 data: 0.100390 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.776026 (0.739145) acc1: 79.687500 (81.622024) acc5: 95.312500 (95.907738) time: 0.378719 data: 0.000166 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.764194 (0.732266) acc1: 81.250000 (81.854839) acc5: 95.312500 (96.320565) time: 0.364895 data: 0.000133 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.750919 (0.743127) acc1: 81.250000 (81.821646) acc5: 95.312500 (96.227134) time: 0.351994 data: 0.000129 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.750919 (0.740998) acc1: 81.250000 (81.984000) acc5: 95.312500 (96.256000) time: 0.358671 data: 0.000108 max mem: 18814 Test: Total time: 0:00:19 (0.394494 s / it) * Acc@1 82.330 Acc@5 96.312 loss 0.746 Max accuracy: 82.33% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0190.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0190.pth Epoch: [191/300] [ 0/1251] eta: 0:39:49 lr: 0.000596 loss: 2.496076 (2.496076) time: 1.910029 data: 1.059943 max mem: 18814 Epoch: [191/300] [ 50/1251] eta: 0:19:39 lr: 0.000596 loss: 2.911757 (2.891433) time: 0.911687 data: 0.000162 max mem: 18814 Epoch: [191/300] [ 100/1251] eta: 0:18:33 lr: 0.000596 loss: 3.143738 (2.914779) time: 0.969754 data: 0.000154 max mem: 18814 Epoch: [191/300] [ 150/1251] eta: 0:17:32 lr: 0.000595 loss: 3.000929 (2.926336) time: 0.911946 data: 0.000161 max mem: 18814 Epoch: [191/300] [ 200/1251] eta: 0:16:46 lr: 0.000595 loss: 3.125841 (2.949590) time: 0.988412 data: 0.000172 max mem: 18814 Epoch: [191/300] [ 250/1251] eta: 0:15:54 lr: 0.000595 loss: 2.847305 (2.937205) time: 0.918182 data: 0.000158 max mem: 18814 Epoch: [191/300] [ 300/1251] eta: 0:15:08 lr: 0.000594 loss: 3.079639 (2.938171) time: 0.940587 data: 0.000147 max mem: 18814 Epoch: [191/300] [ 350/1251] eta: 0:14:23 lr: 0.000594 loss: 2.953212 (2.940620) time: 0.985874 data: 0.000162 max mem: 18814 Epoch: [191/300] [ 400/1251] eta: 0:13:34 lr: 0.000593 loss: 3.220390 (2.961245) time: 0.914912 data: 0.000166 max mem: 18814 Epoch: [191/300] [ 450/1251] eta: 0:12:48 lr: 0.000593 loss: 3.050206 (2.957724) time: 0.993060 data: 0.000176 max mem: 18814 Epoch: [191/300] [ 500/1251] eta: 0:11:59 lr: 0.000593 loss: 2.980987 (2.954542) time: 0.924085 data: 0.000166 max mem: 18814 Epoch: [191/300] [ 550/1251] eta: 0:11:11 lr: 0.000592 loss: 2.735712 (2.945358) time: 0.929068 data: 0.000167 max mem: 18814 Epoch: [191/300] [ 600/1251] eta: 0:10:24 lr: 0.000592 loss: 3.055978 (2.949169) time: 0.975399 data: 0.000191 max mem: 18814 Epoch: [191/300] [ 650/1251] eta: 0:09:35 lr: 0.000592 loss: 3.111705 (2.950563) time: 0.920896 data: 0.000160 max mem: 18814 Epoch: [191/300] [ 700/1251] eta: 0:08:48 lr: 0.000591 loss: 2.961027 (2.951242) time: 0.991800 data: 0.000174 max mem: 18814 Epoch: [191/300] [ 750/1251] eta: 0:07:59 lr: 0.000591 loss: 3.094318 (2.951649) time: 0.922918 data: 0.000164 max mem: 18814 Epoch: [191/300] [ 800/1251] eta: 0:07:12 lr: 0.000590 loss: 2.893558 (2.956745) time: 0.942004 data: 0.000177 max mem: 18814 Epoch: [191/300] [ 850/1251] eta: 0:06:24 lr: 0.000590 loss: 2.966112 (2.957880) time: 0.987327 data: 0.000176 max mem: 18814 Epoch: [191/300] [ 900/1251] eta: 0:05:36 lr: 0.000590 loss: 3.057562 (2.963904) time: 0.909828 data: 0.000172 max mem: 18814 Epoch: [191/300] [ 950/1251] eta: 0:04:48 lr: 0.000589 loss: 3.134357 (2.966202) time: 0.966178 data: 0.000168 max mem: 18814 Epoch: [191/300] [1000/1251] eta: 0:04:00 lr: 0.000589 loss: 3.198604 (2.969192) time: 0.923802 data: 0.000162 max mem: 18814 Epoch: [191/300] [1050/1251] eta: 0:03:12 lr: 0.000589 loss: 3.110138 (2.972902) time: 0.975650 data: 0.000180 max mem: 18814 Epoch: [191/300] [1100/1251] eta: 0:02:24 lr: 0.000588 loss: 3.139347 (2.971842) time: 0.984401 data: 0.000164 max mem: 18814 Epoch: [191/300] [1150/1251] eta: 0:01:36 lr: 0.000588 loss: 3.060971 (2.973904) time: 0.948611 data: 0.000192 max mem: 18814 Epoch: [191/300] [1200/1251] eta: 0:00:48 lr: 0.000587 loss: 3.037127 (2.970193) time: 0.971926 data: 0.000178 max mem: 18814 Epoch: [191/300] [1250/1251] eta: 0:00:00 lr: 0.000587 loss: 3.162751 (2.974305) time: 0.911270 data: 0.000762 max mem: 18814 Epoch: [191/300] Total time: 0:19:58 (0.957752 s / it) Averaged stats: lr: 0.000587 loss: 3.162751 (2.966117) Test: [ 0/49] eta: 0:01:17 loss: 0.533561 (0.533561) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.581411 data: 1.154111 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.620335 (0.725222) acc1: 84.375000 (83.380682) acc5: 95.312500 (96.022727) time: 0.471306 data: 0.105082 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.777806 (0.740022) acc1: 81.250000 (82.217262) acc5: 95.312500 (96.205357) time: 0.360130 data: 0.000150 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.759917 (0.734989) acc1: 81.250000 (82.258065) acc5: 96.875000 (96.522177) time: 0.364639 data: 0.000139 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.713585 (0.746104) acc1: 82.812500 (82.431402) acc5: 96.875000 (96.379573) time: 0.368980 data: 0.000154 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.740138 (0.742758) acc1: 82.812500 (82.560000) acc5: 96.875000 (96.480000) time: 0.355475 data: 0.000119 max mem: 18814 Test: Total time: 0:00:18 (0.387539 s / it) * Acc@1 82.370 Acc@5 96.260 loss 0.751 Max accuracy: 82.37% Epoch: [192/300] [ 0/1251] eta: 0:43:46 lr: 0.000587 loss: 3.219164 (3.219164) time: 2.099672 data: 1.218133 max mem: 18814 Epoch: [192/300] [ 50/1251] eta: 0:19:37 lr: 0.000587 loss: 2.947884 (2.974687) time: 0.953344 data: 0.000172 max mem: 18814 Epoch: [192/300] [ 100/1251] eta: 0:18:32 lr: 0.000586 loss: 2.951412 (2.999759) time: 0.978078 data: 0.000169 max mem: 18814 Epoch: [192/300] [ 150/1251] eta: 0:17:41 lr: 0.000586 loss: 2.742203 (2.948561) time: 0.977301 data: 0.000175 max mem: 18814 Epoch: [192/300] [ 200/1251] eta: 0:16:44 lr: 0.000586 loss: 2.842371 (2.965092) time: 0.914928 data: 0.000157 max mem: 18814 Epoch: [192/300] [ 250/1251] eta: 0:15:58 lr: 0.000585 loss: 2.768167 (2.951556) time: 1.026981 data: 0.000172 max mem: 18814 Epoch: [192/300] [ 300/1251] eta: 0:15:06 lr: 0.000585 loss: 2.780762 (2.948235) time: 0.923223 data: 0.000159 max mem: 18814 Epoch: [192/300] [ 350/1251] eta: 0:14:20 lr: 0.000584 loss: 2.944391 (2.950454) time: 0.924687 data: 0.000152 max mem: 18814 Epoch: [192/300] [ 400/1251] eta: 0:13:33 lr: 0.000584 loss: 2.807740 (2.944602) time: 0.925263 data: 0.000163 max mem: 18814 Epoch: [192/300] [ 450/1251] eta: 0:12:47 lr: 0.000584 loss: 3.135336 (2.939131) time: 0.973077 data: 0.000176 max mem: 18814 Epoch: [192/300] [ 500/1251] eta: 0:12:00 lr: 0.000583 loss: 3.096363 (2.943616) time: 1.027119 data: 0.000164 max mem: 18814 Epoch: [192/300] [ 550/1251] eta: 0:11:10 lr: 0.000583 loss: 2.984069 (2.946390) time: 0.929926 data: 0.000163 max mem: 18814 Epoch: [192/300] [ 600/1251] eta: 0:10:23 lr: 0.000583 loss: 2.992462 (2.940351) time: 0.913981 data: 0.000175 max mem: 18814 Epoch: [192/300] [ 650/1251] eta: 0:09:36 lr: 0.000582 loss: 3.032468 (2.943586) time: 0.955553 data: 0.000166 max mem: 18814 Epoch: [192/300] [ 700/1251] eta: 0:08:49 lr: 0.000582 loss: 2.925654 (2.951775) time: 0.935540 data: 0.000159 max mem: 18814 Epoch: [192/300] [ 750/1251] eta: 0:08:01 lr: 0.000581 loss: 2.495486 (2.950644) time: 1.043825 data: 0.000167 max mem: 18814 Epoch: [192/300] [ 800/1251] eta: 0:07:13 lr: 0.000581 loss: 2.893771 (2.951615) time: 0.953377 data: 0.000165 max mem: 18814 Epoch: [192/300] [ 850/1251] eta: 0:06:25 lr: 0.000581 loss: 2.955819 (2.948981) time: 0.926157 data: 0.000166 max mem: 18814 Epoch: [192/300] [ 900/1251] eta: 0:05:37 lr: 0.000580 loss: 2.953942 (2.947203) time: 0.941697 data: 0.000159 max mem: 18814 Epoch: [192/300] [ 950/1251] eta: 0:04:49 lr: 0.000580 loss: 3.168666 (2.956231) time: 0.916456 data: 0.000158 max mem: 18814 Epoch: [192/300] [1000/1251] eta: 0:04:01 lr: 0.000580 loss: 3.066590 (2.954595) time: 1.034433 data: 0.000157 max mem: 18814 Epoch: [192/300] [1050/1251] eta: 0:03:12 lr: 0.000579 loss: 3.045784 (2.955230) time: 0.948729 data: 0.000157 max mem: 18814 Epoch: [192/300] [1100/1251] eta: 0:02:24 lr: 0.000579 loss: 2.788225 (2.952394) time: 0.918942 data: 0.000166 max mem: 18814 Epoch: [192/300] [1150/1251] eta: 0:01:37 lr: 0.000578 loss: 3.016867 (2.954514) time: 0.924629 data: 0.000152 max mem: 18814 Epoch: [192/300] [1200/1251] eta: 0:00:48 lr: 0.000578 loss: 2.942170 (2.956841) time: 0.938561 data: 0.000151 max mem: 18814 Epoch: [192/300] [1250/1251] eta: 0:00:00 lr: 0.000578 loss: 2.888626 (2.957485) time: 0.976669 data: 0.000747 max mem: 18814 Epoch: [192/300] Total time: 0:20:01 (0.960307 s / it) Averaged stats: lr: 0.000578 loss: 2.888626 (2.955287) Test: [ 0/49] eta: 0:01:27 loss: 0.529253 (0.529253) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.775570 data: 1.350214 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.554287 (0.696272) acc1: 82.812500 (83.948864) acc5: 98.437500 (96.875000) time: 0.487565 data: 0.122912 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.812762 (0.740996) acc1: 81.250000 (82.961310) acc5: 96.875000 (96.651786) time: 0.355840 data: 0.000169 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.812762 (0.741908) acc1: 81.250000 (82.459677) acc5: 96.875000 (96.723790) time: 0.352634 data: 0.000151 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.721490 (0.751354) acc1: 82.812500 (82.393293) acc5: 96.875000 (96.760671) time: 0.350832 data: 0.000130 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.753837 (0.748304) acc1: 82.812500 (82.464000) acc5: 96.875000 (96.768000) time: 0.443159 data: 0.000101 max mem: 18814 Test: Total time: 0:00:20 (0.422715 s / it) * Acc@1 82.360 Acc@5 96.332 loss 0.749 Max accuracy: 82.37% Epoch: [193/300] [ 0/1251] eta: 0:42:22 lr: 0.000578 loss: 3.030838 (3.030838) time: 2.032479 data: 1.173224 max mem: 18814 Epoch: [193/300] [ 50/1251] eta: 0:19:54 lr: 0.000577 loss: 3.072358 (3.114773) time: 0.968456 data: 0.000158 max mem: 18814 Epoch: [193/300] [ 100/1251] eta: 0:18:48 lr: 0.000577 loss: 3.102164 (3.058392) time: 0.992927 data: 0.000152 max mem: 18814 Epoch: [193/300] [ 150/1251] eta: 0:17:45 lr: 0.000577 loss: 2.824269 (3.010625) time: 0.952066 data: 0.000167 max mem: 18814 Epoch: [193/300] [ 200/1251] eta: 0:17:00 lr: 0.000576 loss: 3.044421 (3.006212) time: 0.990658 data: 0.000192 max mem: 18814 Epoch: [193/300] [ 250/1251] eta: 0:16:07 lr: 0.000576 loss: 3.190254 (2.993941) time: 0.934684 data: 0.000166 max mem: 18814 Epoch: [193/300] [ 300/1251] eta: 0:15:18 lr: 0.000575 loss: 2.954556 (2.999781) time: 0.979602 data: 0.000163 max mem: 18814 Epoch: [193/300] [ 350/1251] eta: 0:14:32 lr: 0.000575 loss: 3.161299 (2.997197) time: 0.991929 data: 0.000151 max mem: 18814 Epoch: [193/300] [ 400/1251] eta: 0:13:40 lr: 0.000575 loss: 3.075121 (2.980936) time: 0.937389 data: 0.000163 max mem: 18814 Epoch: [193/300] [ 450/1251] eta: 0:12:54 lr: 0.000574 loss: 3.050930 (2.975829) time: 0.991971 data: 0.000163 max mem: 18814 Epoch: [193/300] [ 500/1251] eta: 0:12:04 lr: 0.000574 loss: 3.128205 (2.974756) time: 0.928156 data: 0.000164 max mem: 18814 Epoch: [193/300] [ 550/1251] eta: 0:11:15 lr: 0.000574 loss: 3.012156 (2.969282) time: 0.970441 data: 0.000164 max mem: 18814 Epoch: [193/300] [ 600/1251] eta: 0:10:28 lr: 0.000573 loss: 2.939009 (2.962787) time: 0.982311 data: 0.000185 max mem: 18814 Epoch: [193/300] [ 650/1251] eta: 0:09:39 lr: 0.000573 loss: 2.651921 (2.959538) time: 0.916173 data: 0.000162 max mem: 18814 Epoch: [193/300] [ 700/1251] eta: 0:08:51 lr: 0.000572 loss: 2.895013 (2.965476) time: 0.993965 data: 0.000160 max mem: 18814 Epoch: [193/300] [ 750/1251] eta: 0:08:02 lr: 0.000572 loss: 3.019914 (2.965300) time: 0.932880 data: 0.000177 max mem: 18814 Epoch: [193/300] [ 800/1251] eta: 0:07:14 lr: 0.000572 loss: 2.879159 (2.960832) time: 0.935797 data: 0.000154 max mem: 18814 Epoch: [193/300] [ 850/1251] eta: 0:06:26 lr: 0.000571 loss: 3.056653 (2.960590) time: 0.989079 data: 0.000177 max mem: 18814 Epoch: [193/300] [ 900/1251] eta: 0:05:37 lr: 0.000571 loss: 2.853919 (2.960131) time: 0.913952 data: 0.000157 max mem: 18814 Epoch: [193/300] [ 950/1251] eta: 0:04:49 lr: 0.000571 loss: 2.858680 (2.957364) time: 0.980184 data: 0.000158 max mem: 18814 Epoch: [193/300] [1000/1251] eta: 0:04:01 lr: 0.000570 loss: 3.085363 (2.963556) time: 0.924043 data: 0.000171 max mem: 18814 Epoch: [193/300] [1050/1251] eta: 0:03:13 lr: 0.000570 loss: 2.755913 (2.964042) time: 0.965088 data: 0.000170 max mem: 18814 Epoch: [193/300] [1100/1251] eta: 0:02:25 lr: 0.000569 loss: 2.967950 (2.962016) time: 0.983507 data: 0.000175 max mem: 18814 Epoch: [193/300] [1150/1251] eta: 0:01:37 lr: 0.000569 loss: 3.015733 (2.964149) time: 0.950632 data: 0.000159 max mem: 18814 Epoch: [193/300] [1200/1251] eta: 0:00:49 lr: 0.000569 loss: 3.057641 (2.960046) time: 0.992612 data: 0.000177 max mem: 18814 Epoch: [193/300] [1250/1251] eta: 0:00:00 lr: 0.000568 loss: 3.176214 (2.959810) time: 0.923451 data: 0.000770 max mem: 18814 Epoch: [193/300] Total time: 0:20:04 (0.962648 s / it) Averaged stats: lr: 0.000568 loss: 3.176214 (2.962566) Test: [ 0/49] eta: 0:01:32 loss: 0.612314 (0.612314) acc1: 87.500000 (87.500000) acc5: 96.875000 (96.875000) time: 1.885396 data: 1.511102 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.644171 (0.715815) acc1: 81.250000 (83.806818) acc5: 96.875000 (96.448864) time: 0.506886 data: 0.137516 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.782244 (0.747157) acc1: 81.250000 (82.961310) acc5: 96.875000 (96.130952) time: 0.375431 data: 0.000152 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.779418 (0.738328) acc1: 81.250000 (82.358871) acc5: 96.875000 (96.471774) time: 0.374279 data: 0.000162 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.753736 (0.750968) acc1: 82.812500 (82.583841) acc5: 96.875000 (96.341463) time: 0.365226 data: 0.000151 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.755077 (0.744888) acc1: 82.812500 (82.624000) acc5: 96.875000 (96.480000) time: 0.355069 data: 0.000114 max mem: 18814 Test: Total time: 0:00:19 (0.399820 s / it) * Acc@1 82.350 Acc@5 96.262 loss 0.753 Max accuracy: 82.37% Epoch: [194/300] [ 0/1251] eta: 0:40:42 lr: 0.000568 loss: 2.251875 (2.251875) time: 1.952604 data: 1.075959 max mem: 18814 Epoch: [194/300] [ 50/1251] eta: 0:19:46 lr: 0.000568 loss: 3.111925 (2.899999) time: 0.968244 data: 0.000158 max mem: 18814 Epoch: [194/300] [ 100/1251] eta: 0:18:29 lr: 0.000568 loss: 3.068937 (2.955488) time: 0.972167 data: 0.000175 max mem: 18814 Epoch: [194/300] [ 150/1251] eta: 0:17:44 lr: 0.000567 loss: 2.872033 (2.924468) time: 0.989166 data: 0.000158 max mem: 18814 Epoch: [194/300] [ 200/1251] eta: 0:16:49 lr: 0.000567 loss: 3.049540 (2.946545) time: 0.921728 data: 0.000161 max mem: 18814 Epoch: [194/300] [ 250/1251] eta: 0:16:04 lr: 0.000567 loss: 3.291442 (2.981054) time: 0.994724 data: 0.000187 max mem: 18814 Epoch: [194/300] [ 300/1251] eta: 0:15:16 lr: 0.000566 loss: 2.964681 (2.969571) time: 0.957953 data: 0.000161 max mem: 18814 Epoch: [194/300] [ 350/1251] eta: 0:14:26 lr: 0.000566 loss: 2.943289 (2.958154) time: 0.968571 data: 0.000165 max mem: 18814 Epoch: [194/300] [ 400/1251] eta: 0:13:39 lr: 0.000565 loss: 3.051591 (2.951042) time: 0.989850 data: 0.000180 max mem: 18814 Epoch: [194/300] [ 450/1251] eta: 0:12:49 lr: 0.000565 loss: 3.100365 (2.956233) time: 0.918359 data: 0.000158 max mem: 18814 Epoch: [194/300] [ 500/1251] eta: 0:12:02 lr: 0.000565 loss: 2.841649 (2.936499) time: 0.974634 data: 0.000169 max mem: 18814 Epoch: [194/300] [ 550/1251] eta: 0:11:15 lr: 0.000564 loss: 2.965809 (2.941950) time: 0.973226 data: 0.000166 max mem: 18814 Epoch: [194/300] [ 600/1251] eta: 0:10:25 lr: 0.000564 loss: 3.014447 (2.943930) time: 0.971564 data: 0.000169 max mem: 18814 Epoch: [194/300] [ 650/1251] eta: 0:09:38 lr: 0.000564 loss: 3.222411 (2.948849) time: 0.978635 data: 0.000161 max mem: 18814 Epoch: [194/300] [ 700/1251] eta: 0:08:49 lr: 0.000563 loss: 2.993415 (2.945621) time: 0.918672 data: 0.000189 max mem: 18814 Epoch: [194/300] [ 750/1251] eta: 0:08:01 lr: 0.000563 loss: 3.216758 (2.946725) time: 0.989932 data: 0.000164 max mem: 18814 Epoch: [194/300] [ 800/1251] eta: 0:07:13 lr: 0.000562 loss: 3.019473 (2.946747) time: 0.980862 data: 0.000164 max mem: 18814 Epoch: [194/300] [ 850/1251] eta: 0:06:25 lr: 0.000562 loss: 2.707122 (2.942229) time: 0.982221 data: 0.000166 max mem: 18814 Epoch: [194/300] [ 900/1251] eta: 0:05:37 lr: 0.000562 loss: 3.096181 (2.938901) time: 0.992837 data: 0.000183 max mem: 18814 Epoch: [194/300] [ 950/1251] eta: 0:04:48 lr: 0.000561 loss: 2.879394 (2.933213) time: 0.917907 data: 0.000162 max mem: 18814 Epoch: [194/300] [1000/1251] eta: 0:04:00 lr: 0.000561 loss: 2.945140 (2.933872) time: 0.956650 data: 0.000160 max mem: 18814 Epoch: [194/300] [1050/1251] eta: 0:03:12 lr: 0.000561 loss: 3.084008 (2.933975) time: 0.998320 data: 0.000160 max mem: 18814 Epoch: [194/300] [1100/1251] eta: 0:02:24 lr: 0.000560 loss: 2.933907 (2.935229) time: 0.996398 data: 0.000168 max mem: 18814 Epoch: [194/300] [1150/1251] eta: 0:01:36 lr: 0.000560 loss: 3.069415 (2.931884) time: 0.978380 data: 0.000170 max mem: 18814 Epoch: [194/300] [1200/1251] eta: 0:00:48 lr: 0.000560 loss: 2.922289 (2.927038) time: 0.934268 data: 0.000162 max mem: 18814 Epoch: [194/300] [1250/1251] eta: 0:00:00 lr: 0.000559 loss: 3.197063 (2.928354) time: 0.968810 data: 0.000752 max mem: 18814 Epoch: [194/300] Total time: 0:20:01 (0.960174 s / it) Averaged stats: lr: 0.000559 loss: 3.197063 (2.936419) Test: [ 0/49] eta: 0:01:28 loss: 0.563888 (0.563888) acc1: 81.250000 (81.250000) acc5: 98.437500 (98.437500) time: 1.814688 data: 1.383798 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.563888 (0.686838) acc1: 82.812500 (83.664773) acc5: 98.437500 (96.448864) time: 0.491236 data: 0.125959 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.768549 (0.715427) acc1: 81.250000 (82.738095) acc5: 96.875000 (96.354167) time: 0.377992 data: 0.000159 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.713847 (0.703138) acc1: 81.250000 (82.762097) acc5: 96.875000 (96.723790) time: 0.385140 data: 0.000138 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.713847 (0.720246) acc1: 82.812500 (82.469512) acc5: 96.875000 (96.493902) time: 0.361324 data: 0.000127 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.748096 (0.721855) acc1: 79.687500 (82.272000) acc5: 96.875000 (96.576000) time: 0.353396 data: 0.000104 max mem: 18814 Test: Total time: 0:00:19 (0.396896 s / it) * Acc@1 82.338 Acc@5 96.306 loss 0.732 Max accuracy: 82.37% Epoch: [195/300] [ 0/1251] eta: 0:55:24 lr: 0.000559 loss: 3.172350 (3.172350) time: 2.657528 data: 1.098927 max mem: 18814 Epoch: [195/300] [ 50/1251] eta: 0:19:59 lr: 0.000559 loss: 2.983954 (2.913895) time: 1.021869 data: 0.000173 max mem: 18814 Epoch: [195/300] [ 100/1251] eta: 0:18:55 lr: 0.000558 loss: 2.786293 (2.914401) time: 1.005146 data: 0.000162 max mem: 18814 Epoch: [195/300] [ 150/1251] eta: 0:17:51 lr: 0.000558 loss: 3.040281 (2.927748) time: 0.930923 data: 0.000162 max mem: 18814 Epoch: [195/300] [ 200/1251] eta: 0:16:58 lr: 0.000558 loss: 3.144519 (2.946163) time: 0.970130 data: 0.000157 max mem: 18814 Epoch: [195/300] [ 250/1251] eta: 0:16:08 lr: 0.000557 loss: 2.903716 (2.944515) time: 0.955194 data: 0.000188 max mem: 18814 Epoch: [195/300] [ 300/1251] eta: 0:15:17 lr: 0.000557 loss: 2.891321 (2.932623) time: 0.967854 data: 0.000159 max mem: 18814 Epoch: [195/300] [ 350/1251] eta: 0:14:31 lr: 0.000557 loss: 2.888266 (2.937832) time: 1.017124 data: 0.000155 max mem: 18814 Epoch: [195/300] [ 400/1251] eta: 0:13:39 lr: 0.000556 loss: 3.245198 (2.952157) time: 0.918298 data: 0.000169 max mem: 18814 Epoch: [195/300] [ 450/1251] eta: 0:12:51 lr: 0.000556 loss: 3.124992 (2.958104) time: 1.002524 data: 0.000151 max mem: 18814 Epoch: [195/300] [ 500/1251] eta: 0:12:03 lr: 0.000555 loss: 3.204430 (2.954233) time: 0.958198 data: 0.000154 max mem: 18814 Epoch: [195/300] [ 550/1251] eta: 0:11:14 lr: 0.000555 loss: 2.942205 (2.959204) time: 0.978705 data: 0.000167 max mem: 18814 Epoch: [195/300] [ 600/1251] eta: 0:10:26 lr: 0.000555 loss: 3.064701 (2.959813) time: 0.987467 data: 0.000185 max mem: 18814 Epoch: [195/300] [ 650/1251] eta: 0:09:37 lr: 0.000554 loss: 3.110889 (2.960998) time: 0.921384 data: 0.000170 max mem: 18814 Epoch: [195/300] [ 700/1251] eta: 0:08:49 lr: 0.000554 loss: 3.080123 (2.963465) time: 0.960406 data: 0.000160 max mem: 18814 Epoch: [195/300] [ 750/1251] eta: 0:08:00 lr: 0.000554 loss: 2.896141 (2.961203) time: 0.917987 data: 0.000168 max mem: 18814 Epoch: [195/300] [ 800/1251] eta: 0:07:13 lr: 0.000553 loss: 2.682204 (2.955351) time: 0.987111 data: 0.000160 max mem: 18814 Epoch: [195/300] [ 850/1251] eta: 0:06:25 lr: 0.000553 loss: 3.019836 (2.954286) time: 0.986304 data: 0.000156 max mem: 18814 Epoch: [195/300] [ 900/1251] eta: 0:05:37 lr: 0.000552 loss: 2.258162 (2.946983) time: 0.917683 data: 0.000166 max mem: 18814 Epoch: [195/300] [ 950/1251] eta: 0:04:49 lr: 0.000552 loss: 2.830046 (2.950654) time: 0.973171 data: 0.000154 max mem: 18814 Epoch: [195/300] [1000/1251] eta: 0:04:00 lr: 0.000552 loss: 2.918669 (2.951741) time: 0.910784 data: 0.000153 max mem: 18814 Epoch: [195/300] [1050/1251] eta: 0:03:12 lr: 0.000551 loss: 3.082721 (2.946270) time: 0.976748 data: 0.000161 max mem: 18814 Epoch: [195/300] [1100/1251] eta: 0:02:24 lr: 0.000551 loss: 2.718227 (2.943973) time: 1.018967 data: 0.000165 max mem: 18814 Epoch: [195/300] [1150/1251] eta: 0:01:36 lr: 0.000551 loss: 2.587891 (2.938672) time: 0.916500 data: 0.000161 max mem: 18814 Epoch: [195/300] [1200/1251] eta: 0:00:48 lr: 0.000550 loss: 3.133760 (2.939754) time: 0.927806 data: 0.000170 max mem: 18814 Epoch: [195/300] [1250/1251] eta: 0:00:00 lr: 0.000550 loss: 3.065996 (2.941973) time: 0.924741 data: 0.000776 max mem: 18814 Epoch: [195/300] Total time: 0:20:00 (0.959855 s / it) Averaged stats: lr: 0.000550 loss: 3.065996 (2.941623) Test: [ 0/49] eta: 0:01:28 loss: 0.568178 (0.568178) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.810587 data: 1.424602 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.578654 (0.707840) acc1: 82.812500 (83.380682) acc5: 95.312500 (96.448864) time: 0.492441 data: 0.129650 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.753833 (0.724532) acc1: 82.812500 (82.961310) acc5: 95.312500 (96.651786) time: 0.357069 data: 0.000144 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.718354 (0.717495) acc1: 81.250000 (82.862903) acc5: 96.875000 (96.925403) time: 0.364976 data: 0.000152 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.718354 (0.734852) acc1: 81.250000 (82.774390) acc5: 96.875000 (96.532012) time: 0.362883 data: 0.000160 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.793183 (0.735661) acc1: 81.250000 (82.752000) acc5: 95.312500 (96.640000) time: 0.362718 data: 0.000127 max mem: 18814 Test: Total time: 0:00:19 (0.393811 s / it) * Acc@1 82.472 Acc@5 96.332 loss 0.748 Max accuracy: 82.47% Epoch: [196/300] [ 0/1251] eta: 0:41:56 lr: 0.000550 loss: 3.471622 (3.471622) time: 2.011652 data: 1.139741 max mem: 18814 Epoch: [196/300] [ 50/1251] eta: 0:18:51 lr: 0.000550 loss: 2.820514 (2.952690) time: 0.910062 data: 0.000164 max mem: 18814 Epoch: [196/300] [ 100/1251] eta: 0:18:22 lr: 0.000549 loss: 2.867022 (2.959365) time: 0.925504 data: 0.000159 max mem: 18814 Epoch: [196/300] [ 150/1251] eta: 0:17:35 lr: 0.000549 loss: 3.111078 (2.975622) time: 0.944423 data: 0.000166 max mem: 18814 Epoch: [196/300] [ 200/1251] eta: 0:16:49 lr: 0.000548 loss: 2.803029 (2.932554) time: 0.910373 data: 0.000184 max mem: 18814 Epoch: [196/300] [ 250/1251] eta: 0:16:04 lr: 0.000548 loss: 3.226089 (2.945524) time: 1.045428 data: 0.000178 max mem: 18814 Epoch: [196/300] [ 300/1251] eta: 0:15:10 lr: 0.000548 loss: 2.864703 (2.938925) time: 0.923936 data: 0.000168 max mem: 18814 Epoch: [196/300] [ 350/1251] eta: 0:14:23 lr: 0.000547 loss: 2.887981 (2.954623) time: 0.929327 data: 0.000159 max mem: 18814 Epoch: [196/300] [ 400/1251] eta: 0:13:37 lr: 0.000547 loss: 3.066129 (2.951213) time: 0.924979 data: 0.000156 max mem: 18814 Epoch: [196/300] [ 450/1251] eta: 0:12:49 lr: 0.000547 loss: 2.740026 (2.943450) time: 0.906599 data: 0.000160 max mem: 18814 Epoch: [196/300] [ 500/1251] eta: 0:12:02 lr: 0.000546 loss: 2.897571 (2.941525) time: 1.018692 data: 0.000651 max mem: 18814 Epoch: [196/300] [ 550/1251] eta: 0:11:11 lr: 0.000546 loss: 2.813448 (2.938778) time: 0.920900 data: 0.000174 max mem: 18814 Epoch: [196/300] [ 600/1251] eta: 0:10:24 lr: 0.000546 loss: 2.939578 (2.941824) time: 0.915717 data: 0.000162 max mem: 18814 Epoch: [196/300] [ 650/1251] eta: 0:09:37 lr: 0.000545 loss: 3.091530 (2.950562) time: 0.928821 data: 0.000159 max mem: 18814 Epoch: [196/300] [ 700/1251] eta: 0:08:49 lr: 0.000545 loss: 3.088280 (2.955037) time: 0.914267 data: 0.000157 max mem: 18814 Epoch: [196/300] [ 750/1251] eta: 0:08:01 lr: 0.000544 loss: 2.847910 (2.963242) time: 1.030485 data: 0.000180 max mem: 18814 Epoch: [196/300] [ 800/1251] eta: 0:07:12 lr: 0.000544 loss: 3.057374 (2.966864) time: 0.918864 data: 0.000160 max mem: 18814 Epoch: [196/300] [ 850/1251] eta: 0:06:24 lr: 0.000544 loss: 3.096163 (2.967639) time: 0.923228 data: 0.000174 max mem: 18814 Epoch: [196/300] [ 900/1251] eta: 0:05:37 lr: 0.000543 loss: 3.144695 (2.963838) time: 0.957619 data: 0.000164 max mem: 18814 Epoch: [196/300] [ 950/1251] eta: 0:04:49 lr: 0.000543 loss: 2.736566 (2.951121) time: 0.912582 data: 0.000171 max mem: 18814 Epoch: [196/300] [1000/1251] eta: 0:04:01 lr: 0.000543 loss: 3.023406 (2.948607) time: 1.060382 data: 0.000160 max mem: 18814 Epoch: [196/300] [1050/1251] eta: 0:03:13 lr: 0.000542 loss: 3.028952 (2.947943) time: 0.977017 data: 0.000168 max mem: 18814 Epoch: [196/300] [1100/1251] eta: 0:02:25 lr: 0.000542 loss: 3.116667 (2.948131) time: 0.929267 data: 0.000169 max mem: 18814 Epoch: [196/300] [1150/1251] eta: 0:01:37 lr: 0.000541 loss: 3.079601 (2.950041) time: 0.938758 data: 0.000173 max mem: 18814 Epoch: [196/300] [1200/1251] eta: 0:00:49 lr: 0.000541 loss: 3.074054 (2.950981) time: 0.910872 data: 0.000179 max mem: 18814 Epoch: [196/300] [1250/1251] eta: 0:00:00 lr: 0.000541 loss: 3.091483 (2.951208) time: 1.006825 data: 0.000760 max mem: 18814 Epoch: [196/300] Total time: 0:20:02 (0.961487 s / it) Averaged stats: lr: 0.000541 loss: 3.091483 (2.950007) Test: [ 0/49] eta: 0:01:24 loss: 0.532716 (0.532716) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.731096 data: 1.323373 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.621151 (0.704389) acc1: 82.812500 (83.238636) acc5: 96.875000 (95.880682) time: 0.485168 data: 0.120477 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.773468 (0.724764) acc1: 79.687500 (82.738095) acc5: 96.875000 (96.205357) time: 0.356886 data: 0.000170 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.735604 (0.723709) acc1: 81.250000 (82.358871) acc5: 96.875000 (96.370968) time: 0.360910 data: 0.000164 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.796559 (0.743499) acc1: 82.812500 (82.202744) acc5: 95.312500 (96.189024) time: 0.382848 data: 0.000517 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.796559 (0.740606) acc1: 82.812500 (82.336000) acc5: 95.312500 (96.320000) time: 0.370731 data: 0.000480 max mem: 18814 Test: Total time: 0:00:19 (0.395162 s / it) * Acc@1 82.336 Acc@5 96.286 loss 0.747 Max accuracy: 82.47% Epoch: [197/300] [ 0/1251] eta: 0:40:43 lr: 0.000541 loss: 3.486852 (3.486852) time: 1.953181 data: 1.082308 max mem: 18814 Epoch: [197/300] [ 50/1251] eta: 0:19:50 lr: 0.000540 loss: 2.838423 (2.901261) time: 0.950359 data: 0.000167 max mem: 18814 Epoch: [197/300] [ 100/1251] eta: 0:18:45 lr: 0.000540 loss: 2.977387 (2.912181) time: 0.937209 data: 0.000173 max mem: 18814 Epoch: [197/300] [ 150/1251] eta: 0:17:51 lr: 0.000540 loss: 3.069607 (2.931367) time: 0.970648 data: 0.000165 max mem: 18814 Epoch: [197/300] [ 200/1251] eta: 0:17:00 lr: 0.000539 loss: 3.032919 (2.943406) time: 1.011251 data: 0.000158 max mem: 18814 Epoch: [197/300] [ 250/1251] eta: 0:16:02 lr: 0.000539 loss: 3.165661 (2.948671) time: 0.911138 data: 0.000182 max mem: 18814 Epoch: [197/300] [ 300/1251] eta: 0:15:15 lr: 0.000539 loss: 2.919606 (2.957198) time: 0.923594 data: 0.000177 max mem: 18814 Epoch: [197/300] [ 350/1251] eta: 0:14:28 lr: 0.000538 loss: 3.075072 (2.946590) time: 0.926587 data: 0.000154 max mem: 18814 Epoch: [197/300] [ 400/1251] eta: 0:13:42 lr: 0.000538 loss: 2.948489 (2.945655) time: 0.987384 data: 0.000177 max mem: 18814 Epoch: [197/300] [ 450/1251] eta: 0:12:53 lr: 0.000537 loss: 3.037255 (2.941665) time: 1.014859 data: 0.000206 max mem: 18814 Epoch: [197/300] [ 500/1251] eta: 0:12:02 lr: 0.000537 loss: 3.071840 (2.942193) time: 0.935329 data: 0.000210 max mem: 18814 Epoch: [197/300] [ 550/1251] eta: 0:11:14 lr: 0.000537 loss: 2.975746 (2.946633) time: 0.935196 data: 0.000208 max mem: 18814 Epoch: [197/300] [ 600/1251] eta: 0:10:26 lr: 0.000536 loss: 3.141092 (2.944041) time: 0.931883 data: 0.000174 max mem: 18814 Epoch: [197/300] [ 650/1251] eta: 0:09:38 lr: 0.000536 loss: 3.107711 (2.945930) time: 0.963532 data: 0.000145 max mem: 18814 Epoch: [197/300] [ 700/1251] eta: 0:08:50 lr: 0.000536 loss: 3.019189 (2.944944) time: 1.066614 data: 0.000145 max mem: 18814 Epoch: [197/300] [ 750/1251] eta: 0:08:01 lr: 0.000535 loss: 3.001703 (2.947859) time: 0.930764 data: 0.000168 max mem: 18814 Epoch: [197/300] [ 800/1251] eta: 0:07:13 lr: 0.000535 loss: 2.832774 (2.944150) time: 0.932129 data: 0.000160 max mem: 18814 Epoch: [197/300] [ 850/1251] eta: 0:06:25 lr: 0.000535 loss: 3.183478 (2.951665) time: 0.942033 data: 0.000171 max mem: 18814 Epoch: [197/300] [ 900/1251] eta: 0:05:37 lr: 0.000534 loss: 2.892442 (2.955186) time: 0.959072 data: 0.000179 max mem: 18814 Epoch: [197/300] [ 950/1251] eta: 0:04:49 lr: 0.000534 loss: 2.923837 (2.946346) time: 1.037778 data: 0.000169 max mem: 18814 Epoch: [197/300] [1000/1251] eta: 0:04:01 lr: 0.000533 loss: 3.086375 (2.943032) time: 0.922330 data: 0.000176 max mem: 18814 Epoch: [197/300] [1050/1251] eta: 0:03:13 lr: 0.000533 loss: 2.836468 (2.940736) time: 0.929376 data: 0.000201 max mem: 18814 Epoch: [197/300] [1100/1251] eta: 0:02:25 lr: 0.000533 loss: 3.002323 (2.941809) time: 0.937210 data: 0.000157 max mem: 18814 Epoch: [197/300] [1150/1251] eta: 0:01:37 lr: 0.000532 loss: 3.008543 (2.942858) time: 0.962402 data: 0.000169 max mem: 18814 Epoch: [197/300] [1200/1251] eta: 0:00:49 lr: 0.000532 loss: 3.105056 (2.942474) time: 1.048519 data: 0.000208 max mem: 18814 Epoch: [197/300] [1250/1251] eta: 0:00:00 lr: 0.000532 loss: 3.141554 (2.942225) time: 0.919784 data: 0.000761 max mem: 18814 Epoch: [197/300] Total time: 0:20:02 (0.960858 s / it) Averaged stats: lr: 0.000532 loss: 3.141554 (2.930808) Test: [ 0/49] eta: 0:01:27 loss: 0.520947 (0.520947) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.776307 data: 1.160666 max mem: 18814 Test: [10/49] eta: 0:00:25 loss: 0.574364 (0.682339) acc1: 82.812500 (83.096591) acc5: 98.437500 (97.159091) time: 0.662965 data: 0.105670 max mem: 18814 Test: [20/49] eta: 0:00:14 loss: 0.719167 (0.704927) acc1: 81.250000 (82.663690) acc5: 96.875000 (96.949405) time: 0.452509 data: 0.000148 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.719167 (0.713315) acc1: 82.812500 (82.560484) acc5: 96.875000 (96.975806) time: 0.353978 data: 0.000138 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.758608 (0.729711) acc1: 82.812500 (82.507622) acc5: 96.875000 (96.646341) time: 0.352133 data: 0.000156 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.761581 (0.729268) acc1: 82.812500 (82.624000) acc5: 96.875000 (96.800000) time: 0.347075 data: 0.000131 max mem: 18814 Test: Total time: 0:00:20 (0.422613 s / it) * Acc@1 82.404 Acc@5 96.364 loss 0.751 Max accuracy: 82.47% Epoch: [198/300] [ 0/1251] eta: 0:43:05 lr: 0.000532 loss: 1.799539 (1.799539) time: 2.066563 data: 1.183142 max mem: 18814 Epoch: [198/300] [ 50/1251] eta: 0:19:54 lr: 0.000531 loss: 2.976308 (2.880558) time: 0.924571 data: 0.000151 max mem: 18814 Epoch: [198/300] [ 100/1251] eta: 0:18:49 lr: 0.000531 loss: 2.695669 (2.869700) time: 1.035733 data: 0.000190 max mem: 18814 Epoch: [198/300] [ 150/1251] eta: 0:17:54 lr: 0.000531 loss: 3.022581 (2.889840) time: 1.006820 data: 0.000177 max mem: 18814 Epoch: [198/300] [ 200/1251] eta: 0:16:56 lr: 0.000530 loss: 2.855107 (2.886577) time: 0.932580 data: 0.000162 max mem: 18814 Epoch: [198/300] [ 250/1251] eta: 0:16:09 lr: 0.000530 loss: 3.116307 (2.908178) time: 0.944394 data: 0.000154 max mem: 18814 Epoch: [198/300] [ 300/1251] eta: 0:15:22 lr: 0.000529 loss: 2.966455 (2.891859) time: 0.928975 data: 0.000164 max mem: 18814 Epoch: [198/300] [ 350/1251] eta: 0:14:33 lr: 0.000529 loss: 2.872115 (2.890902) time: 1.039102 data: 0.000172 max mem: 18814 Epoch: [198/300] [ 400/1251] eta: 0:13:46 lr: 0.000529 loss: 3.062100 (2.895694) time: 1.045276 data: 0.000175 max mem: 18814 Epoch: [198/300] [ 450/1251] eta: 0:12:53 lr: 0.000528 loss: 3.200932 (2.908451) time: 0.907789 data: 0.000172 max mem: 18814 Epoch: [198/300] [ 500/1251] eta: 0:12:05 lr: 0.000528 loss: 3.084208 (2.910847) time: 0.936127 data: 0.000165 max mem: 18814 Epoch: [198/300] [ 550/1251] eta: 0:11:17 lr: 0.000528 loss: 2.939466 (2.906327) time: 0.919402 data: 0.000175 max mem: 18814 Epoch: [198/300] [ 600/1251] eta: 0:10:28 lr: 0.000527 loss: 3.140694 (2.907298) time: 1.020554 data: 0.000167 max mem: 18814 Epoch: [198/300] [ 650/1251] eta: 0:09:40 lr: 0.000527 loss: 2.648365 (2.904865) time: 1.036722 data: 0.000161 max mem: 18814 Epoch: [198/300] [ 700/1251] eta: 0:08:50 lr: 0.000527 loss: 3.187356 (2.909883) time: 0.932365 data: 0.000156 max mem: 18814 Epoch: [198/300] [ 750/1251] eta: 0:08:02 lr: 0.000526 loss: 3.066179 (2.910066) time: 0.938186 data: 0.000173 max mem: 18814 Epoch: [198/300] [ 800/1251] eta: 0:07:14 lr: 0.000526 loss: 3.063365 (2.907101) time: 0.931948 data: 0.000164 max mem: 18814 Epoch: [198/300] [ 850/1251] eta: 0:06:26 lr: 0.000525 loss: 3.048132 (2.911424) time: 1.038170 data: 0.000183 max mem: 18814 Epoch: [198/300] [ 900/1251] eta: 0:05:37 lr: 0.000525 loss: 3.176362 (2.916890) time: 0.914644 data: 0.000164 max mem: 18814 Epoch: [198/300] [ 950/1251] eta: 0:04:49 lr: 0.000525 loss: 3.102636 (2.917128) time: 0.922736 data: 0.000161 max mem: 18814 Epoch: [198/300] [1000/1251] eta: 0:04:01 lr: 0.000524 loss: 2.998408 (2.919586) time: 0.999239 data: 0.000161 max mem: 18814 Epoch: [198/300] [1050/1251] eta: 0:03:13 lr: 0.000524 loss: 2.999837 (2.920918) time: 0.907296 data: 0.000172 max mem: 18814 Epoch: [198/300] [1100/1251] eta: 0:02:25 lr: 0.000524 loss: 2.867958 (2.921167) time: 0.974064 data: 0.000162 max mem: 18814 Epoch: [198/300] [1150/1251] eta: 0:01:37 lr: 0.000523 loss: 3.060703 (2.919337) time: 0.922520 data: 0.000169 max mem: 18814 Epoch: [198/300] [1200/1251] eta: 0:00:48 lr: 0.000523 loss: 2.852261 (2.918925) time: 0.915615 data: 0.000173 max mem: 18814 Epoch: [198/300] [1250/1251] eta: 0:00:00 lr: 0.000523 loss: 2.961998 (2.919754) time: 0.964524 data: 0.000754 max mem: 18814 Epoch: [198/300] Total time: 0:20:02 (0.961182 s / it) Averaged stats: lr: 0.000523 loss: 2.961998 (2.913797) Test: [ 0/49] eta: 0:01:27 loss: 0.575018 (0.575018) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.790112 data: 1.392317 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.646877 (0.727494) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.448864) time: 0.490930 data: 0.126735 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.784282 (0.758613) acc1: 81.250000 (82.366071) acc5: 96.875000 (96.354167) time: 0.357923 data: 0.000177 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.750195 (0.746777) acc1: 81.250000 (82.308468) acc5: 96.875000 (96.572581) time: 0.356069 data: 0.000162 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.742111 (0.758875) acc1: 82.812500 (82.355183) acc5: 96.875000 (96.570122) time: 0.358796 data: 0.000153 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.742111 (0.755459) acc1: 81.250000 (82.304000) acc5: 96.875000 (96.608000) time: 0.377800 data: 0.000128 max mem: 18814 Test: Total time: 0:00:19 (0.397423 s / it) * Acc@1 82.530 Acc@5 96.404 loss 0.756 Max accuracy: 82.53% Epoch: [199/300] [ 0/1251] eta: 0:39:45 lr: 0.000523 loss: 3.450837 (3.450837) time: 1.906782 data: 1.051388 max mem: 18814 Epoch: [199/300] [ 50/1251] eta: 0:19:46 lr: 0.000522 loss: 2.728122 (2.855975) time: 1.040045 data: 0.000172 max mem: 18814 Epoch: [199/300] [ 100/1251] eta: 0:18:18 lr: 0.000522 loss: 2.828633 (2.911979) time: 0.913666 data: 0.000158 max mem: 18814 Epoch: [199/300] [ 150/1251] eta: 0:17:37 lr: 0.000521 loss: 3.025551 (2.914265) time: 0.928079 data: 0.000174 max mem: 18814 Epoch: [199/300] [ 200/1251] eta: 0:16:51 lr: 0.000521 loss: 3.151194 (2.898791) time: 0.935404 data: 0.000156 max mem: 18814 Epoch: [199/300] [ 250/1251] eta: 0:16:04 lr: 0.000521 loss: 3.055084 (2.908257) time: 0.972632 data: 0.000170 max mem: 18814 Epoch: [199/300] [ 300/1251] eta: 0:15:18 lr: 0.000520 loss: 3.001250 (2.901962) time: 1.023434 data: 0.000182 max mem: 18814 Epoch: [199/300] [ 350/1251] eta: 0:14:24 lr: 0.000520 loss: 2.793224 (2.894564) time: 0.910266 data: 0.000162 max mem: 18814 Epoch: [199/300] [ 400/1251] eta: 0:13:38 lr: 0.000520 loss: 3.184584 (2.909657) time: 0.941939 data: 0.000174 max mem: 18814 Epoch: [199/300] [ 450/1251] eta: 0:12:50 lr: 0.000519 loss: 2.777380 (2.906525) time: 0.920673 data: 0.000158 max mem: 18814 Epoch: [199/300] [ 500/1251] eta: 0:12:01 lr: 0.000519 loss: 3.094952 (2.910112) time: 0.942963 data: 0.000172 max mem: 18814 Epoch: [199/300] [ 550/1251] eta: 0:11:15 lr: 0.000519 loss: 2.890545 (2.906915) time: 1.031277 data: 0.000162 max mem: 18814 Epoch: [199/300] [ 600/1251] eta: 0:10:24 lr: 0.000518 loss: 3.072170 (2.922567) time: 0.914261 data: 0.000174 max mem: 18814 Epoch: [199/300] [ 650/1251] eta: 0:09:37 lr: 0.000518 loss: 2.883958 (2.915810) time: 0.934490 data: 0.000161 max mem: 18814 Epoch: [199/300] [ 700/1251] eta: 0:08:49 lr: 0.000518 loss: 3.037298 (2.921722) time: 0.931867 data: 0.000149 max mem: 18814 Epoch: [199/300] [ 750/1251] eta: 0:08:01 lr: 0.000517 loss: 3.281830 (2.932869) time: 0.987097 data: 0.000174 max mem: 18814 Epoch: [199/300] [ 800/1251] eta: 0:07:13 lr: 0.000517 loss: 3.008229 (2.936046) time: 1.010333 data: 0.000473 max mem: 18814 Epoch: [199/300] [ 850/1251] eta: 0:06:24 lr: 0.000516 loss: 2.994081 (2.932917) time: 0.911032 data: 0.000154 max mem: 18814 Epoch: [199/300] [ 900/1251] eta: 0:05:36 lr: 0.000516 loss: 3.130765 (2.937396) time: 0.924171 data: 0.000160 max mem: 18814 Epoch: [199/300] [ 950/1251] eta: 0:04:48 lr: 0.000516 loss: 2.807784 (2.931603) time: 0.919638 data: 0.000187 max mem: 18814 Epoch: [199/300] [1000/1251] eta: 0:04:01 lr: 0.000515 loss: 2.971874 (2.937448) time: 0.983127 data: 0.000165 max mem: 18814 Epoch: [199/300] [1050/1251] eta: 0:03:13 lr: 0.000515 loss: 2.982673 (2.934916) time: 1.038683 data: 0.000157 max mem: 18814 Epoch: [199/300] [1100/1251] eta: 0:02:24 lr: 0.000515 loss: 3.111806 (2.934867) time: 0.935135 data: 0.000179 max mem: 18814 Epoch: [199/300] [1150/1251] eta: 0:01:36 lr: 0.000514 loss: 2.986475 (2.932545) time: 0.920090 data: 0.000165 max mem: 18814 Epoch: [199/300] [1200/1251] eta: 0:00:49 lr: 0.000514 loss: 2.686855 (2.931340) time: 0.946622 data: 0.000175 max mem: 18814 Epoch: [199/300] [1250/1251] eta: 0:00:00 lr: 0.000514 loss: 2.883745 (2.929018) time: 0.969536 data: 0.000763 max mem: 18814 Epoch: [199/300] Total time: 0:20:02 (0.961407 s / it) Averaged stats: lr: 0.000514 loss: 2.883745 (2.917469) Test: [ 0/49] eta: 0:01:30 loss: 0.525276 (0.525276) acc1: 84.375000 (84.375000) acc5: 95.312500 (95.312500) time: 1.856035 data: 1.399232 max mem: 18814 Test: [10/49] eta: 0:00:21 loss: 0.565169 (0.697791) acc1: 82.812500 (83.238636) acc5: 95.312500 (96.164773) time: 0.543539 data: 0.127368 max mem: 18814 Test: [20/49] eta: 0:00:13 loss: 0.737570 (0.720973) acc1: 81.250000 (81.845238) acc5: 96.875000 (96.354167) time: 0.386832 data: 0.000164 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.741297 (0.716830) acc1: 81.250000 (81.754032) acc5: 96.875000 (96.572581) time: 0.357581 data: 0.000151 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.756039 (0.730115) acc1: 81.250000 (81.859756) acc5: 96.875000 (96.532012) time: 0.368864 data: 0.000158 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.756039 (0.723553) acc1: 81.250000 (82.048000) acc5: 96.875000 (96.544000) time: 0.363811 data: 0.000128 max mem: 18814 Test: Total time: 0:00:19 (0.405782 s / it) * Acc@1 82.482 Acc@5 96.370 loss 0.727 Max accuracy: 82.53% Epoch: [200/300] [ 0/1251] eta: 0:42:49 lr: 0.000514 loss: 3.163160 (3.163160) time: 2.053856 data: 1.181486 max mem: 18814 Epoch: [200/300] [ 50/1251] eta: 0:19:22 lr: 0.000513 loss: 3.023960 (3.017900) time: 0.970772 data: 0.000158 max mem: 18814 Epoch: [200/300] [ 100/1251] eta: 0:18:26 lr: 0.000513 loss: 3.037999 (2.919063) time: 0.924201 data: 0.000167 max mem: 18814 Epoch: [200/300] [ 150/1251] eta: 0:17:42 lr: 0.000513 loss: 2.920096 (2.927664) time: 0.929298 data: 0.000163 max mem: 18814 Epoch: [200/300] [ 200/1251] eta: 0:16:53 lr: 0.000512 loss: 2.939638 (2.900391) time: 0.916595 data: 0.000167 max mem: 18814 Epoch: [200/300] [ 250/1251] eta: 0:16:05 lr: 0.000512 loss: 2.859433 (2.901831) time: 1.031888 data: 0.000170 max mem: 18814 Epoch: [200/300] [ 300/1251] eta: 0:15:13 lr: 0.000511 loss: 3.120613 (2.900587) time: 0.971853 data: 0.000164 max mem: 18814 Epoch: [200/300] [ 350/1251] eta: 0:14:24 lr: 0.000511 loss: 3.169848 (2.921602) time: 0.936615 data: 0.000162 max mem: 18814 Epoch: [200/300] [ 400/1251] eta: 0:13:38 lr: 0.000511 loss: 2.884212 (2.906239) time: 0.943758 data: 0.000165 max mem: 18814 Epoch: [200/300] [ 450/1251] eta: 0:12:51 lr: 0.000510 loss: 2.982151 (2.923997) time: 0.916897 data: 0.000172 max mem: 18814 Epoch: [200/300] [ 500/1251] eta: 0:12:03 lr: 0.000510 loss: 2.941655 (2.918869) time: 1.030114 data: 0.000164 max mem: 18814 Epoch: [200/300] [ 550/1251] eta: 0:11:14 lr: 0.000510 loss: 2.960578 (2.913967) time: 0.980056 data: 0.000154 max mem: 18814 Epoch: [200/300] [ 600/1251] eta: 0:10:25 lr: 0.000509 loss: 2.911511 (2.918592) time: 0.928788 data: 0.000166 max mem: 18814 Epoch: [200/300] [ 650/1251] eta: 0:09:37 lr: 0.000509 loss: 3.146024 (2.907514) time: 0.925773 data: 0.000163 max mem: 18814 Epoch: [200/300] [ 700/1251] eta: 0:08:49 lr: 0.000509 loss: 3.099734 (2.917899) time: 0.930100 data: 0.000157 max mem: 18814 Epoch: [200/300] [ 750/1251] eta: 0:08:02 lr: 0.000508 loss: 2.894421 (2.920171) time: 1.042127 data: 0.000165 max mem: 18814 Epoch: [200/300] [ 800/1251] eta: 0:07:13 lr: 0.000508 loss: 3.006784 (2.920510) time: 0.979792 data: 0.000167 max mem: 18814 Epoch: [200/300] [ 850/1251] eta: 0:06:25 lr: 0.000507 loss: 2.921690 (2.922983) time: 0.923417 data: 0.000162 max mem: 18814 Epoch: [200/300] [ 900/1251] eta: 0:05:37 lr: 0.000507 loss: 2.910414 (2.916854) time: 0.947618 data: 0.000172 max mem: 18814 Epoch: [200/300] [ 950/1251] eta: 0:04:49 lr: 0.000507 loss: 2.780792 (2.913307) time: 0.937068 data: 0.000162 max mem: 18814 Epoch: [200/300] [1000/1251] eta: 0:04:01 lr: 0.000506 loss: 3.075742 (2.916438) time: 1.019853 data: 0.000157 max mem: 18814 Epoch: [200/300] [1050/1251] eta: 0:03:13 lr: 0.000506 loss: 2.744928 (2.911145) time: 0.969393 data: 0.000160 max mem: 18814 Epoch: [200/300] [1100/1251] eta: 0:02:25 lr: 0.000506 loss: 2.946498 (2.909670) time: 0.922147 data: 0.000153 max mem: 18814 Epoch: [200/300] [1150/1251] eta: 0:01:37 lr: 0.000505 loss: 2.883230 (2.913640) time: 0.937444 data: 0.000159 max mem: 18814 Epoch: [200/300] [1200/1251] eta: 0:00:49 lr: 0.000505 loss: 3.071011 (2.915858) time: 0.914802 data: 0.000166 max mem: 18814 Epoch: [200/300] [1250/1251] eta: 0:00:00 lr: 0.000505 loss: 2.939938 (2.915443) time: 1.034848 data: 0.000772 max mem: 18814 Epoch: [200/300] Total time: 0:20:03 (0.961930 s / it) Averaged stats: lr: 0.000505 loss: 2.939938 (2.918459) Test: [ 0/49] eta: 0:01:27 loss: 0.467735 (0.467735) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.794252 data: 1.393278 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.598002 (0.714654) acc1: 82.812500 (83.238636) acc5: 96.875000 (96.022727) time: 0.489259 data: 0.126808 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.747367 (0.737284) acc1: 81.250000 (82.217262) acc5: 96.875000 (96.428571) time: 0.358234 data: 0.000149 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.723680 (0.727272) acc1: 81.250000 (82.358871) acc5: 96.875000 (96.572581) time: 0.390570 data: 0.000131 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.746243 (0.733514) acc1: 82.812500 (82.545732) acc5: 96.875000 (96.532012) time: 0.395766 data: 0.000120 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.763645 (0.730089) acc1: 82.812500 (82.464000) acc5: 96.875000 (96.608000) time: 0.361713 data: 0.000103 max mem: 18814 Test: Total time: 0:00:19 (0.403413 s / it) * Acc@1 82.422 Acc@5 96.342 loss 0.739 Max accuracy: 82.53% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0200.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0200.pth Epoch: [201/300] [ 0/1251] eta: 0:59:18 lr: 0.000505 loss: 3.308391 (3.308391) time: 2.844687 data: 1.193085 max mem: 18814 Epoch: [201/300] [ 50/1251] eta: 0:19:39 lr: 0.000504 loss: 3.038030 (2.990895) time: 0.964969 data: 0.000184 max mem: 18814 Epoch: [201/300] [ 100/1251] eta: 0:18:43 lr: 0.000504 loss: 2.860082 (2.968627) time: 0.978432 data: 0.000153 max mem: 18814 Epoch: [201/300] [ 150/1251] eta: 0:17:45 lr: 0.000504 loss: 3.027369 (2.945477) time: 0.922683 data: 0.000168 max mem: 18814 Epoch: [201/300] [ 200/1251] eta: 0:16:55 lr: 0.000503 loss: 2.836467 (2.914520) time: 0.968045 data: 0.000169 max mem: 18814 Epoch: [201/300] [ 250/1251] eta: 0:16:06 lr: 0.000503 loss: 2.695464 (2.904063) time: 0.960359 data: 0.000175 max mem: 18814 Epoch: [201/300] [ 300/1251] eta: 0:15:15 lr: 0.000502 loss: 2.831402 (2.907298) time: 0.980392 data: 0.000164 max mem: 18814 Epoch: [201/300] [ 350/1251] eta: 0:14:25 lr: 0.000502 loss: 2.682598 (2.896722) time: 0.960391 data: 0.000166 max mem: 18814 Epoch: [201/300] [ 400/1251] eta: 0:13:37 lr: 0.000502 loss: 2.795048 (2.898018) time: 0.922896 data: 0.000160 max mem: 18814 Epoch: [201/300] [ 450/1251] eta: 0:12:49 lr: 0.000501 loss: 3.035335 (2.905754) time: 0.970948 data: 0.000159 max mem: 18814 Epoch: [201/300] [ 500/1251] eta: 0:12:00 lr: 0.000501 loss: 3.062939 (2.908287) time: 0.912873 data: 0.000160 max mem: 18814 Epoch: [201/300] [ 550/1251] eta: 0:11:13 lr: 0.000501 loss: 2.862138 (2.912823) time: 0.988294 data: 0.000169 max mem: 18814 Epoch: [201/300] [ 600/1251] eta: 0:10:25 lr: 0.000500 loss: 2.971485 (2.920051) time: 0.954006 data: 0.000158 max mem: 18814 Epoch: [201/300] [ 650/1251] eta: 0:09:36 lr: 0.000500 loss: 3.171004 (2.921039) time: 0.917505 data: 0.000164 max mem: 18814 Epoch: [201/300] [ 700/1251] eta: 0:08:48 lr: 0.000500 loss: 2.938955 (2.923580) time: 0.977165 data: 0.000160 max mem: 18814 Epoch: [201/300] [ 750/1251] eta: 0:08:00 lr: 0.000499 loss: 2.903911 (2.920950) time: 0.924570 data: 0.000216 max mem: 18814 Epoch: [201/300] [ 800/1251] eta: 0:07:12 lr: 0.000499 loss: 3.021744 (2.919995) time: 1.036262 data: 0.000162 max mem: 18814 Epoch: [201/300] [ 850/1251] eta: 0:06:24 lr: 0.000499 loss: 2.781868 (2.918225) time: 0.987175 data: 0.000159 max mem: 18814 Epoch: [201/300] [ 900/1251] eta: 0:05:36 lr: 0.000498 loss: 2.759194 (2.917901) time: 0.916910 data: 0.000168 max mem: 18814 Epoch: [201/300] [ 950/1251] eta: 0:04:48 lr: 0.000498 loss: 3.146557 (2.916117) time: 0.928207 data: 0.000154 max mem: 18814 Epoch: [201/300] [1000/1251] eta: 0:04:00 lr: 0.000498 loss: 2.943623 (2.911843) time: 0.919939 data: 0.000148 max mem: 18814 Epoch: [201/300] [1050/1251] eta: 0:03:12 lr: 0.000497 loss: 2.820282 (2.910057) time: 1.043727 data: 0.000171 max mem: 18814 Epoch: [201/300] [1100/1251] eta: 0:02:24 lr: 0.000497 loss: 2.822496 (2.906038) time: 1.005265 data: 0.000161 max mem: 18814 Epoch: [201/300] [1150/1251] eta: 0:01:36 lr: 0.000496 loss: 2.981559 (2.902917) time: 0.930359 data: 0.000170 max mem: 18814 Epoch: [201/300] [1200/1251] eta: 0:00:48 lr: 0.000496 loss: 3.019346 (2.906655) time: 0.928031 data: 0.000168 max mem: 18814 Epoch: [201/300] [1250/1251] eta: 0:00:00 lr: 0.000496 loss: 2.998253 (2.907285) time: 0.926687 data: 0.000772 max mem: 18814 Epoch: [201/300] Total time: 0:20:01 (0.960311 s / it) Averaged stats: lr: 0.000496 loss: 2.998253 (2.913270) Test: [ 0/49] eta: 0:01:15 loss: 0.490909 (0.490909) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.545277 data: 1.123962 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.559356 (0.685517) acc1: 85.937500 (84.517045) acc5: 96.875000 (96.022727) time: 0.468823 data: 0.102330 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.714150 (0.711275) acc1: 81.250000 (83.110119) acc5: 96.875000 (96.428571) time: 0.357854 data: 0.000146 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.743820 (0.718312) acc1: 81.250000 (82.762097) acc5: 96.875000 (96.572581) time: 0.374045 data: 0.000134 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.721953 (0.733305) acc1: 82.812500 (82.660061) acc5: 96.875000 (96.570122) time: 0.371411 data: 0.000143 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.721953 (0.732984) acc1: 82.812500 (82.592000) acc5: 96.875000 (96.640000) time: 0.351303 data: 0.000117 max mem: 18814 Test: Total time: 0:00:19 (0.389244 s / it) * Acc@1 82.602 Acc@5 96.476 loss 0.739 Max accuracy: 82.60% Epoch: [202/300] [ 0/1251] eta: 0:42:17 lr: 0.000496 loss: 2.534059 (2.534059) time: 2.028503 data: 1.167012 max mem: 18814 Epoch: [202/300] [ 50/1251] eta: 0:18:48 lr: 0.000495 loss: 2.237869 (2.755446) time: 0.912843 data: 0.000167 max mem: 18814 Epoch: [202/300] [ 100/1251] eta: 0:18:27 lr: 0.000495 loss: 3.039109 (2.886173) time: 0.940844 data: 0.000181 max mem: 18814 Epoch: [202/300] [ 150/1251] eta: 0:17:42 lr: 0.000495 loss: 2.781612 (2.870525) time: 0.936564 data: 0.000176 max mem: 18814 Epoch: [202/300] [ 200/1251] eta: 0:16:45 lr: 0.000494 loss: 2.952896 (2.862770) time: 0.954943 data: 0.000183 max mem: 18814 Epoch: [202/300] [ 250/1251] eta: 0:16:02 lr: 0.000494 loss: 2.795188 (2.848556) time: 0.995566 data: 0.000156 max mem: 18814 Epoch: [202/300] [ 300/1251] eta: 0:15:10 lr: 0.000494 loss: 3.026873 (2.857266) time: 0.923481 data: 0.000164 max mem: 18814 Epoch: [202/300] [ 350/1251] eta: 0:14:24 lr: 0.000493 loss: 2.684578 (2.842255) time: 0.977197 data: 0.000164 max mem: 18814 Epoch: [202/300] [ 400/1251] eta: 0:13:37 lr: 0.000493 loss: 2.874296 (2.854565) time: 0.983651 data: 0.000160 max mem: 18814 Epoch: [202/300] [ 450/1251] eta: 0:12:47 lr: 0.000493 loss: 3.020080 (2.870868) time: 0.981540 data: 0.000158 max mem: 18814 Epoch: [202/300] [ 500/1251] eta: 0:12:01 lr: 0.000492 loss: 3.070443 (2.876613) time: 0.992104 data: 0.000163 max mem: 18814 Epoch: [202/300] [ 550/1251] eta: 0:11:13 lr: 0.000492 loss: 3.039724 (2.877846) time: 0.927477 data: 0.000190 max mem: 18814 Epoch: [202/300] [ 600/1251] eta: 0:10:25 lr: 0.000491 loss: 2.755670 (2.882666) time: 0.987087 data: 0.000169 max mem: 18814 Epoch: [202/300] [ 650/1251] eta: 0:09:38 lr: 0.000491 loss: 2.825004 (2.885334) time: 0.961264 data: 0.000159 max mem: 18814 Epoch: [202/300] [ 700/1251] eta: 0:08:49 lr: 0.000491 loss: 2.715239 (2.884187) time: 0.969973 data: 0.000186 max mem: 18814 Epoch: [202/300] [ 750/1251] eta: 0:08:01 lr: 0.000490 loss: 3.022107 (2.889901) time: 0.996794 data: 0.000173 max mem: 18814 Epoch: [202/300] [ 800/1251] eta: 0:07:13 lr: 0.000490 loss: 3.118785 (2.889595) time: 0.919942 data: 0.000158 max mem: 18814 Epoch: [202/300] [ 850/1251] eta: 0:06:25 lr: 0.000490 loss: 3.005409 (2.886946) time: 0.986297 data: 0.000162 max mem: 18814 Epoch: [202/300] [ 900/1251] eta: 0:05:37 lr: 0.000489 loss: 2.810589 (2.888647) time: 0.976644 data: 0.000156 max mem: 18814 Epoch: [202/300] [ 950/1251] eta: 0:04:49 lr: 0.000489 loss: 3.095935 (2.887040) time: 0.950612 data: 0.000165 max mem: 18814 Epoch: [202/300] [1000/1251] eta: 0:04:01 lr: 0.000489 loss: 3.162933 (2.894765) time: 0.999234 data: 0.000154 max mem: 18814 Epoch: [202/300] [1050/1251] eta: 0:03:13 lr: 0.000488 loss: 3.027634 (2.896539) time: 0.933048 data: 0.000188 max mem: 18814 Epoch: [202/300] [1100/1251] eta: 0:02:25 lr: 0.000488 loss: 2.969588 (2.903635) time: 1.001254 data: 0.000173 max mem: 18814 Epoch: [202/300] [1150/1251] eta: 0:01:37 lr: 0.000488 loss: 3.064801 (2.907117) time: 0.975126 data: 0.000167 max mem: 18814 Epoch: [202/300] [1200/1251] eta: 0:00:49 lr: 0.000487 loss: 3.129106 (2.910995) time: 0.978722 data: 0.000178 max mem: 18814 Epoch: [202/300] [1250/1251] eta: 0:00:00 lr: 0.000487 loss: 3.227138 (2.915092) time: 0.990058 data: 0.000755 max mem: 18814 Epoch: [202/300] Total time: 0:20:03 (0.962212 s / it) Averaged stats: lr: 0.000487 loss: 3.227138 (2.910578) Test: [ 0/49] eta: 0:01:18 loss: 0.537755 (0.537755) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.595012 data: 1.193817 max mem: 18814 Test: [10/49] eta: 0:00:18 loss: 0.595878 (0.713799) acc1: 82.812500 (82.954545) acc5: 98.437500 (97.017045) time: 0.475184 data: 0.108691 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.753755 (0.730558) acc1: 81.250000 (82.589286) acc5: 96.875000 (97.023810) time: 0.357584 data: 0.000181 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.748433 (0.725979) acc1: 81.250000 (82.510081) acc5: 96.875000 (96.975806) time: 0.352982 data: 0.000182 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.744422 (0.733979) acc1: 82.812500 (82.736280) acc5: 96.875000 (96.722561) time: 0.351367 data: 0.000162 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.746122 (0.733142) acc1: 82.812500 (82.944000) acc5: 96.875000 (96.800000) time: 0.436643 data: 0.000128 max mem: 18814 Test: Total time: 0:00:20 (0.416630 s / it) * Acc@1 82.608 Acc@5 96.446 loss 0.747 Max accuracy: 82.61% Epoch: [203/300] [ 0/1251] eta: 0:41:17 lr: 0.000487 loss: 2.622159 (2.622159) time: 1.980326 data: 1.114530 max mem: 18814 Epoch: [203/300] [ 50/1251] eta: 0:19:43 lr: 0.000487 loss: 2.779658 (2.778511) time: 0.972532 data: 0.000152 max mem: 18814 Epoch: [203/300] [ 100/1251] eta: 0:18:47 lr: 0.000486 loss: 3.025552 (2.827775) time: 0.983490 data: 0.000162 max mem: 18814 Epoch: [203/300] [ 150/1251] eta: 0:17:46 lr: 0.000486 loss: 3.184329 (2.895561) time: 0.987062 data: 0.000178 max mem: 18814 Epoch: [203/300] [ 200/1251] eta: 0:17:01 lr: 0.000485 loss: 3.091754 (2.852569) time: 0.990794 data: 0.000164 max mem: 18814 Epoch: [203/300] [ 250/1251] eta: 0:16:07 lr: 0.000485 loss: 3.084306 (2.872036) time: 0.913649 data: 0.000160 max mem: 18814 Epoch: [203/300] [ 300/1251] eta: 0:15:17 lr: 0.000485 loss: 2.989270 (2.874503) time: 0.960135 data: 0.000158 max mem: 18814 Epoch: [203/300] [ 350/1251] eta: 0:14:27 lr: 0.000484 loss: 3.060519 (2.880120) time: 0.964356 data: 0.000170 max mem: 18814 Epoch: [203/300] [ 400/1251] eta: 0:13:36 lr: 0.000484 loss: 3.032468 (2.882703) time: 0.969482 data: 0.000168 max mem: 18814 Epoch: [203/300] [ 450/1251] eta: 0:12:50 lr: 0.000484 loss: 2.911351 (2.885567) time: 0.977805 data: 0.000179 max mem: 18814 Epoch: [203/300] [ 500/1251] eta: 0:12:00 lr: 0.000483 loss: 2.998660 (2.890962) time: 0.915206 data: 0.000180 max mem: 18814 Epoch: [203/300] [ 550/1251] eta: 0:11:12 lr: 0.000483 loss: 3.008272 (2.884784) time: 0.992378 data: 0.000155 max mem: 18814 Epoch: [203/300] [ 600/1251] eta: 0:10:25 lr: 0.000483 loss: 3.012665 (2.882671) time: 0.977072 data: 0.000164 max mem: 18814 Epoch: [203/300] [ 650/1251] eta: 0:09:36 lr: 0.000482 loss: 2.874366 (2.884903) time: 0.983185 data: 0.000153 max mem: 18814 Epoch: [203/300] [ 700/1251] eta: 0:08:49 lr: 0.000482 loss: 2.888813 (2.888674) time: 0.999432 data: 0.000169 max mem: 18814 Epoch: [203/300] [ 750/1251] eta: 0:08:01 lr: 0.000482 loss: 3.196007 (2.894966) time: 0.911024 data: 0.000162 max mem: 18814 Epoch: [203/300] [ 800/1251] eta: 0:07:13 lr: 0.000481 loss: 3.120593 (2.895022) time: 0.985561 data: 0.000176 max mem: 18814 Epoch: [203/300] [ 850/1251] eta: 0:06:24 lr: 0.000481 loss: 2.845516 (2.893136) time: 0.913228 data: 0.000167 max mem: 18814 Epoch: [203/300] [ 900/1251] eta: 0:05:37 lr: 0.000481 loss: 2.671833 (2.883229) time: 0.987237 data: 0.000180 max mem: 18814 Epoch: [203/300] [ 950/1251] eta: 0:04:48 lr: 0.000480 loss: 3.030455 (2.884811) time: 0.914736 data: 0.000173 max mem: 18814 Epoch: [203/300] [1000/1251] eta: 0:04:00 lr: 0.000480 loss: 2.988266 (2.883026) time: 0.930580 data: 0.000166 max mem: 18814 Epoch: [203/300] [1050/1251] eta: 0:03:12 lr: 0.000480 loss: 2.946259 (2.886628) time: 1.052745 data: 0.000176 max mem: 18814 Epoch: [203/300] [1100/1251] eta: 0:02:24 lr: 0.000479 loss: 3.212346 (2.895461) time: 0.971989 data: 0.000181 max mem: 18814 Epoch: [203/300] [1150/1251] eta: 0:01:36 lr: 0.000479 loss: 2.939467 (2.894845) time: 0.938248 data: 0.000164 max mem: 18814 Epoch: [203/300] [1200/1251] eta: 0:00:48 lr: 0.000478 loss: 3.075003 (2.897190) time: 0.995767 data: 0.000175 max mem: 18814 Epoch: [203/300] [1250/1251] eta: 0:00:00 lr: 0.000478 loss: 3.064158 (2.898999) time: 0.914827 data: 0.000768 max mem: 18814 Epoch: [203/300] Total time: 0:20:00 (0.959302 s / it) Averaged stats: lr: 0.000478 loss: 3.064158 (2.889205) Test: [ 0/49] eta: 0:01:21 loss: 0.566342 (0.566342) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.665176 data: 1.192655 max mem: 18814 Test: [10/49] eta: 0:00:25 loss: 0.566342 (0.685643) acc1: 82.812500 (84.090909) acc5: 98.437500 (96.448864) time: 0.641971 data: 0.108573 max mem: 18814 Test: [20/49] eta: 0:00:14 loss: 0.740352 (0.715220) acc1: 82.812500 (82.886905) acc5: 96.875000 (96.354167) time: 0.446583 data: 0.000148 max mem: 18814 Test: [30/49] eta: 0:00:08 loss: 0.725467 (0.713939) acc1: 82.812500 (82.711694) acc5: 96.875000 (96.723790) time: 0.354233 data: 0.000136 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.738452 (0.727887) acc1: 81.250000 (82.240854) acc5: 98.437500 (96.722561) time: 0.352514 data: 0.000132 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.749074 (0.722018) acc1: 81.250000 (82.432000) acc5: 96.875000 (96.736000) time: 0.346745 data: 0.000104 max mem: 18814 Test: Total time: 0:00:20 (0.417903 s / it) * Acc@1 82.578 Acc@5 96.500 loss 0.732 Max accuracy: 82.61% Epoch: [204/300] [ 0/1251] eta: 0:41:23 lr: 0.000478 loss: 2.821573 (2.821573) time: 1.985074 data: 1.114470 max mem: 18814 Epoch: [204/300] [ 50/1251] eta: 0:19:23 lr: 0.000478 loss: 2.866262 (2.712319) time: 0.911701 data: 0.000146 max mem: 18814 Epoch: [204/300] [ 100/1251] eta: 0:18:21 lr: 0.000477 loss: 3.039748 (2.826420) time: 0.922622 data: 0.000160 max mem: 18814 Epoch: [204/300] [ 150/1251] eta: 0:17:40 lr: 0.000477 loss: 3.015784 (2.848463) time: 0.935714 data: 0.000170 max mem: 18814 Epoch: [204/300] [ 200/1251] eta: 0:16:48 lr: 0.000477 loss: 2.899987 (2.861788) time: 1.000873 data: 0.000149 max mem: 18814 Epoch: [204/300] [ 250/1251] eta: 0:16:06 lr: 0.000476 loss: 3.027984 (2.882006) time: 1.072990 data: 0.000199 max mem: 18814 Epoch: [204/300] [ 300/1251] eta: 0:15:12 lr: 0.000476 loss: 2.889318 (2.876514) time: 0.921281 data: 0.000168 max mem: 18814 Epoch: [204/300] [ 350/1251] eta: 0:14:25 lr: 0.000476 loss: 3.011325 (2.879752) time: 0.926290 data: 0.000150 max mem: 18814 Epoch: [204/300] [ 400/1251] eta: 0:13:38 lr: 0.000475 loss: 3.083480 (2.893892) time: 0.929980 data: 0.000168 max mem: 18814 Epoch: [204/300] [ 450/1251] eta: 0:12:50 lr: 0.000475 loss: 2.998952 (2.894169) time: 1.030398 data: 0.000160 max mem: 18814 Epoch: [204/300] [ 500/1251] eta: 0:12:02 lr: 0.000475 loss: 3.000542 (2.893745) time: 1.021948 data: 0.000179 max mem: 18814 Epoch: [204/300] [ 550/1251] eta: 0:11:12 lr: 0.000474 loss: 3.149555 (2.895130) time: 0.917239 data: 0.000159 max mem: 18814 Epoch: [204/300] [ 600/1251] eta: 0:10:24 lr: 0.000474 loss: 2.711437 (2.891533) time: 0.922657 data: 0.000166 max mem: 18814 Epoch: [204/300] [ 650/1251] eta: 0:09:35 lr: 0.000474 loss: 2.894231 (2.887460) time: 0.922976 data: 0.000152 max mem: 18814 Epoch: [204/300] [ 700/1251] eta: 0:08:47 lr: 0.000473 loss: 2.962415 (2.877554) time: 0.974784 data: 0.000157 max mem: 18814 Epoch: [204/300] [ 750/1251] eta: 0:08:00 lr: 0.000473 loss: 2.935763 (2.880724) time: 0.966268 data: 0.000173 max mem: 18814 Epoch: [204/300] [ 800/1251] eta: 0:07:12 lr: 0.000473 loss: 2.853960 (2.882013) time: 0.908444 data: 0.000160 max mem: 18814 Epoch: [204/300] [ 850/1251] eta: 0:06:24 lr: 0.000472 loss: 3.007085 (2.881839) time: 0.977282 data: 0.000173 max mem: 18814 Epoch: [204/300] [ 900/1251] eta: 0:05:36 lr: 0.000472 loss: 3.182651 (2.887659) time: 0.915320 data: 0.000155 max mem: 18814 Epoch: [204/300] [ 950/1251] eta: 0:04:48 lr: 0.000471 loss: 3.118396 (2.888243) time: 0.998047 data: 0.000158 max mem: 18814 Epoch: [204/300] [1000/1251] eta: 0:04:00 lr: 0.000471 loss: 2.923028 (2.897708) time: 0.947993 data: 0.000163 max mem: 18814 Epoch: [204/300] [1050/1251] eta: 0:03:12 lr: 0.000471 loss: 3.083245 (2.895374) time: 0.913319 data: 0.000153 max mem: 18814 Epoch: [204/300] [1100/1251] eta: 0:02:24 lr: 0.000470 loss: 3.076615 (2.896028) time: 0.979639 data: 0.000161 max mem: 18814 Epoch: [204/300] [1150/1251] eta: 0:01:36 lr: 0.000470 loss: 2.664413 (2.890868) time: 0.919775 data: 0.000156 max mem: 18814 Epoch: [204/300] [1200/1251] eta: 0:00:48 lr: 0.000470 loss: 3.159268 (2.895886) time: 0.978969 data: 0.000174 max mem: 18814 Epoch: [204/300] [1250/1251] eta: 0:00:00 lr: 0.000469 loss: 2.952488 (2.899142) time: 0.967785 data: 0.000810 max mem: 18814 Epoch: [204/300] Total time: 0:20:00 (0.959483 s / it) Averaged stats: lr: 0.000469 loss: 2.952488 (2.909564) Test: [ 0/49] eta: 0:01:17 loss: 0.506008 (0.506008) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.587267 data: 1.109544 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.545413 (0.688358) acc1: 84.375000 (84.090909) acc5: 98.437500 (96.306818) time: 0.496824 data: 0.101021 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.744707 (0.705231) acc1: 81.250000 (82.961310) acc5: 98.437500 (96.875000) time: 0.372096 data: 0.000160 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.706739 (0.705181) acc1: 81.250000 (82.661290) acc5: 98.437500 (97.127016) time: 0.361408 data: 0.000144 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.692844 (0.720491) acc1: 82.812500 (82.545732) acc5: 96.875000 (96.913110) time: 0.358167 data: 0.000131 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.706739 (0.715715) acc1: 81.250000 (82.592000) acc5: 96.875000 (96.928000) time: 0.347002 data: 0.000109 max mem: 18814 Test: Total time: 0:00:19 (0.389556 s / it) * Acc@1 82.532 Acc@5 96.438 loss 0.726 Max accuracy: 82.61% Epoch: [205/300] [ 0/1251] eta: 0:43:14 lr: 0.000469 loss: 2.852783 (2.852783) time: 2.073707 data: 1.160314 max mem: 18814 Epoch: [205/300] [ 50/1251] eta: 0:19:39 lr: 0.000469 loss: 2.773597 (2.828980) time: 0.978479 data: 0.000178 max mem: 18814 Epoch: [205/300] [ 100/1251] eta: 0:18:41 lr: 0.000469 loss: 2.386440 (2.807661) time: 0.950294 data: 0.000155 max mem: 18814 Epoch: [205/300] [ 150/1251] eta: 0:17:49 lr: 0.000468 loss: 2.940075 (2.815899) time: 1.001793 data: 0.000167 max mem: 18814 Epoch: [205/300] [ 200/1251] eta: 0:17:01 lr: 0.000468 loss: 3.141723 (2.855659) time: 0.999183 data: 0.000156 max mem: 18814 Epoch: [205/300] [ 250/1251] eta: 0:16:07 lr: 0.000468 loss: 3.078145 (2.871303) time: 0.917668 data: 0.000162 max mem: 18814 Epoch: [205/300] [ 300/1251] eta: 0:15:19 lr: 0.000467 loss: 2.971659 (2.881498) time: 0.969833 data: 0.000154 max mem: 18814 Epoch: [205/300] [ 350/1251] eta: 0:14:30 lr: 0.000467 loss: 3.023747 (2.883883) time: 0.954144 data: 0.000173 max mem: 18814 Epoch: [205/300] [ 400/1251] eta: 0:13:40 lr: 0.000467 loss: 2.901720 (2.874803) time: 0.969024 data: 0.000172 max mem: 18814 Epoch: [205/300] [ 450/1251] eta: 0:12:52 lr: 0.000466 loss: 2.593778 (2.867772) time: 0.983231 data: 0.000176 max mem: 18814 Epoch: [205/300] [ 500/1251] eta: 0:12:02 lr: 0.000466 loss: 2.875139 (2.859673) time: 0.920653 data: 0.000177 max mem: 18814 Epoch: [205/300] [ 550/1251] eta: 0:11:15 lr: 0.000466 loss: 2.870231 (2.852117) time: 0.991037 data: 0.000162 max mem: 18814 Epoch: [205/300] [ 600/1251] eta: 0:10:27 lr: 0.000465 loss: 3.099288 (2.859594) time: 0.971413 data: 0.000165 max mem: 18814 Epoch: [205/300] [ 650/1251] eta: 0:09:38 lr: 0.000465 loss: 2.910259 (2.857841) time: 0.991154 data: 0.000179 max mem: 18814 Epoch: [205/300] [ 700/1251] eta: 0:08:51 lr: 0.000465 loss: 3.043428 (2.858885) time: 0.984607 data: 0.000156 max mem: 18814 Epoch: [205/300] [ 750/1251] eta: 0:08:01 lr: 0.000464 loss: 2.958358 (2.857893) time: 0.908489 data: 0.000190 max mem: 18814 Epoch: [205/300] [ 800/1251] eta: 0:07:13 lr: 0.000464 loss: 3.067992 (2.863020) time: 0.964902 data: 0.000166 max mem: 18814 Epoch: [205/300] [ 850/1251] eta: 0:06:25 lr: 0.000463 loss: 3.161766 (2.868409) time: 0.949232 data: 0.000172 max mem: 18814 Epoch: [205/300] [ 900/1251] eta: 0:05:37 lr: 0.000463 loss: 2.961314 (2.861893) time: 0.964947 data: 0.000178 max mem: 18814 Epoch: [205/300] [ 950/1251] eta: 0:04:49 lr: 0.000463 loss: 2.766378 (2.857108) time: 0.994878 data: 0.000161 max mem: 18814 Epoch: [205/300] [1000/1251] eta: 0:04:01 lr: 0.000462 loss: 3.022665 (2.855981) time: 0.906444 data: 0.000163 max mem: 18814 Epoch: [205/300] [1050/1251] eta: 0:03:13 lr: 0.000462 loss: 2.853740 (2.855784) time: 0.978912 data: 0.000160 max mem: 18814 Epoch: [205/300] [1100/1251] eta: 0:02:24 lr: 0.000462 loss: 2.849095 (2.857287) time: 0.913253 data: 0.000186 max mem: 18814 Epoch: [205/300] [1150/1251] eta: 0:01:36 lr: 0.000461 loss: 3.089745 (2.856622) time: 0.968450 data: 0.000157 max mem: 18814 Epoch: [205/300] [1200/1251] eta: 0:00:48 lr: 0.000461 loss: 2.907948 (2.857946) time: 0.939960 data: 0.000168 max mem: 18814 Epoch: [205/300] [1250/1251] eta: 0:00:00 lr: 0.000461 loss: 3.159906 (2.862271) time: 0.920704 data: 0.000765 max mem: 18814 Epoch: [205/300] Total time: 0:20:00 (0.959859 s / it) Averaged stats: lr: 0.000461 loss: 3.159906 (2.865387) Test: [ 0/49] eta: 0:01:30 loss: 0.587980 (0.587980) acc1: 82.812500 (82.812500) acc5: 100.000000 (100.000000) time: 1.856359 data: 1.415939 max mem: 18814 Test: [10/49] eta: 0:00:20 loss: 0.587980 (0.698104) acc1: 84.375000 (85.085227) acc5: 96.875000 (96.875000) time: 0.523582 data: 0.128897 max mem: 18814 Test: [20/49] eta: 0:00:13 loss: 0.780018 (0.724810) acc1: 82.812500 (84.002976) acc5: 96.875000 (96.577381) time: 0.383347 data: 0.000163 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.719677 (0.722402) acc1: 81.250000 (83.165323) acc5: 96.875000 (96.622984) time: 0.364614 data: 0.000143 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.717696 (0.736108) acc1: 81.250000 (82.850610) acc5: 96.875000 (96.493902) time: 0.352589 data: 0.000149 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.717696 (0.731862) acc1: 81.250000 (82.784000) acc5: 95.312500 (96.576000) time: 0.355665 data: 0.000122 max mem: 18814 Test: Total time: 0:00:19 (0.401060 s / it) * Acc@1 82.570 Acc@5 96.454 loss 0.743 Max accuracy: 82.61% Epoch: [206/300] [ 0/1251] eta: 0:41:05 lr: 0.000461 loss: 2.436816 (2.436816) time: 1.970606 data: 1.101260 max mem: 18814 Epoch: [206/300] [ 50/1251] eta: 0:19:28 lr: 0.000460 loss: 3.028143 (2.926556) time: 0.990029 data: 0.000153 max mem: 18814 Epoch: [206/300] [ 100/1251] eta: 0:18:38 lr: 0.000460 loss: 3.051819 (2.930324) time: 0.996051 data: 0.000161 max mem: 18814 Epoch: [206/300] [ 150/1251] eta: 0:17:43 lr: 0.000460 loss: 3.110100 (2.932843) time: 0.918483 data: 0.000168 max mem: 18814 Epoch: [206/300] [ 200/1251] eta: 0:16:55 lr: 0.000459 loss: 3.006533 (2.927824) time: 0.989828 data: 0.000164 max mem: 18814 Epoch: [206/300] [ 250/1251] eta: 0:16:09 lr: 0.000459 loss: 3.043241 (2.909447) time: 0.990653 data: 0.000173 max mem: 18814 Epoch: [206/300] [ 300/1251] eta: 0:15:18 lr: 0.000459 loss: 2.968853 (2.923028) time: 0.946314 data: 0.000165 max mem: 18814 Epoch: [206/300] [ 350/1251] eta: 0:14:29 lr: 0.000458 loss: 2.875479 (2.909398) time: 0.981034 data: 0.000178 max mem: 18814 Epoch: [206/300] [ 400/1251] eta: 0:13:39 lr: 0.000458 loss: 3.136017 (2.907745) time: 0.936351 data: 0.000208 max mem: 18814 Epoch: [206/300] [ 450/1251] eta: 0:12:51 lr: 0.000458 loss: 3.028338 (2.914839) time: 0.962637 data: 0.000218 max mem: 18814 Epoch: [206/300] [ 500/1251] eta: 0:12:03 lr: 0.000457 loss: 2.663899 (2.904941) time: 0.996238 data: 0.000156 max mem: 18814 Epoch: [206/300] [ 550/1251] eta: 0:11:14 lr: 0.000457 loss: 3.075216 (2.905527) time: 0.955917 data: 0.000167 max mem: 18814 Epoch: [206/300] [ 600/1251] eta: 0:10:26 lr: 0.000457 loss: 3.119915 (2.907425) time: 0.969058 data: 0.000202 max mem: 18814 Epoch: [206/300] [ 650/1251] eta: 0:09:37 lr: 0.000456 loss: 3.019843 (2.911053) time: 0.930015 data: 0.000164 max mem: 18814 Epoch: [206/300] [ 700/1251] eta: 0:08:50 lr: 0.000456 loss: 2.844739 (2.906732) time: 0.959005 data: 0.000160 max mem: 18814 Epoch: [206/300] [ 750/1251] eta: 0:08:02 lr: 0.000456 loss: 2.731808 (2.902593) time: 0.968864 data: 0.000158 max mem: 18814 Epoch: [206/300] [ 800/1251] eta: 0:07:13 lr: 0.000455 loss: 2.871356 (2.904598) time: 0.913971 data: 0.000170 max mem: 18814 Epoch: [206/300] [ 850/1251] eta: 0:06:25 lr: 0.000455 loss: 3.089421 (2.914517) time: 0.929732 data: 0.000173 max mem: 18814 Epoch: [206/300] [ 900/1251] eta: 0:05:37 lr: 0.000455 loss: 2.872550 (2.914863) time: 0.933242 data: 0.000188 max mem: 18814 Epoch: [206/300] [ 950/1251] eta: 0:04:49 lr: 0.000454 loss: 2.746281 (2.913579) time: 1.042848 data: 0.000170 max mem: 18814 Epoch: [206/300] [1000/1251] eta: 0:04:00 lr: 0.000454 loss: 2.984350 (2.910026) time: 0.922072 data: 0.000163 max mem: 18814 Epoch: [206/300] [1050/1251] eta: 0:03:12 lr: 0.000454 loss: 3.187541 (2.908922) time: 0.913812 data: 0.000195 max mem: 18814 Epoch: [206/300] [1100/1251] eta: 0:02:25 lr: 0.000453 loss: 2.847081 (2.907504) time: 0.986314 data: 0.000207 max mem: 18814 Epoch: [206/300] [1150/1251] eta: 0:01:36 lr: 0.000453 loss: 3.052135 (2.908293) time: 0.918120 data: 0.000202 max mem: 18814 Epoch: [206/300] [1200/1251] eta: 0:00:48 lr: 0.000452 loss: 2.854168 (2.904866) time: 0.997779 data: 0.000204 max mem: 18814 Epoch: [206/300] [1250/1251] eta: 0:00:00 lr: 0.000452 loss: 2.546733 (2.901613) time: 0.952041 data: 0.000813 max mem: 18814 Epoch: [206/300] Total time: 0:20:01 (0.960807 s / it) Averaged stats: lr: 0.000452 loss: 2.546733 (2.898516) Test: [ 0/49] eta: 0:01:30 loss: 0.514783 (0.514783) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.846130 data: 1.451883 max mem: 18814 Test: [10/49] eta: 0:00:20 loss: 0.552773 (0.674636) acc1: 84.375000 (83.806818) acc5: 98.437500 (97.017045) time: 0.524636 data: 0.132134 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.752074 (0.700921) acc1: 81.250000 (82.886905) acc5: 96.875000 (96.949405) time: 0.373058 data: 0.000152 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.752074 (0.709959) acc1: 81.250000 (82.762097) acc5: 96.875000 (96.824597) time: 0.353816 data: 0.000162 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.735405 (0.722696) acc1: 82.812500 (82.583841) acc5: 96.875000 (96.798780) time: 0.351640 data: 0.000158 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.735405 (0.714863) acc1: 82.812500 (82.976000) acc5: 96.875000 (96.864000) time: 0.346283 data: 0.000118 max mem: 18814 Test: Total time: 0:00:19 (0.391936 s / it) * Acc@1 82.706 Acc@5 96.572 loss 0.723 Max accuracy: 82.71% Epoch: [207/300] [ 0/1251] eta: 0:41:26 lr: 0.000452 loss: 3.567383 (3.567383) time: 1.987295 data: 1.129700 max mem: 18814 Epoch: [207/300] [ 50/1251] eta: 0:19:40 lr: 0.000452 loss: 2.882490 (2.892200) time: 0.960716 data: 0.000176 max mem: 18814 Epoch: [207/300] [ 100/1251] eta: 0:18:33 lr: 0.000451 loss: 2.680350 (2.854531) time: 0.976094 data: 0.000163 max mem: 18814 Epoch: [207/300] [ 150/1251] eta: 0:17:46 lr: 0.000451 loss: 2.920959 (2.875374) time: 1.000697 data: 0.000174 max mem: 18814 Epoch: [207/300] [ 200/1251] eta: 0:16:50 lr: 0.000451 loss: 2.793577 (2.869456) time: 0.928071 data: 0.000169 max mem: 18814 Epoch: [207/300] [ 250/1251] eta: 0:16:04 lr: 0.000450 loss: 2.897549 (2.853909) time: 0.987790 data: 0.000188 max mem: 18814 Epoch: [207/300] [ 300/1251] eta: 0:15:18 lr: 0.000450 loss: 3.170685 (2.860732) time: 0.985009 data: 0.000164 max mem: 18814 Epoch: [207/300] [ 350/1251] eta: 0:14:27 lr: 0.000450 loss: 3.101693 (2.872035) time: 0.968760 data: 0.000159 max mem: 18814 Epoch: [207/300] [ 400/1251] eta: 0:13:39 lr: 0.000449 loss: 2.958025 (2.868131) time: 0.999404 data: 0.000171 max mem: 18814 Epoch: [207/300] [ 450/1251] eta: 0:12:49 lr: 0.000449 loss: 2.976758 (2.868125) time: 0.919309 data: 0.000159 max mem: 18814 Epoch: [207/300] [ 500/1251] eta: 0:12:00 lr: 0.000449 loss: 2.902746 (2.869364) time: 0.926372 data: 0.000152 max mem: 18814 Epoch: [207/300] [ 550/1251] eta: 0:11:13 lr: 0.000448 loss: 2.892448 (2.865747) time: 0.944588 data: 0.000176 max mem: 18814 Epoch: [207/300] [ 600/1251] eta: 0:10:26 lr: 0.000448 loss: 2.966868 (2.868430) time: 0.990027 data: 0.000166 max mem: 18814 Epoch: [207/300] [ 650/1251] eta: 0:09:39 lr: 0.000448 loss: 2.862126 (2.870039) time: 1.045959 data: 0.000166 max mem: 18814 Epoch: [207/300] [ 700/1251] eta: 0:08:49 lr: 0.000447 loss: 2.758000 (2.867570) time: 0.964726 data: 0.000153 max mem: 18814 Epoch: [207/300] [ 750/1251] eta: 0:08:01 lr: 0.000447 loss: 2.882948 (2.868395) time: 0.931340 data: 0.000160 max mem: 18814 Epoch: [207/300] [ 800/1251] eta: 0:07:13 lr: 0.000447 loss: 2.991281 (2.868075) time: 0.937582 data: 0.000161 max mem: 18814 Epoch: [207/300] [ 850/1251] eta: 0:06:25 lr: 0.000446 loss: 3.104295 (2.867098) time: 0.930362 data: 0.000160 max mem: 18814 Epoch: [207/300] [ 900/1251] eta: 0:05:38 lr: 0.000446 loss: 2.743574 (2.866139) time: 1.041444 data: 0.000158 max mem: 18814 Epoch: [207/300] [ 950/1251] eta: 0:04:49 lr: 0.000446 loss: 2.894913 (2.865906) time: 0.960362 data: 0.000165 max mem: 18814 Epoch: [207/300] [1000/1251] eta: 0:04:01 lr: 0.000445 loss: 2.899086 (2.869003) time: 0.929171 data: 0.000145 max mem: 18814 Epoch: [207/300] [1050/1251] eta: 0:03:13 lr: 0.000445 loss: 3.107126 (2.869071) time: 0.925897 data: 0.000164 max mem: 18814 Epoch: [207/300] [1100/1251] eta: 0:02:25 lr: 0.000445 loss: 3.189570 (2.874152) time: 0.939059 data: 0.000166 max mem: 18814 Epoch: [207/300] [1150/1251] eta: 0:01:37 lr: 0.000444 loss: 2.900980 (2.874190) time: 1.038874 data: 0.000175 max mem: 18814 Epoch: [207/300] [1200/1251] eta: 0:00:49 lr: 0.000444 loss: 3.177899 (2.878246) time: 0.998996 data: 0.000158 max mem: 18814 Epoch: [207/300] [1250/1251] eta: 0:00:00 lr: 0.000444 loss: 2.822596 (2.877906) time: 0.932473 data: 0.000746 max mem: 18814 Epoch: [207/300] Total time: 0:20:02 (0.961183 s / it) Averaged stats: lr: 0.000444 loss: 2.822596 (2.877134) Test: [ 0/49] eta: 0:01:27 loss: 0.562541 (0.562541) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.875000) time: 1.787557 data: 1.414424 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.576028 (0.665999) acc1: 82.812500 (84.943182) acc5: 96.875000 (96.164773) time: 0.491694 data: 0.128739 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.768618 (0.707475) acc1: 81.250000 (83.333333) acc5: 96.875000 (96.354167) time: 0.358011 data: 0.000162 max mem: 18814 Test: [30/49] eta: 0:00:09 loss: 0.745580 (0.706161) acc1: 81.250000 (82.963710) acc5: 96.875000 (96.320565) time: 0.464080 data: 0.000147 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.708099 (0.710369) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.303354) time: 0.461536 data: 0.000159 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.708099 (0.709245) acc1: 82.812500 (82.912000) acc5: 96.875000 (96.480000) time: 0.346207 data: 0.000136 max mem: 18814 Test: Total time: 0:00:21 (0.429707 s / it) * Acc@1 82.452 Acc@5 96.384 loss 0.724 Max accuracy: 82.71% Epoch: [208/300] [ 0/1251] eta: 2:16:41 lr: 0.000444 loss: 3.759382 (3.759382) time: 6.555966 data: 1.885853 max mem: 18510 Epoch: [208/300] [ 50/1251] eta: 0:21:48 lr: 0.000443 loss: 3.027862 (3.061368) time: 0.993255 data: 0.000160 max mem: 18814 Epoch: [208/300] [ 100/1251] eta: 0:19:36 lr: 0.000443 loss: 3.147747 (2.969067) time: 0.989149 data: 0.000153 max mem: 18814 Epoch: [208/300] [ 150/1251] eta: 0:18:20 lr: 0.000443 loss: 2.856766 (2.935210) time: 0.949833 data: 0.000166 max mem: 18814 Epoch: [208/300] [ 200/1251] eta: 0:17:21 lr: 0.000442 loss: 2.906574 (2.920979) time: 0.998269 data: 0.000162 max mem: 18814 Epoch: [208/300] [ 250/1251] eta: 0:16:29 lr: 0.000442 loss: 2.789497 (2.907051) time: 0.992378 data: 0.000165 max mem: 18814 Epoch: [208/300] [ 300/1251] eta: 0:15:34 lr: 0.000442 loss: 2.583311 (2.880995) time: 0.921830 data: 0.000145 max mem: 18814 Epoch: [208/300] [ 350/1251] eta: 0:14:42 lr: 0.000441 loss: 3.108508 (2.895985) time: 0.987855 data: 0.000173 max mem: 18814 Epoch: [208/300] [ 400/1251] eta: 0:13:52 lr: 0.000441 loss: 2.891778 (2.886127) time: 0.966653 data: 0.000183 max mem: 18814 Epoch: [208/300] [ 450/1251] eta: 0:13:00 lr: 0.000441 loss: 2.866877 (2.893489) time: 0.968311 data: 0.000196 max mem: 18814 Epoch: [208/300] [ 500/1251] eta: 0:12:11 lr: 0.000440 loss: 2.857787 (2.881896) time: 0.981450 data: 0.000159 max mem: 18814 Epoch: [208/300] [ 550/1251] eta: 0:11:19 lr: 0.000440 loss: 3.000110 (2.876451) time: 0.904980 data: 0.000160 max mem: 18814 Epoch: [208/300] [ 600/1251] eta: 0:10:31 lr: 0.000440 loss: 2.972651 (2.881333) time: 0.975878 data: 0.000160 max mem: 18814 Epoch: [208/300] [ 650/1251] eta: 0:09:42 lr: 0.000439 loss: 2.939317 (2.883865) time: 0.938878 data: 0.000163 max mem: 18814 Epoch: [208/300] [ 700/1251] eta: 0:08:53 lr: 0.000439 loss: 2.605635 (2.878325) time: 0.989719 data: 0.000164 max mem: 18814 Epoch: [208/300] [ 750/1251] eta: 0:08:05 lr: 0.000439 loss: 3.053746 (2.879385) time: 0.989051 data: 0.000162 max mem: 18814 Epoch: [208/300] [ 800/1251] eta: 0:07:15 lr: 0.000438 loss: 2.932221 (2.879905) time: 0.904075 data: 0.000164 max mem: 18814 Epoch: [208/300] [ 850/1251] eta: 0:06:27 lr: 0.000438 loss: 2.813078 (2.868006) time: 0.976605 data: 0.000179 max mem: 18814 Epoch: [208/300] [ 900/1251] eta: 0:05:38 lr: 0.000437 loss: 2.876396 (2.867066) time: 0.905233 data: 0.000164 max mem: 18814 Epoch: [208/300] [ 950/1251] eta: 0:04:50 lr: 0.000437 loss: 2.466626 (2.867077) time: 1.003950 data: 0.000176 max mem: 18814 Epoch: [208/300] [1000/1251] eta: 0:04:01 lr: 0.000437 loss: 3.094022 (2.868839) time: 0.929295 data: 0.000148 max mem: 18814 Epoch: [208/300] [1050/1251] eta: 0:03:13 lr: 0.000436 loss: 2.996115 (2.868780) time: 0.920469 data: 0.000166 max mem: 18814 Epoch: [208/300] [1100/1251] eta: 0:02:25 lr: 0.000436 loss: 2.856632 (2.867763) time: 0.971786 data: 0.000170 max mem: 18814 Epoch: [208/300] [1150/1251] eta: 0:01:37 lr: 0.000436 loss: 2.803890 (2.869422) time: 0.903589 data: 0.000184 max mem: 18814 Epoch: [208/300] [1200/1251] eta: 0:00:49 lr: 0.000435 loss: 3.160712 (2.873979) time: 0.971848 data: 0.000164 max mem: 18814 Epoch: [208/300] [1250/1251] eta: 0:00:00 lr: 0.000435 loss: 3.077091 (2.873930) time: 0.929932 data: 0.000815 max mem: 18814 Epoch: [208/300] Total time: 0:20:04 (0.963080 s / it) Averaged stats: lr: 0.000435 loss: 3.077091 (2.877641) Test: [ 0/49] eta: 0:01:39 loss: 0.542052 (0.542052) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 2.034451 data: 1.211511 max mem: 18814 Test: [10/49] eta: 0:00:19 loss: 0.549261 (0.700061) acc1: 82.812500 (83.238636) acc5: 98.437500 (96.732955) time: 0.511982 data: 0.110287 max mem: 18814 Test: [20/49] eta: 0:00:12 loss: 0.734721 (0.713821) acc1: 82.812500 (83.407738) acc5: 96.875000 (96.354167) time: 0.356325 data: 0.000145 max mem: 18814 Test: [30/49] eta: 0:00:07 loss: 0.734721 (0.715314) acc1: 82.812500 (83.014113) acc5: 96.875000 (96.320565) time: 0.353221 data: 0.000138 max mem: 18814 Test: [40/49] eta: 0:00:03 loss: 0.719563 (0.721373) acc1: 82.812500 (83.193598) acc5: 96.875000 (96.493902) time: 0.350742 data: 0.000132 max mem: 18814 Test: [48/49] eta: 0:00:00 loss: 0.723934 (0.717630) acc1: 82.812500 (83.232000) acc5: 96.875000 (96.512000) time: 0.351327 data: 0.000099 max mem: 18814 Test: Total time: 0:00:19 (0.392022 s / it) * Acc@1 82.810 Acc@5 96.474 loss 0.727 Max accuracy: 82.81% Epoch: [209/300] [ 0/1251] eta: 2:02:44 lr: 0.000435 loss: 3.429220 (3.429220) time: 5.887105 data: 1.976226 max mem: 18510 Epoch: [209/300] [ 50/1251] eta: 0:21:13 lr: 0.000435 loss: 2.912154 (3.011839) time: 0.918260 data: 0.000168 max mem: 18817 Epoch: [209/300] [ 100/1251] eta: 0:19:20 lr: 0.000434 loss: 3.001601 (2.943445) time: 1.015892 data: 0.000168 max mem: 18817 Epoch: [209/300] [ 150/1251] eta: 0:17:55 lr: 0.000434 loss: 2.821223 (2.923668) time: 0.913242 data: 0.000172 max mem: 18817 Epoch: [209/300] [ 200/1251] eta: 0:17:02 lr: 0.000434 loss: 2.851861 (2.895852) time: 0.967153 data: 0.000182 max mem: 18817 Epoch: [209/300] [ 250/1251] eta: 0:16:07 lr: 0.000433 loss: 2.776141 (2.866115) time: 0.970039 data: 0.000168 max mem: 18817 Epoch: [209/300] [ 300/1251] eta: 0:15:14 lr: 0.000433 loss: 2.548492 (2.845904) time: 0.969713 data: 0.000165 max mem: 18817 Epoch: [209/300] [ 350/1251] eta: 0:14:23 lr: 0.000433 loss: 2.954580 (2.862637) time: 0.965806 data: 0.000156 max mem: 18817 Epoch: [209/300] [ 400/1251] eta: 0:13:32 lr: 0.000432 loss: 2.823421 (2.855944) time: 0.956211 data: 0.000166 max mem: 18817 Epoch: [209/300] [ 450/1251] eta: 0:12:43 lr: 0.000432 loss: 2.894331 (2.870015) time: 0.927816 data: 0.000166 max mem: 18817 Epoch: [209/300] [ 500/1251] eta: 0:11:53 lr: 0.000432 loss: 2.887049 (2.857910) time: 0.953712 data: 0.000183 max mem: 18817 Epoch: [209/300] [ 550/1251] eta: 0:11:06 lr: 0.000431 loss: 2.979520 (2.860120) time: 0.981566 data: 0.000185 max mem: 18817 Epoch: [209/300] [ 600/1251] eta: 0:10:18 lr: 0.000431 loss: 3.065218 (2.869512) time: 0.958254 data: 0.000179 max mem: 18817 Epoch: [209/300] [ 650/1251] eta: 0:09:30 lr: 0.000431 loss: 2.879197 (2.871495) time: 0.957301 data: 0.000165 max mem: 18817 Epoch: [209/300] [ 700/1251] eta: 0:08:42 lr: 0.000430 loss: 2.820809 (2.868613) time: 0.917868 data: 0.000179 max mem: 18817 Epoch: [209/300] [ 750/1251] eta: 0:07:54 lr: 0.000430 loss: 2.957981 (2.868593) time: 0.945896 data: 0.000174 max mem: 18817 Epoch: [209/300] [ 800/1251] eta: 0:07:06 lr: 0.000430 loss: 2.876620 (2.871234) time: 0.922451 data: 0.000165 max mem: 18817 Epoch: [209/300] [ 850/1251] eta: 0:06:19 lr: 0.000429 loss: 2.840877 (2.863706) time: 0.963032 data: 0.000190 max mem: 18817 Epoch: [209/300] [ 900/1251] eta: 0:05:31 lr: 0.000429 loss: 2.929150 (2.862371) time: 0.932968 data: 0.000166 max mem: 18817 Epoch: [209/300] [ 950/1251] eta: 0:04:44 lr: 0.000429 loss: 2.461593 (2.860646) time: 1.046029 data: 0.000165 max mem: 18817 Epoch: [209/300] [1000/1251] eta: 0:03:57 lr: 0.000428 loss: 3.088037 (2.863609) time: 0.921762 data: 0.000168 max mem: 18817 Epoch: [209/300] [1050/1251] eta: 0:03:10 lr: 0.000428 loss: 2.942135 (2.863612) time: 0.986797 data: 0.000167 max mem: 18817 Epoch: [209/300] [1100/1251] eta: 0:02:22 lr: 0.000428 loss: 2.916154 (2.864066) time: 0.902371 data: 0.000177 max mem: 18817 Epoch: [209/300] [1150/1251] eta: 0:01:35 lr: 0.000427 loss: 2.741190 (2.866616) time: 0.983223 data: 0.000179 max mem: 18817 Epoch: [209/300] [1200/1251] eta: 0:00:48 lr: 0.000427 loss: 2.899888 (2.870114) time: 0.914982 data: 0.000164 max mem: 18817 Epoch: [209/300] [1250/1251] eta: 0:00:00 lr: 0.000427 loss: 2.956308 (2.867594) time: 0.951905 data: 0.000745 max mem: 18817 Epoch: [209/300] Total time: 0:19:41 (0.944559 s / it) Averaged stats: lr: 0.000427 loss: 2.956308 (2.869034) Test: [ 0/49] eta: 0:01:19 loss: 0.564065 (0.564065) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.613136 data: 1.199953 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.605197 (0.676127) acc1: 84.375000 (83.238636) acc5: 96.875000 (96.306818) time: 0.474158 data: 0.109231 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.745366 (0.714461) acc1: 81.250000 (82.217262) acc5: 96.875000 (96.354167) time: 0.355919 data: 0.000138 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.734517 (0.713383) acc1: 79.687500 (82.106855) acc5: 96.875000 (96.572581) time: 0.365375 data: 0.000120 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.726243 (0.723936) acc1: 82.812500 (82.240854) acc5: 96.875000 (96.455793) time: 0.367328 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.765267 (0.722837) acc1: 82.812500 (82.432000) acc5: 96.875000 (96.544000) time: 0.361880 data: 0.000108 max mem: 18817 Test: Total time: 0:00:19 (0.390340 s / it) * Acc@1 82.702 Acc@5 96.396 loss 0.734 Max accuracy: 82.81% Epoch: [210/300] [ 0/1251] eta: 0:40:42 lr: 0.000427 loss: 2.749877 (2.749877) time: 1.952514 data: 1.053203 max mem: 18817 Epoch: [210/300] [ 50/1251] eta: 0:19:42 lr: 0.000426 loss: 2.993291 (2.987717) time: 0.923412 data: 0.000169 max mem: 18817 Epoch: [210/300] [ 100/1251] eta: 0:18:40 lr: 0.000426 loss: 3.074547 (2.897087) time: 0.927038 data: 0.000182 max mem: 18817 Epoch: [210/300] [ 150/1251] eta: 0:17:45 lr: 0.000426 loss: 2.965070 (2.899203) time: 0.965224 data: 0.000178 max mem: 18817 Epoch: [210/300] [ 200/1251] eta: 0:16:50 lr: 0.000425 loss: 2.989846 (2.912197) time: 0.971076 data: 0.000185 max mem: 18817 Epoch: [210/300] [ 250/1251] eta: 0:15:56 lr: 0.000425 loss: 2.762729 (2.877486) time: 0.954962 data: 0.000191 max mem: 18817 Epoch: [210/300] [ 300/1251] eta: 0:15:04 lr: 0.000425 loss: 2.829890 (2.880165) time: 0.908718 data: 0.000173 max mem: 18817 Epoch: [210/300] [ 350/1251] eta: 0:14:19 lr: 0.000424 loss: 2.999873 (2.883027) time: 0.985265 data: 0.000166 max mem: 18817 Epoch: [210/300] [ 400/1251] eta: 0:13:32 lr: 0.000424 loss: 2.661210 (2.872473) time: 1.021253 data: 0.000161 max mem: 18817 Epoch: [210/300] [ 450/1251] eta: 0:12:43 lr: 0.000424 loss: 3.235456 (2.879255) time: 0.955014 data: 0.000187 max mem: 18817 Epoch: [210/300] [ 500/1251] eta: 0:11:55 lr: 0.000423 loss: 2.805815 (2.869721) time: 0.984842 data: 0.000174 max mem: 18817 Epoch: [210/300] [ 550/1251] eta: 0:11:06 lr: 0.000423 loss: 2.960880 (2.873394) time: 0.915837 data: 0.000178 max mem: 18817 Epoch: [210/300] [ 600/1251] eta: 0:10:19 lr: 0.000423 loss: 2.759035 (2.872335) time: 0.997706 data: 0.000175 max mem: 18817 Epoch: [210/300] [ 650/1251] eta: 0:09:32 lr: 0.000422 loss: 2.899474 (2.869085) time: 1.029588 data: 0.000164 max mem: 18817 Epoch: [210/300] [ 700/1251] eta: 0:08:44 lr: 0.000422 loss: 2.859451 (2.860457) time: 0.974471 data: 0.000187 max mem: 18817 Epoch: [210/300] [ 750/1251] eta: 0:07:56 lr: 0.000422 loss: 2.922952 (2.851871) time: 0.910878 data: 0.000173 max mem: 18817 Epoch: [210/300] [ 800/1251] eta: 0:07:08 lr: 0.000421 loss: 2.908092 (2.856750) time: 0.915765 data: 0.000185 max mem: 18817 Epoch: [210/300] [ 850/1251] eta: 0:06:21 lr: 0.000421 loss: 3.014636 (2.856262) time: 0.975921 data: 0.000167 max mem: 18817 Epoch: [210/300] [ 900/1251] eta: 0:05:34 lr: 0.000421 loss: 3.061231 (2.858712) time: 1.024440 data: 0.000167 max mem: 18817 Epoch: [210/300] [ 950/1251] eta: 0:04:46 lr: 0.000420 loss: 2.868520 (2.858306) time: 0.908340 data: 0.000184 max mem: 18817 Epoch: [210/300] [1000/1251] eta: 0:03:58 lr: 0.000420 loss: 2.760942 (2.854389) time: 0.980005 data: 0.000180 max mem: 18817 Epoch: [210/300] [1050/1251] eta: 0:03:11 lr: 0.000420 loss: 2.798864 (2.855178) time: 0.906652 data: 0.000168 max mem: 18817 Epoch: [210/300] [1100/1251] eta: 0:02:23 lr: 0.000419 loss: 2.809372 (2.858438) time: 0.949576 data: 0.000186 max mem: 18817 Epoch: [210/300] [1150/1251] eta: 0:01:35 lr: 0.000419 loss: 2.747290 (2.857455) time: 0.969668 data: 0.000169 max mem: 18817 Epoch: [210/300] [1200/1251] eta: 0:00:48 lr: 0.000419 loss: 3.108500 (2.860450) time: 0.917367 data: 0.000179 max mem: 18817 Epoch: [210/300] [1250/1251] eta: 0:00:00 lr: 0.000418 loss: 2.875549 (2.859110) time: 0.924260 data: 0.000777 max mem: 18817 Epoch: [210/300] Total time: 0:19:48 (0.949963 s / it) Averaged stats: lr: 0.000418 loss: 2.875549 (2.859425) Test: [ 0/49] eta: 0:01:35 loss: 0.578229 (0.578229) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.942583 data: 1.497807 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.578229 (0.672916) acc1: 82.812500 (84.232955) acc5: 98.437500 (96.590909) time: 0.500092 data: 0.136285 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.714769 (0.693883) acc1: 82.812500 (83.110119) acc5: 96.875000 (96.651786) time: 0.353802 data: 0.000124 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.730367 (0.703412) acc1: 79.687500 (82.711694) acc5: 96.875000 (96.723790) time: 0.351260 data: 0.000123 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.737808 (0.717222) acc1: 82.812500 (82.621951) acc5: 96.875000 (96.570122) time: 0.405928 data: 0.000140 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.763076 (0.712756) acc1: 82.812500 (82.848000) acc5: 96.875000 (96.736000) time: 0.433382 data: 0.000123 max mem: 18817 Test: Total time: 0:00:20 (0.422036 s / it) * Acc@1 82.856 Acc@5 96.568 loss 0.727 Max accuracy: 82.86% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0210.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0210.pth Epoch: [211/300] [ 0/1251] eta: 0:43:45 lr: 0.000418 loss: 2.730502 (2.730502) time: 2.098451 data: 1.223786 max mem: 18817 Epoch: [211/300] [ 50/1251] eta: 0:19:05 lr: 0.000418 loss: 2.922567 (2.903287) time: 0.920951 data: 0.000177 max mem: 18817 Epoch: [211/300] [ 100/1251] eta: 0:18:20 lr: 0.000418 loss: 3.066688 (2.945691) time: 0.910655 data: 0.000190 max mem: 18817 Epoch: [211/300] [ 150/1251] eta: 0:17:32 lr: 0.000417 loss: 3.140826 (2.909410) time: 0.968707 data: 0.000174 max mem: 18817 Epoch: [211/300] [ 200/1251] eta: 0:16:40 lr: 0.000417 loss: 3.095593 (2.930953) time: 0.970789 data: 0.000187 max mem: 18817 Epoch: [211/300] [ 250/1251] eta: 0:15:53 lr: 0.000417 loss: 2.732802 (2.924548) time: 0.950073 data: 0.000176 max mem: 18817 Epoch: [211/300] [ 300/1251] eta: 0:15:04 lr: 0.000416 loss: 2.862645 (2.905832) time: 0.913751 data: 0.000167 max mem: 18817 Epoch: [211/300] [ 350/1251] eta: 0:14:15 lr: 0.000416 loss: 2.830408 (2.897770) time: 0.911929 data: 0.000158 max mem: 18817 Epoch: [211/300] [ 400/1251] eta: 0:13:28 lr: 0.000416 loss: 2.959836 (2.873719) time: 0.916434 data: 0.000175 max mem: 18817 Epoch: [211/300] [ 450/1251] eta: 0:12:41 lr: 0.000415 loss: 2.724079 (2.875133) time: 0.952880 data: 0.000184 max mem: 18817 Epoch: [211/300] [ 500/1251] eta: 0:11:52 lr: 0.000415 loss: 3.087740 (2.872080) time: 0.976370 data: 0.000186 max mem: 18817 Epoch: [211/300] [ 550/1251] eta: 0:11:04 lr: 0.000415 loss: 3.050726 (2.871276) time: 0.922482 data: 0.000152 max mem: 18817 Epoch: [211/300] [ 600/1251] eta: 0:10:18 lr: 0.000414 loss: 2.928850 (2.864043) time: 0.921035 data: 0.000188 max mem: 18817 Epoch: [211/300] [ 650/1251] eta: 0:09:30 lr: 0.000414 loss: 2.802284 (2.860532) time: 0.913141 data: 0.000160 max mem: 18817 Epoch: [211/300] [ 700/1251] eta: 0:08:43 lr: 0.000414 loss: 2.960348 (2.858819) time: 0.949611 data: 0.000165 max mem: 18817 Epoch: [211/300] [ 750/1251] eta: 0:07:56 lr: 0.000413 loss: 3.038963 (2.866348) time: 0.956196 data: 0.000439 max mem: 18817 Epoch: [211/300] [ 800/1251] eta: 0:07:08 lr: 0.000413 loss: 2.916592 (2.866152) time: 0.967362 data: 0.000176 max mem: 18817 Epoch: [211/300] [ 850/1251] eta: 0:06:20 lr: 0.000413 loss: 2.911824 (2.864612) time: 0.911573 data: 0.000185 max mem: 18817 Epoch: [211/300] [ 900/1251] eta: 0:05:33 lr: 0.000412 loss: 2.861050 (2.862364) time: 0.908002 data: 0.000173 max mem: 18817 Epoch: [211/300] [ 950/1251] eta: 0:04:45 lr: 0.000412 loss: 2.631886 (2.859427) time: 0.949695 data: 0.000171 max mem: 18817 Epoch: [211/300] [1000/1251] eta: 0:03:58 lr: 0.000412 loss: 2.903735 (2.857003) time: 0.975013 data: 0.000443 max mem: 18817 Epoch: [211/300] [1050/1251] eta: 0:03:10 lr: 0.000411 loss: 2.839695 (2.855471) time: 0.960714 data: 0.000169 max mem: 18817 Epoch: [211/300] [1100/1251] eta: 0:02:23 lr: 0.000411 loss: 2.829698 (2.857380) time: 0.907838 data: 0.000176 max mem: 18817 Epoch: [211/300] [1150/1251] eta: 0:01:35 lr: 0.000411 loss: 2.919947 (2.862903) time: 0.905729 data: 0.000167 max mem: 18817 Epoch: [211/300] [1200/1251] eta: 0:00:48 lr: 0.000410 loss: 2.863775 (2.864771) time: 0.933667 data: 0.000181 max mem: 18817 Epoch: [211/300] [1250/1251] eta: 0:00:00 lr: 0.000410 loss: 2.957560 (2.868587) time: 0.963224 data: 0.000774 max mem: 18817 Epoch: [211/300] Total time: 0:19:47 (0.949585 s / it) Averaged stats: lr: 0.000410 loss: 2.957560 (2.870421) Test: [ 0/49] eta: 0:01:16 loss: 0.539973 (0.539973) acc1: 84.375000 (84.375000) acc5: 96.875000 (96.875000) time: 1.564010 data: 1.174432 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.594779 (0.703969) acc1: 84.375000 (83.380682) acc5: 96.875000 (96.306818) time: 0.469429 data: 0.106895 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.744285 (0.727241) acc1: 82.812500 (83.333333) acc5: 96.875000 (96.651786) time: 0.362788 data: 0.000129 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.725102 (0.729533) acc1: 82.812500 (82.862903) acc5: 98.437500 (96.925403) time: 0.358438 data: 0.000130 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.722232 (0.742347) acc1: 82.812500 (82.736280) acc5: 96.875000 (96.760671) time: 0.348501 data: 0.000146 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.774375 (0.734423) acc1: 82.812500 (83.168000) acc5: 96.875000 (96.864000) time: 0.342960 data: 0.000122 max mem: 18817 Test: Total time: 0:00:18 (0.379722 s / it) * Acc@1 82.766 Acc@5 96.500 loss 0.754 Max accuracy: 82.86% Epoch: [212/300] [ 0/1251] eta: 0:40:55 lr: 0.000410 loss: 2.573201 (2.573201) time: 1.962704 data: 1.088464 max mem: 18817 Epoch: [212/300] [ 50/1251] eta: 0:19:40 lr: 0.000410 loss: 2.949645 (2.845366) time: 0.917346 data: 0.000183 max mem: 18817 Epoch: [212/300] [ 100/1251] eta: 0:18:36 lr: 0.000409 loss: 3.029136 (2.810517) time: 0.961323 data: 0.000165 max mem: 18817 Epoch: [212/300] [ 150/1251] eta: 0:17:32 lr: 0.000409 loss: 2.779046 (2.781623) time: 0.964085 data: 0.000181 max mem: 18817 Epoch: [212/300] [ 200/1251] eta: 0:16:46 lr: 0.000409 loss: 2.620475 (2.785505) time: 0.971078 data: 0.000164 max mem: 18817 Epoch: [212/300] [ 250/1251] eta: 0:15:53 lr: 0.000408 loss: 2.745121 (2.805788) time: 0.912843 data: 0.000166 max mem: 18817 Epoch: [212/300] [ 300/1251] eta: 0:15:07 lr: 0.000408 loss: 3.134574 (2.822013) time: 0.953336 data: 0.000169 max mem: 18817 Epoch: [212/300] [ 350/1251] eta: 0:14:18 lr: 0.000408 loss: 2.760598 (2.802327) time: 0.955047 data: 0.000182 max mem: 18817 Epoch: [212/300] [ 400/1251] eta: 0:13:30 lr: 0.000407 loss: 2.950007 (2.808586) time: 0.983345 data: 0.000174 max mem: 18817 Epoch: [212/300] [ 450/1251] eta: 0:12:41 lr: 0.000407 loss: 2.712466 (2.818353) time: 0.923351 data: 0.000182 max mem: 18817 Epoch: [212/300] [ 500/1251] eta: 0:11:53 lr: 0.000407 loss: 3.018734 (2.823357) time: 0.908233 data: 0.000187 max mem: 18817 Epoch: [212/300] [ 550/1251] eta: 0:11:07 lr: 0.000406 loss: 2.829182 (2.821317) time: 0.987160 data: 0.000183 max mem: 18817 Epoch: [212/300] [ 600/1251] eta: 0:10:20 lr: 0.000406 loss: 2.955309 (2.821621) time: 0.968611 data: 0.000167 max mem: 18817 Epoch: [212/300] [ 650/1251] eta: 0:09:31 lr: 0.000406 loss: 3.019712 (2.828693) time: 0.957672 data: 0.000168 max mem: 18817 Epoch: [212/300] [ 700/1251] eta: 0:08:43 lr: 0.000405 loss: 2.851269 (2.823239) time: 0.922559 data: 0.000163 max mem: 18817 Epoch: [212/300] [ 750/1251] eta: 0:07:56 lr: 0.000405 loss: 2.746333 (2.825991) time: 0.922502 data: 0.000177 max mem: 18817 Epoch: [212/300] [ 800/1251] eta: 0:07:09 lr: 0.000405 loss: 2.938587 (2.828470) time: 0.959483 data: 0.000180 max mem: 18817 Epoch: [212/300] [ 850/1251] eta: 0:06:21 lr: 0.000404 loss: 2.924397 (2.830856) time: 1.024856 data: 0.000168 max mem: 18817 Epoch: [212/300] [ 900/1251] eta: 0:05:33 lr: 0.000404 loss: 2.919578 (2.833969) time: 0.980706 data: 0.000173 max mem: 18817 Epoch: [212/300] [ 950/1251] eta: 0:04:46 lr: 0.000404 loss: 3.082051 (2.841670) time: 0.921989 data: 0.000177 max mem: 18817 Epoch: [212/300] [1000/1251] eta: 0:03:58 lr: 0.000403 loss: 3.003982 (2.842274) time: 0.915145 data: 0.000164 max mem: 18817 Epoch: [212/300] [1050/1251] eta: 0:03:11 lr: 0.000403 loss: 3.013151 (2.843360) time: 0.970500 data: 0.000189 max mem: 18817 Epoch: [212/300] [1100/1251] eta: 0:02:23 lr: 0.000403 loss: 2.977362 (2.843532) time: 0.954211 data: 0.000178 max mem: 18817 Epoch: [212/300] [1150/1251] eta: 0:01:36 lr: 0.000403 loss: 2.764713 (2.838584) time: 0.971879 data: 0.000186 max mem: 18817 Epoch: [212/300] [1200/1251] eta: 0:00:48 lr: 0.000402 loss: 3.068343 (2.844800) time: 0.919463 data: 0.000178 max mem: 18817 Epoch: [212/300] [1250/1251] eta: 0:00:00 lr: 0.000402 loss: 2.991142 (2.846855) time: 0.916337 data: 0.000788 max mem: 18817 Epoch: [212/300] Total time: 0:19:49 (0.950763 s / it) Averaged stats: lr: 0.000402 loss: 2.991142 (2.854183) Test: [ 0/49] eta: 0:01:28 loss: 0.597639 (0.597639) acc1: 81.250000 (81.250000) acc5: 96.875000 (96.875000) time: 1.797122 data: 1.379536 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.597639 (0.688128) acc1: 82.812500 (83.948864) acc5: 96.875000 (96.448864) time: 0.488056 data: 0.125549 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.779528 (0.715530) acc1: 82.812500 (83.184524) acc5: 96.875000 (96.502976) time: 0.402108 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.729099 (0.706718) acc1: 82.812500 (83.064516) acc5: 96.875000 (96.673387) time: 0.442870 data: 0.000146 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.713830 (0.723222) acc1: 82.812500 (82.964939) acc5: 96.875000 (96.455793) time: 0.393051 data: 0.000139 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.757757 (0.721726) acc1: 81.250000 (82.848000) acc5: 95.312500 (96.480000) time: 0.344334 data: 0.000112 max mem: 18817 Test: Total time: 0:00:20 (0.419151 s / it) * Acc@1 82.824 Acc@5 96.548 loss 0.725 Max accuracy: 82.86% Epoch: [213/300] [ 0/1251] eta: 0:39:39 lr: 0.000402 loss: 2.994387 (2.994387) time: 1.902416 data: 1.052127 max mem: 18817 Epoch: [213/300] [ 50/1251] eta: 0:18:53 lr: 0.000402 loss: 2.967396 (2.803802) time: 0.946576 data: 0.000170 max mem: 18817 Epoch: [213/300] [ 100/1251] eta: 0:17:59 lr: 0.000401 loss: 2.895923 (2.805848) time: 0.913193 data: 0.000183 max mem: 18817 Epoch: [213/300] [ 150/1251] eta: 0:17:17 lr: 0.000401 loss: 2.777758 (2.809469) time: 0.912967 data: 0.000174 max mem: 18817 Epoch: [213/300] [ 200/1251] eta: 0:16:37 lr: 0.000401 loss: 2.852136 (2.836444) time: 0.984747 data: 0.000181 max mem: 18817 Epoch: [213/300] [ 250/1251] eta: 0:15:51 lr: 0.000400 loss: 3.146953 (2.861164) time: 1.019937 data: 0.000170 max mem: 18817 Epoch: [213/300] [ 300/1251] eta: 0:15:04 lr: 0.000400 loss: 2.948543 (2.864003) time: 1.000967 data: 0.000170 max mem: 18817 Epoch: [213/300] [ 350/1251] eta: 0:14:15 lr: 0.000400 loss: 3.045284 (2.859490) time: 0.928002 data: 0.000169 max mem: 18817 Epoch: [213/300] [ 400/1251] eta: 0:13:27 lr: 0.000399 loss: 3.039540 (2.853770) time: 0.913968 data: 0.000171 max mem: 18817 Epoch: [213/300] [ 450/1251] eta: 0:12:40 lr: 0.000399 loss: 2.906479 (2.852691) time: 0.965234 data: 0.000163 max mem: 18817 Epoch: [213/300] [ 500/1251] eta: 0:11:52 lr: 0.000399 loss: 2.980883 (2.852120) time: 0.977765 data: 0.000190 max mem: 18817 Epoch: [213/300] [ 550/1251] eta: 0:11:05 lr: 0.000398 loss: 2.895706 (2.844526) time: 0.970286 data: 0.000178 max mem: 18817 Epoch: [213/300] [ 600/1251] eta: 0:10:18 lr: 0.000398 loss: 2.828341 (2.839771) time: 0.929140 data: 0.000185 max mem: 18817 Epoch: [213/300] [ 650/1251] eta: 0:09:31 lr: 0.000398 loss: 2.933559 (2.848796) time: 0.925062 data: 0.000172 max mem: 18817 Epoch: [213/300] [ 700/1251] eta: 0:08:44 lr: 0.000397 loss: 2.933850 (2.850043) time: 0.960069 data: 0.000175 max mem: 18817 Epoch: [213/300] [ 750/1251] eta: 0:07:56 lr: 0.000397 loss: 2.820710 (2.846021) time: 0.980765 data: 0.000170 max mem: 18817 Epoch: [213/300] [ 800/1251] eta: 0:07:09 lr: 0.000397 loss: 2.711049 (2.847729) time: 0.957337 data: 0.000177 max mem: 18817 Epoch: [213/300] [ 850/1251] eta: 0:06:21 lr: 0.000396 loss: 2.781133 (2.849467) time: 0.904291 data: 0.000173 max mem: 18817 Epoch: [213/300] [ 900/1251] eta: 0:05:33 lr: 0.000396 loss: 2.789938 (2.855254) time: 0.912992 data: 0.000178 max mem: 18817 Epoch: [213/300] [ 950/1251] eta: 0:04:46 lr: 0.000396 loss: 2.983813 (2.857094) time: 0.979101 data: 0.000175 max mem: 18817 Epoch: [213/300] [1000/1251] eta: 0:03:58 lr: 0.000395 loss: 2.908634 (2.860811) time: 0.970881 data: 0.000179 max mem: 18817 Epoch: [213/300] [1050/1251] eta: 0:03:11 lr: 0.000395 loss: 2.869562 (2.859490) time: 0.968474 data: 0.000188 max mem: 18817 Epoch: [213/300] [1100/1251] eta: 0:02:23 lr: 0.000395 loss: 2.909923 (2.865054) time: 0.908322 data: 0.000187 max mem: 18817 Epoch: [213/300] [1150/1251] eta: 0:01:36 lr: 0.000394 loss: 2.855867 (2.861586) time: 0.969073 data: 0.000167 max mem: 18817 Epoch: [213/300] [1200/1251] eta: 0:00:48 lr: 0.000394 loss: 3.067199 (2.863685) time: 0.966493 data: 0.000174 max mem: 18817 Epoch: [213/300] [1250/1251] eta: 0:00:00 lr: 0.000394 loss: 3.193816 (2.863666) time: 0.955520 data: 0.000771 max mem: 18817 Epoch: [213/300] Total time: 0:19:49 (0.951096 s / it) Averaged stats: lr: 0.000394 loss: 3.193816 (2.861140) Test: [ 0/49] eta: 0:01:20 loss: 0.550413 (0.550413) acc1: 82.812500 (82.812500) acc5: 96.875000 (96.875000) time: 1.636424 data: 1.212856 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.640825 (0.697567) acc1: 82.812500 (83.380682) acc5: 96.875000 (96.306818) time: 0.482222 data: 0.110386 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.745692 (0.721166) acc1: 81.250000 (82.366071) acc5: 96.875000 (96.354167) time: 0.453244 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.741825 (0.717041) acc1: 81.250000 (82.610887) acc5: 96.875000 (96.622984) time: 0.445484 data: 0.000145 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.722478 (0.732248) acc1: 84.375000 (82.888720) acc5: 96.875000 (96.532012) time: 0.348687 data: 0.000141 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.751901 (0.733020) acc1: 84.375000 (82.784000) acc5: 96.875000 (96.576000) time: 0.343741 data: 0.000119 max mem: 18817 Test: Total time: 0:00:20 (0.419144 s / it) * Acc@1 82.830 Acc@5 96.476 loss 0.749 Max accuracy: 82.86% Epoch: [214/300] [ 0/1251] eta: 0:42:30 lr: 0.000394 loss: 3.054783 (3.054783) time: 2.038882 data: 1.160975 max mem: 18817 Epoch: [214/300] [ 50/1251] eta: 0:19:07 lr: 0.000393 loss: 3.172383 (2.989666) time: 0.907239 data: 0.000181 max mem: 18817 Epoch: [214/300] [ 100/1251] eta: 0:18:18 lr: 0.000393 loss: 2.914330 (2.900884) time: 0.977384 data: 0.000193 max mem: 18817 Epoch: [214/300] [ 150/1251] eta: 0:17:36 lr: 0.000393 loss: 2.976514 (2.909072) time: 0.981550 data: 0.000180 max mem: 18817 Epoch: [214/300] [ 200/1251] eta: 0:16:42 lr: 0.000392 loss: 2.993768 (2.873472) time: 0.975643 data: 0.000181 max mem: 18817 Epoch: [214/300] [ 250/1251] eta: 0:15:51 lr: 0.000392 loss: 2.909461 (2.854106) time: 0.913491 data: 0.000171 max mem: 18817 Epoch: [214/300] [ 300/1251] eta: 0:15:05 lr: 0.000392 loss: 2.840713 (2.860237) time: 0.906981 data: 0.000186 max mem: 18817 Epoch: [214/300] [ 350/1251] eta: 0:14:19 lr: 0.000391 loss: 2.753940 (2.858049) time: 0.987148 data: 0.000189 max mem: 18817 Epoch: [214/300] [ 400/1251] eta: 0:13:32 lr: 0.000391 loss: 2.803815 (2.852379) time: 0.970943 data: 0.000164 max mem: 18817 Epoch: [214/300] [ 450/1251] eta: 0:12:42 lr: 0.000391 loss: 3.009731 (2.847181) time: 0.955627 data: 0.000177 max mem: 18817 Epoch: [214/300] [ 500/1251] eta: 0:11:53 lr: 0.000390 loss: 2.647129 (2.847009) time: 0.915307 data: 0.000173 max mem: 18817 Epoch: [214/300] [ 550/1251] eta: 0:11:06 lr: 0.000390 loss: 2.952947 (2.848251) time: 0.986148 data: 0.000182 max mem: 18817 Epoch: [214/300] [ 600/1251] eta: 0:10:19 lr: 0.000390 loss: 2.675755 (2.844825) time: 1.015260 data: 0.000179 max mem: 18817 Epoch: [214/300] [ 650/1251] eta: 0:09:31 lr: 0.000389 loss: 2.796042 (2.842231) time: 0.946505 data: 0.000169 max mem: 18817 Epoch: [214/300] [ 700/1251] eta: 0:08:43 lr: 0.000389 loss: 2.570921 (2.833348) time: 0.910878 data: 0.000164 max mem: 18817 Epoch: [214/300] [ 750/1251] eta: 0:07:56 lr: 0.000389 loss: 2.767634 (2.828509) time: 0.911121 data: 0.000192 max mem: 18817 Epoch: [214/300] [ 800/1251] eta: 0:07:08 lr: 0.000389 loss: 2.626021 (2.829946) time: 0.972214 data: 0.000171 max mem: 18817 Epoch: [214/300] [ 850/1251] eta: 0:06:20 lr: 0.000388 loss: 2.992752 (2.829510) time: 0.951724 data: 0.000184 max mem: 18817 Epoch: [214/300] [ 900/1251] eta: 0:05:33 lr: 0.000388 loss: 3.113091 (2.831311) time: 0.912803 data: 0.000169 max mem: 18817 Epoch: [214/300] [ 950/1251] eta: 0:04:46 lr: 0.000388 loss: 2.871601 (2.830527) time: 0.936260 data: 0.000178 max mem: 18817 Epoch: [214/300] [1000/1251] eta: 0:03:58 lr: 0.000387 loss: 2.930332 (2.834245) time: 0.980112 data: 0.000165 max mem: 18817 Epoch: [214/300] [1050/1251] eta: 0:03:10 lr: 0.000387 loss: 2.972793 (2.837031) time: 0.971063 data: 0.000195 max mem: 18817 Epoch: [214/300] [1100/1251] eta: 0:02:23 lr: 0.000387 loss: 2.977491 (2.838741) time: 0.919262 data: 0.000173 max mem: 18817 Epoch: [214/300] [1150/1251] eta: 0:01:35 lr: 0.000386 loss: 2.780402 (2.837710) time: 0.914683 data: 0.000166 max mem: 18817 Epoch: [214/300] [1200/1251] eta: 0:00:48 lr: 0.000386 loss: 2.812263 (2.836180) time: 0.980370 data: 0.000173 max mem: 18817 Epoch: [214/300] [1250/1251] eta: 0:00:00 lr: 0.000386 loss: 2.852838 (2.836214) time: 1.024985 data: 0.000759 max mem: 18817 Epoch: [214/300] Total time: 0:19:50 (0.951418 s / it) Averaged stats: lr: 0.000386 loss: 2.852838 (2.838866) Test: [ 0/49] eta: 0:01:28 loss: 0.555256 (0.555256) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.875000) time: 1.813045 data: 1.402372 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.555256 (0.677826) acc1: 84.375000 (83.238636) acc5: 96.875000 (96.306818) time: 0.488830 data: 0.127622 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.717427 (0.695551) acc1: 81.250000 (82.812500) acc5: 96.875000 (96.205357) time: 0.354109 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.708477 (0.697414) acc1: 81.250000 (82.862903) acc5: 96.875000 (96.471774) time: 0.351771 data: 0.000140 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.708477 (0.709133) acc1: 82.812500 (82.964939) acc5: 96.875000 (96.455793) time: 0.348903 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.725897 (0.709802) acc1: 82.812500 (83.136000) acc5: 96.875000 (96.544000) time: 0.343724 data: 0.000102 max mem: 18817 Test: Total time: 0:00:18 (0.381447 s / it) * Acc@1 83.030 Acc@5 96.578 loss 0.723 Max accuracy: 83.03% Epoch: [215/300] [ 0/1251] eta: 0:38:53 lr: 0.000386 loss: 2.275071 (2.275071) time: 1.865481 data: 0.987132 max mem: 18817 Epoch: [215/300] [ 50/1251] eta: 0:19:20 lr: 0.000385 loss: 3.004951 (2.790197) time: 0.905355 data: 0.000172 max mem: 18817 Epoch: [215/300] [ 100/1251] eta: 0:18:18 lr: 0.000385 loss: 2.800784 (2.807855) time: 0.956963 data: 0.000157 max mem: 18817 Epoch: [215/300] [ 150/1251] eta: 0:17:33 lr: 0.000385 loss: 2.840244 (2.819437) time: 0.977635 data: 0.000179 max mem: 18817 Epoch: [215/300] [ 200/1251] eta: 0:16:40 lr: 0.000384 loss: 3.055587 (2.849193) time: 0.971380 data: 0.000175 max mem: 18817 Epoch: [215/300] [ 250/1251] eta: 0:15:50 lr: 0.000384 loss: 2.847425 (2.826256) time: 0.910231 data: 0.000184 max mem: 18817 Epoch: [215/300] [ 300/1251] eta: 0:15:04 lr: 0.000384 loss: 2.817604 (2.831296) time: 0.923651 data: 0.000167 max mem: 18817 Epoch: [215/300] [ 350/1251] eta: 0:14:17 lr: 0.000383 loss: 2.711727 (2.828408) time: 0.966066 data: 0.000173 max mem: 18817 Epoch: [215/300] [ 400/1251] eta: 0:13:31 lr: 0.000383 loss: 2.953108 (2.838639) time: 0.973678 data: 0.000162 max mem: 18817 Epoch: [215/300] [ 450/1251] eta: 0:12:42 lr: 0.000383 loss: 2.866605 (2.841371) time: 0.971212 data: 0.000182 max mem: 18817 Epoch: [215/300] [ 500/1251] eta: 0:11:53 lr: 0.000382 loss: 2.800274 (2.835395) time: 0.915416 data: 0.000170 max mem: 18817 Epoch: [215/300] [ 550/1251] eta: 0:11:05 lr: 0.000382 loss: 2.713714 (2.828425) time: 0.923559 data: 0.000174 max mem: 18817 Epoch: [215/300] [ 600/1251] eta: 0:10:19 lr: 0.000382 loss: 3.011700 (2.832340) time: 0.971604 data: 0.000182 max mem: 18817 Epoch: [215/300] [ 650/1251] eta: 0:09:31 lr: 0.000381 loss: 2.873056 (2.831849) time: 1.000207 data: 0.000187 max mem: 18817 Epoch: [215/300] [ 700/1251] eta: 0:08:43 lr: 0.000381 loss: 2.898556 (2.834820) time: 0.963744 data: 0.000173 max mem: 18817 Epoch: [215/300] [ 750/1251] eta: 0:07:55 lr: 0.000381 loss: 2.888302 (2.828920) time: 0.914954 data: 0.000182 max mem: 18817 Epoch: [215/300] [ 800/1251] eta: 0:07:08 lr: 0.000380 loss: 2.769601 (2.831131) time: 0.919548 data: 0.000186 max mem: 18817 Epoch: [215/300] [ 850/1251] eta: 0:06:21 lr: 0.000380 loss: 2.876556 (2.833450) time: 0.960865 data: 0.000179 max mem: 18817 Epoch: [215/300] [ 900/1251] eta: 0:05:33 lr: 0.000380 loss: 2.866548 (2.837459) time: 0.967919 data: 0.000177 max mem: 18817 Epoch: [215/300] [ 950/1251] eta: 0:04:45 lr: 0.000379 loss: 2.934370 (2.838235) time: 0.945815 data: 0.000183 max mem: 18817 torch.distributed: socket accepted connection from 10.248.221.206:18886. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.248.221.206:26104. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.248.221.206:29874. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.248.221.206:35708. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.248.221.206:41774. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.248.221.206:45514. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.248.221.206:48536. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) Epoch: [215/300] [1000/1251] eta: 0:03:58 lr: 0.000379 loss: 3.206818 (2.840563) time: 0.914894 data: 0.000168 max mem: 18817 torch.distributed: socket accepted connection from 10.248.221.206:52006. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.248.221.206:56254. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.248.221.206:61356. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.248.221.206:10154. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) torch.distributed: socket accepted connection from 10.248.221.206:13962. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) Epoch: [215/300] [1050/1251] eta: 0:03:10 lr: 0.000379 loss: 3.028788 (2.844336) time: 0.913893 data: 0.000175 max mem: 18817 torch.distributed: socket accepted connection from 10.248.221.206:914. (To turn off this message, please set BYTED_TORCH_C10D_LOG_LEVEL={WARNING, ERROR, CRITICAL}) Epoch: [215/300] [1100/1251] eta: 0:02:23 lr: 0.000379 loss: 2.917890 (2.846521) time: 0.965284 data: 0.000176 max mem: 18817 Epoch: [215/300] [1150/1251] eta: 0:01:35 lr: 0.000378 loss: 2.979493 (2.845903) time: 0.976525 data: 0.000165 max mem: 18817 Epoch: [215/300] [1200/1251] eta: 0:00:48 lr: 0.000378 loss: 2.882793 (2.843921) time: 0.937159 data: 0.000171 max mem: 18817 Epoch: [215/300] [1250/1251] eta: 0:00:00 lr: 0.000378 loss: 2.997112 (2.847724) time: 0.914510 data: 0.000868 max mem: 18817 Epoch: [215/300] Total time: 0:19:48 (0.950156 s / it) Averaged stats: lr: 0.000378 loss: 2.997112 (2.841274) Test: [ 0/49] eta: 0:01:27 loss: 0.494167 (0.494167) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.787681 data: 1.358303 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.571393 (0.689186) acc1: 84.375000 (83.522727) acc5: 98.437500 (96.732955) time: 0.487498 data: 0.123618 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.715451 (0.704254) acc1: 82.812500 (82.514881) acc5: 96.875000 (96.800595) time: 0.374287 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.692117 (0.706370) acc1: 81.250000 (82.610887) acc5: 96.875000 (96.975806) time: 0.371644 data: 0.000127 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.694713 (0.718832) acc1: 82.812500 (82.736280) acc5: 96.875000 (96.875000) time: 0.353932 data: 0.000119 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.781941 (0.717834) acc1: 82.812500 (82.944000) acc5: 96.875000 (96.928000) time: 0.348539 data: 0.000099 max mem: 18817 Test: Total time: 0:00:19 (0.391788 s / it) * Acc@1 82.968 Acc@5 96.596 loss 0.729 Max accuracy: 83.03% Epoch: [216/300] [ 0/1251] eta: 0:39:34 lr: 0.000378 loss: 2.891284 (2.891284) time: 1.897993 data: 1.031039 max mem: 18817 Epoch: [216/300] [ 50/1251] eta: 0:19:37 lr: 0.000377 loss: 2.784231 (2.841300) time: 1.019795 data: 0.000188 max mem: 18817 Epoch: [216/300] [ 100/1251] eta: 0:18:29 lr: 0.000377 loss: 2.839287 (2.862862) time: 0.985506 data: 0.000178 max mem: 18817 Epoch: [216/300] [ 150/1251] eta: 0:17:30 lr: 0.000377 loss: 2.776162 (2.848376) time: 0.917891 data: 0.000189 max mem: 18817 Epoch: [216/300] [ 200/1251] eta: 0:16:43 lr: 0.000376 loss: 2.694990 (2.858378) time: 0.904285 data: 0.000175 max mem: 18817 Epoch: [216/300] [ 250/1251] eta: 0:15:56 lr: 0.000376 loss: 2.452936 (2.846780) time: 0.972235 data: 0.000186 max mem: 18817 Epoch: [216/300] [ 300/1251] eta: 0:15:07 lr: 0.000376 loss: 2.690389 (2.842859) time: 0.991875 data: 0.000180 max mem: 18817 Epoch: [216/300] [ 350/1251] eta: 0:14:19 lr: 0.000375 loss: 2.983650 (2.843708) time: 0.970630 data: 0.000176 max mem: 18817 Epoch: [216/300] [ 400/1251] eta: 0:13:31 lr: 0.000375 loss: 2.834756 (2.833686) time: 0.929212 data: 0.000174 max mem: 18817 Epoch: [216/300] [ 450/1251] eta: 0:12:43 lr: 0.000375 loss: 2.991889 (2.825864) time: 0.905806 data: 0.000180 max mem: 18817 Epoch: [216/300] [ 500/1251] eta: 0:11:57 lr: 0.000374 loss: 2.944182 (2.829864) time: 0.980638 data: 0.000182 max mem: 18817 Epoch: [216/300] [ 550/1251] eta: 0:11:09 lr: 0.000374 loss: 2.870330 (2.835403) time: 0.977774 data: 0.000182 max mem: 18817 Epoch: [216/300] [ 600/1251] eta: 0:10:22 lr: 0.000374 loss: 2.766445 (2.826983) time: 1.006703 data: 0.000193 max mem: 18817 Epoch: [216/300] [ 650/1251] eta: 0:09:34 lr: 0.000373 loss: 3.059335 (2.830205) time: 0.925799 data: 0.000165 max mem: 18817 Epoch: [216/300] [ 700/1251] eta: 0:08:46 lr: 0.000373 loss: 2.827932 (2.836391) time: 0.906630 data: 0.000171 max mem: 18817 Epoch: [216/300] [ 750/1251] eta: 0:07:59 lr: 0.000373 loss: 2.427409 (2.836877) time: 0.992261 data: 0.000169 max mem: 18817 Epoch: [216/300] [ 800/1251] eta: 0:07:10 lr: 0.000372 loss: 2.730005 (2.837197) time: 0.955133 data: 0.000165 max mem: 18817 Epoch: [216/300] [ 850/1251] eta: 0:06:22 lr: 0.000372 loss: 2.665917 (2.834453) time: 0.938528 data: 0.000183 max mem: 18817 Epoch: [216/300] [ 900/1251] eta: 0:05:35 lr: 0.000372 loss: 2.774263 (2.832148) time: 0.919080 data: 0.000182 max mem: 18817 Epoch: [216/300] [ 950/1251] eta: 0:04:47 lr: 0.000372 loss: 3.111692 (2.843014) time: 0.948737 data: 0.000168 max mem: 18817 Epoch: [216/300] [1000/1251] eta: 0:03:59 lr: 0.000371 loss: 2.827785 (2.841626) time: 0.975693 data: 0.000173 max mem: 18817 Epoch: [216/300] [1050/1251] eta: 0:03:11 lr: 0.000371 loss: 2.904618 (2.843710) time: 0.948043 data: 0.000184 max mem: 18817 Epoch: [216/300] [1100/1251] eta: 0:02:23 lr: 0.000371 loss: 2.773733 (2.839056) time: 0.915826 data: 0.000175 max mem: 18817 Epoch: [216/300] [1150/1251] eta: 0:01:36 lr: 0.000370 loss: 2.801648 (2.839172) time: 0.913358 data: 0.000177 max mem: 18817 Epoch: [216/300] [1200/1251] eta: 0:00:48 lr: 0.000370 loss: 2.881951 (2.837826) time: 0.963244 data: 0.000163 max mem: 18817 Epoch: [216/300] [1250/1251] eta: 0:00:00 lr: 0.000370 loss: 2.944416 (2.838521) time: 1.000940 data: 0.000759 max mem: 18817 Epoch: [216/300] Total time: 0:19:53 (0.953884 s / it) Averaged stats: lr: 0.000370 loss: 2.944416 (2.834515) Test: [ 0/49] eta: 0:01:25 loss: 0.464180 (0.464180) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.743633 data: 1.340275 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.574163 (0.668992) acc1: 84.375000 (83.806818) acc5: 98.437500 (97.301136) time: 0.486498 data: 0.121989 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.705929 (0.687868) acc1: 82.812500 (83.258929) acc5: 96.875000 (97.098214) time: 0.357581 data: 0.000168 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.705884 (0.689469) acc1: 81.250000 (82.812500) acc5: 96.875000 (97.127016) time: 0.353345 data: 0.000160 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.712945 (0.707267) acc1: 82.812500 (83.041159) acc5: 96.875000 (96.989329) time: 0.349661 data: 0.000143 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.727925 (0.705956) acc1: 82.812500 (82.880000) acc5: 96.875000 (97.024000) time: 0.343999 data: 0.000121 max mem: 18817 Test: Total time: 0:00:18 (0.381785 s / it) * Acc@1 83.062 Acc@5 96.526 loss 0.715 Max accuracy: 83.06% Epoch: [217/300] [ 0/1251] eta: 0:42:28 lr: 0.000370 loss: 2.783849 (2.783849) time: 2.037225 data: 1.163620 max mem: 18817 Epoch: [217/300] [ 50/1251] eta: 0:19:43 lr: 0.000369 loss: 2.956763 (2.924640) time: 0.909085 data: 0.000172 max mem: 18817 Epoch: [217/300] [ 100/1251] eta: 0:18:47 lr: 0.000369 loss: 2.961913 (2.894130) time: 0.986264 data: 0.000187 max mem: 18817 Epoch: [217/300] [ 150/1251] eta: 0:17:44 lr: 0.000369 loss: 2.589512 (2.871552) time: 0.978569 data: 0.000180 max mem: 18817 Epoch: [217/300] [ 200/1251] eta: 0:16:47 lr: 0.000368 loss: 3.207279 (2.884552) time: 0.930875 data: 0.000171 max mem: 18817 Epoch: [217/300] [ 250/1251] eta: 0:16:02 lr: 0.000368 loss: 3.068213 (2.880339) time: 0.934726 data: 0.000169 max mem: 18817 Epoch: [217/300] [ 300/1251] eta: 0:15:15 lr: 0.000368 loss: 2.923927 (2.882930) time: 0.917382 data: 0.000167 max mem: 18817 Epoch: [217/300] [ 350/1251] eta: 0:14:28 lr: 0.000367 loss: 2.883429 (2.873786) time: 0.987983 data: 0.000166 max mem: 18817 Epoch: [217/300] [ 400/1251] eta: 0:13:37 lr: 0.000367 loss: 2.963196 (2.863759) time: 0.980544 data: 0.000169 max mem: 18817 Epoch: [217/300] [ 450/1251] eta: 0:12:47 lr: 0.000367 loss: 2.934197 (2.850823) time: 0.960102 data: 0.000183 max mem: 18817 Epoch: [217/300] [ 500/1251] eta: 0:11:59 lr: 0.000366 loss: 2.874183 (2.850005) time: 0.966437 data: 0.000188 max mem: 18817 Epoch: [217/300] [ 550/1251] eta: 0:11:09 lr: 0.000366 loss: 2.884684 (2.843318) time: 0.913467 data: 0.000180 max mem: 18817 Epoch: [217/300] [ 600/1251] eta: 0:10:22 lr: 0.000366 loss: 2.787696 (2.833158) time: 0.909490 data: 0.000175 max mem: 18817 Epoch: [217/300] [ 650/1251] eta: 0:09:34 lr: 0.000366 loss: 2.608667 (2.833133) time: 0.978360 data: 0.000170 max mem: 18817 Epoch: [217/300] [ 700/1251] eta: 0:08:46 lr: 0.000365 loss: 2.850818 (2.837420) time: 0.986344 data: 0.000156 max mem: 18817 Epoch: [217/300] [ 750/1251] eta: 0:07:58 lr: 0.000365 loss: 2.853888 (2.840164) time: 0.928761 data: 0.000173 max mem: 18817 Epoch: [217/300] [ 800/1251] eta: 0:07:10 lr: 0.000365 loss: 2.892039 (2.839287) time: 0.918068 data: 0.000168 max mem: 18817 Epoch: [217/300] [ 850/1251] eta: 0:06:23 lr: 0.000364 loss: 2.864478 (2.840208) time: 1.016835 data: 0.000189 max mem: 18817 Epoch: [217/300] [ 900/1251] eta: 0:05:35 lr: 0.000364 loss: 2.811108 (2.839289) time: 0.910226 data: 0.000183 max mem: 18817 Epoch: [217/300] [ 950/1251] eta: 0:04:47 lr: 0.000364 loss: 2.859499 (2.836335) time: 0.952700 data: 0.000176 max mem: 18817 Epoch: [217/300] [1000/1251] eta: 0:03:59 lr: 0.000363 loss: 2.980629 (2.841475) time: 0.964267 data: 0.000179 max mem: 18817 Epoch: [217/300] [1050/1251] eta: 0:03:11 lr: 0.000363 loss: 2.706742 (2.840058) time: 0.917628 data: 0.000179 max mem: 18817 Epoch: [217/300] [1100/1251] eta: 0:02:24 lr: 0.000363 loss: 2.902042 (2.839534) time: 0.936478 data: 0.000180 max mem: 18817 Epoch: [217/300] [1150/1251] eta: 0:01:36 lr: 0.000362 loss: 3.000312 (2.841711) time: 0.914871 data: 0.000185 max mem: 18817 Epoch: [217/300] [1200/1251] eta: 0:00:48 lr: 0.000362 loss: 2.813570 (2.837476) time: 0.980042 data: 0.000170 max mem: 18817 Epoch: [217/300] [1250/1251] eta: 0:00:00 lr: 0.000362 loss: 3.074590 (2.838661) time: 0.961033 data: 0.000755 max mem: 18817 Epoch: [217/300] Total time: 0:19:54 (0.954486 s / it) Averaged stats: lr: 0.000362 loss: 3.074590 (2.837009) Test: [ 0/49] eta: 0:01:24 loss: 0.473916 (0.473916) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.724086 data: 1.323525 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.572784 (0.674478) acc1: 84.375000 (83.948864) acc5: 98.437500 (97.159091) time: 0.481528 data: 0.120469 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.732017 (0.703197) acc1: 82.812500 (83.333333) acc5: 96.875000 (96.949405) time: 0.355240 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.727045 (0.698694) acc1: 82.812500 (83.014113) acc5: 96.875000 (96.925403) time: 0.352368 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.711310 (0.710631) acc1: 82.812500 (83.346037) acc5: 96.875000 (96.798780) time: 0.371153 data: 0.000151 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.741115 (0.711756) acc1: 82.812500 (83.264000) acc5: 96.875000 (96.896000) time: 0.371985 data: 0.000121 max mem: 18817 Test: Total time: 0:00:19 (0.392603 s / it) * Acc@1 83.020 Acc@5 96.600 loss 0.725 Max accuracy: 83.06% Epoch: [218/300] [ 0/1251] eta: 0:41:46 lr: 0.000362 loss: 2.077452 (2.077452) time: 2.003979 data: 1.137063 max mem: 18817 Epoch: [218/300] [ 50/1251] eta: 0:19:05 lr: 0.000361 loss: 2.985559 (2.833290) time: 0.909359 data: 0.000188 max mem: 18817 Epoch: [218/300] [ 100/1251] eta: 0:18:19 lr: 0.000361 loss: 3.004010 (2.856364) time: 0.928823 data: 0.000166 max mem: 18817 Epoch: [218/300] [ 150/1251] eta: 0:17:32 lr: 0.000361 loss: 2.692505 (2.819116) time: 0.978792 data: 0.000172 max mem: 18817 Epoch: [218/300] [ 200/1251] eta: 0:16:45 lr: 0.000361 loss: 2.826724 (2.846023) time: 1.033012 data: 0.000175 max mem: 18817 Epoch: [218/300] [ 250/1251] eta: 0:15:53 lr: 0.000360 loss: 3.022814 (2.870940) time: 0.966901 data: 0.000178 max mem: 18817 Epoch: [218/300] [ 300/1251] eta: 0:15:04 lr: 0.000360 loss: 2.862935 (2.865135) time: 0.920886 data: 0.000184 max mem: 18817 Epoch: [218/300] [ 350/1251] eta: 0:14:17 lr: 0.000360 loss: 2.921246 (2.849872) time: 0.908941 data: 0.000165 max mem: 18817 Epoch: [218/300] [ 400/1251] eta: 0:13:30 lr: 0.000359 loss: 3.064932 (2.844110) time: 0.976958 data: 0.000188 max mem: 18817 Epoch: [218/300] [ 450/1251] eta: 0:12:44 lr: 0.000359 loss: 3.014288 (2.845404) time: 1.020416 data: 0.000170 max mem: 18817 Epoch: [218/300] [ 500/1251] eta: 0:11:55 lr: 0.000359 loss: 2.795771 (2.832709) time: 0.975278 data: 0.000180 max mem: 18817 Epoch: [218/300] [ 550/1251] eta: 0:11:07 lr: 0.000358 loss: 2.814549 (2.834494) time: 0.931626 data: 0.000181 max mem: 18817 Epoch: [218/300] [ 600/1251] eta: 0:10:20 lr: 0.000358 loss: 2.897184 (2.831299) time: 0.913870 data: 0.000189 max mem: 18817 Epoch: [218/300] [ 650/1251] eta: 0:09:33 lr: 0.000358 loss: 2.921801 (2.834744) time: 0.956015 data: 0.000178 max mem: 18817 Epoch: [218/300] [ 700/1251] eta: 0:08:45 lr: 0.000357 loss: 2.870371 (2.832909) time: 0.981589 data: 0.000170 max mem: 18817 Epoch: [218/300] [ 750/1251] eta: 0:07:57 lr: 0.000357 loss: 2.912021 (2.833489) time: 0.923265 data: 0.000196 max mem: 18817 Epoch: [218/300] [ 800/1251] eta: 0:07:09 lr: 0.000357 loss: 3.000119 (2.833411) time: 0.919615 data: 0.000175 max mem: 18817 Epoch: [218/300] [ 850/1251] eta: 0:06:22 lr: 0.000356 loss: 2.725254 (2.829715) time: 0.907681 data: 0.000178 max mem: 18817 Epoch: [218/300] [ 900/1251] eta: 0:05:34 lr: 0.000356 loss: 2.847836 (2.822882) time: 0.965966 data: 0.000184 max mem: 18817 Epoch: [218/300] [ 950/1251] eta: 0:04:46 lr: 0.000356 loss: 2.670296 (2.818636) time: 0.965019 data: 0.000188 max mem: 18817 Epoch: [218/300] [1000/1251] eta: 0:03:59 lr: 0.000356 loss: 3.063831 (2.819389) time: 0.913481 data: 0.000169 max mem: 18817 Epoch: [218/300] [1050/1251] eta: 0:03:11 lr: 0.000355 loss: 2.895792 (2.820810) time: 0.938426 data: 0.000175 max mem: 18817 Epoch: [218/300] [1100/1251] eta: 0:02:23 lr: 0.000355 loss: 2.913976 (2.820335) time: 0.908247 data: 0.000160 max mem: 18817 Epoch: [218/300] [1150/1251] eta: 0:01:36 lr: 0.000355 loss: 2.826225 (2.815786) time: 0.958142 data: 0.000169 max mem: 18817 Epoch: [218/300] [1200/1251] eta: 0:00:48 lr: 0.000354 loss: 2.788485 (2.813211) time: 0.954363 data: 0.000180 max mem: 18817 Epoch: [218/300] [1250/1251] eta: 0:00:00 lr: 0.000354 loss: 2.950695 (2.815511) time: 0.951053 data: 0.000770 max mem: 18817 Epoch: [218/300] Total time: 0:19:51 (0.952387 s / it) Averaged stats: lr: 0.000354 loss: 2.950695 (2.809394) Test: [ 0/49] eta: 0:01:30 loss: 0.524062 (0.524062) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.838980 data: 1.441483 max mem: 18817 Test: [10/49] eta: 0:00:25 loss: 0.566437 (0.684739) acc1: 82.812500 (83.664773) acc5: 96.875000 (96.306818) time: 0.666389 data: 0.131188 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.728405 (0.702403) acc1: 81.250000 (83.035714) acc5: 96.875000 (96.502976) time: 0.450493 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.686945 (0.693333) acc1: 81.250000 (82.913306) acc5: 96.875000 (96.572581) time: 0.352184 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.689893 (0.704931) acc1: 82.812500 (83.041159) acc5: 95.312500 (96.455793) time: 0.350207 data: 0.000147 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.690187 (0.705241) acc1: 82.812500 (83.008000) acc5: 95.312500 (96.448000) time: 0.344802 data: 0.000124 max mem: 18817 Test: Total time: 0:00:20 (0.423161 s / it) * Acc@1 83.070 Acc@5 96.566 loss 0.717 Max accuracy: 83.07% Epoch: [219/300] [ 0/1251] eta: 0:42:37 lr: 0.000354 loss: 3.055252 (3.055252) time: 2.044037 data: 1.181344 max mem: 18817 Epoch: [219/300] [ 50/1251] eta: 0:19:15 lr: 0.000354 loss: 2.789497 (2.859584) time: 0.916300 data: 0.000164 max mem: 18817 Epoch: [219/300] [ 100/1251] eta: 0:18:29 lr: 0.000353 loss: 2.757586 (2.862593) time: 0.955365 data: 0.000168 max mem: 18817 Epoch: [219/300] [ 150/1251] eta: 0:17:40 lr: 0.000353 loss: 2.864695 (2.860783) time: 0.960768 data: 0.000172 max mem: 18817 Epoch: [219/300] [ 200/1251] eta: 0:16:43 lr: 0.000353 loss: 2.989766 (2.868536) time: 0.954827 data: 0.000170 max mem: 18817 Epoch: [219/300] [ 250/1251] eta: 0:15:52 lr: 0.000352 loss: 2.650342 (2.874060) time: 0.923453 data: 0.000177 max mem: 18817 Epoch: [219/300] [ 300/1251] eta: 0:15:08 lr: 0.000352 loss: 2.580584 (2.858196) time: 0.925299 data: 0.000175 max mem: 18817 Epoch: [219/300] [ 350/1251] eta: 0:14:23 lr: 0.000352 loss: 2.905709 (2.856128) time: 0.993350 data: 0.000166 max mem: 18817 Epoch: [219/300] [ 400/1251] eta: 0:13:35 lr: 0.000351 loss: 2.929156 (2.857767) time: 0.949297 data: 0.000180 max mem: 18817 Epoch: [219/300] [ 450/1251] eta: 0:12:44 lr: 0.000351 loss: 2.849047 (2.855706) time: 0.955724 data: 0.000208 max mem: 18817 Epoch: [219/300] [ 500/1251] eta: 0:11:56 lr: 0.000351 loss: 2.943444 (2.840992) time: 0.925011 data: 0.000174 max mem: 18817 Epoch: [219/300] [ 550/1251] eta: 0:11:09 lr: 0.000351 loss: 2.878773 (2.848131) time: 0.908550 data: 0.000215 max mem: 18817 Epoch: [219/300] [ 600/1251] eta: 0:10:21 lr: 0.000350 loss: 2.983701 (2.848029) time: 0.962857 data: 0.000175 max mem: 18817 Epoch: [219/300] [ 650/1251] eta: 0:09:33 lr: 0.000350 loss: 2.913036 (2.845858) time: 0.975010 data: 0.000177 max mem: 18817 Epoch: [219/300] [ 700/1251] eta: 0:08:45 lr: 0.000350 loss: 2.829490 (2.844284) time: 0.969253 data: 0.000171 max mem: 18817 Epoch: [219/300] [ 750/1251] eta: 0:07:56 lr: 0.000349 loss: 2.610678 (2.841102) time: 0.911400 data: 0.000197 max mem: 18817 Epoch: [219/300] [ 800/1251] eta: 0:07:09 lr: 0.000349 loss: 2.771347 (2.837735) time: 0.918691 data: 0.000176 max mem: 18817 Epoch: [219/300] [ 850/1251] eta: 0:06:21 lr: 0.000349 loss: 3.005868 (2.836920) time: 0.973177 data: 0.000175 max mem: 18817 Epoch: [219/300] [ 900/1251] eta: 0:05:33 lr: 0.000348 loss: 2.241440 (2.831837) time: 0.969542 data: 0.000186 max mem: 18817 Epoch: [219/300] [ 950/1251] eta: 0:04:46 lr: 0.000348 loss: 2.801324 (2.835054) time: 0.974099 data: 0.000167 max mem: 18817 Epoch: [219/300] [1000/1251] eta: 0:03:58 lr: 0.000348 loss: 2.905819 (2.835255) time: 0.928768 data: 0.000177 max mem: 18817 Epoch: [219/300] [1050/1251] eta: 0:03:11 lr: 0.000347 loss: 2.965491 (2.829363) time: 0.907988 data: 0.000211 max mem: 18817 Epoch: [219/300] [1100/1251] eta: 0:02:23 lr: 0.000347 loss: 2.668573 (2.824703) time: 0.974332 data: 0.000206 max mem: 18817 Epoch: [219/300] [1150/1251] eta: 0:01:36 lr: 0.000347 loss: 2.502360 (2.819277) time: 0.975076 data: 0.000229 max mem: 18817 Epoch: [219/300] [1200/1251] eta: 0:00:48 lr: 0.000347 loss: 2.875405 (2.820273) time: 0.967600 data: 0.000171 max mem: 18817 Epoch: [219/300] [1250/1251] eta: 0:00:00 lr: 0.000346 loss: 2.903811 (2.822624) time: 0.909773 data: 0.000768 max mem: 18817 Epoch: [219/300] Total time: 0:19:50 (0.951457 s / it) Averaged stats: lr: 0.000346 loss: 2.903811 (2.818837) Test: [ 0/49] eta: 0:01:25 loss: 0.476886 (0.476886) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.747277 data: 1.334645 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.605184 (0.687044) acc1: 82.812500 (84.232955) acc5: 96.875000 (96.590909) time: 0.483224 data: 0.121461 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.750461 (0.707039) acc1: 82.812500 (83.705357) acc5: 96.875000 (96.651786) time: 0.368569 data: 0.000129 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.713068 (0.707628) acc1: 82.812500 (83.417339) acc5: 96.875000 (96.572581) time: 0.365559 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.713068 (0.714945) acc1: 82.812500 (83.536585) acc5: 96.875000 (96.532012) time: 0.349430 data: 0.000155 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.715980 (0.715241) acc1: 82.812500 (83.328000) acc5: 96.875000 (96.608000) time: 0.344668 data: 0.000123 max mem: 18817 Test: Total time: 0:00:18 (0.386570 s / it) * Acc@1 82.956 Acc@5 96.542 loss 0.729 Max accuracy: 83.07% Epoch: [220/300] [ 0/1251] eta: 0:42:42 lr: 0.000346 loss: 3.157230 (3.157230) time: 2.048203 data: 1.180244 max mem: 18817 Epoch: [220/300] [ 50/1251] eta: 0:19:13 lr: 0.000346 loss: 2.649007 (2.794204) time: 0.960747 data: 0.000163 max mem: 18817 Epoch: [220/300] [ 100/1251] eta: 0:18:12 lr: 0.000346 loss: 2.723000 (2.803951) time: 0.912753 data: 0.000173 max mem: 18817 Epoch: [220/300] [ 150/1251] eta: 0:17:24 lr: 0.000345 loss: 2.768486 (2.809019) time: 0.906183 data: 0.000154 max mem: 18817 Epoch: [220/300] [ 200/1251] eta: 0:16:39 lr: 0.000345 loss: 2.797467 (2.778270) time: 0.971703 data: 0.000180 max mem: 18817 Epoch: [220/300] [ 250/1251] eta: 0:15:50 lr: 0.000345 loss: 2.996269 (2.799026) time: 0.966908 data: 0.000166 max mem: 18817 Epoch: [220/300] [ 300/1251] eta: 0:15:00 lr: 0.000344 loss: 2.817119 (2.803857) time: 0.975330 data: 0.000174 max mem: 18817 Epoch: [220/300] [ 350/1251] eta: 0:14:13 lr: 0.000344 loss: 2.748153 (2.819765) time: 0.958245 data: 0.000177 max mem: 18817 Epoch: [220/300] [ 400/1251] eta: 0:13:24 lr: 0.000344 loss: 2.942820 (2.824806) time: 0.919229 data: 0.000183 max mem: 18817 Epoch: [220/300] [ 450/1251] eta: 0:12:38 lr: 0.000343 loss: 2.705251 (2.820284) time: 0.956732 data: 0.000177 max mem: 18817 Epoch: [220/300] [ 500/1251] eta: 0:11:51 lr: 0.000343 loss: 2.780087 (2.818958) time: 0.965330 data: 0.000175 max mem: 18817 Epoch: [220/300] [ 550/1251] eta: 0:11:03 lr: 0.000343 loss: 2.724507 (2.812611) time: 0.956999 data: 0.000172 max mem: 18817 Epoch: [220/300] [ 600/1251] eta: 0:10:15 lr: 0.000343 loss: 2.877481 (2.815307) time: 0.905802 data: 0.000175 max mem: 18817 Epoch: [220/300] [ 650/1251] eta: 0:09:28 lr: 0.000342 loss: 3.021356 (2.823973) time: 0.916799 data: 0.000169 max mem: 18817 Epoch: [220/300] [ 700/1251] eta: 0:08:41 lr: 0.000342 loss: 2.961991 (2.827018) time: 0.988914 data: 0.000189 max mem: 18817 Epoch: [220/300] [ 750/1251] eta: 0:07:54 lr: 0.000342 loss: 2.787026 (2.833929) time: 0.937126 data: 0.000178 max mem: 18817 Epoch: [220/300] [ 800/1251] eta: 0:07:07 lr: 0.000341 loss: 2.892192 (2.838234) time: 0.987199 data: 0.000163 max mem: 18817 Epoch: [220/300] [ 850/1251] eta: 0:06:19 lr: 0.000341 loss: 3.042198 (2.839928) time: 0.971373 data: 0.000175 max mem: 18817 Epoch: [220/300] [ 900/1251] eta: 0:05:32 lr: 0.000341 loss: 2.850126 (2.839358) time: 0.913311 data: 0.000178 max mem: 18817 Epoch: [220/300] [ 950/1251] eta: 0:04:44 lr: 0.000340 loss: 2.768878 (2.828680) time: 0.934363 data: 0.000170 max mem: 18817 Epoch: [220/300] [1000/1251] eta: 0:03:57 lr: 0.000340 loss: 2.864287 (2.830690) time: 0.972534 data: 0.000178 max mem: 18817 Epoch: [220/300] [1050/1251] eta: 0:03:10 lr: 0.000340 loss: 2.906617 (2.828040) time: 1.012547 data: 0.000186 max mem: 18817 Epoch: [220/300] [1100/1251] eta: 0:02:23 lr: 0.000339 loss: 2.963063 (2.827984) time: 0.991811 data: 0.000171 max mem: 18817 Epoch: [220/300] [1150/1251] eta: 0:01:35 lr: 0.000339 loss: 2.949174 (2.829963) time: 0.924160 data: 0.000174 max mem: 18817 Epoch: [220/300] [1200/1251] eta: 0:00:48 lr: 0.000339 loss: 2.827510 (2.828558) time: 0.941761 data: 0.000172 max mem: 18817 Epoch: [220/300] [1250/1251] eta: 0:00:00 lr: 0.000339 loss: 3.017164 (2.830811) time: 0.969860 data: 0.000771 max mem: 18817 Epoch: [220/300] Total time: 0:19:47 (0.949599 s / it) Averaged stats: lr: 0.000339 loss: 3.017164 (2.823993) Test: [ 0/49] eta: 0:01:16 loss: 0.487025 (0.487025) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.560588 data: 1.130428 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.590618 (0.669960) acc1: 85.937500 (84.375000) acc5: 95.312500 (96.164773) time: 0.470081 data: 0.102929 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.725544 (0.701018) acc1: 82.812500 (83.407738) acc5: 96.875000 (96.279762) time: 0.361086 data: 0.000158 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.721440 (0.702264) acc1: 81.250000 (83.366935) acc5: 96.875000 (96.370968) time: 0.371509 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.719651 (0.710694) acc1: 81.250000 (83.193598) acc5: 96.875000 (96.493902) time: 0.364471 data: 0.000128 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.746189 (0.711508) acc1: 81.250000 (83.040000) acc5: 96.875000 (96.640000) time: 0.347610 data: 0.000106 max mem: 18817 Test: Total time: 0:00:18 (0.387315 s / it) * Acc@1 83.068 Acc@5 96.574 loss 0.723 Max accuracy: 83.07% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0220.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0220.pth Epoch: [221/300] [ 0/1251] eta: 0:41:16 lr: 0.000339 loss: 3.538661 (3.538661) time: 1.979801 data: 1.123275 max mem: 18817 Epoch: [221/300] [ 50/1251] eta: 0:19:22 lr: 0.000338 loss: 2.768705 (2.777304) time: 0.955061 data: 0.000182 max mem: 18817 Epoch: [221/300] [ 100/1251] eta: 0:18:30 lr: 0.000338 loss: 2.884216 (2.775939) time: 0.966497 data: 0.000174 max mem: 18817 Epoch: [221/300] [ 150/1251] eta: 0:17:32 lr: 0.000338 loss: 2.957785 (2.795839) time: 0.962640 data: 0.000193 max mem: 18817 Epoch: [221/300] [ 200/1251] eta: 0:16:38 lr: 0.000337 loss: 2.916440 (2.807805) time: 0.916032 data: 0.000168 max mem: 18817 Epoch: [221/300] [ 250/1251] eta: 0:15:51 lr: 0.000337 loss: 3.123667 (2.812988) time: 0.910280 data: 0.000187 max mem: 18817 Epoch: [221/300] [ 300/1251] eta: 0:15:04 lr: 0.000337 loss: 2.822260 (2.824178) time: 0.966649 data: 0.000176 max mem: 18817 Epoch: [221/300] [ 350/1251] eta: 0:14:15 lr: 0.000336 loss: 2.929165 (2.810030) time: 0.985932 data: 0.000154 max mem: 18817 Epoch: [221/300] [ 400/1251] eta: 0:13:28 lr: 0.000336 loss: 2.849697 (2.810898) time: 0.970262 data: 0.000177 max mem: 18817 Epoch: [221/300] [ 450/1251] eta: 0:12:39 lr: 0.000336 loss: 2.797525 (2.802471) time: 0.914544 data: 0.000177 max mem: 18817 Epoch: [221/300] [ 500/1251] eta: 0:11:52 lr: 0.000336 loss: 2.917113 (2.803286) time: 0.915614 data: 0.000180 max mem: 18817 Epoch: [221/300] [ 550/1251] eta: 0:11:05 lr: 0.000335 loss: 3.081705 (2.810887) time: 0.961226 data: 0.000176 max mem: 18817 Epoch: [221/300] [ 600/1251] eta: 0:10:17 lr: 0.000335 loss: 2.903023 (2.810917) time: 0.958514 data: 0.000170 max mem: 18817 Epoch: [221/300] [ 650/1251] eta: 0:09:30 lr: 0.000335 loss: 2.929528 (2.813732) time: 0.958210 data: 0.000185 max mem: 18817 Epoch: [221/300] [ 700/1251] eta: 0:08:42 lr: 0.000334 loss: 2.786324 (2.812596) time: 0.924788 data: 0.000170 max mem: 18817 Epoch: [221/300] [ 750/1251] eta: 0:07:55 lr: 0.000334 loss: 3.055351 (2.814562) time: 0.903860 data: 0.000170 max mem: 18817 Epoch: [221/300] [ 800/1251] eta: 0:07:08 lr: 0.000334 loss: 2.828828 (2.810138) time: 0.968319 data: 0.000185 max mem: 18817 Epoch: [221/300] [ 850/1251] eta: 0:06:20 lr: 0.000333 loss: 3.026631 (2.816754) time: 0.964688 data: 0.000187 max mem: 18817 Epoch: [221/300] [ 900/1251] eta: 0:05:33 lr: 0.000333 loss: 2.978771 (2.820117) time: 0.924039 data: 0.000163 max mem: 18817 Epoch: [221/300] [ 950/1251] eta: 0:04:45 lr: 0.000333 loss: 2.771559 (2.813805) time: 0.914322 data: 0.000178 max mem: 18817 Epoch: [221/300] [1000/1251] eta: 0:03:58 lr: 0.000332 loss: 3.025325 (2.811922) time: 0.980952 data: 0.000162 max mem: 18817 Epoch: [221/300] [1050/1251] eta: 0:03:11 lr: 0.000332 loss: 2.678071 (2.809243) time: 0.969392 data: 0.000182 max mem: 18817 Epoch: [221/300] [1100/1251] eta: 0:02:23 lr: 0.000332 loss: 2.886944 (2.809325) time: 0.975878 data: 0.000163 max mem: 18817 Epoch: [221/300] [1150/1251] eta: 0:01:35 lr: 0.000332 loss: 3.018629 (2.813252) time: 0.923785 data: 0.000179 max mem: 18817 Epoch: [221/300] [1200/1251] eta: 0:00:48 lr: 0.000331 loss: 2.940682 (2.813437) time: 0.916775 data: 0.000179 max mem: 18817 Epoch: [221/300] [1250/1251] eta: 0:00:00 lr: 0.000331 loss: 2.911225 (2.812892) time: 0.978434 data: 0.000782 max mem: 18817 Epoch: [221/300] Total time: 0:19:50 (0.951425 s / it) Averaged stats: lr: 0.000331 loss: 2.911225 (2.805882) Test: [ 0/49] eta: 0:01:27 loss: 0.454937 (0.454937) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.787613 data: 1.394009 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.580333 (0.671353) acc1: 82.812500 (83.522727) acc5: 96.875000 (96.590909) time: 0.496193 data: 0.126865 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.739247 (0.695328) acc1: 81.250000 (83.035714) acc5: 96.875000 (96.205357) time: 0.374349 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.713655 (0.696936) acc1: 81.250000 (82.862903) acc5: 96.875000 (96.270161) time: 0.387756 data: 0.000130 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.709547 (0.705729) acc1: 82.812500 (82.964939) acc5: 96.875000 (96.455793) time: 0.370428 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.719484 (0.708949) acc1: 82.812500 (83.008000) acc5: 96.875000 (96.608000) time: 0.346558 data: 0.000104 max mem: 18817 Test: Total time: 0:00:19 (0.398222 s / it) * Acc@1 83.076 Acc@5 96.578 loss 0.720 Max accuracy: 83.08% Epoch: [222/300] [ 0/1251] eta: 0:54:26 lr: 0.000331 loss: 1.905297 (1.905297) time: 2.611064 data: 1.344665 max mem: 18817 Epoch: [222/300] [ 50/1251] eta: 0:19:23 lr: 0.000331 loss: 2.817487 (2.687615) time: 0.971808 data: 0.000188 max mem: 18817 Epoch: [222/300] [ 100/1251] eta: 0:18:15 lr: 0.000330 loss: 2.697022 (2.724437) time: 0.912378 data: 0.000187 max mem: 18817 Epoch: [222/300] [ 150/1251] eta: 0:17:30 lr: 0.000330 loss: 2.861038 (2.745855) time: 0.908117 data: 0.000179 max mem: 18817 Epoch: [222/300] [ 200/1251] eta: 0:16:42 lr: 0.000330 loss: 2.623728 (2.753594) time: 0.965922 data: 0.000192 max mem: 18817 Epoch: [222/300] [ 250/1251] eta: 0:15:53 lr: 0.000329 loss: 3.055286 (2.773435) time: 0.979714 data: 0.000183 max mem: 18817 Epoch: [222/300] [ 300/1251] eta: 0:15:02 lr: 0.000329 loss: 2.820263 (2.756938) time: 0.908265 data: 0.000182 max mem: 18817 Epoch: [222/300] [ 350/1251] eta: 0:14:16 lr: 0.000329 loss: 2.784118 (2.766216) time: 0.915717 data: 0.000187 max mem: 18817 Epoch: [222/300] [ 400/1251] eta: 0:13:29 lr: 0.000329 loss: 2.970961 (2.768977) time: 0.997440 data: 0.000199 max mem: 18817 Epoch: [222/300] [ 450/1251] eta: 0:12:40 lr: 0.000328 loss: 3.089992 (2.782658) time: 0.964827 data: 0.000172 max mem: 18817 Epoch: [222/300] [ 500/1251] eta: 0:11:52 lr: 0.000328 loss: 3.024920 (2.786167) time: 0.919970 data: 0.000173 max mem: 18817 Epoch: [222/300] [ 550/1251] eta: 0:11:05 lr: 0.000328 loss: 2.831148 (2.780714) time: 0.939249 data: 0.000180 max mem: 18817 Epoch: [222/300] [ 600/1251] eta: 0:10:19 lr: 0.000327 loss: 2.849782 (2.785205) time: 0.947521 data: 0.000201 max mem: 18817 Epoch: [222/300] [ 650/1251] eta: 0:09:32 lr: 0.000327 loss: 2.715162 (2.785154) time: 1.025190 data: 0.000179 max mem: 18817 Epoch: [222/300] [ 700/1251] eta: 0:08:45 lr: 0.000327 loss: 2.971951 (2.788337) time: 0.995700 data: 0.000194 max mem: 18817 Epoch: [222/300] [ 750/1251] eta: 0:07:56 lr: 0.000326 loss: 2.824506 (2.784491) time: 0.917122 data: 0.000191 max mem: 18817 Epoch: [222/300] [ 800/1251] eta: 0:07:09 lr: 0.000326 loss: 2.809919 (2.779481) time: 0.929854 data: 0.000179 max mem: 18817 Epoch: [222/300] [ 850/1251] eta: 0:06:22 lr: 0.000326 loss: 2.958727 (2.782731) time: 1.001119 data: 0.000181 max mem: 18817 Epoch: [222/300] [ 900/1251] eta: 0:05:35 lr: 0.000326 loss: 2.930193 (2.785602) time: 1.033712 data: 0.000177 max mem: 18817 Epoch: [222/300] [ 950/1251] eta: 0:04:46 lr: 0.000325 loss: 2.871470 (2.783900) time: 0.958886 data: 0.000175 max mem: 18817 Epoch: [222/300] [1000/1251] eta: 0:03:59 lr: 0.000325 loss: 2.958528 (2.786473) time: 0.929362 data: 0.000166 max mem: 18817 Epoch: [222/300] [1050/1251] eta: 0:03:11 lr: 0.000325 loss: 2.864727 (2.787778) time: 0.914982 data: 0.000193 max mem: 18817 Epoch: [222/300] [1100/1251] eta: 0:02:23 lr: 0.000324 loss: 2.843006 (2.786667) time: 0.989099 data: 0.000189 max mem: 18817 Epoch: [222/300] [1150/1251] eta: 0:01:36 lr: 0.000324 loss: 2.895257 (2.785983) time: 1.017623 data: 0.000171 max mem: 18817 Epoch: [222/300] [1200/1251] eta: 0:00:48 lr: 0.000324 loss: 2.596876 (2.784580) time: 0.947818 data: 0.000176 max mem: 18817 Epoch: [222/300] [1250/1251] eta: 0:00:00 lr: 0.000323 loss: 2.674264 (2.785206) time: 0.916111 data: 0.000758 max mem: 18817 Epoch: [222/300] Total time: 0:19:51 (0.952414 s / it) Averaged stats: lr: 0.000323 loss: 2.674264 (2.785541) Test: [ 0/49] eta: 0:01:25 loss: 0.435650 (0.435650) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.745003 data: 1.361062 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.572775 (0.665551) acc1: 84.375000 (84.375000) acc5: 98.437500 (96.875000) time: 0.485226 data: 0.123856 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.719690 (0.692183) acc1: 84.375000 (83.556548) acc5: 96.875000 (96.502976) time: 0.396536 data: 0.000132 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.704519 (0.695342) acc1: 82.812500 (82.963710) acc5: 96.875000 (96.673387) time: 0.454493 data: 0.000136 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.726893 (0.702668) acc1: 82.812500 (83.193598) acc5: 96.875000 (96.646341) time: 0.411220 data: 0.000150 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.726893 (0.701296) acc1: 82.812500 (83.296000) acc5: 96.875000 (96.704000) time: 0.344656 data: 0.000124 max mem: 18817 Test: Total time: 0:00:20 (0.422904 s / it) * Acc@1 83.128 Acc@5 96.614 loss 0.716 Max accuracy: 83.13% Epoch: [223/300] [ 0/1251] eta: 0:42:53 lr: 0.000323 loss: 3.157604 (3.157604) time: 2.057140 data: 1.181672 max mem: 18817 Epoch: [223/300] [ 50/1251] eta: 0:19:49 lr: 0.000323 loss: 2.733656 (2.764125) time: 0.977157 data: 0.000180 max mem: 18817 Epoch: [223/300] [ 100/1251] eta: 0:18:35 lr: 0.000323 loss: 2.893344 (2.813742) time: 0.986843 data: 0.000179 max mem: 18817 Epoch: [223/300] [ 150/1251] eta: 0:17:35 lr: 0.000323 loss: 2.803607 (2.809408) time: 0.914370 data: 0.000188 max mem: 18817 Epoch: [223/300] [ 200/1251] eta: 0:16:46 lr: 0.000322 loss: 3.007632 (2.786092) time: 0.940063 data: 0.000175 max mem: 18817 Epoch: [223/300] [ 250/1251] eta: 0:15:54 lr: 0.000322 loss: 3.008512 (2.796958) time: 0.917780 data: 0.000182 max mem: 18817 Epoch: [223/300] [ 300/1251] eta: 0:15:07 lr: 0.000322 loss: 2.948955 (2.795155) time: 0.973629 data: 0.000162 max mem: 18817 Epoch: [223/300] [ 350/1251] eta: 0:14:20 lr: 0.000321 loss: 2.678730 (2.787862) time: 0.971908 data: 0.000172 max mem: 18817 Epoch: [223/300] [ 400/1251] eta: 0:13:31 lr: 0.000321 loss: 2.979377 (2.795843) time: 0.957022 data: 0.000207 max mem: 18817 Epoch: [223/300] [ 450/1251] eta: 0:12:42 lr: 0.000321 loss: 2.626148 (2.789711) time: 0.915060 data: 0.000168 max mem: 18817 Epoch: [223/300] [ 500/1251] eta: 0:11:55 lr: 0.000320 loss: 2.925484 (2.784829) time: 0.915904 data: 0.000168 max mem: 18817 Epoch: [223/300] [ 550/1251] eta: 0:11:08 lr: 0.000320 loss: 2.678455 (2.777460) time: 1.038541 data: 0.000176 max mem: 18817 Epoch: [223/300] [ 600/1251] eta: 0:10:18 lr: 0.000320 loss: 3.039314 (2.793149) time: 0.907497 data: 0.000172 max mem: 18817 Epoch: [223/300] [ 650/1251] eta: 0:09:32 lr: 0.000320 loss: 2.543202 (2.786441) time: 0.927629 data: 0.000176 max mem: 18817 Epoch: [223/300] [ 700/1251] eta: 0:08:44 lr: 0.000319 loss: 2.867105 (2.793712) time: 0.971507 data: 0.000169 max mem: 18817 Epoch: [223/300] [ 750/1251] eta: 0:07:56 lr: 0.000319 loss: 2.869670 (2.803313) time: 0.969808 data: 0.000163 max mem: 18817 Epoch: [223/300] [ 800/1251] eta: 0:07:09 lr: 0.000319 loss: 2.828199 (2.808032) time: 0.959447 data: 0.000172 max mem: 18817 Epoch: [223/300] [ 850/1251] eta: 0:06:21 lr: 0.000318 loss: 2.626047 (2.805526) time: 0.912683 data: 0.000181 max mem: 18817 Epoch: [223/300] [ 900/1251] eta: 0:05:33 lr: 0.000318 loss: 3.002872 (2.811082) time: 0.966412 data: 0.000198 max mem: 18817 Epoch: [223/300] [ 950/1251] eta: 0:04:46 lr: 0.000318 loss: 2.580253 (2.804623) time: 0.954170 data: 0.000180 max mem: 18817 Epoch: [223/300] [1000/1251] eta: 0:03:58 lr: 0.000317 loss: 3.009558 (2.810417) time: 0.967109 data: 0.000163 max mem: 18817 Epoch: [223/300] [1050/1251] eta: 0:03:11 lr: 0.000317 loss: 2.833788 (2.806776) time: 0.917971 data: 0.000178 max mem: 18817 Epoch: [223/300] [1100/1251] eta: 0:02:23 lr: 0.000317 loss: 2.867080 (2.805438) time: 0.920632 data: 0.000181 max mem: 18817 Epoch: [223/300] [1150/1251] eta: 0:01:36 lr: 0.000317 loss: 2.811962 (2.800730) time: 0.962806 data: 0.000167 max mem: 18817 Epoch: [223/300] [1200/1251] eta: 0:00:48 lr: 0.000316 loss: 2.716374 (2.798519) time: 0.975645 data: 0.000173 max mem: 18817 Epoch: [223/300] [1250/1251] eta: 0:00:00 lr: 0.000316 loss: 2.757298 (2.796518) time: 0.966344 data: 0.000772 max mem: 18817 Epoch: [223/300] Total time: 0:19:50 (0.951711 s / it) Averaged stats: lr: 0.000316 loss: 2.757298 (2.791871) Test: [ 0/49] eta: 0:01:26 loss: 0.486956 (0.486956) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.771551 data: 1.374718 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.554934 (0.671337) acc1: 84.375000 (84.801136) acc5: 98.437500 (96.590909) time: 0.489942 data: 0.125107 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 0.762428 (0.701715) acc1: 82.812500 (84.002976) acc5: 96.875000 (96.577381) time: 0.457095 data: 0.000136 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.759923 (0.700004) acc1: 82.812500 (83.215726) acc5: 96.875000 (96.875000) time: 0.452149 data: 0.000132 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.704658 (0.706523) acc1: 82.812500 (83.003049) acc5: 96.875000 (96.913110) time: 0.349142 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.708477 (0.703401) acc1: 82.812500 (83.168000) acc5: 96.875000 (96.928000) time: 0.344035 data: 0.000110 max mem: 18817 Test: Total time: 0:00:20 (0.422809 s / it) * Acc@1 83.172 Acc@5 96.554 loss 0.713 Max accuracy: 83.17% Epoch: [224/300] [ 0/1251] eta: 0:40:46 lr: 0.000316 loss: 3.105394 (3.105394) time: 1.955862 data: 1.097271 max mem: 18817 Epoch: [224/300] [ 50/1251] eta: 0:19:19 lr: 0.000316 loss: 2.859162 (2.889617) time: 0.952240 data: 0.000176 max mem: 18817 Epoch: [224/300] [ 100/1251] eta: 0:18:17 lr: 0.000315 loss: 2.638479 (2.791273) time: 0.979342 data: 0.000190 max mem: 18817 Epoch: [224/300] [ 150/1251] eta: 0:17:25 lr: 0.000315 loss: 2.795772 (2.791467) time: 0.912971 data: 0.000189 max mem: 18817 Epoch: [224/300] [ 200/1251] eta: 0:16:38 lr: 0.000315 loss: 2.803035 (2.770675) time: 0.909561 data: 0.000181 max mem: 18817 Epoch: [224/300] [ 250/1251] eta: 0:15:56 lr: 0.000315 loss: 2.943794 (2.775826) time: 0.934777 data: 0.000176 max mem: 18817 Epoch: [224/300] [ 300/1251] eta: 0:15:08 lr: 0.000314 loss: 2.953839 (2.775834) time: 0.966783 data: 0.000169 max mem: 18817 Epoch: [224/300] [ 350/1251] eta: 0:14:20 lr: 0.000314 loss: 2.980329 (2.790420) time: 0.962039 data: 0.000175 max mem: 18817 Epoch: [224/300] [ 400/1251] eta: 0:13:30 lr: 0.000314 loss: 2.810116 (2.778622) time: 0.985836 data: 0.000181 max mem: 18817 Epoch: [224/300] [ 450/1251] eta: 0:12:44 lr: 0.000313 loss: 2.992782 (2.792508) time: 0.983429 data: 0.000175 max mem: 18817 Epoch: [224/300] [ 500/1251] eta: 0:11:55 lr: 0.000313 loss: 2.916149 (2.792649) time: 0.919264 data: 0.000188 max mem: 18817 Epoch: [224/300] [ 550/1251] eta: 0:11:07 lr: 0.000313 loss: 2.677098 (2.783850) time: 0.937619 data: 0.000177 max mem: 18817 Epoch: [224/300] [ 600/1251] eta: 0:10:20 lr: 0.000312 loss: 2.807092 (2.782862) time: 0.984810 data: 0.000181 max mem: 18817 Epoch: [224/300] [ 650/1251] eta: 0:09:32 lr: 0.000312 loss: 3.061016 (2.778771) time: 0.965424 data: 0.000174 max mem: 18817 Epoch: [224/300] [ 700/1251] eta: 0:08:44 lr: 0.000312 loss: 2.987808 (2.786929) time: 0.924686 data: 0.000174 max mem: 18817 Epoch: [224/300] [ 750/1251] eta: 0:07:57 lr: 0.000312 loss: 2.816375 (2.790330) time: 0.922333 data: 0.000180 max mem: 18817 Epoch: [224/300] [ 800/1251] eta: 0:07:09 lr: 0.000311 loss: 2.751867 (2.792299) time: 0.978684 data: 0.000181 max mem: 18817 Epoch: [224/300] [ 850/1251] eta: 0:06:22 lr: 0.000311 loss: 2.954869 (2.794912) time: 0.975467 data: 0.000178 max mem: 18817 Epoch: [224/300] [ 900/1251] eta: 0:05:34 lr: 0.000311 loss: 2.387040 (2.790209) time: 0.949691 data: 0.000172 max mem: 18817 Epoch: [224/300] [ 950/1251] eta: 0:04:46 lr: 0.000310 loss: 3.020397 (2.790417) time: 0.912701 data: 0.000183 max mem: 18817 Epoch: [224/300] [1000/1251] eta: 0:03:59 lr: 0.000310 loss: 3.063933 (2.795967) time: 0.916412 data: 0.000167 max mem: 18817 Epoch: [224/300] [1050/1251] eta: 0:03:11 lr: 0.000310 loss: 2.787943 (2.788522) time: 0.971354 data: 0.000175 max mem: 18817 Epoch: [224/300] [1100/1251] eta: 0:02:23 lr: 0.000310 loss: 3.004606 (2.788493) time: 0.913419 data: 0.000177 max mem: 18817 Epoch: [224/300] [1150/1251] eta: 0:01:36 lr: 0.000309 loss: 2.783011 (2.790717) time: 0.960358 data: 0.000188 max mem: 18817 Epoch: [224/300] [1200/1251] eta: 0:00:48 lr: 0.000309 loss: 2.877588 (2.791767) time: 0.974880 data: 0.000166 max mem: 18817 Epoch: [224/300] [1250/1251] eta: 0:00:00 lr: 0.000309 loss: 2.807293 (2.790244) time: 0.926977 data: 0.000774 max mem: 18817 Epoch: [224/300] Total time: 0:19:49 (0.950916 s / it) Averaged stats: lr: 0.000309 loss: 2.807293 (2.790926) Test: [ 0/49] eta: 0:01:13 loss: 0.507525 (0.507525) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.504899 data: 1.088547 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.581187 (0.668221) acc1: 84.375000 (83.664773) acc5: 96.875000 (96.306818) time: 0.468786 data: 0.099104 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.748449 (0.705713) acc1: 82.812500 (82.886905) acc5: 96.875000 (96.577381) time: 0.451370 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.700942 (0.703697) acc1: 81.250000 (82.913306) acc5: 96.875000 (96.875000) time: 0.445627 data: 0.000141 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.700942 (0.711560) acc1: 81.250000 (82.850610) acc5: 96.875000 (96.951220) time: 0.350278 data: 0.000137 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.708291 (0.709874) acc1: 81.250000 (83.040000) acc5: 96.875000 (96.928000) time: 0.344346 data: 0.000102 max mem: 18817 Test: Total time: 0:00:20 (0.415444 s / it) * Acc@1 83.168 Acc@5 96.592 loss 0.716 Max accuracy: 83.17% Epoch: [225/300] [ 0/1251] eta: 0:41:34 lr: 0.000309 loss: 2.867511 (2.867511) time: 1.993957 data: 1.113988 max mem: 18817 Epoch: [225/300] [ 50/1251] eta: 0:19:23 lr: 0.000308 loss: 2.935636 (2.889292) time: 1.012808 data: 0.000146 max mem: 18817 Epoch: [225/300] [ 100/1251] eta: 0:18:19 lr: 0.000308 loss: 2.613131 (2.844473) time: 0.960758 data: 0.000170 max mem: 18817 Epoch: [225/300] [ 150/1251] eta: 0:17:25 lr: 0.000308 loss: 2.876706 (2.810805) time: 0.913335 data: 0.000173 max mem: 18817 Epoch: [225/300] [ 200/1251] eta: 0:16:42 lr: 0.000307 loss: 2.956613 (2.784644) time: 0.919754 data: 0.000188 max mem: 18817 Epoch: [225/300] [ 250/1251] eta: 0:15:56 lr: 0.000307 loss: 2.651341 (2.781308) time: 0.960291 data: 0.000173 max mem: 18817 Epoch: [225/300] [ 300/1251] eta: 0:15:03 lr: 0.000307 loss: 2.670513 (2.783238) time: 0.954018 data: 0.000164 max mem: 18817 Epoch: [225/300] [ 350/1251] eta: 0:14:17 lr: 0.000307 loss: 2.731572 (2.779770) time: 0.975994 data: 0.000168 max mem: 18817 Epoch: [225/300] [ 400/1251] eta: 0:13:27 lr: 0.000306 loss: 2.766785 (2.782680) time: 0.922444 data: 0.000186 max mem: 18817 Epoch: [225/300] [ 450/1251] eta: 0:12:40 lr: 0.000306 loss: 2.875855 (2.785372) time: 0.928425 data: 0.000179 max mem: 18817 Epoch: [225/300] [ 500/1251] eta: 0:11:54 lr: 0.000306 loss: 2.853776 (2.786334) time: 0.988839 data: 0.000175 max mem: 18817 Epoch: [225/300] [ 550/1251] eta: 0:11:06 lr: 0.000305 loss: 2.804943 (2.787034) time: 0.975602 data: 0.000172 max mem: 18817 Epoch: [225/300] [ 600/1251] eta: 0:10:19 lr: 0.000305 loss: 3.014024 (2.796356) time: 0.959914 data: 0.000179 max mem: 18817 Epoch: [225/300] [ 650/1251] eta: 0:09:31 lr: 0.000305 loss: 2.775599 (2.793764) time: 0.913632 data: 0.000178 max mem: 18817 Epoch: [225/300] [ 700/1251] eta: 0:08:44 lr: 0.000305 loss: 2.865790 (2.796845) time: 0.979670 data: 0.000181 max mem: 18817 Epoch: [225/300] [ 750/1251] eta: 0:07:57 lr: 0.000304 loss: 2.885260 (2.796580) time: 0.965875 data: 0.000183 max mem: 18817 Epoch: [225/300] [ 800/1251] eta: 0:07:09 lr: 0.000304 loss: 2.998990 (2.796626) time: 0.976964 data: 0.000168 max mem: 18817 Epoch: [225/300] [ 850/1251] eta: 0:06:21 lr: 0.000304 loss: 2.836600 (2.794324) time: 0.926075 data: 0.000168 max mem: 18817 Epoch: [225/300] [ 900/1251] eta: 0:05:34 lr: 0.000303 loss: 2.800817 (2.794276) time: 0.909518 data: 0.000179 max mem: 18817 Epoch: [225/300] [ 950/1251] eta: 0:04:46 lr: 0.000303 loss: 3.039687 (2.790741) time: 0.987527 data: 0.000174 max mem: 18817 Epoch: [225/300] [1000/1251] eta: 0:03:59 lr: 0.000303 loss: 2.898664 (2.789491) time: 0.973801 data: 0.000176 max mem: 18817 Epoch: [225/300] [1050/1251] eta: 0:03:11 lr: 0.000303 loss: 2.808028 (2.788799) time: 0.948750 data: 0.000171 max mem: 18817 Epoch: [225/300] [1100/1251] eta: 0:02:23 lr: 0.000302 loss: 2.774823 (2.782405) time: 0.923964 data: 0.000170 max mem: 18817 Epoch: [225/300] [1150/1251] eta: 0:01:36 lr: 0.000302 loss: 2.897864 (2.780632) time: 0.904735 data: 0.000172 max mem: 18817 Epoch: [225/300] [1200/1251] eta: 0:00:48 lr: 0.000302 loss: 2.976115 (2.784115) time: 0.971400 data: 0.000178 max mem: 18817 Epoch: [225/300] [1250/1251] eta: 0:00:00 lr: 0.000301 loss: 2.785014 (2.785413) time: 1.026404 data: 0.000774 max mem: 18817 Epoch: [225/300] Total time: 0:19:51 (0.952346 s / it) Averaged stats: lr: 0.000301 loss: 2.785014 (2.785556) Test: [ 0/49] eta: 0:01:26 loss: 0.516149 (0.516149) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.762272 data: 1.356924 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.576598 (0.676119) acc1: 84.375000 (84.232955) acc5: 98.437500 (96.448864) time: 0.485635 data: 0.123507 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.751479 (0.710707) acc1: 82.812500 (82.886905) acc5: 96.875000 (96.502976) time: 0.354722 data: 0.000138 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.751479 (0.710134) acc1: 79.687500 (82.661290) acc5: 96.875000 (96.622984) time: 0.352046 data: 0.000115 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.732944 (0.719552) acc1: 81.250000 (82.583841) acc5: 96.875000 (96.760671) time: 0.349720 data: 0.000116 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.732944 (0.716558) acc1: 82.812500 (82.688000) acc5: 96.875000 (96.800000) time: 0.344100 data: 0.000097 max mem: 18817 Test: Total time: 0:00:18 (0.381022 s / it) * Acc@1 83.212 Acc@5 96.682 loss 0.717 Max accuracy: 83.21% Epoch: [226/300] [ 0/1251] eta: 0:42:23 lr: 0.000301 loss: 2.579736 (2.579736) time: 2.033345 data: 1.153725 max mem: 18817 Epoch: [226/300] [ 50/1251] eta: 0:19:39 lr: 0.000301 loss: 2.087576 (2.607578) time: 0.928975 data: 0.000180 max mem: 18817 Epoch: [226/300] [ 100/1251] eta: 0:18:31 lr: 0.000301 loss: 2.988997 (2.748598) time: 0.963626 data: 0.000190 max mem: 18817 Epoch: [226/300] [ 150/1251] eta: 0:17:33 lr: 0.000301 loss: 2.647881 (2.722439) time: 0.979166 data: 0.000173 max mem: 18817 Epoch: [226/300] [ 200/1251] eta: 0:16:41 lr: 0.000300 loss: 2.859175 (2.725482) time: 0.932226 data: 0.000184 max mem: 18817 Epoch: [226/300] [ 250/1251] eta: 0:15:56 lr: 0.000300 loss: 2.679913 (2.720650) time: 0.932369 data: 0.000179 max mem: 18817 Epoch: [226/300] [ 300/1251] eta: 0:15:08 lr: 0.000300 loss: 2.733707 (2.723754) time: 0.948455 data: 0.000171 max mem: 18817 Epoch: [226/300] [ 350/1251] eta: 0:14:20 lr: 0.000299 loss: 2.543395 (2.714852) time: 1.024608 data: 0.000174 max mem: 18817 Epoch: [226/300] [ 400/1251] eta: 0:13:31 lr: 0.000299 loss: 2.693004 (2.725749) time: 0.976907 data: 0.000206 max mem: 18817 Epoch: [226/300] [ 450/1251] eta: 0:12:42 lr: 0.000299 loss: 2.890350 (2.736770) time: 0.917802 data: 0.000184 max mem: 18817 Epoch: [226/300] [ 500/1251] eta: 0:11:56 lr: 0.000298 loss: 2.923104 (2.741750) time: 0.932717 data: 0.000181 max mem: 18817 Epoch: [226/300] [ 550/1251] eta: 0:11:06 lr: 0.000298 loss: 3.030100 (2.749891) time: 0.959010 data: 0.000175 max mem: 18817 Epoch: [226/300] [ 600/1251] eta: 0:10:20 lr: 0.000298 loss: 2.892325 (2.752492) time: 0.979575 data: 0.000165 max mem: 18817 Epoch: [226/300] [ 650/1251] eta: 0:09:32 lr: 0.000298 loss: 2.870583 (2.759854) time: 0.911641 data: 0.000165 max mem: 18817 Epoch: [226/300] [ 700/1251] eta: 0:08:44 lr: 0.000297 loss: 2.662618 (2.758476) time: 0.912711 data: 0.000183 max mem: 18817 Epoch: [226/300] [ 750/1251] eta: 0:07:57 lr: 0.000297 loss: 2.867591 (2.765586) time: 0.950710 data: 0.000165 max mem: 18817 Epoch: [226/300] [ 800/1251] eta: 0:07:09 lr: 0.000297 loss: 2.918985 (2.767144) time: 0.962594 data: 0.000174 max mem: 18817 Epoch: [226/300] [ 850/1251] eta: 0:06:21 lr: 0.000296 loss: 2.774369 (2.763710) time: 0.910727 data: 0.000171 max mem: 18817 Epoch: [226/300] [ 900/1251] eta: 0:05:33 lr: 0.000296 loss: 2.780754 (2.762484) time: 0.915621 data: 0.000167 max mem: 18817 Epoch: [226/300] [ 950/1251] eta: 0:04:46 lr: 0.000296 loss: 2.890308 (2.763152) time: 0.999965 data: 0.000456 max mem: 18817 Epoch: [226/300] [1000/1251] eta: 0:03:59 lr: 0.000296 loss: 3.016662 (2.769761) time: 0.949646 data: 0.000170 max mem: 18817 Epoch: [226/300] [1050/1251] eta: 0:03:11 lr: 0.000295 loss: 2.957693 (2.769313) time: 0.970006 data: 0.000160 max mem: 18817 Epoch: [226/300] [1100/1251] eta: 0:02:23 lr: 0.000295 loss: 2.926525 (2.776838) time: 0.925388 data: 0.000189 max mem: 18817 Epoch: [226/300] [1150/1251] eta: 0:01:36 lr: 0.000295 loss: 2.952312 (2.780475) time: 0.916633 data: 0.000187 max mem: 18817 Epoch: [226/300] [1200/1251] eta: 0:00:48 lr: 0.000294 loss: 2.782181 (2.783470) time: 0.973060 data: 0.000186 max mem: 18817 Epoch: [226/300] [1250/1251] eta: 0:00:00 lr: 0.000294 loss: 3.063168 (2.786721) time: 0.962940 data: 0.000800 max mem: 18817 Epoch: [226/300] Total time: 0:19:52 (0.953020 s / it) Averaged stats: lr: 0.000294 loss: 3.063168 (2.784865) Test: [ 0/49] eta: 0:01:26 loss: 0.487130 (0.487130) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.774594 data: 1.380623 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.578900 (0.683421) acc1: 85.937500 (83.664773) acc5: 98.437500 (96.448864) time: 0.487347 data: 0.125648 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.740189 (0.713548) acc1: 82.812500 (83.630952) acc5: 96.875000 (96.577381) time: 0.355548 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.717241 (0.708608) acc1: 82.812500 (83.518145) acc5: 96.875000 (96.673387) time: 0.352349 data: 0.000157 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.690087 (0.713480) acc1: 82.812500 (83.422256) acc5: 96.875000 (96.798780) time: 0.350345 data: 0.000160 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.699318 (0.713583) acc1: 82.812500 (83.552000) acc5: 96.875000 (96.800000) time: 0.344562 data: 0.000112 max mem: 18817 Test: Total time: 0:00:18 (0.382156 s / it) * Acc@1 83.318 Acc@5 96.644 loss 0.722 Max accuracy: 83.32% Epoch: [227/300] [ 0/1251] eta: 0:43:28 lr: 0.000294 loss: 2.476121 (2.476121) time: 2.084821 data: 1.181830 max mem: 18817 Epoch: [227/300] [ 50/1251] eta: 0:19:36 lr: 0.000294 loss: 2.711005 (2.670189) time: 0.924524 data: 0.000186 max mem: 18817 Epoch: [227/300] [ 100/1251] eta: 0:18:34 lr: 0.000294 loss: 2.770880 (2.701071) time: 0.944143 data: 0.000453 max mem: 18817 Epoch: [227/300] [ 150/1251] eta: 0:17:32 lr: 0.000293 loss: 2.917789 (2.776712) time: 0.961802 data: 0.000189 max mem: 18817 Epoch: [227/300] [ 200/1251] eta: 0:16:36 lr: 0.000293 loss: 2.689051 (2.729546) time: 0.905935 data: 0.000180 max mem: 18817 Epoch: [227/300] [ 250/1251] eta: 0:15:50 lr: 0.000293 loss: 2.862236 (2.738090) time: 0.918340 data: 0.000175 max mem: 18817 Epoch: [227/300] [ 300/1251] eta: 0:15:04 lr: 0.000292 loss: 3.002151 (2.750014) time: 0.944310 data: 0.000454 max mem: 18817 Epoch: [227/300] [ 350/1251] eta: 0:14:18 lr: 0.000292 loss: 2.807504 (2.745504) time: 1.041743 data: 0.000179 max mem: 18817 Epoch: [227/300] [ 400/1251] eta: 0:13:29 lr: 0.000292 loss: 2.877650 (2.741783) time: 0.976807 data: 0.000171 max mem: 18817 Epoch: [227/300] [ 450/1251] eta: 0:12:40 lr: 0.000292 loss: 2.881000 (2.744596) time: 0.907142 data: 0.000183 max mem: 18817 Epoch: [227/300] [ 500/1251] eta: 0:11:54 lr: 0.000291 loss: 3.051630 (2.749782) time: 0.935329 data: 0.000177 max mem: 18817 Epoch: [227/300] [ 550/1251] eta: 0:11:07 lr: 0.000291 loss: 2.823300 (2.738849) time: 0.981670 data: 0.000195 max mem: 18817 Epoch: [227/300] [ 600/1251] eta: 0:10:20 lr: 0.000291 loss: 2.759505 (2.738314) time: 1.025921 data: 0.000179 max mem: 18817 Epoch: [227/300] [ 650/1251] eta: 0:09:31 lr: 0.000290 loss: 2.752586 (2.739729) time: 0.971402 data: 0.000183 max mem: 18817 Epoch: [227/300] [ 700/1251] eta: 0:08:44 lr: 0.000290 loss: 2.786519 (2.743180) time: 0.924584 data: 0.000177 max mem: 18817 Epoch: [227/300] [ 750/1251] eta: 0:07:57 lr: 0.000290 loss: 2.962683 (2.748628) time: 0.929914 data: 0.000178 max mem: 18817 Epoch: [227/300] [ 800/1251] eta: 0:07:09 lr: 0.000290 loss: 3.007360 (2.751310) time: 0.964318 data: 0.000184 max mem: 18817 Epoch: [227/300] [ 850/1251] eta: 0:06:22 lr: 0.000289 loss: 2.669883 (2.751866) time: 1.021995 data: 0.000171 max mem: 18817 Epoch: [227/300] [ 900/1251] eta: 0:05:34 lr: 0.000289 loss: 2.623000 (2.743794) time: 0.973750 data: 0.000189 max mem: 18817 Epoch: [227/300] [ 950/1251] eta: 0:04:46 lr: 0.000289 loss: 3.046928 (2.748228) time: 0.921089 data: 0.000174 max mem: 18817 Epoch: [227/300] [1000/1251] eta: 0:03:58 lr: 0.000288 loss: 2.837124 (2.748771) time: 0.925995 data: 0.000164 max mem: 18817 Epoch: [227/300] [1050/1251] eta: 0:03:11 lr: 0.000288 loss: 2.835084 (2.750034) time: 0.986750 data: 0.000178 max mem: 18817 Epoch: [227/300] [1100/1251] eta: 0:02:23 lr: 0.000288 loss: 2.974365 (2.756330) time: 1.036839 data: 0.000189 max mem: 18817 Epoch: [227/300] [1150/1251] eta: 0:01:36 lr: 0.000288 loss: 2.991784 (2.755725) time: 0.973124 data: 0.000181 max mem: 18817 Epoch: [227/300] [1200/1251] eta: 0:00:48 lr: 0.000287 loss: 2.912552 (2.757634) time: 0.913089 data: 0.000182 max mem: 18817 Epoch: [227/300] [1250/1251] eta: 0:00:00 lr: 0.000287 loss: 2.820471 (2.756584) time: 0.912139 data: 0.000768 max mem: 18817 Epoch: [227/300] Total time: 0:19:51 (0.952381 s / it) Averaged stats: lr: 0.000287 loss: 2.820471 (2.761931) Test: [ 0/49] eta: 0:01:25 loss: 0.475603 (0.475603) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.748128 data: 1.345703 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.629096 (0.666418) acc1: 82.812500 (83.948864) acc5: 98.437500 (96.875000) time: 0.486311 data: 0.122470 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.747027 (0.700612) acc1: 81.250000 (83.258929) acc5: 96.875000 (96.800595) time: 0.356183 data: 0.000136 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.727059 (0.697507) acc1: 82.812500 (83.114919) acc5: 96.875000 (96.875000) time: 0.353013 data: 0.000142 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.723547 (0.709737) acc1: 82.812500 (82.926829) acc5: 96.875000 (96.951220) time: 0.350559 data: 0.000135 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.726165 (0.707517) acc1: 82.812500 (82.976000) acc5: 96.875000 (96.928000) time: 0.344282 data: 0.000105 max mem: 18817 Test: Total time: 0:00:18 (0.381607 s / it) * Acc@1 83.224 Acc@5 96.638 loss 0.721 Max accuracy: 83.32% Epoch: [228/300] [ 0/1251] eta: 0:41:45 lr: 0.000287 loss: 2.888344 (2.888344) time: 2.002606 data: 1.135333 max mem: 18817 Epoch: [228/300] [ 50/1251] eta: 0:19:22 lr: 0.000287 loss: 2.827000 (2.604069) time: 0.973985 data: 0.000176 max mem: 18817 Epoch: [228/300] [ 100/1251] eta: 0:18:17 lr: 0.000286 loss: 2.974319 (2.718328) time: 0.909812 data: 0.000170 max mem: 18817 Epoch: [228/300] [ 150/1251] eta: 0:17:33 lr: 0.000286 loss: 2.882713 (2.734552) time: 0.919785 data: 0.000160 max mem: 18817 Epoch: [228/300] [ 200/1251] eta: 0:16:44 lr: 0.000286 loss: 2.917787 (2.760688) time: 0.962155 data: 0.000198 max mem: 18817 Epoch: [228/300] [ 250/1251] eta: 0:15:59 lr: 0.000286 loss: 2.974679 (2.766799) time: 0.969384 data: 0.000174 max mem: 18817 Epoch: [228/300] [ 300/1251] eta: 0:15:09 lr: 0.000285 loss: 2.858147 (2.763691) time: 0.976728 data: 0.000173 max mem: 18817 Epoch: [228/300] [ 350/1251] eta: 0:14:20 lr: 0.000285 loss: 2.935266 (2.770368) time: 0.934401 data: 0.000172 max mem: 18817 Epoch: [228/300] [ 400/1251] eta: 0:13:32 lr: 0.000285 loss: 3.061784 (2.783209) time: 0.917027 data: 0.000179 max mem: 18817 Epoch: [228/300] [ 450/1251] eta: 0:12:44 lr: 0.000284 loss: 2.847506 (2.780592) time: 0.965755 data: 0.000181 max mem: 18817 Epoch: [228/300] [ 500/1251] eta: 0:11:57 lr: 0.000284 loss: 2.916775 (2.778455) time: 1.021306 data: 0.000166 max mem: 18817 Epoch: [228/300] [ 550/1251] eta: 0:11:08 lr: 0.000284 loss: 2.885556 (2.779808) time: 0.966475 data: 0.000192 max mem: 18817 Epoch: [228/300] [ 600/1251] eta: 0:10:20 lr: 0.000284 loss: 2.633145 (2.775359) time: 0.938656 data: 0.000170 max mem: 18817 Epoch: [228/300] [ 650/1251] eta: 0:09:32 lr: 0.000283 loss: 2.925941 (2.776229) time: 0.917220 data: 0.000158 max mem: 18817 Epoch: [228/300] [ 700/1251] eta: 0:08:45 lr: 0.000283 loss: 2.807480 (2.767264) time: 0.968735 data: 0.000168 max mem: 18817 Epoch: [228/300] [ 750/1251] eta: 0:07:57 lr: 0.000283 loss: 2.860338 (2.772073) time: 1.003709 data: 0.000190 max mem: 18817 Epoch: [228/300] [ 800/1251] eta: 0:07:09 lr: 0.000283 loss: 2.680354 (2.774138) time: 0.964049 data: 0.000173 max mem: 18817 Epoch: [228/300] [ 850/1251] eta: 0:06:21 lr: 0.000282 loss: 2.664477 (2.772894) time: 0.932176 data: 0.000188 max mem: 18817 Epoch: [228/300] [ 900/1251] eta: 0:05:34 lr: 0.000282 loss: 2.797379 (2.777211) time: 0.923469 data: 0.000184 max mem: 18817 Epoch: [228/300] [ 950/1251] eta: 0:04:46 lr: 0.000282 loss: 2.848546 (2.777512) time: 0.957121 data: 0.000167 max mem: 18817 Epoch: [228/300] [1000/1251] eta: 0:03:58 lr: 0.000281 loss: 2.780946 (2.784070) time: 0.953787 data: 0.000168 max mem: 18817 Epoch: [228/300] [1050/1251] eta: 0:03:11 lr: 0.000281 loss: 2.876405 (2.780484) time: 0.908783 data: 0.000176 max mem: 18817 Epoch: [228/300] [1100/1251] eta: 0:02:23 lr: 0.000281 loss: 3.040765 (2.783025) time: 0.918426 data: 0.000170 max mem: 18817 Epoch: [228/300] [1150/1251] eta: 0:01:36 lr: 0.000281 loss: 2.826571 (2.781582) time: 0.915071 data: 0.000169 max mem: 18817 Epoch: [228/300] [1200/1251] eta: 0:00:48 lr: 0.000280 loss: 2.921901 (2.783669) time: 0.963366 data: 0.000165 max mem: 18817 Epoch: [228/300] [1250/1251] eta: 0:00:00 lr: 0.000280 loss: 2.924729 (2.785051) time: 0.973223 data: 0.000787 max mem: 18817 Epoch: [228/300] Total time: 0:19:50 (0.952016 s / it) Averaged stats: lr: 0.000280 loss: 2.924729 (2.782428) Test: [ 0/49] eta: 0:01:29 loss: 0.486249 (0.486249) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.830203 data: 1.451832 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.588697 (0.672125) acc1: 82.812500 (82.954545) acc5: 98.437500 (96.875000) time: 0.490779 data: 0.132127 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.711674 (0.696571) acc1: 82.812500 (82.961310) acc5: 96.875000 (96.651786) time: 0.353665 data: 0.000139 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.660535 (0.695643) acc1: 81.250000 (82.963710) acc5: 96.875000 (96.925403) time: 0.350858 data: 0.000128 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.668814 (0.698850) acc1: 82.812500 (82.812500) acc5: 96.875000 (97.065549) time: 0.361301 data: 0.000125 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.694726 (0.699541) acc1: 81.250000 (82.784000) acc5: 96.875000 (97.024000) time: 0.357960 data: 0.000102 max mem: 18817 Test: Total time: 0:00:19 (0.388304 s / it) * Acc@1 83.222 Acc@5 96.670 loss 0.708 Max accuracy: 83.32% Epoch: [229/300] [ 0/1251] eta: 0:42:16 lr: 0.000280 loss: 2.651615 (2.651615) time: 2.027449 data: 1.150403 max mem: 18817 Epoch: [229/300] [ 50/1251] eta: 0:19:09 lr: 0.000280 loss: 2.451437 (2.668431) time: 0.916915 data: 0.000158 max mem: 18817 Epoch: [229/300] [ 100/1251] eta: 0:18:23 lr: 0.000279 loss: 2.414170 (2.663809) time: 0.914305 data: 0.000165 max mem: 18817 Epoch: [229/300] [ 150/1251] eta: 0:17:33 lr: 0.000279 loss: 2.949928 (2.714942) time: 0.957191 data: 0.000168 max mem: 18817 Epoch: [229/300] [ 200/1251] eta: 0:16:44 lr: 0.000279 loss: 3.174071 (2.752169) time: 0.956797 data: 0.000181 max mem: 18817 Epoch: [229/300] [ 250/1251] eta: 0:15:52 lr: 0.000279 loss: 2.762186 (2.762478) time: 0.964970 data: 0.000182 max mem: 18817 Epoch: [229/300] [ 300/1251] eta: 0:15:03 lr: 0.000278 loss: 2.747843 (2.754635) time: 0.915566 data: 0.000169 max mem: 18817 Epoch: [229/300] [ 350/1251] eta: 0:14:16 lr: 0.000278 loss: 2.925732 (2.759046) time: 0.923376 data: 0.000174 max mem: 18817 Epoch: [229/300] [ 400/1251] eta: 0:13:30 lr: 0.000278 loss: 2.626336 (2.744730) time: 0.976134 data: 0.000179 max mem: 18817 Epoch: [229/300] [ 450/1251] eta: 0:12:42 lr: 0.000277 loss: 2.399075 (2.743900) time: 0.978977 data: 0.000181 max mem: 18817 Epoch: [229/300] [ 500/1251] eta: 0:11:54 lr: 0.000277 loss: 2.741612 (2.737659) time: 1.025688 data: 0.000170 max mem: 18817 Epoch: [229/300] [ 550/1251] eta: 0:11:07 lr: 0.000277 loss: 2.732855 (2.727129) time: 0.984904 data: 0.000180 max mem: 18817 Epoch: [229/300] [ 600/1251] eta: 0:10:19 lr: 0.000277 loss: 2.952488 (2.734953) time: 0.911187 data: 0.000187 max mem: 18817 Epoch: [229/300] [ 650/1251] eta: 0:09:31 lr: 0.000276 loss: 2.983249 (2.734273) time: 0.927595 data: 0.000182 max mem: 18817 Epoch: [229/300] [ 700/1251] eta: 0:08:43 lr: 0.000276 loss: 2.928421 (2.736761) time: 0.954959 data: 0.000170 max mem: 18817 Epoch: [229/300] [ 750/1251] eta: 0:07:56 lr: 0.000276 loss: 2.839736 (2.737082) time: 1.030936 data: 0.000179 max mem: 18817 Epoch: [229/300] [ 800/1251] eta: 0:07:08 lr: 0.000276 loss: 2.731686 (2.735654) time: 0.959736 data: 0.000164 max mem: 18817 Epoch: [229/300] [ 850/1251] eta: 0:06:20 lr: 0.000275 loss: 2.952177 (2.743515) time: 0.917658 data: 0.000223 max mem: 18817 Epoch: [229/300] [ 900/1251] eta: 0:05:33 lr: 0.000275 loss: 2.845988 (2.736031) time: 0.911925 data: 0.000171 max mem: 18817 Epoch: [229/300] [ 950/1251] eta: 0:04:45 lr: 0.000275 loss: 2.832672 (2.735284) time: 0.976897 data: 0.000173 max mem: 18817 Epoch: [229/300] [1000/1251] eta: 0:03:58 lr: 0.000274 loss: 2.758932 (2.731540) time: 0.966267 data: 0.000170 max mem: 18817 Epoch: [229/300] [1050/1251] eta: 0:03:10 lr: 0.000274 loss: 2.643939 (2.728413) time: 0.912277 data: 0.000175 max mem: 18817 Epoch: [229/300] [1100/1251] eta: 0:02:23 lr: 0.000274 loss: 2.750976 (2.726776) time: 0.932222 data: 0.000173 max mem: 18817 Epoch: [229/300] [1150/1251] eta: 0:01:35 lr: 0.000274 loss: 2.900225 (2.727091) time: 0.921887 data: 0.000167 max mem: 18817 Epoch: [229/300] [1200/1251] eta: 0:00:48 lr: 0.000273 loss: 2.773747 (2.730337) time: 0.976358 data: 0.000185 max mem: 18817 Epoch: [229/300] [1250/1251] eta: 0:00:00 lr: 0.000273 loss: 2.906238 (2.733227) time: 0.959389 data: 0.000758 max mem: 18817 Epoch: [229/300] Total time: 0:19:49 (0.950883 s / it) Averaged stats: lr: 0.000273 loss: 2.906238 (2.741622) Test: [ 0/49] eta: 0:01:15 loss: 0.529758 (0.529758) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.538295 data: 1.142853 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.651840 (0.664355) acc1: 84.375000 (83.664773) acc5: 98.437500 (96.732955) time: 0.467739 data: 0.104045 max mem: 18817 Test: [20/49] eta: 0:00:11 loss: 0.721325 (0.692678) acc1: 82.812500 (82.886905) acc5: 96.875000 (96.800595) time: 0.356107 data: 0.000146 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.715210 (0.694285) acc1: 81.250000 (82.560484) acc5: 96.875000 (96.925403) time: 0.351797 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.688177 (0.707830) acc1: 82.812500 (82.545732) acc5: 96.875000 (96.951220) time: 0.369792 data: 0.000129 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.715210 (0.705437) acc1: 82.812500 (82.752000) acc5: 96.875000 (96.960000) time: 0.376699 data: 0.000108 max mem: 18817 Test: Total time: 0:00:19 (0.390214 s / it) * Acc@1 83.318 Acc@5 96.696 loss 0.709 Max accuracy: 83.32% Epoch: [230/300] [ 0/1251] eta: 0:45:08 lr: 0.000273 loss: 2.398548 (2.398548) time: 2.165094 data: 1.284152 max mem: 18817 Epoch: [230/300] [ 50/1251] eta: 0:19:41 lr: 0.000273 loss: 2.778044 (2.819817) time: 0.921385 data: 0.000164 max mem: 18817 Epoch: [230/300] [ 100/1251] eta: 0:18:41 lr: 0.000272 loss: 2.979976 (2.809795) time: 0.987637 data: 0.000184 max mem: 18817 Epoch: [230/300] [ 150/1251] eta: 0:17:53 lr: 0.000272 loss: 3.025141 (2.802635) time: 1.047182 data: 0.000172 max mem: 18817 Epoch: [230/300] [ 200/1251] eta: 0:16:55 lr: 0.000272 loss: 2.808577 (2.789702) time: 0.957136 data: 0.000159 max mem: 18817 Epoch: [230/300] [ 250/1251] eta: 0:16:03 lr: 0.000272 loss: 2.818858 (2.771130) time: 0.922330 data: 0.000156 max mem: 18817 Epoch: [230/300] [ 300/1251] eta: 0:15:13 lr: 0.000271 loss: 2.946769 (2.787106) time: 0.929544 data: 0.000165 max mem: 18817 Epoch: [230/300] [ 350/1251] eta: 0:14:25 lr: 0.000271 loss: 2.626150 (2.773924) time: 0.966554 data: 0.000173 max mem: 18817 Epoch: [230/300] [ 400/1251] eta: 0:13:36 lr: 0.000271 loss: 3.071985 (2.773334) time: 1.020502 data: 0.000184 max mem: 18817 Epoch: [230/300] [ 450/1251] eta: 0:12:46 lr: 0.000271 loss: 2.787676 (2.780233) time: 0.978725 data: 0.000172 max mem: 18817 Epoch: [230/300] [ 500/1251] eta: 0:11:57 lr: 0.000270 loss: 2.734300 (2.776213) time: 0.920328 data: 0.000172 max mem: 18817 Epoch: [230/300] [ 550/1251] eta: 0:11:08 lr: 0.000270 loss: 2.875070 (2.779969) time: 0.913969 data: 0.000173 max mem: 18817 Epoch: [230/300] [ 600/1251] eta: 0:10:21 lr: 0.000270 loss: 2.929192 (2.776532) time: 0.910056 data: 0.000168 max mem: 18817 Epoch: [230/300] [ 650/1251] eta: 0:09:33 lr: 0.000269 loss: 2.850504 (2.777991) time: 0.990256 data: 0.000170 max mem: 18817 Epoch: [230/300] [ 700/1251] eta: 0:08:45 lr: 0.000269 loss: 2.813532 (2.780044) time: 1.010040 data: 0.000168 max mem: 18817 Epoch: [230/300] [ 750/1251] eta: 0:07:57 lr: 0.000269 loss: 2.713944 (2.775820) time: 0.971794 data: 0.000176 max mem: 18817 Epoch: [230/300] [ 800/1251] eta: 0:07:09 lr: 0.000269 loss: 2.636124 (2.773625) time: 0.909829 data: 0.000188 max mem: 18817 Epoch: [230/300] [ 850/1251] eta: 0:06:21 lr: 0.000268 loss: 2.950001 (2.781739) time: 0.918435 data: 0.000184 max mem: 18817 Epoch: [230/300] [ 900/1251] eta: 0:05:34 lr: 0.000268 loss: 2.833223 (2.784635) time: 0.973343 data: 0.000182 max mem: 18817 Epoch: [230/300] [ 950/1251] eta: 0:04:46 lr: 0.000268 loss: 2.540361 (2.782206) time: 0.970322 data: 0.000174 max mem: 18817 Epoch: [230/300] [1000/1251] eta: 0:03:59 lr: 0.000268 loss: 2.924586 (2.780573) time: 0.989392 data: 0.000165 max mem: 18817 Epoch: [230/300] [1050/1251] eta: 0:03:11 lr: 0.000267 loss: 2.890030 (2.777333) time: 0.959100 data: 0.000170 max mem: 18817 Epoch: [230/300] [1100/1251] eta: 0:02:23 lr: 0.000267 loss: 2.863953 (2.774066) time: 0.903284 data: 0.000187 max mem: 18817 Epoch: [230/300] [1150/1251] eta: 0:01:35 lr: 0.000267 loss: 2.796153 (2.773818) time: 0.913327 data: 0.000173 max mem: 18817 Epoch: [230/300] [1200/1251] eta: 0:00:48 lr: 0.000266 loss: 2.672304 (2.769927) time: 0.957767 data: 0.000187 max mem: 18817 Epoch: [230/300] [1250/1251] eta: 0:00:00 lr: 0.000266 loss: 2.654335 (2.769415) time: 0.951616 data: 0.000765 max mem: 18817 Epoch: [230/300] Total time: 0:19:49 (0.950698 s / it) Averaged stats: lr: 0.000266 loss: 2.654335 (2.772188) Test: [ 0/49] eta: 0:01:15 loss: 0.527278 (0.527278) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.539009 data: 1.121087 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.559113 (0.683049) acc1: 82.812500 (83.806818) acc5: 96.875000 (96.732955) time: 0.468836 data: 0.102069 max mem: 18817 Test: [20/49] eta: 0:00:11 loss: 0.777552 (0.714101) acc1: 81.250000 (83.184524) acc5: 96.875000 (96.651786) time: 0.357412 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.705029 (0.706032) acc1: 82.812500 (83.366935) acc5: 96.875000 (96.723790) time: 0.356776 data: 0.000128 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.701258 (0.706967) acc1: 82.812500 (83.269817) acc5: 96.875000 (96.913110) time: 0.364713 data: 0.000128 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.701258 (0.704756) acc1: 82.812500 (83.264000) acc5: 96.875000 (96.960000) time: 0.370774 data: 0.000115 max mem: 18817 Test: Total time: 0:00:19 (0.388942 s / it) * Acc@1 83.320 Acc@5 96.696 loss 0.717 Max accuracy: 83.32% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0230.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0230.pth Epoch: [231/300] [ 0/1251] eta: 0:40:41 lr: 0.000266 loss: 3.128747 (3.128747) time: 1.951681 data: 1.093504 max mem: 18817 Epoch: [231/300] [ 50/1251] eta: 0:19:04 lr: 0.000266 loss: 2.731474 (2.838004) time: 0.953173 data: 0.000174 max mem: 18817 Epoch: [231/300] [ 100/1251] eta: 0:18:06 lr: 0.000266 loss: 2.543900 (2.771556) time: 0.916992 data: 0.000179 max mem: 18817 Epoch: [231/300] [ 150/1251] eta: 0:17:27 lr: 0.000265 loss: 2.864089 (2.780322) time: 0.930524 data: 0.000183 max mem: 18817 Epoch: [231/300] [ 200/1251] eta: 0:16:39 lr: 0.000265 loss: 2.805239 (2.775589) time: 0.956021 data: 0.000181 max mem: 18817 Epoch: [231/300] [ 250/1251] eta: 0:15:55 lr: 0.000265 loss: 2.788400 (2.757012) time: 1.053564 data: 0.000169 max mem: 18817 Epoch: [231/300] [ 300/1251] eta: 0:15:05 lr: 0.000264 loss: 2.807690 (2.759537) time: 0.972792 data: 0.000168 max mem: 18817 Epoch: [231/300] [ 350/1251] eta: 0:14:15 lr: 0.000264 loss: 2.972937 (2.773769) time: 0.921407 data: 0.000170 max mem: 18817 Epoch: [231/300] [ 400/1251] eta: 0:13:28 lr: 0.000264 loss: 2.709269 (2.764178) time: 0.910817 data: 0.000183 max mem: 18817 Epoch: [231/300] [ 450/1251] eta: 0:12:42 lr: 0.000264 loss: 2.888093 (2.761378) time: 0.991758 data: 0.000165 max mem: 18817 Epoch: [231/300] [ 500/1251] eta: 0:11:56 lr: 0.000263 loss: 2.824258 (2.754992) time: 1.024153 data: 0.000173 max mem: 18817 Epoch: [231/300] [ 550/1251] eta: 0:11:07 lr: 0.000263 loss: 2.745947 (2.754742) time: 0.943032 data: 0.000167 max mem: 18817 Epoch: [231/300] [ 600/1251] eta: 0:10:18 lr: 0.000263 loss: 2.840258 (2.757923) time: 0.920598 data: 0.000166 max mem: 18817 Epoch: [231/300] [ 650/1251] eta: 0:09:31 lr: 0.000263 loss: 2.789321 (2.757360) time: 0.923584 data: 0.000167 max mem: 18817 Epoch: [231/300] [ 700/1251] eta: 0:08:44 lr: 0.000262 loss: 2.568244 (2.755289) time: 0.952660 data: 0.000178 max mem: 18817 Epoch: [231/300] [ 750/1251] eta: 0:07:55 lr: 0.000262 loss: 2.822916 (2.757340) time: 0.955641 data: 0.000173 max mem: 18817 Epoch: [231/300] [ 800/1251] eta: 0:07:08 lr: 0.000262 loss: 2.692117 (2.755251) time: 0.917452 data: 0.000172 max mem: 18817 Epoch: [231/300] [ 850/1251] eta: 0:06:20 lr: 0.000261 loss: 2.917608 (2.755509) time: 0.916246 data: 0.000167 max mem: 18817 Epoch: [231/300] [ 900/1251] eta: 0:05:33 lr: 0.000261 loss: 2.687251 (2.753538) time: 0.916410 data: 0.000200 max mem: 18817 Epoch: [231/300] [ 950/1251] eta: 0:04:46 lr: 0.000261 loss: 2.933823 (2.751801) time: 1.021368 data: 0.000182 max mem: 18817 Epoch: [231/300] [1000/1251] eta: 0:03:58 lr: 0.000261 loss: 2.785859 (2.750723) time: 0.981979 data: 0.000173 max mem: 18817 Epoch: [231/300] [1050/1251] eta: 0:03:10 lr: 0.000260 loss: 2.965899 (2.751755) time: 0.929463 data: 0.000185 max mem: 18817 Epoch: [231/300] [1100/1251] eta: 0:02:23 lr: 0.000260 loss: 3.099025 (2.758969) time: 0.914504 data: 0.000180 max mem: 18817 Epoch: [231/300] [1150/1251] eta: 0:01:35 lr: 0.000260 loss: 2.739426 (2.758776) time: 0.908392 data: 0.000173 max mem: 18817 Epoch: [231/300] [1200/1251] eta: 0:00:48 lr: 0.000260 loss: 2.988100 (2.761441) time: 0.943209 data: 0.000172 max mem: 18817 Epoch: [231/300] [1250/1251] eta: 0:00:00 lr: 0.000259 loss: 2.690522 (2.759811) time: 0.959240 data: 0.000768 max mem: 18817 Epoch: [231/300] Total time: 0:19:48 (0.950214 s / it) Averaged stats: lr: 0.000259 loss: 2.690522 (2.753716) Test: [ 0/49] eta: 0:01:26 loss: 0.517118 (0.517118) acc1: 82.812500 (82.812500) acc5: 100.000000 (100.000000) time: 1.761594 data: 1.358165 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.559530 (0.648431) acc1: 84.375000 (84.232955) acc5: 96.875000 (96.875000) time: 0.484393 data: 0.123618 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.734146 (0.691981) acc1: 82.812500 (83.035714) acc5: 96.875000 (96.800595) time: 0.353741 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.688915 (0.689416) acc1: 82.812500 (83.266129) acc5: 96.875000 (96.925403) time: 0.351680 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.695712 (0.696692) acc1: 82.812500 (83.079268) acc5: 98.437500 (97.065549) time: 0.349527 data: 0.000125 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.702922 (0.698111) acc1: 81.250000 (83.200000) acc5: 96.875000 (97.024000) time: 0.354847 data: 0.000103 max mem: 18817 Test: Total time: 0:00:18 (0.386313 s / it) * Acc@1 83.376 Acc@5 96.666 loss 0.706 Max accuracy: 83.38% Epoch: [232/300] [ 0/1251] eta: 0:39:26 lr: 0.000259 loss: 1.923492 (1.923492) time: 1.892056 data: 1.026668 max mem: 18817 Epoch: [232/300] [ 50/1251] eta: 0:19:11 lr: 0.000259 loss: 2.779152 (2.694097) time: 0.924662 data: 0.000176 max mem: 18817 Epoch: [232/300] [ 100/1251] eta: 0:18:25 lr: 0.000259 loss: 2.978687 (2.771860) time: 0.910060 data: 0.000182 max mem: 18817 Epoch: [232/300] [ 150/1251] eta: 0:17:32 lr: 0.000259 loss: 2.743826 (2.740944) time: 0.965035 data: 0.000183 max mem: 18817 Epoch: [232/300] [ 200/1251] eta: 0:16:41 lr: 0.000258 loss: 2.968177 (2.739150) time: 0.990067 data: 0.000181 max mem: 18817 Epoch: [232/300] [ 250/1251] eta: 0:15:55 lr: 0.000258 loss: 2.698205 (2.730539) time: 0.964783 data: 0.000189 max mem: 18817 Epoch: [232/300] [ 300/1251] eta: 0:15:04 lr: 0.000258 loss: 2.601252 (2.736738) time: 0.913525 data: 0.000175 max mem: 18817 Epoch: [232/300] [ 350/1251] eta: 0:14:15 lr: 0.000257 loss: 2.687970 (2.729932) time: 0.901530 data: 0.000163 max mem: 18817 Epoch: [232/300] [ 400/1251] eta: 0:13:28 lr: 0.000257 loss: 2.742393 (2.730252) time: 0.912418 data: 0.000173 max mem: 18817 Epoch: [232/300] [ 450/1251] eta: 0:12:42 lr: 0.000257 loss: 2.823380 (2.722878) time: 0.945877 data: 0.000172 max mem: 18817 Epoch: [232/300] [ 500/1251] eta: 0:11:52 lr: 0.000257 loss: 2.650067 (2.723276) time: 0.945472 data: 0.000184 max mem: 18817 Epoch: [232/300] [ 550/1251] eta: 0:11:04 lr: 0.000256 loss: 2.764808 (2.735502) time: 0.916932 data: 0.000183 max mem: 18817 Epoch: [232/300] [ 600/1251] eta: 0:10:17 lr: 0.000256 loss: 2.661286 (2.743654) time: 0.930320 data: 0.000175 max mem: 18817 Epoch: [232/300] [ 650/1251] eta: 0:09:30 lr: 0.000256 loss: 2.678550 (2.743283) time: 0.916836 data: 0.000165 max mem: 18817 Epoch: [232/300] [ 700/1251] eta: 0:08:43 lr: 0.000256 loss: 2.881670 (2.740252) time: 0.956136 data: 0.000164 max mem: 18817 Epoch: [232/300] [ 750/1251] eta: 0:07:55 lr: 0.000255 loss: 2.653749 (2.742997) time: 0.973100 data: 0.000186 max mem: 18817 Epoch: [232/300] [ 800/1251] eta: 0:07:07 lr: 0.000255 loss: 2.841062 (2.740300) time: 0.913989 data: 0.000173 max mem: 18817 Epoch: [232/300] [ 850/1251] eta: 0:06:20 lr: 0.000255 loss: 2.655735 (2.734053) time: 0.915193 data: 0.000176 max mem: 18817 Epoch: [232/300] [ 900/1251] eta: 0:05:33 lr: 0.000254 loss: 2.801422 (2.738024) time: 0.948633 data: 0.000166 max mem: 18817 Epoch: [232/300] [ 950/1251] eta: 0:04:45 lr: 0.000254 loss: 2.850897 (2.742667) time: 0.996865 data: 0.000169 max mem: 18817 Epoch: [232/300] [1000/1251] eta: 0:03:58 lr: 0.000254 loss: 2.706341 (2.740024) time: 0.957392 data: 0.000156 max mem: 18817 Epoch: [232/300] [1050/1251] eta: 0:03:10 lr: 0.000254 loss: 3.036700 (2.744052) time: 0.919249 data: 0.000172 max mem: 18817 Epoch: [232/300] [1100/1251] eta: 0:02:23 lr: 0.000253 loss: 2.851766 (2.744616) time: 0.936316 data: 0.000176 max mem: 18817 Epoch: [232/300] [1150/1251] eta: 0:01:35 lr: 0.000253 loss: 2.690179 (2.742729) time: 0.964410 data: 0.000176 max mem: 18817 Epoch: [232/300] [1200/1251] eta: 0:00:48 lr: 0.000253 loss: 2.996090 (2.746837) time: 1.058297 data: 0.000180 max mem: 18817 Epoch: [232/300] [1250/1251] eta: 0:00:00 lr: 0.000253 loss: 2.777166 (2.747237) time: 0.995302 data: 0.000743 max mem: 18817 Epoch: [232/300] Total time: 0:19:49 (0.951004 s / it) Averaged stats: lr: 0.000253 loss: 2.777166 (2.747978) Test: [ 0/49] eta: 0:01:16 loss: 0.514994 (0.514994) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.563045 data: 1.149915 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.556578 (0.687059) acc1: 84.375000 (84.375000) acc5: 98.437500 (96.306818) time: 0.470779 data: 0.104689 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.739900 (0.711953) acc1: 82.812500 (83.407738) acc5: 96.875000 (96.651786) time: 0.356914 data: 0.000152 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.718980 (0.702287) acc1: 81.250000 (83.266129) acc5: 96.875000 (96.774194) time: 0.352819 data: 0.000143 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.694583 (0.707356) acc1: 84.375000 (83.346037) acc5: 96.875000 (96.836890) time: 0.350322 data: 0.000143 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.695746 (0.707378) acc1: 84.375000 (83.264000) acc5: 96.875000 (96.896000) time: 0.344117 data: 0.000113 max mem: 18817 Test: Total time: 0:00:18 (0.378408 s / it) * Acc@1 83.302 Acc@5 96.674 loss 0.718 Max accuracy: 83.38% Epoch: [233/300] [ 0/1251] eta: 0:41:23 lr: 0.000253 loss: 3.065175 (3.065175) time: 1.984853 data: 1.119594 max mem: 18817 Epoch: [233/300] [ 50/1251] eta: 0:19:13 lr: 0.000252 loss: 2.831363 (2.748830) time: 0.915158 data: 0.000162 max mem: 18817 Epoch: [233/300] [ 100/1251] eta: 0:18:15 lr: 0.000252 loss: 2.834575 (2.753402) time: 0.903179 data: 0.000164 max mem: 18817 Epoch: [233/300] [ 150/1251] eta: 0:17:30 lr: 0.000252 loss: 2.820300 (2.787331) time: 0.972276 data: 0.000174 max mem: 18817 Epoch: [233/300] [ 200/1251] eta: 0:16:36 lr: 0.000252 loss: 2.507003 (2.771741) time: 0.954804 data: 0.000175 max mem: 18817 Epoch: [233/300] [ 250/1251] eta: 0:15:52 lr: 0.000251 loss: 2.832833 (2.775078) time: 0.971143 data: 0.000175 max mem: 18817 Epoch: [233/300] [ 300/1251] eta: 0:15:01 lr: 0.000251 loss: 2.876934 (2.775994) time: 0.909876 data: 0.000171 max mem: 18817 Epoch: [233/300] [ 350/1251] eta: 0:14:13 lr: 0.000251 loss: 2.773844 (2.772426) time: 0.929520 data: 0.000176 max mem: 18817 Epoch: [233/300] [ 400/1251] eta: 0:13:24 lr: 0.000250 loss: 2.746470 (2.764869) time: 0.902432 data: 0.000169 max mem: 18817 Epoch: [233/300] [ 450/1251] eta: 0:12:38 lr: 0.000250 loss: 2.804493 (2.762020) time: 0.970878 data: 0.000169 max mem: 18817 Epoch: [233/300] [ 500/1251] eta: 0:11:50 lr: 0.000250 loss: 2.444355 (2.753061) time: 0.983627 data: 0.000159 max mem: 18817 Epoch: [233/300] [ 550/1251] eta: 0:11:03 lr: 0.000250 loss: 2.568526 (2.741374) time: 0.911731 data: 0.000174 max mem: 18817 Epoch: [233/300] [ 600/1251] eta: 0:10:17 lr: 0.000249 loss: 2.888099 (2.747828) time: 0.937143 data: 0.000178 max mem: 18817 Epoch: [233/300] [ 650/1251] eta: 0:09:30 lr: 0.000249 loss: 2.940351 (2.748839) time: 0.909991 data: 0.000167 max mem: 18817 Epoch: [233/300] [ 700/1251] eta: 0:08:43 lr: 0.000249 loss: 2.915497 (2.747092) time: 0.973000 data: 0.000184 max mem: 18817 Epoch: [233/300] [ 750/1251] eta: 0:07:55 lr: 0.000249 loss: 3.037147 (2.755552) time: 0.970203 data: 0.000185 max mem: 18817 Epoch: [233/300] [ 800/1251] eta: 0:07:07 lr: 0.000248 loss: 2.957662 (2.764393) time: 0.925471 data: 0.000182 max mem: 18817 Epoch: [233/300] [ 850/1251] eta: 0:06:20 lr: 0.000248 loss: 2.890181 (2.767517) time: 0.929307 data: 0.000171 max mem: 18817 Epoch: [233/300] [ 900/1251] eta: 0:05:33 lr: 0.000248 loss: 2.876473 (2.766493) time: 0.983238 data: 0.000182 max mem: 18817 Epoch: [233/300] [ 950/1251] eta: 0:04:46 lr: 0.000248 loss: 2.747974 (2.761337) time: 1.015802 data: 0.000180 max mem: 18817 Epoch: [233/300] [1000/1251] eta: 0:03:58 lr: 0.000247 loss: 2.813890 (2.760953) time: 0.984739 data: 0.000164 max mem: 18817 Epoch: [233/300] [1050/1251] eta: 0:03:10 lr: 0.000247 loss: 2.899128 (2.765264) time: 0.911458 data: 0.000174 max mem: 18817 Epoch: [233/300] [1100/1251] eta: 0:02:23 lr: 0.000247 loss: 2.827111 (2.768188) time: 0.935613 data: 0.000194 max mem: 18817 Epoch: [233/300] [1150/1251] eta: 0:01:36 lr: 0.000246 loss: 2.794764 (2.771483) time: 0.976121 data: 0.000183 max mem: 18817 Epoch: [233/300] [1200/1251] eta: 0:00:48 lr: 0.000246 loss: 2.714634 (2.766854) time: 1.008389 data: 0.000183 max mem: 18817 Epoch: [233/300] [1250/1251] eta: 0:00:00 lr: 0.000246 loss: 2.881959 (2.767974) time: 0.973615 data: 0.000762 max mem: 18817 Epoch: [233/300] Total time: 0:19:50 (0.951418 s / it) Averaged stats: lr: 0.000246 loss: 2.881959 (2.762535) Test: [ 0/49] eta: 0:01:29 loss: 0.592121 (0.592121) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.830031 data: 1.428739 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.592121 (0.656119) acc1: 84.375000 (84.801136) acc5: 98.437500 (96.590909) time: 0.493682 data: 0.130045 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.725182 (0.689598) acc1: 84.375000 (84.077381) acc5: 96.875000 (96.502976) time: 0.356631 data: 0.000148 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.704681 (0.696798) acc1: 82.812500 (83.770161) acc5: 96.875000 (96.622984) time: 0.352101 data: 0.000121 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.712602 (0.705556) acc1: 82.812500 (83.460366) acc5: 96.875000 (96.722561) time: 0.348784 data: 0.000118 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.727520 (0.703561) acc1: 82.812500 (83.456000) acc5: 96.875000 (96.800000) time: 0.347244 data: 0.000102 max mem: 18817 Test: Total time: 0:00:18 (0.385017 s / it) * Acc@1 83.404 Acc@5 96.636 loss 0.717 Max accuracy: 83.40% Epoch: [234/300] [ 0/1251] eta: 0:41:11 lr: 0.000246 loss: 3.281273 (3.281273) time: 1.975940 data: 1.105387 max mem: 18817 Epoch: [234/300] [ 50/1251] eta: 0:19:21 lr: 0.000246 loss: 2.624411 (2.637119) time: 0.914109 data: 0.000186 max mem: 18817 Epoch: [234/300] [ 100/1251] eta: 0:18:25 lr: 0.000245 loss: 2.808682 (2.715827) time: 0.907404 data: 0.000176 max mem: 18817 Epoch: [234/300] [ 150/1251] eta: 0:17:37 lr: 0.000245 loss: 2.752941 (2.728883) time: 0.973350 data: 0.000168 max mem: 18817 Epoch: [234/300] [ 200/1251] eta: 0:16:40 lr: 0.000245 loss: 2.877532 (2.740136) time: 0.948240 data: 0.000167 max mem: 18817 Epoch: [234/300] [ 250/1251] eta: 0:15:52 lr: 0.000245 loss: 3.002727 (2.750981) time: 0.933244 data: 0.000178 max mem: 18817 Epoch: [234/300] [ 300/1251] eta: 0:15:06 lr: 0.000244 loss: 2.905485 (2.741515) time: 0.936100 data: 0.000155 max mem: 18817 Epoch: [234/300] [ 350/1251] eta: 0:14:18 lr: 0.000244 loss: 2.760056 (2.738476) time: 0.978633 data: 0.000169 max mem: 18817 Epoch: [234/300] [ 400/1251] eta: 0:13:31 lr: 0.000244 loss: 2.889439 (2.745092) time: 1.006990 data: 0.000182 max mem: 18817 Epoch: [234/300] [ 450/1251] eta: 0:12:42 lr: 0.000244 loss: 2.880422 (2.748055) time: 0.969893 data: 0.000179 max mem: 18817 Epoch: [234/300] [ 500/1251] eta: 0:11:52 lr: 0.000243 loss: 2.968471 (2.749894) time: 0.911776 data: 0.000178 max mem: 18817 Epoch: [234/300] [ 550/1251] eta: 0:11:06 lr: 0.000243 loss: 2.733978 (2.753596) time: 0.907464 data: 0.000168 max mem: 18817 Epoch: [234/300] [ 600/1251] eta: 0:10:19 lr: 0.000243 loss: 2.880344 (2.752755) time: 0.972126 data: 0.000178 max mem: 18817 Epoch: [234/300] [ 650/1251] eta: 0:09:31 lr: 0.000243 loss: 2.885644 (2.750042) time: 1.020685 data: 0.000166 max mem: 18817 Epoch: [234/300] [ 700/1251] eta: 0:08:43 lr: 0.000242 loss: 2.755426 (2.749186) time: 0.972921 data: 0.000173 max mem: 18817 Epoch: [234/300] [ 750/1251] eta: 0:07:55 lr: 0.000242 loss: 2.870503 (2.745326) time: 0.916052 data: 0.000181 max mem: 18817 Epoch: [234/300] [ 800/1251] eta: 0:07:09 lr: 0.000242 loss: 2.803560 (2.744681) time: 0.926928 data: 0.000169 max mem: 18817 Epoch: [234/300] [ 850/1251] eta: 0:06:21 lr: 0.000242 loss: 2.849791 (2.749871) time: 0.964633 data: 0.000162 max mem: 18817 Epoch: [234/300] [ 900/1251] eta: 0:05:34 lr: 0.000241 loss: 2.865885 (2.753652) time: 1.006114 data: 0.000172 max mem: 18817 Epoch: [234/300] [ 950/1251] eta: 0:04:46 lr: 0.000241 loss: 2.885476 (2.757061) time: 0.926747 data: 0.000187 max mem: 18817 Epoch: [234/300] [1000/1251] eta: 0:03:58 lr: 0.000241 loss: 2.719318 (2.755613) time: 0.937150 data: 0.000160 max mem: 18817 Epoch: [234/300] [1050/1251] eta: 0:03:11 lr: 0.000240 loss: 2.842481 (2.759263) time: 0.913700 data: 0.000190 max mem: 18817 Epoch: [234/300] [1100/1251] eta: 0:02:23 lr: 0.000240 loss: 2.745518 (2.759798) time: 0.918722 data: 0.000410 max mem: 18817 Epoch: [234/300] [1150/1251] eta: 0:01:36 lr: 0.000240 loss: 2.809502 (2.760949) time: 0.979411 data: 0.000178 max mem: 18817 Epoch: [234/300] [1200/1251] eta: 0:00:48 lr: 0.000240 loss: 2.966603 (2.761652) time: 0.973111 data: 0.000179 max mem: 18817 Epoch: [234/300] [1250/1251] eta: 0:00:00 lr: 0.000239 loss: 2.657506 (2.758427) time: 0.976150 data: 0.000777 max mem: 18817 Epoch: [234/300] Total time: 0:19:50 (0.951865 s / it) Averaged stats: lr: 0.000239 loss: 2.657506 (2.756895) Test: [ 0/49] eta: 0:01:26 loss: 0.517311 (0.517311) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.766547 data: 1.355806 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.569389 (0.679308) acc1: 84.375000 (83.664773) acc5: 98.437500 (97.017045) time: 0.486825 data: 0.123389 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.757539 (0.709655) acc1: 82.812500 (83.258929) acc5: 96.875000 (96.428571) time: 0.355324 data: 0.000131 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.737411 (0.707738) acc1: 82.812500 (83.366935) acc5: 96.875000 (96.723790) time: 0.362800 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.723195 (0.714262) acc1: 82.812500 (83.041159) acc5: 96.875000 (96.722561) time: 0.448461 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.732728 (0.710745) acc1: 82.812500 (83.104000) acc5: 96.875000 (96.800000) time: 0.442600 data: 0.000100 max mem: 18817 Test: Total time: 0:00:20 (0.421626 s / it) * Acc@1 83.428 Acc@5 96.720 loss 0.718 Max accuracy: 83.43% Epoch: [235/300] [ 0/1251] eta: 0:45:31 lr: 0.000239 loss: 2.734295 (2.734295) time: 2.183647 data: 1.294255 max mem: 18817 Epoch: [235/300] [ 50/1251] eta: 0:19:53 lr: 0.000239 loss: 2.895398 (2.694950) time: 0.971406 data: 0.000171 max mem: 18817 Epoch: [235/300] [ 100/1251] eta: 0:18:42 lr: 0.000239 loss: 2.444218 (2.699437) time: 0.968967 data: 0.000170 max mem: 18817 Epoch: [235/300] [ 150/1251] eta: 0:17:42 lr: 0.000239 loss: 2.665158 (2.709748) time: 0.981011 data: 0.000160 max mem: 18817 Epoch: [235/300] [ 200/1251] eta: 0:16:54 lr: 0.000238 loss: 2.985818 (2.729883) time: 0.932423 data: 0.000175 max mem: 18817 Epoch: [235/300] [ 250/1251] eta: 0:15:58 lr: 0.000238 loss: 2.915086 (2.733900) time: 0.908887 data: 0.000162 max mem: 18817 Epoch: [235/300] [ 300/1251] eta: 0:15:10 lr: 0.000238 loss: 2.855463 (2.745553) time: 0.919662 data: 0.000174 max mem: 18817 Epoch: [235/300] [ 350/1251] eta: 0:14:22 lr: 0.000238 loss: 2.683372 (2.754028) time: 0.916411 data: 0.000176 max mem: 18817 Epoch: [235/300] [ 400/1251] eta: 0:13:35 lr: 0.000237 loss: 2.740393 (2.747499) time: 0.970152 data: 0.000179 max mem: 18817 Epoch: [235/300] [ 450/1251] eta: 0:12:45 lr: 0.000237 loss: 2.558941 (2.742223) time: 0.962892 data: 0.000183 max mem: 18817 Epoch: [235/300] [ 500/1251] eta: 0:11:56 lr: 0.000237 loss: 2.938431 (2.740961) time: 0.911583 data: 0.000179 max mem: 18817 Epoch: [235/300] [ 550/1251] eta: 0:11:09 lr: 0.000237 loss: 2.796242 (2.748686) time: 0.921557 data: 0.000171 max mem: 18817 Epoch: [235/300] [ 600/1251] eta: 0:10:21 lr: 0.000236 loss: 2.851679 (2.751427) time: 0.964503 data: 0.000176 max mem: 18817 Epoch: [235/300] [ 650/1251] eta: 0:09:33 lr: 0.000236 loss: 2.760331 (2.744266) time: 1.019086 data: 0.000186 max mem: 18817 Epoch: [235/300] [ 700/1251] eta: 0:08:45 lr: 0.000236 loss: 2.809439 (2.742366) time: 0.976728 data: 0.000163 max mem: 18817 Epoch: [235/300] [ 750/1251] eta: 0:07:56 lr: 0.000236 loss: 2.982991 (2.744649) time: 0.907806 data: 0.000165 max mem: 18817 Epoch: [235/300] [ 800/1251] eta: 0:07:09 lr: 0.000235 loss: 2.948828 (2.750775) time: 0.923373 data: 0.000177 max mem: 18817 Epoch: [235/300] [ 850/1251] eta: 0:06:21 lr: 0.000235 loss: 2.638309 (2.748857) time: 0.969529 data: 0.000182 max mem: 18817 Epoch: [235/300] [ 900/1251] eta: 0:05:34 lr: 0.000235 loss: 2.655785 (2.743943) time: 1.047758 data: 0.000189 max mem: 18817 Epoch: [235/300] [ 950/1251] eta: 0:04:46 lr: 0.000234 loss: 3.047220 (2.745613) time: 0.976849 data: 0.000179 max mem: 18817 Epoch: [235/300] [1000/1251] eta: 0:03:58 lr: 0.000234 loss: 2.976544 (2.749794) time: 0.917327 data: 0.000165 max mem: 18817 Epoch: [235/300] [1050/1251] eta: 0:03:11 lr: 0.000234 loss: 2.937709 (2.752540) time: 0.915856 data: 0.000166 max mem: 18817 Epoch: [235/300] [1100/1251] eta: 0:02:23 lr: 0.000234 loss: 2.856287 (2.753774) time: 0.966763 data: 0.000169 max mem: 18817 Epoch: [235/300] [1150/1251] eta: 0:01:36 lr: 0.000233 loss: 2.743786 (2.746970) time: 0.994862 data: 0.000194 max mem: 18817 Epoch: [235/300] [1200/1251] eta: 0:00:48 lr: 0.000233 loss: 2.655482 (2.741939) time: 0.949188 data: 0.000168 max mem: 18817 Epoch: [235/300] [1250/1251] eta: 0:00:00 lr: 0.000233 loss: 2.680362 (2.740473) time: 0.915571 data: 0.000766 max mem: 18817 Epoch: [235/300] Total time: 0:19:50 (0.951789 s / it) Averaged stats: lr: 0.000233 loss: 2.680362 (2.743114) Test: [ 0/49] eta: 0:01:15 loss: 0.545461 (0.545461) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.535322 data: 1.083957 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.619290 (0.676437) acc1: 84.375000 (84.943182) acc5: 98.437500 (96.875000) time: 0.468739 data: 0.098709 max mem: 18817 Test: [20/49] eta: 0:00:11 loss: 0.751610 (0.710729) acc1: 84.375000 (83.854167) acc5: 96.875000 (96.875000) time: 0.357142 data: 0.000149 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.738009 (0.709781) acc1: 82.812500 (83.467742) acc5: 96.875000 (96.925403) time: 0.454707 data: 0.000127 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.708411 (0.722760) acc1: 82.812500 (83.079268) acc5: 96.875000 (96.836890) time: 0.451513 data: 0.000128 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.723813 (0.719568) acc1: 82.812500 (83.136000) acc5: 96.875000 (96.832000) time: 0.404773 data: 0.000102 max mem: 18817 Test: Total time: 0:00:20 (0.419887 s / it) * Acc@1 83.474 Acc@5 96.772 loss 0.725 Max accuracy: 83.47% Epoch: [236/300] [ 0/1251] eta: 0:41:17 lr: 0.000233 loss: 2.957434 (2.957434) time: 1.980649 data: 1.111017 max mem: 18817 Epoch: [236/300] [ 50/1251] eta: 0:19:33 lr: 0.000233 loss: 2.725884 (2.735753) time: 0.966428 data: 0.000201 max mem: 18817 Epoch: [236/300] [ 100/1251] eta: 0:18:18 lr: 0.000232 loss: 2.914537 (2.809394) time: 0.955783 data: 0.000179 max mem: 18817 Epoch: [236/300] [ 150/1251] eta: 0:17:22 lr: 0.000232 loss: 2.670851 (2.768574) time: 0.910320 data: 0.000176 max mem: 18817 Epoch: [236/300] [ 200/1251] eta: 0:16:38 lr: 0.000232 loss: 2.846690 (2.745102) time: 0.914331 data: 0.000188 max mem: 18817 Epoch: [236/300] [ 250/1251] eta: 0:15:52 lr: 0.000232 loss: 2.834107 (2.732519) time: 0.913092 data: 0.000184 max mem: 18817 Epoch: [236/300] [ 300/1251] eta: 0:15:04 lr: 0.000231 loss: 2.861147 (2.745000) time: 0.946211 data: 0.000200 max mem: 18817 Epoch: [236/300] [ 350/1251] eta: 0:14:15 lr: 0.000231 loss: 2.781335 (2.737349) time: 0.960340 data: 0.000147 max mem: 18817 Epoch: [236/300] [ 400/1251] eta: 0:13:26 lr: 0.000231 loss: 2.443955 (2.731714) time: 0.922230 data: 0.000177 max mem: 18817 Epoch: [236/300] [ 450/1251] eta: 0:12:40 lr: 0.000231 loss: 2.426485 (2.717237) time: 0.918379 data: 0.000181 max mem: 18817 Epoch: [236/300] [ 500/1251] eta: 0:11:53 lr: 0.000230 loss: 2.770559 (2.717346) time: 0.969842 data: 0.000178 max mem: 18817 Epoch: [236/300] [ 550/1251] eta: 0:11:05 lr: 0.000230 loss: 2.961378 (2.729112) time: 1.008260 data: 0.000174 max mem: 18817 Epoch: [236/300] [ 600/1251] eta: 0:10:17 lr: 0.000230 loss: 2.963217 (2.732358) time: 0.967751 data: 0.000159 max mem: 18817 Epoch: [236/300] [ 650/1251] eta: 0:09:30 lr: 0.000230 loss: 2.547682 (2.731492) time: 0.924107 data: 0.000187 max mem: 18817 Epoch: [236/300] [ 700/1251] eta: 0:08:43 lr: 0.000229 loss: 2.704659 (2.728316) time: 0.925573 data: 0.000181 max mem: 18817 Epoch: [236/300] [ 750/1251] eta: 0:07:55 lr: 0.000229 loss: 2.679265 (2.733409) time: 0.959812 data: 0.000165 max mem: 18817 Epoch: [236/300] [ 800/1251] eta: 0:07:08 lr: 0.000229 loss: 2.663871 (2.732236) time: 1.045037 data: 0.000198 max mem: 18817 Epoch: [236/300] [ 850/1251] eta: 0:06:20 lr: 0.000229 loss: 2.694277 (2.726551) time: 0.968039 data: 0.000188 max mem: 18817 Epoch: [236/300] [ 900/1251] eta: 0:05:33 lr: 0.000228 loss: 2.788601 (2.721933) time: 0.909167 data: 0.000179 max mem: 18817 Epoch: [236/300] [ 950/1251] eta: 0:04:45 lr: 0.000228 loss: 2.824965 (2.732356) time: 0.910895 data: 0.000179 max mem: 18817 Epoch: [236/300] [1000/1251] eta: 0:03:58 lr: 0.000228 loss: 2.967760 (2.734026) time: 0.973915 data: 0.000168 max mem: 18817 Epoch: [236/300] [1050/1251] eta: 0:03:11 lr: 0.000228 loss: 2.806737 (2.731508) time: 1.057621 data: 0.000204 max mem: 18817 Epoch: [236/300] [1100/1251] eta: 0:02:23 lr: 0.000227 loss: 2.872808 (2.735984) time: 0.970453 data: 0.000184 max mem: 18817 Epoch: [236/300] [1150/1251] eta: 0:01:35 lr: 0.000227 loss: 2.711724 (2.733011) time: 0.917188 data: 0.000179 max mem: 18817 Epoch: [236/300] [1200/1251] eta: 0:00:48 lr: 0.000227 loss: 2.775668 (2.734863) time: 0.907994 data: 0.000164 max mem: 18817 Epoch: [236/300] [1250/1251] eta: 0:00:00 lr: 0.000227 loss: 2.702044 (2.731307) time: 0.965407 data: 0.000770 max mem: 18817 Epoch: [236/300] Total time: 0:19:49 (0.950808 s / it) Averaged stats: lr: 0.000227 loss: 2.702044 (2.727328) Test: [ 0/49] eta: 0:01:26 loss: 0.462211 (0.462211) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.771668 data: 1.351127 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.601979 (0.663848) acc1: 85.937500 (84.659091) acc5: 96.875000 (96.590909) time: 0.497876 data: 0.122973 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.766952 (0.698786) acc1: 82.812500 (83.779762) acc5: 96.875000 (96.428571) time: 0.361146 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.694120 (0.694633) acc1: 82.812500 (83.518145) acc5: 96.875000 (96.774194) time: 0.353167 data: 0.000142 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.694120 (0.709003) acc1: 82.812500 (83.346037) acc5: 96.875000 (96.875000) time: 0.353485 data: 0.000160 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.703724 (0.706794) acc1: 82.812500 (83.488000) acc5: 96.875000 (96.864000) time: 0.348470 data: 0.000131 max mem: 18817 Test: Total time: 0:00:18 (0.385949 s / it) * Acc@1 83.510 Acc@5 96.658 loss 0.719 Max accuracy: 83.51% Epoch: [237/300] [ 0/1251] eta: 0:43:47 lr: 0.000227 loss: 3.163844 (3.163844) time: 2.099942 data: 1.224477 max mem: 18817 Epoch: [237/300] [ 50/1251] eta: 0:18:57 lr: 0.000226 loss: 2.594428 (2.644537) time: 0.948001 data: 0.000160 max mem: 18817 Epoch: [237/300] [ 100/1251] eta: 0:18:04 lr: 0.000226 loss: 2.989694 (2.720413) time: 0.912574 data: 0.000169 max mem: 18817 Epoch: [237/300] [ 150/1251] eta: 0:17:26 lr: 0.000226 loss: 2.904355 (2.747849) time: 0.920630 data: 0.000163 max mem: 18817 Epoch: [237/300] [ 200/1251] eta: 0:16:41 lr: 0.000226 loss: 2.746785 (2.774733) time: 0.922871 data: 0.000166 max mem: 18817 Epoch: [237/300] [ 250/1251] eta: 0:15:55 lr: 0.000225 loss: 2.552675 (2.772328) time: 0.967863 data: 0.000173 max mem: 18817 Epoch: [237/300] [ 300/1251] eta: 0:15:04 lr: 0.000225 loss: 2.661386 (2.751408) time: 0.964635 data: 0.000164 max mem: 18817 Epoch: [237/300] [ 350/1251] eta: 0:14:16 lr: 0.000225 loss: 2.603832 (2.728425) time: 0.904477 data: 0.000171 max mem: 18817 Epoch: [237/300] [ 400/1251] eta: 0:13:29 lr: 0.000225 loss: 2.757714 (2.714649) time: 0.927971 data: 0.000170 max mem: 18817 Epoch: [237/300] [ 450/1251] eta: 0:12:42 lr: 0.000224 loss: 2.870936 (2.717820) time: 0.974674 data: 0.000167 max mem: 18817 Epoch: [237/300] [ 500/1251] eta: 0:11:55 lr: 0.000224 loss: 2.929365 (2.717820) time: 0.954576 data: 0.000185 max mem: 18817 Epoch: [237/300] [ 550/1251] eta: 0:11:07 lr: 0.000224 loss: 2.901226 (2.713651) time: 0.969765 data: 0.000177 max mem: 18817 Epoch: [237/300] [ 600/1251] eta: 0:10:19 lr: 0.000224 loss: 2.892795 (2.726558) time: 0.915325 data: 0.000181 max mem: 18817 Epoch: [237/300] [ 650/1251] eta: 0:09:32 lr: 0.000223 loss: 2.317504 (2.720293) time: 0.912667 data: 0.000185 max mem: 18817 Epoch: [237/300] [ 700/1251] eta: 0:08:44 lr: 0.000223 loss: 2.705674 (2.718154) time: 0.967561 data: 0.000172 max mem: 18817 Epoch: [237/300] [ 750/1251] eta: 0:07:57 lr: 0.000223 loss: 2.531492 (2.712547) time: 0.960344 data: 0.000170 max mem: 18817 Epoch: [237/300] [ 800/1251] eta: 0:07:09 lr: 0.000223 loss: 2.794141 (2.719320) time: 0.992136 data: 0.000189 max mem: 18817 Epoch: [237/300] [ 850/1251] eta: 0:06:21 lr: 0.000222 loss: 2.729855 (2.718053) time: 0.910887 data: 0.000172 max mem: 18817 Epoch: [237/300] [ 900/1251] eta: 0:05:34 lr: 0.000222 loss: 2.646620 (2.712784) time: 0.915039 data: 0.000175 max mem: 18817 Epoch: [237/300] [ 950/1251] eta: 0:04:46 lr: 0.000222 loss: 2.610631 (2.712517) time: 0.957620 data: 0.000180 max mem: 18817 Epoch: [237/300] [1000/1251] eta: 0:03:58 lr: 0.000222 loss: 2.607828 (2.708736) time: 0.907147 data: 0.000166 max mem: 18817 Epoch: [237/300] [1050/1251] eta: 0:03:11 lr: 0.000221 loss: 2.799437 (2.711040) time: 0.916877 data: 0.000170 max mem: 18817 Epoch: [237/300] [1100/1251] eta: 0:02:23 lr: 0.000221 loss: 2.644224 (2.708728) time: 0.907034 data: 0.000163 max mem: 18817 Epoch: [237/300] [1150/1251] eta: 0:01:36 lr: 0.000221 loss: 2.711828 (2.708990) time: 0.954859 data: 0.000186 max mem: 18817 Epoch: [237/300] [1200/1251] eta: 0:00:48 lr: 0.000221 loss: 2.671323 (2.709430) time: 0.981083 data: 0.000180 max mem: 18817 Epoch: [237/300] [1250/1251] eta: 0:00:00 lr: 0.000220 loss: 2.920971 (2.714887) time: 0.923240 data: 0.000758 max mem: 18817 Epoch: [237/300] Total time: 0:19:48 (0.950380 s / it) Averaged stats: lr: 0.000220 loss: 2.920971 (2.717074) Test: [ 0/49] eta: 0:01:20 loss: 0.485806 (0.485806) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.648820 data: 1.135559 max mem: 18817 Test: [10/49] eta: 0:00:25 loss: 0.551765 (0.664673) acc1: 84.375000 (85.795455) acc5: 98.437500 (96.448864) time: 0.650230 data: 0.103387 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.760466 (0.700521) acc1: 84.375000 (84.523810) acc5: 96.875000 (96.577381) time: 0.450742 data: 0.000152 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.731894 (0.713906) acc1: 81.250000 (83.618952) acc5: 96.875000 (96.673387) time: 0.351505 data: 0.000139 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.727015 (0.727402) acc1: 82.812500 (83.650915) acc5: 96.875000 (96.646341) time: 0.349104 data: 0.000143 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.738943 (0.727685) acc1: 82.812500 (83.552000) acc5: 96.875000 (96.768000) time: 0.344071 data: 0.000116 max mem: 18817 Test: Total time: 0:00:20 (0.417655 s / it) * Acc@1 83.446 Acc@5 96.634 loss 0.739 Max accuracy: 83.51% Epoch: [238/300] [ 0/1251] eta: 0:40:40 lr: 0.000220 loss: 3.053078 (3.053078) time: 1.951238 data: 1.089776 max mem: 18817 Epoch: [238/300] [ 50/1251] eta: 0:19:43 lr: 0.000220 loss: 2.605229 (2.679224) time: 0.985068 data: 0.000169 max mem: 18817 Epoch: [238/300] [ 100/1251] eta: 0:18:13 lr: 0.000220 loss: 2.728641 (2.674867) time: 0.905549 data: 0.000177 max mem: 18817 Epoch: [238/300] [ 150/1251] eta: 0:17:27 lr: 0.000220 loss: 2.699440 (2.683349) time: 0.909050 data: 0.000150 max mem: 18817 Epoch: [238/300] [ 200/1251] eta: 0:16:39 lr: 0.000219 loss: 2.704004 (2.688483) time: 0.951939 data: 0.000193 max mem: 18817 Epoch: [238/300] [ 250/1251] eta: 0:15:56 lr: 0.000219 loss: 2.672792 (2.707542) time: 0.970122 data: 0.000176 max mem: 18817 Epoch: [238/300] [ 300/1251] eta: 0:15:05 lr: 0.000219 loss: 2.710140 (2.702907) time: 0.967797 data: 0.000181 max mem: 18817 Epoch: [238/300] [ 350/1251] eta: 0:14:16 lr: 0.000219 loss: 2.691284 (2.706148) time: 0.933922 data: 0.000186 max mem: 18817 Epoch: [238/300] [ 400/1251] eta: 0:13:29 lr: 0.000218 loss: 2.615267 (2.708524) time: 0.911375 data: 0.000168 max mem: 18817 Epoch: [238/300] [ 450/1251] eta: 0:12:43 lr: 0.000218 loss: 2.758494 (2.703679) time: 0.970946 data: 0.000172 max mem: 18817 Epoch: [238/300] [ 500/1251] eta: 0:11:56 lr: 0.000218 loss: 2.701833 (2.711008) time: 1.001260 data: 0.000178 max mem: 18817 Epoch: [238/300] [ 550/1251] eta: 0:11:08 lr: 0.000218 loss: 2.811514 (2.705379) time: 0.967250 data: 0.000175 max mem: 18817 Epoch: [238/300] [ 600/1251] eta: 0:10:19 lr: 0.000217 loss: 2.811694 (2.705385) time: 0.919183 data: 0.000180 max mem: 18817 Epoch: [238/300] [ 650/1251] eta: 0:09:32 lr: 0.000217 loss: 2.739600 (2.710184) time: 0.908537 data: 0.000181 max mem: 18817 Epoch: [238/300] [ 700/1251] eta: 0:08:45 lr: 0.000217 loss: 2.736903 (2.716219) time: 0.975667 data: 0.000165 max mem: 18817 Epoch: [238/300] [ 750/1251] eta: 0:07:58 lr: 0.000217 loss: 2.582662 (2.715630) time: 1.011665 data: 0.000190 max mem: 18817 Epoch: [238/300] [ 800/1251] eta: 0:07:10 lr: 0.000216 loss: 2.783106 (2.714835) time: 0.970272 data: 0.000169 max mem: 18817 Epoch: [238/300] [ 850/1251] eta: 0:06:22 lr: 0.000216 loss: 2.837789 (2.712525) time: 0.914675 data: 0.000175 max mem: 18817 Epoch: [238/300] [ 900/1251] eta: 0:05:34 lr: 0.000216 loss: 2.730628 (2.713881) time: 0.920586 data: 0.000169 max mem: 18817 Epoch: [238/300] [ 950/1251] eta: 0:04:46 lr: 0.000216 loss: 3.027949 (2.710822) time: 0.976465 data: 0.000176 max mem: 18817 Epoch: [238/300] [1000/1251] eta: 0:03:58 lr: 0.000215 loss: 2.933398 (2.711490) time: 0.948767 data: 0.000163 max mem: 18817 Epoch: [238/300] [1050/1251] eta: 0:03:11 lr: 0.000215 loss: 2.763277 (2.716633) time: 0.978523 data: 0.000185 max mem: 18817 Epoch: [238/300] [1100/1251] eta: 0:02:23 lr: 0.000215 loss: 2.580458 (2.713725) time: 0.919903 data: 0.000174 max mem: 18817 Epoch: [238/300] [1150/1251] eta: 0:01:36 lr: 0.000215 loss: 2.906616 (2.715757) time: 0.905658 data: 0.000177 max mem: 18817 Epoch: [238/300] [1200/1251] eta: 0:00:48 lr: 0.000214 loss: 2.765762 (2.712988) time: 0.965646 data: 0.000166 max mem: 18817 Epoch: [238/300] [1250/1251] eta: 0:00:00 lr: 0.000214 loss: 2.754022 (2.711013) time: 0.981132 data: 0.000766 max mem: 18817 Epoch: [238/300] Total time: 0:19:51 (0.952121 s / it) Averaged stats: lr: 0.000214 loss: 2.754022 (2.710656) Test: [ 0/49] eta: 0:01:26 loss: 0.488568 (0.488568) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.771028 data: 1.386329 max mem: 18817 Test: [10/49] eta: 0:00:26 loss: 0.550891 (0.640401) acc1: 84.375000 (84.659091) acc5: 98.437500 (96.875000) time: 0.669682 data: 0.126164 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 0.732584 (0.677311) acc1: 84.375000 (83.407738) acc5: 96.875000 (96.949405) time: 0.455951 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.672049 (0.681182) acc1: 81.250000 (83.266129) acc5: 96.875000 (97.127016) time: 0.352521 data: 0.000145 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.672049 (0.690842) acc1: 81.250000 (83.193598) acc5: 96.875000 (97.103659) time: 0.349795 data: 0.000159 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.705193 (0.690118) acc1: 82.812500 (83.264000) acc5: 96.875000 (97.088000) time: 0.344420 data: 0.000134 max mem: 18817 Test: Total time: 0:00:20 (0.423524 s / it) * Acc@1 83.362 Acc@5 96.704 loss 0.704 Max accuracy: 83.51% Epoch: [239/300] [ 0/1251] eta: 0:40:44 lr: 0.000214 loss: 3.119576 (3.119576) time: 1.953848 data: 1.083773 max mem: 18817 Epoch: [239/300] [ 50/1251] eta: 0:19:11 lr: 0.000214 loss: 2.677662 (2.692576) time: 0.905363 data: 0.000188 max mem: 18817 Epoch: [239/300] [ 100/1251] eta: 0:18:27 lr: 0.000214 loss: 2.750098 (2.678616) time: 0.910505 data: 0.000162 max mem: 18817 Epoch: [239/300] [ 150/1251] eta: 0:17:32 lr: 0.000213 loss: 2.507843 (2.720541) time: 0.956397 data: 0.000180 max mem: 18817 Epoch: [239/300] [ 200/1251] eta: 0:16:47 lr: 0.000213 loss: 2.719038 (2.734892) time: 1.041421 data: 0.000173 max mem: 18817 Epoch: [239/300] [ 250/1251] eta: 0:15:56 lr: 0.000213 loss: 2.788477 (2.711783) time: 0.980512 data: 0.000179 max mem: 18817 Epoch: [239/300] [ 300/1251] eta: 0:15:05 lr: 0.000213 loss: 2.730024 (2.705929) time: 0.916944 data: 0.000161 max mem: 18817 Epoch: [239/300] [ 350/1251] eta: 0:14:19 lr: 0.000212 loss: 2.808869 (2.712601) time: 0.915996 data: 0.000150 max mem: 18817 Epoch: [239/300] [ 400/1251] eta: 0:13:32 lr: 0.000212 loss: 2.932784 (2.714178) time: 0.962159 data: 0.000164 max mem: 18817 Epoch: [239/300] [ 450/1251] eta: 0:12:45 lr: 0.000212 loss: 2.687233 (2.701066) time: 1.023599 data: 0.000165 max mem: 18817 Epoch: [239/300] [ 500/1251] eta: 0:11:56 lr: 0.000212 loss: 2.624804 (2.698338) time: 0.976695 data: 0.000196 max mem: 18817 Epoch: [239/300] [ 550/1251] eta: 0:11:07 lr: 0.000211 loss: 2.673009 (2.691639) time: 0.927361 data: 0.000168 max mem: 18817 Epoch: [239/300] [ 600/1251] eta: 0:10:20 lr: 0.000211 loss: 2.934071 (2.702680) time: 0.907491 data: 0.000180 max mem: 18817 Epoch: [239/300] [ 650/1251] eta: 0:09:32 lr: 0.000211 loss: 2.596090 (2.705545) time: 0.971200 data: 0.000183 max mem: 18817 Epoch: [239/300] [ 700/1251] eta: 0:08:44 lr: 0.000211 loss: 2.804903 (2.709524) time: 0.962552 data: 0.000175 max mem: 18817 Epoch: [239/300] [ 750/1251] eta: 0:07:57 lr: 0.000210 loss: 2.878564 (2.717110) time: 0.972948 data: 0.000169 max mem: 18817 Epoch: [239/300] [ 800/1251] eta: 0:07:09 lr: 0.000210 loss: 2.633961 (2.706880) time: 0.911736 data: 0.000182 max mem: 18817 Epoch: [239/300] [ 850/1251] eta: 0:06:21 lr: 0.000210 loss: 2.767131 (2.705236) time: 0.917717 data: 0.000192 max mem: 18817 Epoch: [239/300] [ 900/1251] eta: 0:05:34 lr: 0.000210 loss: 2.972725 (2.705960) time: 0.960381 data: 0.000190 max mem: 18817 Epoch: [239/300] [ 950/1251] eta: 0:04:46 lr: 0.000209 loss: 2.906392 (2.705332) time: 0.956665 data: 0.000183 max mem: 18817 Epoch: [239/300] [1000/1251] eta: 0:03:58 lr: 0.000209 loss: 2.943653 (2.706415) time: 0.931022 data: 0.000175 max mem: 18817 Epoch: [239/300] [1050/1251] eta: 0:03:11 lr: 0.000209 loss: 2.749881 (2.702228) time: 0.917497 data: 0.000187 max mem: 18817 Epoch: [239/300] [1100/1251] eta: 0:02:23 lr: 0.000209 loss: 2.706683 (2.698758) time: 0.958121 data: 0.000173 max mem: 18817 Epoch: [239/300] [1150/1251] eta: 0:01:36 lr: 0.000208 loss: 2.781176 (2.698578) time: 0.965244 data: 0.000177 max mem: 18817 Epoch: [239/300] [1200/1251] eta: 0:00:48 lr: 0.000208 loss: 2.771035 (2.698506) time: 0.958993 data: 0.000176 max mem: 18817 Epoch: [239/300] [1250/1251] eta: 0:00:00 lr: 0.000208 loss: 2.722816 (2.698285) time: 0.907633 data: 0.000759 max mem: 18817 Epoch: [239/300] Total time: 0:19:49 (0.950519 s / it) Averaged stats: lr: 0.000208 loss: 2.722816 (2.699682) Test: [ 0/49] eta: 0:01:26 loss: 0.545420 (0.545420) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.771017 data: 1.390572 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.545420 (0.671720) acc1: 85.937500 (84.943182) acc5: 98.437500 (96.448864) time: 0.486542 data: 0.126546 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.718408 (0.697909) acc1: 82.812500 (83.630952) acc5: 96.875000 (96.577381) time: 0.355504 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.690987 (0.698624) acc1: 82.812500 (83.316532) acc5: 96.875000 (96.875000) time: 0.352163 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.697639 (0.711773) acc1: 82.812500 (83.155488) acc5: 96.875000 (96.875000) time: 0.349244 data: 0.000136 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.715225 (0.709471) acc1: 82.812500 (83.264000) acc5: 96.875000 (96.832000) time: 0.350371 data: 0.000113 max mem: 18817 Test: Total time: 0:00:18 (0.384376 s / it) * Acc@1 83.298 Acc@5 96.662 loss 0.714 Max accuracy: 83.51% Epoch: [240/300] [ 0/1251] eta: 0:43:17 lr: 0.000208 loss: 2.040854 (2.040854) time: 2.076545 data: 1.202698 max mem: 18817 Epoch: [240/300] [ 50/1251] eta: 0:19:25 lr: 0.000208 loss: 2.310887 (2.574101) time: 1.012685 data: 0.000180 max mem: 18817 Epoch: [240/300] [ 100/1251] eta: 0:18:17 lr: 0.000207 loss: 2.909893 (2.616611) time: 0.968782 data: 0.000176 max mem: 18817 Epoch: [240/300] [ 150/1251] eta: 0:17:25 lr: 0.000207 loss: 2.692513 (2.661735) time: 0.915988 data: 0.000180 max mem: 18817 Epoch: [240/300] [ 200/1251] eta: 0:16:40 lr: 0.000207 loss: 2.626617 (2.677623) time: 0.924547 data: 0.000185 max mem: 18817 Epoch: [240/300] [ 250/1251] eta: 0:15:54 lr: 0.000207 loss: 2.751416 (2.667863) time: 0.973042 data: 0.000178 max mem: 18817 Epoch: [240/300] [ 300/1251] eta: 0:15:09 lr: 0.000206 loss: 2.891408 (2.682982) time: 1.038839 data: 0.000177 max mem: 18817 Epoch: [240/300] [ 350/1251] eta: 0:14:19 lr: 0.000206 loss: 2.565318 (2.687981) time: 0.971284 data: 0.000165 max mem: 18817 Epoch: [240/300] [ 400/1251] eta: 0:13:29 lr: 0.000206 loss: 2.660901 (2.683418) time: 0.912289 data: 0.000176 max mem: 18817 Epoch: [240/300] [ 450/1251] eta: 0:12:43 lr: 0.000206 loss: 2.833829 (2.685881) time: 0.923361 data: 0.000170 max mem: 18817 Epoch: [240/300] [ 500/1251] eta: 0:11:56 lr: 0.000205 loss: 2.644981 (2.684692) time: 0.943473 data: 0.000181 max mem: 18817 Epoch: [240/300] [ 550/1251] eta: 0:11:09 lr: 0.000205 loss: 2.584775 (2.678338) time: 1.052829 data: 0.000176 max mem: 18817 Epoch: [240/300] [ 600/1251] eta: 0:10:20 lr: 0.000205 loss: 2.696852 (2.680854) time: 0.953348 data: 0.000169 max mem: 18817 Epoch: [240/300] [ 650/1251] eta: 0:09:32 lr: 0.000205 loss: 2.801475 (2.687600) time: 0.906867 data: 0.000179 max mem: 18817 Epoch: [240/300] [ 700/1251] eta: 0:08:44 lr: 0.000205 loss: 2.823399 (2.695796) time: 0.915043 data: 0.000169 max mem: 18817 Epoch: [240/300] [ 750/1251] eta: 0:07:57 lr: 0.000204 loss: 2.942194 (2.699088) time: 1.006025 data: 0.000171 max mem: 18817 Epoch: [240/300] [ 800/1251] eta: 0:07:09 lr: 0.000204 loss: 2.918025 (2.697572) time: 0.985192 data: 0.000182 max mem: 18817 Epoch: [240/300] [ 850/1251] eta: 0:06:22 lr: 0.000204 loss: 2.923276 (2.704405) time: 0.972914 data: 0.000171 max mem: 18817 Epoch: [240/300] [ 900/1251] eta: 0:05:34 lr: 0.000204 loss: 2.904288 (2.705561) time: 0.994002 data: 0.000180 max mem: 18817 Epoch: [240/300] [ 950/1251] eta: 0:04:46 lr: 0.000203 loss: 2.765212 (2.706062) time: 0.907310 data: 0.000169 max mem: 18817 Epoch: [240/300] [1000/1251] eta: 0:03:59 lr: 0.000203 loss: 2.714428 (2.705537) time: 0.912575 data: 0.000178 max mem: 18817 Epoch: [240/300] [1050/1251] eta: 0:03:11 lr: 0.000203 loss: 2.847896 (2.705488) time: 0.975383 data: 0.000182 max mem: 18817 Epoch: [240/300] [1100/1251] eta: 0:02:23 lr: 0.000203 loss: 2.828379 (2.704593) time: 0.969811 data: 0.000185 max mem: 18817 Epoch: [240/300] [1150/1251] eta: 0:01:36 lr: 0.000202 loss: 2.732911 (2.702559) time: 0.990127 data: 0.000175 max mem: 18817 Epoch: [240/300] [1200/1251] eta: 0:00:48 lr: 0.000202 loss: 2.855342 (2.705411) time: 0.912522 data: 0.000184 max mem: 18817 Epoch: [240/300] [1250/1251] eta: 0:00:00 lr: 0.000202 loss: 2.511564 (2.706131) time: 0.980165 data: 0.000775 max mem: 18817 Epoch: [240/300] Total time: 0:19:52 (0.953510 s / it) Averaged stats: lr: 0.000202 loss: 2.511564 (2.710006) Test: [ 0/49] eta: 0:01:21 loss: 0.477348 (0.477348) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.665824 data: 1.255421 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.535782 (0.661218) acc1: 84.375000 (84.659091) acc5: 96.875000 (96.306818) time: 0.480938 data: 0.114285 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.734906 (0.690344) acc1: 82.812500 (83.482143) acc5: 96.875000 (96.354167) time: 0.372450 data: 0.000155 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.689160 (0.696824) acc1: 81.250000 (83.165323) acc5: 96.875000 (96.673387) time: 0.374556 data: 0.000137 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.673135 (0.707081) acc1: 82.812500 (83.231707) acc5: 96.875000 (96.684451) time: 0.356923 data: 0.000131 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.689160 (0.704175) acc1: 84.375000 (83.552000) acc5: 96.875000 (96.736000) time: 0.345046 data: 0.000111 max mem: 18817 Test: Total time: 0:00:19 (0.389687 s / it) * Acc@1 83.446 Acc@5 96.648 loss 0.711 Max accuracy: 83.51% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0240.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0240.pth Epoch: [241/300] [ 0/1251] eta: 0:42:20 lr: 0.000202 loss: 2.561518 (2.561518) time: 2.030626 data: 1.168068 max mem: 18817 Epoch: [241/300] [ 50/1251] eta: 0:19:14 lr: 0.000202 loss: 2.873194 (2.695739) time: 0.971892 data: 0.000174 max mem: 18817 Epoch: [241/300] [ 100/1251] eta: 0:18:09 lr: 0.000201 loss: 2.625505 (2.706190) time: 0.912130 data: 0.000188 max mem: 18817 Epoch: [241/300] [ 150/1251] eta: 0:17:24 lr: 0.000201 loss: 2.660389 (2.726288) time: 0.904652 data: 0.000183 max mem: 18817 Epoch: [241/300] [ 200/1251] eta: 0:16:39 lr: 0.000201 loss: 2.807631 (2.716380) time: 0.962155 data: 0.000171 max mem: 18817 Epoch: [241/300] [ 250/1251] eta: 0:15:53 lr: 0.000201 loss: 2.906418 (2.720196) time: 1.016748 data: 0.000174 max mem: 18817 Epoch: [241/300] [ 300/1251] eta: 0:15:03 lr: 0.000200 loss: 2.598124 (2.729350) time: 0.987025 data: 0.000177 max mem: 18817 Epoch: [241/300] [ 350/1251] eta: 0:14:13 lr: 0.000200 loss: 2.730941 (2.719462) time: 0.913410 data: 0.000173 max mem: 18817 Epoch: [241/300] [ 400/1251] eta: 0:13:26 lr: 0.000200 loss: 2.787483 (2.718527) time: 0.921635 data: 0.000175 max mem: 18817 Epoch: [241/300] [ 450/1251] eta: 0:12:40 lr: 0.000200 loss: 2.756422 (2.728576) time: 0.961674 data: 0.000162 max mem: 18817 Epoch: [241/300] [ 500/1251] eta: 0:11:52 lr: 0.000199 loss: 2.764474 (2.729938) time: 0.967618 data: 0.000179 max mem: 18817 Epoch: [241/300] [ 550/1251] eta: 0:11:05 lr: 0.000199 loss: 2.849684 (2.717373) time: 0.970650 data: 0.000168 max mem: 18817 Epoch: [241/300] [ 600/1251] eta: 0:10:18 lr: 0.000199 loss: 2.886302 (2.715989) time: 0.928983 data: 0.000192 max mem: 18817 Epoch: [241/300] [ 650/1251] eta: 0:09:30 lr: 0.000199 loss: 2.752331 (2.717101) time: 0.907648 data: 0.000172 max mem: 18817 Epoch: [241/300] [ 700/1251] eta: 0:08:43 lr: 0.000199 loss: 2.839748 (2.717547) time: 0.974216 data: 0.000285 max mem: 18817 Epoch: [241/300] [ 750/1251] eta: 0:07:55 lr: 0.000198 loss: 2.615483 (2.712644) time: 0.968917 data: 0.000171 max mem: 18817 Epoch: [241/300] [ 800/1251] eta: 0:07:08 lr: 0.000198 loss: 2.551854 (2.707011) time: 0.925767 data: 0.000159 max mem: 18817 Epoch: [241/300] [ 850/1251] eta: 0:06:20 lr: 0.000198 loss: 2.848639 (2.707061) time: 0.913284 data: 0.000181 max mem: 18817 Epoch: [241/300] [ 900/1251] eta: 0:05:33 lr: 0.000198 loss: 2.686212 (2.704064) time: 0.956278 data: 0.000168 max mem: 18817 Epoch: [241/300] [ 950/1251] eta: 0:04:46 lr: 0.000197 loss: 2.809967 (2.701109) time: 0.963527 data: 0.000180 max mem: 18817 Epoch: [241/300] [1000/1251] eta: 0:03:58 lr: 0.000197 loss: 2.952204 (2.706030) time: 0.966912 data: 0.000170 max mem: 18817 Epoch: [241/300] [1050/1251] eta: 0:03:10 lr: 0.000197 loss: 2.807145 (2.709023) time: 0.911700 data: 0.000171 max mem: 18817 Epoch: [241/300] [1100/1251] eta: 0:02:23 lr: 0.000197 loss: 2.931301 (2.712618) time: 0.936506 data: 0.000172 max mem: 18817 Epoch: [241/300] [1150/1251] eta: 0:01:35 lr: 0.000196 loss: 2.690027 (2.715389) time: 0.965708 data: 0.000165 max mem: 18817 Epoch: [241/300] [1200/1251] eta: 0:00:48 lr: 0.000196 loss: 2.538228 (2.714001) time: 0.989573 data: 0.000186 max mem: 18817 Epoch: [241/300] [1250/1251] eta: 0:00:00 lr: 0.000196 loss: 2.957010 (2.714930) time: 0.973812 data: 0.000757 max mem: 18817 Epoch: [241/300] Total time: 0:19:49 (0.950543 s / it) Averaged stats: lr: 0.000196 loss: 2.957010 (2.709050) Test: [ 0/49] eta: 0:01:30 loss: 0.509979 (0.509979) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.853363 data: 1.439069 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.561556 (0.652111) acc1: 84.375000 (84.375000) acc5: 98.437500 (96.590909) time: 0.491850 data: 0.130958 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.702336 (0.678589) acc1: 82.812500 (83.705357) acc5: 96.875000 (96.875000) time: 0.418725 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.671984 (0.679794) acc1: 81.250000 (83.417339) acc5: 96.875000 (96.975806) time: 0.447941 data: 0.000140 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.670807 (0.690335) acc1: 82.812500 (83.422256) acc5: 96.875000 (97.027439) time: 0.380500 data: 0.000147 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.670807 (0.691004) acc1: 82.812500 (83.296000) acc5: 96.875000 (97.056000) time: 0.344021 data: 0.000116 max mem: 18817 Test: Total time: 0:00:20 (0.421504 s / it) * Acc@1 83.464 Acc@5 96.662 loss 0.711 Max accuracy: 83.51% Epoch: [242/300] [ 0/1251] eta: 0:41:06 lr: 0.000196 loss: 3.086746 (3.086746) time: 1.971686 data: 1.110265 max mem: 18817 Epoch: [242/300] [ 50/1251] eta: 0:19:09 lr: 0.000196 loss: 2.621153 (2.665261) time: 0.924984 data: 0.000176 max mem: 18817 Epoch: [242/300] [ 100/1251] eta: 0:18:20 lr: 0.000195 loss: 2.793272 (2.677720) time: 0.918412 data: 0.000163 max mem: 18817 Epoch: [242/300] [ 150/1251] eta: 0:17:38 lr: 0.000195 loss: 2.627450 (2.669966) time: 1.001649 data: 0.000173 max mem: 18817 Epoch: [242/300] [ 200/1251] eta: 0:16:48 lr: 0.000195 loss: 2.698111 (2.646467) time: 1.011407 data: 0.000183 max mem: 18817 Epoch: [242/300] [ 250/1251] eta: 0:15:56 lr: 0.000195 loss: 2.692833 (2.650499) time: 0.956823 data: 0.000173 max mem: 18817 Epoch: [242/300] [ 300/1251] eta: 0:15:06 lr: 0.000195 loss: 2.857088 (2.670363) time: 0.919116 data: 0.000163 max mem: 18817 Epoch: [242/300] [ 350/1251] eta: 0:14:18 lr: 0.000194 loss: 2.806135 (2.672444) time: 0.910028 data: 0.000171 max mem: 18817 Epoch: [242/300] [ 400/1251] eta: 0:13:31 lr: 0.000194 loss: 2.820253 (2.667982) time: 0.962015 data: 0.000176 max mem: 18817 Epoch: [242/300] [ 450/1251] eta: 0:12:44 lr: 0.000194 loss: 2.756265 (2.671289) time: 1.033816 data: 0.000186 max mem: 18817 Epoch: [242/300] [ 500/1251] eta: 0:11:54 lr: 0.000194 loss: 2.677338 (2.668612) time: 0.923600 data: 0.000170 max mem: 18817 Epoch: [242/300] [ 550/1251] eta: 0:11:06 lr: 0.000193 loss: 2.507552 (2.665524) time: 0.917912 data: 0.000179 max mem: 18817 Epoch: [242/300] [ 600/1251] eta: 0:10:19 lr: 0.000193 loss: 2.785846 (2.660697) time: 0.916131 data: 0.000155 max mem: 18817 Epoch: [242/300] [ 650/1251] eta: 0:09:31 lr: 0.000193 loss: 2.577322 (2.663382) time: 0.946235 data: 0.000162 max mem: 18817 Epoch: [242/300] [ 700/1251] eta: 0:08:43 lr: 0.000193 loss: 2.909557 (2.670396) time: 0.954425 data: 0.000171 max mem: 18817 Epoch: [242/300] [ 750/1251] eta: 0:07:55 lr: 0.000192 loss: 2.777542 (2.672724) time: 0.912995 data: 0.000193 max mem: 18817 Epoch: [242/300] [ 800/1251] eta: 0:07:08 lr: 0.000192 loss: 2.606438 (2.673533) time: 0.915133 data: 0.000166 max mem: 18817 Epoch: [242/300] [ 850/1251] eta: 0:06:21 lr: 0.000192 loss: 2.677032 (2.672073) time: 0.918882 data: 0.000176 max mem: 18817 Epoch: [242/300] [ 900/1251] eta: 0:05:33 lr: 0.000192 loss: 2.983362 (2.674763) time: 0.974250 data: 0.000166 max mem: 18817 Epoch: [242/300] [ 950/1251] eta: 0:04:46 lr: 0.000191 loss: 2.751432 (2.679414) time: 0.960862 data: 0.000166 max mem: 18817 Epoch: [242/300] [1000/1251] eta: 0:03:58 lr: 0.000191 loss: 2.652667 (2.676285) time: 0.913758 data: 0.000169 max mem: 18817 Epoch: [242/300] [1050/1251] eta: 0:03:10 lr: 0.000191 loss: 2.732401 (2.680267) time: 0.910455 data: 0.000181 max mem: 18817 Epoch: [242/300] [1100/1251] eta: 0:02:23 lr: 0.000191 loss: 2.754954 (2.679604) time: 0.965042 data: 0.000181 max mem: 18817 Epoch: [242/300] [1150/1251] eta: 0:01:35 lr: 0.000191 loss: 2.722871 (2.683967) time: 1.015784 data: 0.000182 max mem: 18817 Epoch: [242/300] [1200/1251] eta: 0:00:48 lr: 0.000190 loss: 2.796257 (2.684206) time: 0.965630 data: 0.000169 max mem: 18817 Epoch: [242/300] [1250/1251] eta: 0:00:00 lr: 0.000190 loss: 2.604769 (2.685110) time: 0.913910 data: 0.000771 max mem: 18817 Epoch: [242/300] Total time: 0:19:47 (0.949587 s / it) Averaged stats: lr: 0.000190 loss: 2.604769 (2.689987) Test: [ 0/49] eta: 0:01:31 loss: 0.459657 (0.459657) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.870959 data: 1.486042 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.556629 (0.663210) acc1: 82.812500 (83.948864) acc5: 98.437500 (96.732955) time: 0.497361 data: 0.135268 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.737683 (0.699858) acc1: 81.250000 (82.812500) acc5: 95.312500 (96.577381) time: 0.431451 data: 0.000165 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.698231 (0.697383) acc1: 81.250000 (82.862903) acc5: 96.875000 (96.875000) time: 0.441343 data: 0.000153 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.677541 (0.711608) acc1: 82.812500 (82.736280) acc5: 96.875000 (96.913110) time: 0.363444 data: 0.000159 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.696070 (0.706364) acc1: 82.812500 (82.976000) acc5: 96.875000 (97.024000) time: 0.344443 data: 0.000139 max mem: 18817 Test: Total time: 0:00:20 (0.421665 s / it) * Acc@1 83.566 Acc@5 96.690 loss 0.712 Max accuracy: 83.57% Epoch: [243/300] [ 0/1251] eta: 0:41:11 lr: 0.000190 loss: 2.798192 (2.798192) time: 1.975298 data: 1.101546 max mem: 18817 Epoch: [243/300] [ 50/1251] eta: 0:19:30 lr: 0.000190 loss: 2.745008 (2.612513) time: 0.971366 data: 0.000172 max mem: 18817 Epoch: [243/300] [ 100/1251] eta: 0:18:37 lr: 0.000190 loss: 2.843278 (2.599584) time: 1.029770 data: 0.000167 max mem: 18817 Epoch: [243/300] [ 150/1251] eta: 0:17:36 lr: 0.000189 loss: 2.779964 (2.621484) time: 0.958899 data: 0.000168 max mem: 18817 Epoch: [243/300] [ 200/1251] eta: 0:16:42 lr: 0.000189 loss: 2.777972 (2.629103) time: 0.915157 data: 0.000180 max mem: 18817 Epoch: [243/300] [ 250/1251] eta: 0:15:56 lr: 0.000189 loss: 2.721251 (2.620775) time: 0.911808 data: 0.000173 max mem: 18817 Epoch: [243/300] [ 300/1251] eta: 0:15:10 lr: 0.000189 loss: 2.849251 (2.638261) time: 0.960994 data: 0.000164 max mem: 18817 Epoch: [243/300] [ 350/1251] eta: 0:14:20 lr: 0.000188 loss: 2.743780 (2.649852) time: 0.966520 data: 0.000177 max mem: 18817 Epoch: [243/300] [ 400/1251] eta: 0:13:30 lr: 0.000188 loss: 2.498450 (2.645471) time: 0.917859 data: 0.000187 max mem: 18817 Epoch: [243/300] [ 450/1251] eta: 0:12:43 lr: 0.000188 loss: 2.738458 (2.642098) time: 0.911041 data: 0.000174 max mem: 18817 Epoch: [243/300] [ 500/1251] eta: 0:11:56 lr: 0.000188 loss: 2.779420 (2.645623) time: 0.916019 data: 0.000178 max mem: 18817 Epoch: [243/300] [ 550/1251] eta: 0:11:09 lr: 0.000188 loss: 2.798487 (2.656423) time: 0.954151 data: 0.000163 max mem: 18817 Epoch: [243/300] [ 600/1251] eta: 0:10:20 lr: 0.000187 loss: 2.439365 (2.658797) time: 0.974223 data: 0.000174 max mem: 18817 Epoch: [243/300] [ 650/1251] eta: 0:09:32 lr: 0.000187 loss: 2.949185 (2.662000) time: 0.911835 data: 0.000172 max mem: 18817 Epoch: [243/300] [ 700/1251] eta: 0:08:44 lr: 0.000187 loss: 2.716021 (2.666771) time: 0.955463 data: 0.000169 max mem: 18817 Epoch: [243/300] [ 750/1251] eta: 0:07:56 lr: 0.000187 loss: 2.874568 (2.676205) time: 0.910883 data: 0.000175 max mem: 18817 Epoch: [243/300] [ 800/1251] eta: 0:07:09 lr: 0.000186 loss: 2.755429 (2.676379) time: 0.913755 data: 0.000172 max mem: 18817 Epoch: [243/300] [ 850/1251] eta: 0:06:22 lr: 0.000186 loss: 2.754054 (2.679574) time: 0.961017 data: 0.000169 max mem: 18817 Epoch: [243/300] [ 900/1251] eta: 0:05:34 lr: 0.000186 loss: 2.783115 (2.680174) time: 0.961804 data: 0.000182 max mem: 18817 Epoch: [243/300] [ 950/1251] eta: 0:04:46 lr: 0.000186 loss: 2.682279 (2.682993) time: 0.926273 data: 0.000177 max mem: 18817 Epoch: [243/300] [1000/1251] eta: 0:03:58 lr: 0.000185 loss: 2.747476 (2.680493) time: 0.912495 data: 0.000181 max mem: 18817 Epoch: [243/300] [1050/1251] eta: 0:03:11 lr: 0.000185 loss: 2.544011 (2.679665) time: 0.979699 data: 0.000170 max mem: 18817 Epoch: [243/300] [1100/1251] eta: 0:02:23 lr: 0.000185 loss: 2.817689 (2.680147) time: 0.957649 data: 0.000167 max mem: 18817 Epoch: [243/300] [1150/1251] eta: 0:01:36 lr: 0.000185 loss: 2.831070 (2.681960) time: 0.966082 data: 0.000172 max mem: 18817 Epoch: [243/300] [1200/1251] eta: 0:00:48 lr: 0.000185 loss: 2.675727 (2.683018) time: 0.972053 data: 0.000167 max mem: 18817 Epoch: [243/300] [1250/1251] eta: 0:00:00 lr: 0.000184 loss: 2.659130 (2.681754) time: 0.923779 data: 0.000759 max mem: 18817 Epoch: [243/300] Total time: 0:19:49 (0.951086 s / it) Averaged stats: lr: 0.000184 loss: 2.659130 (2.687719) Test: [ 0/49] eta: 0:01:30 loss: 0.483951 (0.483951) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.842530 data: 1.398010 max mem: 18817 Test: [10/49] eta: 0:00:25 loss: 0.518096 (0.662429) acc1: 84.375000 (84.659091) acc5: 98.437500 (96.448864) time: 0.664696 data: 0.128377 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.686172 (0.679141) acc1: 82.812500 (83.705357) acc5: 96.875000 (96.875000) time: 0.449965 data: 0.000774 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.673024 (0.685278) acc1: 82.812500 (83.266129) acc5: 96.875000 (96.925403) time: 0.352351 data: 0.000132 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.672812 (0.694181) acc1: 84.375000 (83.346037) acc5: 96.875000 (96.951220) time: 0.349399 data: 0.000123 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.684197 (0.692406) acc1: 84.375000 (83.552000) acc5: 96.875000 (96.928000) time: 0.344008 data: 0.000100 max mem: 18817 Test: Total time: 0:00:20 (0.422087 s / it) * Acc@1 83.506 Acc@5 96.658 loss 0.708 Max accuracy: 83.57% Epoch: [244/300] [ 0/1251] eta: 0:44:06 lr: 0.000184 loss: 3.265457 (3.265457) time: 2.115222 data: 1.251513 max mem: 18817 Epoch: [244/300] [ 50/1251] eta: 0:19:38 lr: 0.000184 loss: 2.740571 (2.643284) time: 0.963546 data: 0.000202 max mem: 18817 Epoch: [244/300] [ 100/1251] eta: 0:18:34 lr: 0.000184 loss: 2.596877 (2.663009) time: 1.021698 data: 0.000175 max mem: 18817 Epoch: [244/300] [ 150/1251] eta: 0:17:33 lr: 0.000184 loss: 2.757184 (2.675124) time: 0.961671 data: 0.000172 max mem: 18817 Epoch: [244/300] [ 200/1251] eta: 0:16:42 lr: 0.000183 loss: 2.775466 (2.669838) time: 0.917337 data: 0.000177 max mem: 18817 Epoch: [244/300] [ 250/1251] eta: 0:15:57 lr: 0.000183 loss: 2.817918 (2.676056) time: 0.945824 data: 0.000180 max mem: 18817 Epoch: [244/300] [ 300/1251] eta: 0:15:10 lr: 0.000183 loss: 2.764457 (2.684229) time: 0.978350 data: 0.000168 max mem: 18817 Epoch: [244/300] [ 350/1251] eta: 0:14:24 lr: 0.000183 loss: 2.794261 (2.688345) time: 1.036121 data: 0.000186 max mem: 18817 Epoch: [244/300] [ 400/1251] eta: 0:13:34 lr: 0.000182 loss: 2.842648 (2.704718) time: 0.978879 data: 0.000184 max mem: 18817 Epoch: [244/300] [ 450/1251] eta: 0:12:43 lr: 0.000182 loss: 2.775425 (2.706059) time: 0.903339 data: 0.000186 max mem: 18817 Epoch: [244/300] [ 500/1251] eta: 0:11:56 lr: 0.000182 loss: 2.767433 (2.706454) time: 0.922958 data: 0.000194 max mem: 18817 Epoch: [244/300] [ 550/1251] eta: 0:11:08 lr: 0.000182 loss: 2.569297 (2.703887) time: 0.957912 data: 0.000177 max mem: 18817 Epoch: [244/300] [ 600/1251] eta: 0:10:20 lr: 0.000182 loss: 2.844211 (2.710246) time: 0.989593 data: 0.000171 max mem: 18817 Epoch: [244/300] [ 650/1251] eta: 0:09:31 lr: 0.000181 loss: 2.711789 (2.704242) time: 0.924833 data: 0.000174 max mem: 18817 Epoch: [244/300] [ 700/1251] eta: 0:08:44 lr: 0.000181 loss: 2.635279 (2.697665) time: 0.914990 data: 0.000175 max mem: 18817 Epoch: [244/300] [ 750/1251] eta: 0:07:57 lr: 0.000181 loss: 2.477285 (2.689422) time: 0.918342 data: 0.000164 max mem: 18817 Epoch: [244/300] [ 800/1251] eta: 0:07:10 lr: 0.000181 loss: 2.765073 (2.689735) time: 0.984010 data: 0.000173 max mem: 18817 Epoch: [244/300] [ 850/1251] eta: 0:06:22 lr: 0.000180 loss: 2.779280 (2.693305) time: 0.975504 data: 0.000204 max mem: 18817 Epoch: [244/300] [ 900/1251] eta: 0:05:34 lr: 0.000180 loss: 2.558366 (2.691682) time: 0.910770 data: 0.000173 max mem: 18817 Epoch: [244/300] [ 950/1251] eta: 0:04:46 lr: 0.000180 loss: 2.803520 (2.697197) time: 0.925083 data: 0.000184 max mem: 18817 Epoch: [244/300] [1000/1251] eta: 0:03:59 lr: 0.000180 loss: 2.751383 (2.697494) time: 0.922452 data: 0.000166 max mem: 18817 Epoch: [244/300] [1050/1251] eta: 0:03:11 lr: 0.000180 loss: 2.731448 (2.696710) time: 0.946602 data: 0.000187 max mem: 18817 Epoch: [244/300] [1100/1251] eta: 0:02:23 lr: 0.000179 loss: 2.563278 (2.693675) time: 0.970187 data: 0.000175 max mem: 18817 Epoch: [244/300] [1150/1251] eta: 0:01:36 lr: 0.000179 loss: 2.715508 (2.691049) time: 0.921481 data: 0.000172 max mem: 18817 Epoch: [244/300] [1200/1251] eta: 0:00:48 lr: 0.000179 loss: 2.596828 (2.690064) time: 0.918098 data: 0.000179 max mem: 18817 Epoch: [244/300] [1250/1251] eta: 0:00:00 lr: 0.000179 loss: 2.674720 (2.692246) time: 0.932950 data: 0.000767 max mem: 18817 Epoch: [244/300] Total time: 0:19:52 (0.953070 s / it) Averaged stats: lr: 0.000179 loss: 2.674720 (2.700992) Test: [ 0/49] eta: 0:01:18 loss: 0.451685 (0.451685) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.601317 data: 1.155092 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.601779 (0.656084) acc1: 84.375000 (84.659091) acc5: 96.875000 (96.448864) time: 0.474278 data: 0.105153 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.745003 (0.684519) acc1: 84.375000 (83.630952) acc5: 96.875000 (96.800595) time: 0.365023 data: 0.000140 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.677278 (0.691149) acc1: 81.250000 (83.165323) acc5: 96.875000 (96.875000) time: 0.362039 data: 0.000127 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.706642 (0.702684) acc1: 81.250000 (83.079268) acc5: 96.875000 (96.798780) time: 0.351021 data: 0.000123 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.716058 (0.698705) acc1: 82.812500 (83.328000) acc5: 96.875000 (96.864000) time: 0.345802 data: 0.000100 max mem: 18817 Test: Total time: 0:00:18 (0.383541 s / it) * Acc@1 83.554 Acc@5 96.684 loss 0.716 Max accuracy: 83.57% Epoch: [245/300] [ 0/1251] eta: 0:42:26 lr: 0.000179 loss: 2.512293 (2.512293) time: 2.035482 data: 1.151717 max mem: 18817 Epoch: [245/300] [ 50/1251] eta: 0:19:08 lr: 0.000178 loss: 2.966015 (2.785054) time: 0.965573 data: 0.000183 max mem: 18817 Epoch: [245/300] [ 100/1251] eta: 0:18:14 lr: 0.000178 loss: 2.933508 (2.770476) time: 0.925579 data: 0.000178 max mem: 18817 Epoch: [245/300] [ 150/1251] eta: 0:17:25 lr: 0.000178 loss: 2.750876 (2.744597) time: 0.904990 data: 0.000185 max mem: 18817 Epoch: [245/300] [ 200/1251] eta: 0:16:40 lr: 0.000178 loss: 2.798435 (2.732399) time: 0.945173 data: 0.000195 max mem: 18817 Epoch: [245/300] [ 250/1251] eta: 0:15:51 lr: 0.000177 loss: 2.752489 (2.719285) time: 0.991094 data: 0.000175 max mem: 18817 Epoch: [245/300] [ 300/1251] eta: 0:15:02 lr: 0.000177 loss: 2.546437 (2.726946) time: 0.972268 data: 0.000172 max mem: 18817 Epoch: [245/300] [ 350/1251] eta: 0:14:12 lr: 0.000177 loss: 2.625280 (2.703445) time: 0.910834 data: 0.000165 max mem: 18817 Epoch: [245/300] [ 400/1251] eta: 0:13:26 lr: 0.000177 loss: 2.885001 (2.707138) time: 0.912154 data: 0.000165 max mem: 18817 Epoch: [245/300] [ 450/1251] eta: 0:12:40 lr: 0.000177 loss: 2.889747 (2.707837) time: 0.974392 data: 0.000190 max mem: 18817 Epoch: [245/300] [ 500/1251] eta: 0:11:54 lr: 0.000176 loss: 2.316617 (2.686743) time: 1.030992 data: 0.000173 max mem: 18817 Epoch: [245/300] [ 550/1251] eta: 0:11:06 lr: 0.000176 loss: 2.607744 (2.682027) time: 0.968596 data: 0.000170 max mem: 18817 Epoch: [245/300] [ 600/1251] eta: 0:10:17 lr: 0.000176 loss: 2.689626 (2.681879) time: 0.929420 data: 0.000202 max mem: 18817 Epoch: [245/300] [ 650/1251] eta: 0:09:30 lr: 0.000176 loss: 2.849114 (2.681022) time: 0.916639 data: 0.000165 max mem: 18817 Epoch: [245/300] [ 700/1251] eta: 0:08:44 lr: 0.000175 loss: 2.740247 (2.682319) time: 0.984240 data: 0.000170 max mem: 18817 Epoch: [245/300] [ 750/1251] eta: 0:07:56 lr: 0.000175 loss: 2.905747 (2.684098) time: 1.034850 data: 0.000178 max mem: 18817 Epoch: [245/300] [ 800/1251] eta: 0:07:08 lr: 0.000175 loss: 2.852355 (2.683320) time: 0.964060 data: 0.000176 max mem: 18817 Epoch: [245/300] [ 850/1251] eta: 0:06:20 lr: 0.000175 loss: 2.671575 (2.674105) time: 0.919334 data: 0.000166 max mem: 18817 Epoch: [245/300] [ 900/1251] eta: 0:05:33 lr: 0.000175 loss: 2.887602 (2.673990) time: 0.924412 data: 0.000172 max mem: 18817 Epoch: [245/300] [ 950/1251] eta: 0:04:46 lr: 0.000174 loss: 2.759025 (2.678851) time: 0.954483 data: 0.000178 max mem: 18817 Epoch: [245/300] [1000/1251] eta: 0:03:58 lr: 0.000174 loss: 2.687548 (2.681997) time: 1.005397 data: 0.000175 max mem: 18817 Epoch: [245/300] [1050/1251] eta: 0:03:11 lr: 0.000174 loss: 2.688319 (2.680238) time: 0.968671 data: 0.000170 max mem: 18817 Epoch: [245/300] [1100/1251] eta: 0:02:23 lr: 0.000174 loss: 2.684010 (2.681799) time: 0.913147 data: 0.000179 max mem: 18817 Epoch: [245/300] [1150/1251] eta: 0:01:35 lr: 0.000173 loss: 2.888307 (2.685374) time: 0.912586 data: 0.000170 max mem: 18817 Epoch: [245/300] [1200/1251] eta: 0:00:48 lr: 0.000173 loss: 2.812785 (2.687629) time: 0.972134 data: 0.000179 max mem: 18817 Epoch: [245/300] [1250/1251] eta: 0:00:00 lr: 0.000173 loss: 2.880600 (2.689564) time: 0.980693 data: 0.000776 max mem: 18817 Epoch: [245/300] Total time: 0:19:50 (0.951334 s / it) Averaged stats: lr: 0.000173 loss: 2.880600 (2.684491) Test: [ 0/49] eta: 0:01:29 loss: 0.484917 (0.484917) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.820719 data: 1.397500 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.522711 (0.640856) acc1: 85.937500 (85.795455) acc5: 98.437500 (96.590909) time: 0.490725 data: 0.127176 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.715940 (0.687115) acc1: 82.812500 (83.928571) acc5: 96.875000 (96.800595) time: 0.355660 data: 0.000139 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.702983 (0.692103) acc1: 81.250000 (83.417339) acc5: 96.875000 (96.875000) time: 0.353014 data: 0.000145 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.696661 (0.707888) acc1: 81.250000 (83.155488) acc5: 96.875000 (96.760671) time: 0.361411 data: 0.000155 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.706410 (0.700948) acc1: 82.812500 (83.264000) acc5: 96.875000 (96.896000) time: 0.369661 data: 0.000128 max mem: 18817 Test: Total time: 0:00:19 (0.393588 s / it) * Acc@1 83.450 Acc@5 96.688 loss 0.711 Max accuracy: 83.57% Epoch: [246/300] [ 0/1251] eta: 0:40:42 lr: 0.000173 loss: 2.747372 (2.747372) time: 1.952462 data: 1.076205 max mem: 18817 Epoch: [246/300] [ 50/1251] eta: 0:19:38 lr: 0.000173 loss: 2.815603 (2.723643) time: 0.989400 data: 0.000172 max mem: 18817 Epoch: [246/300] [ 100/1251] eta: 0:18:27 lr: 0.000173 loss: 2.730978 (2.706309) time: 0.994602 data: 0.000170 max mem: 18817 Epoch: [246/300] [ 150/1251] eta: 0:17:39 lr: 0.000172 loss: 2.603714 (2.709841) time: 0.982688 data: 0.000178 max mem: 18817 Epoch: [246/300] [ 200/1251] eta: 0:16:44 lr: 0.000172 loss: 2.816506 (2.700544) time: 0.914847 data: 0.000173 max mem: 18817 Epoch: [246/300] [ 250/1251] eta: 0:15:56 lr: 0.000172 loss: 2.822064 (2.692523) time: 0.906839 data: 0.000186 max mem: 18817 Epoch: [246/300] [ 300/1251] eta: 0:15:11 lr: 0.000172 loss: 2.728451 (2.692489) time: 0.974483 data: 0.000167 max mem: 18817 Epoch: [246/300] [ 350/1251] eta: 0:14:19 lr: 0.000171 loss: 2.676733 (2.687408) time: 0.962568 data: 0.000170 max mem: 18817 Epoch: [246/300] [ 400/1251] eta: 0:13:33 lr: 0.000171 loss: 2.708957 (2.682938) time: 0.964148 data: 0.000177 max mem: 18817 Epoch: [246/300] [ 450/1251] eta: 0:12:43 lr: 0.000171 loss: 2.648666 (2.673822) time: 0.916078 data: 0.000195 max mem: 18817 Epoch: [246/300] [ 500/1251] eta: 0:11:56 lr: 0.000171 loss: 2.766256 (2.676522) time: 0.970396 data: 0.000162 max mem: 18817 Epoch: [246/300] [ 550/1251] eta: 0:11:07 lr: 0.000171 loss: 2.923920 (2.685044) time: 0.951301 data: 0.000174 max mem: 18817 Epoch: [246/300] [ 600/1251] eta: 0:10:19 lr: 0.000170 loss: 2.852260 (2.683684) time: 0.918807 data: 0.000174 max mem: 18817 Epoch: [246/300] [ 650/1251] eta: 0:09:31 lr: 0.000170 loss: 2.893661 (2.681312) time: 0.920178 data: 0.000170 max mem: 18817 Epoch: [246/300] [ 700/1251] eta: 0:08:44 lr: 0.000170 loss: 2.414135 (2.682099) time: 0.970458 data: 0.000168 max mem: 18817 Epoch: [246/300] [ 750/1251] eta: 0:07:57 lr: 0.000170 loss: 2.842247 (2.689779) time: 1.053949 data: 0.000169 max mem: 18817 Epoch: [246/300] [ 800/1251] eta: 0:07:09 lr: 0.000170 loss: 2.809239 (2.688311) time: 0.976193 data: 0.000169 max mem: 18817 Epoch: [246/300] [ 850/1251] eta: 0:06:21 lr: 0.000169 loss: 2.557565 (2.683558) time: 0.932799 data: 0.000174 max mem: 18817 Epoch: [246/300] [ 900/1251] eta: 0:05:34 lr: 0.000169 loss: 2.770602 (2.686896) time: 0.938117 data: 0.000181 max mem: 18817 Epoch: [246/300] [ 950/1251] eta: 0:04:46 lr: 0.000169 loss: 2.783994 (2.686384) time: 0.956407 data: 0.000178 max mem: 18817 Epoch: [246/300] [1000/1251] eta: 0:03:59 lr: 0.000169 loss: 2.880338 (2.689350) time: 1.038503 data: 0.000165 max mem: 18817 Epoch: [246/300] [1050/1251] eta: 0:03:11 lr: 0.000168 loss: 2.730278 (2.686646) time: 0.910176 data: 0.000178 max mem: 18817 Epoch: [246/300] [1100/1251] eta: 0:02:23 lr: 0.000168 loss: 2.537289 (2.687811) time: 0.960712 data: 0.000188 max mem: 18817 Epoch: [246/300] [1150/1251] eta: 0:01:36 lr: 0.000168 loss: 2.695707 (2.692961) time: 0.981662 data: 0.000161 max mem: 18817 Epoch: [246/300] [1200/1251] eta: 0:00:48 lr: 0.000168 loss: 2.937772 (2.694024) time: 0.976726 data: 0.000174 max mem: 18817 Epoch: [246/300] [1250/1251] eta: 0:00:00 lr: 0.000168 loss: 2.837804 (2.692227) time: 0.906994 data: 0.000768 max mem: 18817 Epoch: [246/300] Total time: 0:19:49 (0.950808 s / it) Averaged stats: lr: 0.000168 loss: 2.837804 (2.689820) Test: [ 0/49] eta: 0:01:25 loss: 0.519123 (0.519123) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.744869 data: 1.329455 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.559952 (0.665534) acc1: 84.375000 (84.232955) acc5: 96.875000 (96.590909) time: 0.487278 data: 0.120999 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.717260 (0.689950) acc1: 82.812500 (83.854167) acc5: 96.875000 (96.800595) time: 0.359171 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.697346 (0.695870) acc1: 82.812500 (83.316532) acc5: 96.875000 (97.026210) time: 0.353697 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.716260 (0.708672) acc1: 82.812500 (83.155488) acc5: 96.875000 (96.913110) time: 0.348808 data: 0.000122 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.716260 (0.704084) acc1: 82.812500 (83.296000) acc5: 96.875000 (96.960000) time: 0.428571 data: 0.000101 max mem: 18817 Test: Total time: 0:00:20 (0.417929 s / it) * Acc@1 83.506 Acc@5 96.658 loss 0.715 Max accuracy: 83.57% Epoch: [247/300] [ 0/1251] eta: 0:43:42 lr: 0.000168 loss: 3.187575 (3.187575) time: 2.096098 data: 1.232630 max mem: 18817 Epoch: [247/300] [ 50/1251] eta: 0:19:17 lr: 0.000167 loss: 2.786112 (2.736682) time: 0.968146 data: 0.000163 max mem: 18817 Epoch: [247/300] [ 100/1251] eta: 0:18:26 lr: 0.000167 loss: 2.583465 (2.690941) time: 0.958928 data: 0.000160 max mem: 18817 Epoch: [247/300] [ 150/1251] eta: 0:17:27 lr: 0.000167 loss: 2.823870 (2.689152) time: 0.967991 data: 0.000177 max mem: 18817 Epoch: [247/300] [ 200/1251] eta: 0:16:39 lr: 0.000167 loss: 2.738715 (2.700017) time: 0.923655 data: 0.000174 max mem: 18817 Epoch: [247/300] [ 250/1251] eta: 0:15:54 lr: 0.000166 loss: 2.826721 (2.719492) time: 0.918166 data: 0.000179 max mem: 18817 Epoch: [247/300] [ 300/1251] eta: 0:15:09 lr: 0.000166 loss: 2.640191 (2.703792) time: 0.978640 data: 0.000164 max mem: 18817 Epoch: [247/300] [ 350/1251] eta: 0:14:20 lr: 0.000166 loss: 2.873688 (2.704289) time: 0.995422 data: 0.000167 max mem: 18817 Epoch: [247/300] [ 400/1251] eta: 0:13:31 lr: 0.000166 loss: 2.692861 (2.703158) time: 0.961672 data: 0.000176 max mem: 18817 Epoch: [247/300] [ 450/1251] eta: 0:12:42 lr: 0.000166 loss: 2.659644 (2.693283) time: 0.924196 data: 0.000174 max mem: 18817 Epoch: [247/300] [ 500/1251] eta: 0:11:55 lr: 0.000165 loss: 2.605880 (2.698728) time: 0.909280 data: 0.000162 max mem: 18817 Epoch: [247/300] [ 550/1251] eta: 0:11:08 lr: 0.000165 loss: 2.616030 (2.694357) time: 0.998340 data: 0.000177 max mem: 18817 Epoch: [247/300] [ 600/1251] eta: 0:10:21 lr: 0.000165 loss: 2.620418 (2.685158) time: 1.040064 data: 0.000192 max mem: 18817 Epoch: [247/300] [ 650/1251] eta: 0:09:33 lr: 0.000165 loss: 2.768591 (2.689116) time: 0.992017 data: 0.000161 max mem: 18817 Epoch: [247/300] [ 700/1251] eta: 0:08:45 lr: 0.000164 loss: 2.757179 (2.693303) time: 0.911384 data: 0.000169 max mem: 18817 Epoch: [247/300] [ 750/1251] eta: 0:07:57 lr: 0.000164 loss: 2.824570 (2.692970) time: 0.926090 data: 0.000166 max mem: 18817 Epoch: [247/300] [ 800/1251] eta: 0:07:10 lr: 0.000164 loss: 2.854345 (2.687174) time: 0.985090 data: 0.000172 max mem: 18817 Epoch: [247/300] [ 850/1251] eta: 0:06:22 lr: 0.000164 loss: 2.798059 (2.688821) time: 0.999634 data: 0.000176 max mem: 18817 Epoch: [247/300] [ 900/1251] eta: 0:05:34 lr: 0.000164 loss: 2.811269 (2.691129) time: 0.976954 data: 0.000174 max mem: 18817 Epoch: [247/300] [ 950/1251] eta: 0:04:46 lr: 0.000163 loss: 2.811506 (2.694150) time: 0.929747 data: 0.000170 max mem: 18817 Epoch: [247/300] [1000/1251] eta: 0:03:59 lr: 0.000163 loss: 2.542418 (2.693085) time: 0.916029 data: 0.000165 max mem: 18817 Epoch: [247/300] [1050/1251] eta: 0:03:11 lr: 0.000163 loss: 2.790033 (2.692506) time: 0.971911 data: 0.000174 max mem: 18817 Epoch: [247/300] [1100/1251] eta: 0:02:23 lr: 0.000163 loss: 2.835074 (2.694342) time: 0.973978 data: 0.000171 max mem: 18817 Epoch: [247/300] [1150/1251] eta: 0:01:36 lr: 0.000163 loss: 2.832757 (2.695368) time: 0.986517 data: 0.000175 max mem: 18817 Epoch: [247/300] [1200/1251] eta: 0:00:48 lr: 0.000162 loss: 2.742127 (2.690162) time: 0.929518 data: 0.000173 max mem: 18817 Epoch: [247/300] [1250/1251] eta: 0:00:00 lr: 0.000162 loss: 2.702172 (2.690234) time: 0.913131 data: 0.000763 max mem: 18817 Epoch: [247/300] Total time: 0:19:53 (0.954127 s / it) Averaged stats: lr: 0.000162 loss: 2.702172 (2.694366) Test: [ 0/49] eta: 0:01:27 loss: 0.471003 (0.471003) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.785970 data: 1.390863 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.600219 (0.650415) acc1: 84.375000 (84.659091) acc5: 96.875000 (96.875000) time: 0.516702 data: 0.126590 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.708593 (0.685751) acc1: 82.812500 (83.630952) acc5: 96.875000 (96.651786) time: 0.377574 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.687868 (0.684328) acc1: 82.812500 (83.215726) acc5: 96.875000 (96.875000) time: 0.358382 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.697300 (0.696308) acc1: 82.812500 (83.269817) acc5: 96.875000 (96.798780) time: 0.352969 data: 0.000144 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.708204 (0.691560) acc1: 82.812500 (83.328000) acc5: 96.875000 (96.832000) time: 0.352703 data: 0.000122 max mem: 18817 Test: Total time: 0:00:19 (0.394835 s / it) * Acc@1 83.472 Acc@5 96.658 loss 0.710 Max accuracy: 83.57% Epoch: [248/300] [ 0/1251] eta: 0:41:47 lr: 0.000162 loss: 2.664836 (2.664836) time: 2.003998 data: 1.131961 max mem: 18817 Epoch: [248/300] [ 50/1251] eta: 0:19:15 lr: 0.000162 loss: 2.722618 (2.692218) time: 0.976463 data: 0.000195 max mem: 18817 Epoch: [248/300] [ 100/1251] eta: 0:18:14 lr: 0.000162 loss: 2.895887 (2.702238) time: 0.923892 data: 0.000469 max mem: 18817 Epoch: [248/300] [ 150/1251] eta: 0:17:38 lr: 0.000161 loss: 2.657805 (2.742873) time: 0.936635 data: 0.000175 max mem: 18817 Epoch: [248/300] [ 200/1251] eta: 0:16:52 lr: 0.000161 loss: 2.940489 (2.735952) time: 1.001657 data: 0.000173 max mem: 18817 Epoch: [248/300] [ 250/1251] eta: 0:16:02 lr: 0.000161 loss: 2.638089 (2.717861) time: 0.980868 data: 0.000159 max mem: 18817 Epoch: [248/300] [ 300/1251] eta: 0:15:09 lr: 0.000161 loss: 2.612207 (2.703363) time: 0.948875 data: 0.000186 max mem: 18817 Epoch: [248/300] [ 350/1251] eta: 0:14:19 lr: 0.000161 loss: 2.585416 (2.699644) time: 0.913855 data: 0.000171 max mem: 18817 Epoch: [248/300] [ 400/1251] eta: 0:13:32 lr: 0.000160 loss: 2.848575 (2.698659) time: 0.923764 data: 0.000172 max mem: 18817 Epoch: [248/300] [ 450/1251] eta: 0:12:43 lr: 0.000160 loss: 2.616000 (2.702092) time: 0.919444 data: 0.000162 max mem: 18817 Epoch: [248/300] [ 500/1251] eta: 0:11:55 lr: 0.000160 loss: 2.635129 (2.696736) time: 0.975830 data: 0.000168 max mem: 18817 Epoch: [248/300] [ 550/1251] eta: 0:11:08 lr: 0.000160 loss: 2.888098 (2.697555) time: 1.010072 data: 0.000150 max mem: 18817 Epoch: [248/300] [ 600/1251] eta: 0:10:19 lr: 0.000160 loss: 2.805432 (2.704158) time: 0.955167 data: 0.000159 max mem: 18817 Epoch: [248/300] [ 650/1251] eta: 0:09:31 lr: 0.000159 loss: 2.692284 (2.703638) time: 0.910576 data: 0.000171 max mem: 18817 Epoch: [248/300] [ 700/1251] eta: 0:08:43 lr: 0.000159 loss: 2.424085 (2.695534) time: 0.919766 data: 0.000189 max mem: 18817 Epoch: [248/300] [ 750/1251] eta: 0:07:56 lr: 0.000159 loss: 2.536960 (2.691523) time: 0.957679 data: 0.000175 max mem: 18817 Epoch: [248/300] [ 800/1251] eta: 0:07:09 lr: 0.000159 loss: 2.773160 (2.684541) time: 1.045947 data: 0.000171 max mem: 18817 Epoch: [248/300] [ 850/1251] eta: 0:06:21 lr: 0.000159 loss: 2.703079 (2.681275) time: 0.969418 data: 0.000184 max mem: 18817 Epoch: [248/300] [ 900/1251] eta: 0:05:33 lr: 0.000158 loss: 2.690522 (2.681526) time: 0.922795 data: 0.000179 max mem: 18817 Epoch: [248/300] [ 950/1251] eta: 0:04:46 lr: 0.000158 loss: 2.750890 (2.680793) time: 0.923441 data: 0.000181 max mem: 18817 Epoch: [248/300] [1000/1251] eta: 0:03:58 lr: 0.000158 loss: 2.705610 (2.678960) time: 0.977184 data: 0.000164 max mem: 18817 Epoch: [248/300] [1050/1251] eta: 0:03:11 lr: 0.000158 loss: 2.665857 (2.679875) time: 0.961547 data: 0.000183 max mem: 18817 Epoch: [248/300] [1100/1251] eta: 0:02:23 lr: 0.000157 loss: 2.784372 (2.684348) time: 0.968401 data: 0.000175 max mem: 18817 Epoch: [248/300] [1150/1251] eta: 0:01:35 lr: 0.000157 loss: 2.847618 (2.687581) time: 0.911293 data: 0.000167 max mem: 18817 Epoch: [248/300] [1200/1251] eta: 0:00:48 lr: 0.000157 loss: 2.845808 (2.690103) time: 0.900915 data: 0.000184 max mem: 18817 Epoch: [248/300] [1250/1251] eta: 0:00:00 lr: 0.000157 loss: 2.677958 (2.684319) time: 0.984667 data: 0.000767 max mem: 18817 Epoch: [248/300] Total time: 0:19:49 (0.951107 s / it) Averaged stats: lr: 0.000157 loss: 2.677958 (2.686850) Test: [ 0/49] eta: 0:01:22 loss: 0.450037 (0.450037) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.677147 data: 1.204491 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.601346 (0.641697) acc1: 84.375000 (84.659091) acc5: 96.875000 (96.306818) time: 0.509225 data: 0.109653 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.709089 (0.677530) acc1: 82.812500 (83.928571) acc5: 96.875000 (96.726190) time: 0.372561 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.683622 (0.684106) acc1: 81.250000 (83.266129) acc5: 96.875000 (96.875000) time: 0.351772 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.677654 (0.695372) acc1: 82.812500 (83.269817) acc5: 96.875000 (96.875000) time: 0.357959 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.680557 (0.690527) acc1: 84.375000 (83.552000) acc5: 96.875000 (96.896000) time: 0.353336 data: 0.000106 max mem: 18817 Test: Total time: 0:00:19 (0.390337 s / it) * Acc@1 83.484 Acc@5 96.698 loss 0.705 Max accuracy: 83.57% Epoch: [249/300] [ 0/1251] eta: 0:39:55 lr: 0.000157 loss: 1.847401 (1.847401) time: 1.914806 data: 1.058416 max mem: 18817 Epoch: [249/300] [ 50/1251] eta: 0:19:09 lr: 0.000157 loss: 2.762768 (2.714201) time: 0.963033 data: 0.000164 max mem: 18817 Epoch: [249/300] [ 100/1251] eta: 0:18:06 lr: 0.000156 loss: 2.377870 (2.626844) time: 0.909919 data: 0.000184 max mem: 18817 Epoch: [249/300] [ 150/1251] eta: 0:17:22 lr: 0.000156 loss: 2.677527 (2.624250) time: 0.913777 data: 0.000174 max mem: 18817 Epoch: [249/300] [ 200/1251] eta: 0:16:40 lr: 0.000156 loss: 2.747707 (2.629778) time: 0.985334 data: 0.000178 max mem: 18817 Epoch: [249/300] [ 250/1251] eta: 0:15:50 lr: 0.000156 loss: 2.779994 (2.649909) time: 0.963826 data: 0.000175 max mem: 18817 Epoch: [249/300] [ 300/1251] eta: 0:15:02 lr: 0.000156 loss: 2.784840 (2.639850) time: 0.921500 data: 0.000186 max mem: 18817 Epoch: [249/300] [ 350/1251] eta: 0:14:17 lr: 0.000155 loss: 2.744529 (2.651886) time: 0.908296 data: 0.000156 max mem: 18817 Epoch: [249/300] [ 400/1251] eta: 0:13:29 lr: 0.000155 loss: 2.763571 (2.677118) time: 0.943689 data: 0.000182 max mem: 18817 Epoch: [249/300] [ 450/1251] eta: 0:12:41 lr: 0.000155 loss: 2.601146 (2.674032) time: 0.901191 data: 0.000173 max mem: 18817 Epoch: [249/300] [ 500/1251] eta: 0:11:54 lr: 0.000155 loss: 2.853818 (2.678871) time: 0.967469 data: 0.000169 max mem: 18817 Epoch: [249/300] [ 550/1251] eta: 0:11:06 lr: 0.000155 loss: 2.798922 (2.685851) time: 0.961279 data: 0.000167 max mem: 18817 Epoch: [249/300] [ 600/1251] eta: 0:10:19 lr: 0.000154 loss: 2.577155 (2.675202) time: 0.993727 data: 0.000184 max mem: 18817 Epoch: [249/300] [ 650/1251] eta: 0:09:31 lr: 0.000154 loss: 2.854513 (2.678492) time: 0.930753 data: 0.000185 max mem: 18817 Epoch: [249/300] [ 700/1251] eta: 0:08:44 lr: 0.000154 loss: 2.912278 (2.687552) time: 0.907492 data: 0.000173 max mem: 18817 Epoch: [249/300] [ 750/1251] eta: 0:07:57 lr: 0.000154 loss: 2.369011 (2.678899) time: 0.979991 data: 0.000178 max mem: 18817 Epoch: [249/300] [ 800/1251] eta: 0:07:09 lr: 0.000153 loss: 2.866059 (2.680283) time: 0.976002 data: 0.000169 max mem: 18817 Epoch: [249/300] [ 850/1251] eta: 0:06:22 lr: 0.000153 loss: 2.906649 (2.684569) time: 0.964887 data: 0.000171 max mem: 18817 Epoch: [249/300] [ 900/1251] eta: 0:05:34 lr: 0.000153 loss: 2.701441 (2.692218) time: 0.903749 data: 0.000169 max mem: 18817 Epoch: [249/300] [ 950/1251] eta: 0:04:46 lr: 0.000153 loss: 2.613564 (2.691483) time: 0.915215 data: 0.000176 max mem: 18817 Epoch: [249/300] [1000/1251] eta: 0:03:58 lr: 0.000153 loss: 2.869014 (2.690929) time: 0.902424 data: 0.000159 max mem: 18817 Epoch: [249/300] [1050/1251] eta: 0:03:11 lr: 0.000152 loss: 2.654555 (2.691472) time: 0.966020 data: 0.000172 max mem: 18817 Epoch: [249/300] [1100/1251] eta: 0:02:23 lr: 0.000152 loss: 2.610385 (2.686643) time: 0.960777 data: 0.000178 max mem: 18817 Epoch: [249/300] [1150/1251] eta: 0:01:35 lr: 0.000152 loss: 2.669498 (2.687568) time: 0.920528 data: 0.000176 max mem: 18817 Epoch: [249/300] [1200/1251] eta: 0:00:48 lr: 0.000152 loss: 2.609283 (2.682453) time: 0.920986 data: 0.000175 max mem: 18817 Epoch: [249/300] [1250/1251] eta: 0:00:00 lr: 0.000152 loss: 2.615234 (2.679381) time: 0.955916 data: 0.000770 max mem: 18817 Epoch: [249/300] Total time: 0:19:49 (0.950991 s / it) Averaged stats: lr: 0.000152 loss: 2.615234 (2.676504) Test: [ 0/49] eta: 0:01:14 loss: 0.515538 (0.515538) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.530065 data: 1.089504 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.549616 (0.659689) acc1: 82.812500 (84.232955) acc5: 98.437500 (96.448864) time: 0.479148 data: 0.099200 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.683876 (0.685352) acc1: 82.812500 (83.705357) acc5: 96.875000 (96.577381) time: 0.369997 data: 0.000148 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.683876 (0.691382) acc1: 82.812500 (83.114919) acc5: 96.875000 (96.723790) time: 0.358664 data: 0.000128 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.686787 (0.702659) acc1: 82.812500 (83.193598) acc5: 96.875000 (96.646341) time: 0.353534 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.715113 (0.696498) acc1: 82.812500 (83.360000) acc5: 96.875000 (96.832000) time: 0.348475 data: 0.000104 max mem: 18817 Test: Total time: 0:00:18 (0.384118 s / it) * Acc@1 83.672 Acc@5 96.704 loss 0.708 Max accuracy: 83.67% Epoch: [250/300] [ 0/1251] eta: 0:41:17 lr: 0.000152 loss: 2.640375 (2.640375) time: 1.980170 data: 1.124164 max mem: 18817 Epoch: [250/300] [ 50/1251] eta: 0:19:22 lr: 0.000151 loss: 2.921062 (2.614368) time: 0.978053 data: 0.000174 max mem: 18817 Epoch: [250/300] [ 100/1251] eta: 0:18:17 lr: 0.000151 loss: 2.656311 (2.647009) time: 0.916815 data: 0.000172 max mem: 18817 Epoch: [250/300] [ 150/1251] eta: 0:17:31 lr: 0.000151 loss: 2.850915 (2.690963) time: 0.909394 data: 0.000175 max mem: 18817 Epoch: [250/300] [ 200/1251] eta: 0:16:41 lr: 0.000151 loss: 2.522359 (2.664812) time: 0.966132 data: 0.000169 max mem: 18817 Epoch: [250/300] [ 250/1251] eta: 0:15:55 lr: 0.000151 loss: 2.866116 (2.687884) time: 1.025647 data: 0.000179 max mem: 18817 Epoch: [250/300] [ 300/1251] eta: 0:15:04 lr: 0.000150 loss: 2.796130 (2.681564) time: 0.952096 data: 0.000166 max mem: 18817 Epoch: [250/300] [ 350/1251] eta: 0:14:16 lr: 0.000150 loss: 2.938765 (2.699952) time: 0.923927 data: 0.000163 max mem: 18817 Epoch: [250/300] [ 400/1251] eta: 0:13:27 lr: 0.000150 loss: 2.785902 (2.698887) time: 0.930882 data: 0.000187 max mem: 18817 Epoch: [250/300] [ 450/1251] eta: 0:12:41 lr: 0.000150 loss: 2.632135 (2.682851) time: 0.916604 data: 0.000167 max mem: 18817 Epoch: [250/300] [ 500/1251] eta: 0:11:54 lr: 0.000150 loss: 2.611105 (2.674521) time: 0.991717 data: 0.000182 max mem: 18817 Epoch: [250/300] [ 550/1251] eta: 0:11:07 lr: 0.000149 loss: 2.718050 (2.670386) time: 0.960972 data: 0.000176 max mem: 18817 Epoch: [250/300] [ 600/1251] eta: 0:10:19 lr: 0.000149 loss: 2.694088 (2.667954) time: 0.970662 data: 0.000200 max mem: 18817 Epoch: [250/300] [ 650/1251] eta: 0:09:31 lr: 0.000149 loss: 2.764857 (2.673003) time: 0.920302 data: 0.000165 max mem: 18817 Epoch: [250/300] [ 700/1251] eta: 0:08:44 lr: 0.000149 loss: 2.730835 (2.674342) time: 0.925804 data: 0.000181 max mem: 18817 Epoch: [250/300] [ 750/1251] eta: 0:07:57 lr: 0.000149 loss: 2.781695 (2.676686) time: 0.970558 data: 0.000171 max mem: 18817 Epoch: [250/300] [ 800/1251] eta: 0:07:10 lr: 0.000148 loss: 2.762408 (2.671704) time: 0.991337 data: 0.000176 max mem: 18817 Epoch: [250/300] [ 850/1251] eta: 0:06:22 lr: 0.000148 loss: 2.922811 (2.680771) time: 0.973932 data: 0.000182 max mem: 18817 Epoch: [250/300] [ 900/1251] eta: 0:05:34 lr: 0.000148 loss: 2.560407 (2.682780) time: 0.919785 data: 0.000190 max mem: 18817 Epoch: [250/300] [ 950/1251] eta: 0:04:46 lr: 0.000148 loss: 2.934595 (2.684939) time: 0.923886 data: 0.000183 max mem: 18817 Epoch: [250/300] [1000/1251] eta: 0:03:59 lr: 0.000148 loss: 2.657208 (2.682207) time: 0.987058 data: 0.000172 max mem: 18817 Epoch: [250/300] [1050/1251] eta: 0:03:11 lr: 0.000147 loss: 2.649143 (2.680523) time: 1.041000 data: 0.000178 max mem: 18817 Epoch: [250/300] [1100/1251] eta: 0:02:24 lr: 0.000147 loss: 2.800293 (2.681005) time: 0.981956 data: 0.000170 max mem: 18817 Epoch: [250/300] [1150/1251] eta: 0:01:36 lr: 0.000147 loss: 2.797434 (2.681986) time: 0.933927 data: 0.000179 max mem: 18817 Epoch: [250/300] [1200/1251] eta: 0:00:48 lr: 0.000147 loss: 2.492513 (2.679886) time: 0.903772 data: 0.000179 max mem: 18817 Epoch: [250/300] [1250/1251] eta: 0:00:00 lr: 0.000146 loss: 2.772131 (2.680649) time: 0.979385 data: 0.000782 max mem: 18817 Epoch: [250/300] Total time: 0:19:53 (0.954374 s / it) Averaged stats: lr: 0.000146 loss: 2.772131 (2.682107) Test: [ 0/49] eta: 0:01:28 loss: 0.438459 (0.438459) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.799840 data: 1.407712 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.609190 (0.650403) acc1: 84.375000 (84.801136) acc5: 98.437500 (96.448864) time: 0.489956 data: 0.128134 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.725665 (0.687611) acc1: 82.812500 (83.630952) acc5: 96.875000 (96.502976) time: 0.355744 data: 0.000151 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.703444 (0.695573) acc1: 82.812500 (83.316532) acc5: 96.875000 (96.622984) time: 0.358807 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.703444 (0.703688) acc1: 82.812500 (83.307927) acc5: 96.875000 (96.608232) time: 0.361903 data: 0.000144 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.703444 (0.699613) acc1: 82.812500 (83.360000) acc5: 96.875000 (96.800000) time: 0.363051 data: 0.000127 max mem: 18817 Test: Total time: 0:00:19 (0.393198 s / it) * Acc@1 83.620 Acc@5 96.694 loss 0.714 Max accuracy: 83.67% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0250.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0250.pth Epoch: [251/300] [ 0/1251] eta: 0:42:48 lr: 0.000146 loss: 3.058863 (3.058863) time: 2.053537 data: 1.178044 max mem: 18817 Epoch: [251/300] [ 50/1251] eta: 0:19:37 lr: 0.000146 loss: 2.710245 (2.779154) time: 0.945827 data: 0.000166 max mem: 18817 Epoch: [251/300] [ 100/1251] eta: 0:18:36 lr: 0.000146 loss: 2.801961 (2.720594) time: 0.982371 data: 0.000169 max mem: 18817 Epoch: [251/300] [ 150/1251] eta: 0:17:38 lr: 0.000146 loss: 2.608263 (2.649494) time: 0.983795 data: 0.000167 max mem: 18817 Epoch: [251/300] [ 200/1251] eta: 0:16:45 lr: 0.000146 loss: 2.638270 (2.653235) time: 0.917020 data: 0.000180 max mem: 18817 Epoch: [251/300] [ 250/1251] eta: 0:15:55 lr: 0.000145 loss: 2.566941 (2.646501) time: 0.903353 data: 0.000172 max mem: 18817 Epoch: [251/300] [ 300/1251] eta: 0:15:07 lr: 0.000145 loss: 2.608363 (2.639827) time: 0.969637 data: 0.000177 max mem: 18817 Epoch: [251/300] [ 350/1251] eta: 0:14:19 lr: 0.000145 loss: 2.821083 (2.651831) time: 1.022222 data: 0.000170 max mem: 18817 Epoch: [251/300] [ 400/1251] eta: 0:13:30 lr: 0.000145 loss: 2.671298 (2.643493) time: 0.960057 data: 0.000174 max mem: 18817 Epoch: [251/300] [ 450/1251] eta: 0:12:41 lr: 0.000145 loss: 2.828508 (2.645051) time: 0.908557 data: 0.000173 max mem: 18817 Epoch: [251/300] [ 500/1251] eta: 0:11:54 lr: 0.000144 loss: 2.803873 (2.652288) time: 0.916922 data: 0.000181 max mem: 18817 Epoch: [251/300] [ 550/1251] eta: 0:11:07 lr: 0.000144 loss: 2.698988 (2.650903) time: 0.955461 data: 0.000184 max mem: 18817 Epoch: [251/300] [ 600/1251] eta: 0:10:19 lr: 0.000144 loss: 2.686185 (2.656512) time: 0.974917 data: 0.000187 max mem: 18817 Epoch: [251/300] [ 650/1251] eta: 0:09:32 lr: 0.000144 loss: 2.690873 (2.658578) time: 0.954344 data: 0.000188 max mem: 18817 Epoch: [251/300] [ 700/1251] eta: 0:08:44 lr: 0.000144 loss: 2.657163 (2.647926) time: 0.915145 data: 0.000162 max mem: 18817 Epoch: [251/300] [ 750/1251] eta: 0:07:57 lr: 0.000143 loss: 2.747168 (2.644352) time: 0.930482 data: 0.000592 max mem: 18817 Epoch: [251/300] [ 800/1251] eta: 0:07:09 lr: 0.000143 loss: 2.680473 (2.642222) time: 0.964133 data: 0.000173 max mem: 18817 Epoch: [251/300] [ 850/1251] eta: 0:06:21 lr: 0.000143 loss: 2.443272 (2.639602) time: 0.966414 data: 0.000172 max mem: 18817 Epoch: [251/300] [ 900/1251] eta: 0:05:34 lr: 0.000143 loss: 2.797427 (2.639365) time: 0.958359 data: 0.000176 max mem: 18817 Epoch: [251/300] [ 950/1251] eta: 0:04:46 lr: 0.000143 loss: 2.760363 (2.638078) time: 0.916855 data: 0.000174 max mem: 18817 Epoch: [251/300] [1000/1251] eta: 0:03:58 lr: 0.000142 loss: 2.648640 (2.640993) time: 0.962741 data: 0.000163 max mem: 18817 Epoch: [251/300] [1050/1251] eta: 0:03:11 lr: 0.000142 loss: 2.818521 (2.647451) time: 0.980073 data: 0.000184 max mem: 18817 Epoch: [251/300] [1100/1251] eta: 0:02:23 lr: 0.000142 loss: 2.508459 (2.644105) time: 0.992735 data: 0.000195 max mem: 18817 Epoch: [251/300] [1150/1251] eta: 0:01:36 lr: 0.000142 loss: 2.822056 (2.641130) time: 0.926319 data: 0.000184 max mem: 18817 Epoch: [251/300] [1200/1251] eta: 0:00:48 lr: 0.000142 loss: 2.803019 (2.644219) time: 0.907769 data: 0.000188 max mem: 18817 Epoch: [251/300] [1250/1251] eta: 0:00:00 lr: 0.000141 loss: 2.690452 (2.643145) time: 0.957103 data: 0.000776 max mem: 18817 Epoch: [251/300] Total time: 0:19:51 (0.952503 s / it) Averaged stats: lr: 0.000141 loss: 2.690452 (2.642905) Test: [ 0/49] eta: 0:01:15 loss: 0.476235 (0.476235) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.538649 data: 1.110000 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.659402 (0.661469) acc1: 84.375000 (84.801136) acc5: 96.875000 (96.306818) time: 0.464583 data: 0.101063 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.701351 (0.690522) acc1: 81.250000 (83.556548) acc5: 96.875000 (96.577381) time: 0.364216 data: 0.000147 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.702649 (0.695742) acc1: 81.250000 (83.316532) acc5: 96.875000 (96.572581) time: 0.374777 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.702920 (0.701100) acc1: 82.812500 (83.422256) acc5: 96.875000 (96.570122) time: 0.363018 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.702920 (0.698040) acc1: 84.375000 (83.456000) acc5: 96.875000 (96.640000) time: 0.344652 data: 0.000106 max mem: 18817 Test: Total time: 0:00:18 (0.386450 s / it) * Acc@1 83.592 Acc@5 96.738 loss 0.710 Max accuracy: 83.67% Epoch: [252/300] [ 0/1251] eta: 0:57:39 lr: 0.000141 loss: 3.115961 (3.115961) time: 2.765371 data: 1.216376 max mem: 18817 Epoch: [252/300] [ 50/1251] eta: 0:19:45 lr: 0.000141 loss: 2.805920 (2.740846) time: 0.973900 data: 0.000157 max mem: 18817 Epoch: [252/300] [ 100/1251] eta: 0:18:30 lr: 0.000141 loss: 2.686666 (2.705616) time: 0.923073 data: 0.000171 max mem: 18817 Epoch: [252/300] [ 150/1251] eta: 0:17:37 lr: 0.000141 loss: 2.786049 (2.701919) time: 0.907446 data: 0.000170 max mem: 18817 Epoch: [252/300] [ 200/1251] eta: 0:16:52 lr: 0.000141 loss: 2.783256 (2.681417) time: 0.983525 data: 0.000175 max mem: 18817 Epoch: [252/300] [ 250/1251] eta: 0:15:59 lr: 0.000140 loss: 2.797905 (2.674909) time: 0.979318 data: 0.000188 max mem: 18817 Epoch: [252/300] [ 300/1251] eta: 0:15:09 lr: 0.000140 loss: 2.739958 (2.689616) time: 0.914258 data: 0.000175 max mem: 18817 Epoch: [252/300] [ 350/1251] eta: 0:14:21 lr: 0.000140 loss: 2.728591 (2.666806) time: 0.931235 data: 0.000176 max mem: 18817 Epoch: [252/300] [ 400/1251] eta: 0:13:34 lr: 0.000140 loss: 2.810992 (2.661569) time: 0.963591 data: 0.000173 max mem: 18817 Epoch: [252/300] [ 450/1251] eta: 0:12:45 lr: 0.000140 loss: 2.886469 (2.670660) time: 0.970558 data: 0.000155 max mem: 18817 Epoch: [252/300] [ 500/1251] eta: 0:11:55 lr: 0.000139 loss: 2.584128 (2.667824) time: 0.923437 data: 0.000198 max mem: 18817 Epoch: [252/300] [ 550/1251] eta: 0:11:08 lr: 0.000139 loss: 2.719511 (2.662916) time: 0.916050 data: 0.000185 max mem: 18817 Epoch: [252/300] [ 600/1251] eta: 0:10:21 lr: 0.000139 loss: 2.991495 (2.659150) time: 0.975858 data: 0.000177 max mem: 18817 Epoch: [252/300] [ 650/1251] eta: 0:09:33 lr: 0.000139 loss: 2.640124 (2.652490) time: 0.982683 data: 0.000171 max mem: 18817 Epoch: [252/300] [ 700/1251] eta: 0:08:46 lr: 0.000139 loss: 2.710351 (2.653036) time: 0.988906 data: 0.000191 max mem: 18817 Epoch: [252/300] [ 750/1251] eta: 0:07:58 lr: 0.000138 loss: 2.600058 (2.648782) time: 0.974980 data: 0.000167 max mem: 18817 Epoch: [252/300] [ 800/1251] eta: 0:07:10 lr: 0.000138 loss: 2.843912 (2.657635) time: 0.967574 data: 0.000178 max mem: 18817 Epoch: [252/300] [ 850/1251] eta: 0:06:22 lr: 0.000138 loss: 2.782778 (2.659076) time: 0.912726 data: 0.000191 max mem: 18817 Epoch: [252/300] [ 900/1251] eta: 0:05:34 lr: 0.000138 loss: 2.793760 (2.655960) time: 0.911569 data: 0.000173 max mem: 18817 Epoch: [252/300] [ 950/1251] eta: 0:04:46 lr: 0.000138 loss: 2.472835 (2.647005) time: 0.909234 data: 0.000188 max mem: 18817 Epoch: [252/300] [1000/1251] eta: 0:03:59 lr: 0.000138 loss: 2.658426 (2.643324) time: 1.010525 data: 0.000166 max mem: 18817 Epoch: [252/300] [1050/1251] eta: 0:03:11 lr: 0.000137 loss: 2.714456 (2.642903) time: 0.976676 data: 0.000169 max mem: 18817 Epoch: [252/300] [1100/1251] eta: 0:02:23 lr: 0.000137 loss: 2.801307 (2.648923) time: 0.910824 data: 0.000168 max mem: 18817 Epoch: [252/300] [1150/1251] eta: 0:01:36 lr: 0.000137 loss: 2.772965 (2.651608) time: 0.932549 data: 0.000192 max mem: 18817 Epoch: [252/300] [1200/1251] eta: 0:00:48 lr: 0.000137 loss: 2.772689 (2.646610) time: 0.994214 data: 0.000165 max mem: 18817 Epoch: [252/300] [1250/1251] eta: 0:00:00 lr: 0.000137 loss: 2.752439 (2.645521) time: 1.046824 data: 0.000786 max mem: 18817 Epoch: [252/300] Total time: 0:19:53 (0.953678 s / it) Averaged stats: lr: 0.000137 loss: 2.752439 (2.648671) Test: [ 0/49] eta: 0:01:15 loss: 0.505571 (0.505571) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.544069 data: 1.130527 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.578893 (0.632588) acc1: 85.937500 (84.801136) acc5: 98.437500 (96.590909) time: 0.476707 data: 0.102936 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.681531 (0.667540) acc1: 84.375000 (84.151786) acc5: 96.875000 (96.651786) time: 0.379108 data: 0.000158 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.682918 (0.673288) acc1: 82.812500 (83.770161) acc5: 96.875000 (96.925403) time: 0.370101 data: 0.000146 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.671273 (0.687431) acc1: 82.812500 (83.650915) acc5: 96.875000 (96.836890) time: 0.348845 data: 0.000156 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.671273 (0.684721) acc1: 82.812500 (83.808000) acc5: 96.875000 (96.896000) time: 0.344270 data: 0.000126 max mem: 18817 Test: Total time: 0:00:18 (0.387361 s / it) * Acc@1 83.664 Acc@5 96.714 loss 0.708 Max accuracy: 83.67% Epoch: [253/300] [ 0/1251] eta: 0:40:19 lr: 0.000137 loss: 2.499423 (2.499423) time: 1.934109 data: 1.071622 max mem: 18817 Epoch: [253/300] [ 50/1251] eta: 0:19:12 lr: 0.000136 loss: 2.476386 (2.691219) time: 0.927735 data: 0.000167 max mem: 18817 Epoch: [253/300] [ 100/1251] eta: 0:18:29 lr: 0.000136 loss: 2.663362 (2.677925) time: 0.937273 data: 0.000179 max mem: 18817 Epoch: [253/300] [ 150/1251] eta: 0:17:42 lr: 0.000136 loss: 2.846484 (2.678338) time: 0.982425 data: 0.000175 max mem: 18817 Epoch: [253/300] [ 200/1251] eta: 0:16:52 lr: 0.000136 loss: 2.472667 (2.668216) time: 1.026930 data: 0.000163 max mem: 18817 Epoch: [253/300] [ 250/1251] eta: 0:15:58 lr: 0.000136 loss: 2.546823 (2.658378) time: 0.962067 data: 0.000168 max mem: 18817 Epoch: [253/300] [ 300/1251] eta: 0:15:09 lr: 0.000135 loss: 2.747617 (2.666787) time: 0.927056 data: 0.000165 max mem: 18817 Epoch: [253/300] [ 350/1251] eta: 0:14:24 lr: 0.000135 loss: 2.520790 (2.670209) time: 0.926406 data: 0.000170 max mem: 18817 Epoch: [253/300] [ 400/1251] eta: 0:13:37 lr: 0.000135 loss: 2.904902 (2.682341) time: 0.991499 data: 0.000186 max mem: 18817 Epoch: [253/300] [ 450/1251] eta: 0:12:48 lr: 0.000135 loss: 2.822500 (2.682659) time: 1.032130 data: 0.000195 max mem: 18817 Epoch: [253/300] [ 500/1251] eta: 0:11:59 lr: 0.000135 loss: 2.574063 (2.677239) time: 0.966441 data: 0.000196 max mem: 18817 Epoch: [253/300] [ 550/1251] eta: 0:11:10 lr: 0.000134 loss: 2.766423 (2.679663) time: 0.924373 data: 0.000162 max mem: 18817 Epoch: [253/300] [ 600/1251] eta: 0:10:22 lr: 0.000134 loss: 2.686756 (2.681430) time: 0.918285 data: 0.000179 max mem: 18817 Epoch: [253/300] [ 650/1251] eta: 0:09:34 lr: 0.000134 loss: 2.744939 (2.673169) time: 0.973476 data: 0.000163 max mem: 18817 Epoch: [253/300] [ 700/1251] eta: 0:08:47 lr: 0.000134 loss: 2.684707 (2.667077) time: 1.038713 data: 0.000168 max mem: 18817 Epoch: [253/300] [ 750/1251] eta: 0:07:58 lr: 0.000134 loss: 2.818185 (2.667620) time: 0.960326 data: 0.000164 max mem: 18817 Epoch: [253/300] [ 800/1251] eta: 0:07:10 lr: 0.000133 loss: 2.567751 (2.660190) time: 0.920831 data: 0.000175 max mem: 18817 Epoch: [253/300] [ 850/1251] eta: 0:06:23 lr: 0.000133 loss: 2.604059 (2.659936) time: 0.927478 data: 0.000180 max mem: 18817 Epoch: [253/300] [ 900/1251] eta: 0:05:35 lr: 0.000133 loss: 2.778347 (2.660735) time: 0.973908 data: 0.000165 max mem: 18817 Epoch: [253/300] [ 950/1251] eta: 0:04:47 lr: 0.000133 loss: 2.823149 (2.665190) time: 1.009634 data: 0.000165 max mem: 18817 Epoch: [253/300] [1000/1251] eta: 0:03:59 lr: 0.000133 loss: 2.648051 (2.664272) time: 0.983516 data: 0.000182 max mem: 18817 Epoch: [253/300] [1050/1251] eta: 0:03:12 lr: 0.000132 loss: 2.519019 (2.660500) time: 0.923387 data: 0.000185 max mem: 18817 Epoch: [253/300] [1100/1251] eta: 0:02:24 lr: 0.000132 loss: 2.707614 (2.658196) time: 0.917618 data: 0.000185 max mem: 18817 Epoch: [253/300] [1150/1251] eta: 0:01:36 lr: 0.000132 loss: 2.816587 (2.657469) time: 0.970206 data: 0.000170 max mem: 18817 Epoch: [253/300] [1200/1251] eta: 0:00:48 lr: 0.000132 loss: 2.864774 (2.658409) time: 0.973458 data: 0.000175 max mem: 18817 Epoch: [253/300] [1250/1251] eta: 0:00:00 lr: 0.000132 loss: 2.692166 (2.657459) time: 0.922631 data: 0.000810 max mem: 18817 Epoch: [253/300] Total time: 0:19:54 (0.954890 s / it) Averaged stats: lr: 0.000132 loss: 2.692166 (2.652113) Test: [ 0/49] eta: 0:01:19 loss: 0.484015 (0.484015) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.630265 data: 1.190275 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.600699 (0.662983) acc1: 84.375000 (84.659091) acc5: 98.437500 (96.306818) time: 0.475429 data: 0.108347 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.703930 (0.693919) acc1: 82.812500 (83.928571) acc5: 96.875000 (96.428571) time: 0.356034 data: 0.000140 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.682364 (0.694618) acc1: 82.812500 (83.820565) acc5: 96.875000 (96.673387) time: 0.353208 data: 0.000124 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.697906 (0.706565) acc1: 82.812500 (83.765244) acc5: 96.875000 (96.570122) time: 0.350444 data: 0.000117 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.694721 (0.703794) acc1: 84.375000 (83.904000) acc5: 96.875000 (96.608000) time: 0.344099 data: 0.000099 max mem: 18817 Test: Total time: 0:00:18 (0.379748 s / it) * Acc@1 83.692 Acc@5 96.684 loss 0.710 Max accuracy: 83.69% Epoch: [254/300] [ 0/1251] eta: 0:42:29 lr: 0.000132 loss: 3.103435 (3.103435) time: 2.038204 data: 1.153169 max mem: 18817 Epoch: [254/300] [ 50/1251] eta: 0:19:42 lr: 0.000132 loss: 2.721789 (2.787816) time: 0.913024 data: 0.000180 max mem: 18817 Epoch: [254/300] [ 100/1251] eta: 0:18:36 lr: 0.000131 loss: 2.868216 (2.773285) time: 0.960406 data: 0.000194 max mem: 18817 Epoch: [254/300] [ 150/1251] eta: 0:17:39 lr: 0.000131 loss: 2.566898 (2.740390) time: 0.980389 data: 0.000196 max mem: 18817 Epoch: [254/300] [ 200/1251] eta: 0:16:46 lr: 0.000131 loss: 2.707107 (2.700849) time: 0.924612 data: 0.000182 max mem: 18817 Epoch: [254/300] [ 250/1251] eta: 0:16:01 lr: 0.000131 loss: 2.447400 (2.679863) time: 0.922041 data: 0.000171 max mem: 18817 Epoch: [254/300] [ 300/1251] eta: 0:15:13 lr: 0.000131 loss: 2.741586 (2.665558) time: 0.954990 data: 0.000196 max mem: 18817 Epoch: [254/300] [ 350/1251] eta: 0:14:26 lr: 0.000130 loss: 2.754573 (2.669968) time: 0.981237 data: 0.000162 max mem: 18817 Epoch: [254/300] [ 400/1251] eta: 0:13:34 lr: 0.000130 loss: 2.789520 (2.669483) time: 0.953466 data: 0.000168 max mem: 18817 Epoch: [254/300] [ 450/1251] eta: 0:12:45 lr: 0.000130 loss: 2.702252 (2.665743) time: 0.920246 data: 0.000184 max mem: 18817 Epoch: [254/300] [ 500/1251] eta: 0:11:57 lr: 0.000130 loss: 2.653835 (2.662090) time: 0.922378 data: 0.000169 max mem: 18817 Epoch: [254/300] [ 550/1251] eta: 0:11:09 lr: 0.000130 loss: 2.897199 (2.664160) time: 0.969674 data: 0.000181 max mem: 18817 Epoch: [254/300] [ 600/1251] eta: 0:10:22 lr: 0.000129 loss: 2.658164 (2.658674) time: 1.024671 data: 0.000172 max mem: 18817 Epoch: [254/300] [ 650/1251] eta: 0:09:33 lr: 0.000129 loss: 2.720985 (2.660048) time: 0.964068 data: 0.000173 max mem: 18817 Epoch: [254/300] [ 700/1251] eta: 0:08:45 lr: 0.000129 loss: 2.707818 (2.660217) time: 0.923742 data: 0.000163 max mem: 18817 Epoch: [254/300] [ 750/1251] eta: 0:07:57 lr: 0.000129 loss: 2.762742 (2.650331) time: 0.926985 data: 0.000190 max mem: 18817 Epoch: [254/300] [ 800/1251] eta: 0:07:10 lr: 0.000129 loss: 2.657251 (2.652629) time: 0.964631 data: 0.000179 max mem: 18817 Epoch: [254/300] [ 850/1251] eta: 0:06:22 lr: 0.000128 loss: 2.561388 (2.650176) time: 1.029534 data: 0.000169 max mem: 18817 Epoch: [254/300] [ 900/1251] eta: 0:05:34 lr: 0.000128 loss: 2.549708 (2.647145) time: 0.970258 data: 0.000162 max mem: 18817 Epoch: [254/300] [ 950/1251] eta: 0:04:46 lr: 0.000128 loss: 2.601800 (2.640474) time: 0.914311 data: 0.000178 max mem: 18817 Epoch: [254/300] [1000/1251] eta: 0:03:59 lr: 0.000128 loss: 2.731707 (2.640849) time: 0.927157 data: 0.000157 max mem: 18817 Epoch: [254/300] [1050/1251] eta: 0:03:11 lr: 0.000128 loss: 2.545311 (2.645897) time: 0.968475 data: 0.000183 max mem: 18817 Epoch: [254/300] [1100/1251] eta: 0:02:24 lr: 0.000128 loss: 2.681050 (2.641360) time: 1.062921 data: 0.000190 max mem: 18817 Epoch: [254/300] [1150/1251] eta: 0:01:36 lr: 0.000127 loss: 2.635121 (2.640589) time: 0.941534 data: 0.000182 max mem: 18817 Epoch: [254/300] [1200/1251] eta: 0:00:48 lr: 0.000127 loss: 2.673147 (2.637629) time: 0.918245 data: 0.000200 max mem: 18817 Epoch: [254/300] [1250/1251] eta: 0:00:00 lr: 0.000127 loss: 2.734723 (2.638689) time: 0.924698 data: 0.000765 max mem: 18817 Epoch: [254/300] Total time: 0:19:54 (0.955064 s / it) Averaged stats: lr: 0.000127 loss: 2.734723 (2.643011) Test: [ 0/49] eta: 0:01:27 loss: 0.462494 (0.462494) acc1: 89.062500 (89.062500) acc5: 100.000000 (100.000000) time: 1.786062 data: 1.411786 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.583336 (0.669166) acc1: 84.375000 (83.948864) acc5: 95.312500 (96.590909) time: 0.488019 data: 0.128474 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.711260 (0.690791) acc1: 81.250000 (83.258929) acc5: 96.875000 (96.726190) time: 0.354556 data: 0.000126 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.683631 (0.692411) acc1: 81.250000 (83.165323) acc5: 96.875000 (96.975806) time: 0.350798 data: 0.000123 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.692779 (0.705365) acc1: 82.812500 (83.269817) acc5: 96.875000 (96.836890) time: 0.348764 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.710138 (0.702193) acc1: 82.812500 (83.424000) acc5: 96.875000 (96.992000) time: 0.343715 data: 0.000102 max mem: 18817 Test: Total time: 0:00:18 (0.381544 s / it) * Acc@1 83.626 Acc@5 96.724 loss 0.716 Max accuracy: 83.69% Epoch: [255/300] [ 0/1251] eta: 0:42:04 lr: 0.000127 loss: 2.098825 (2.098825) time: 2.017696 data: 1.141137 max mem: 18817 Epoch: [255/300] [ 50/1251] eta: 0:19:12 lr: 0.000127 loss: 2.802625 (2.634047) time: 0.968280 data: 0.000170 max mem: 18817 Epoch: [255/300] [ 100/1251] eta: 0:18:11 lr: 0.000127 loss: 2.826458 (2.696959) time: 0.905567 data: 0.000179 max mem: 18817 Epoch: [255/300] [ 150/1251] eta: 0:17:31 lr: 0.000126 loss: 2.786624 (2.688409) time: 0.926459 data: 0.000182 max mem: 18817 Epoch: [255/300] [ 200/1251] eta: 0:16:39 lr: 0.000126 loss: 2.730807 (2.661969) time: 0.917102 data: 0.000172 max mem: 18817 Epoch: [255/300] [ 250/1251] eta: 0:15:51 lr: 0.000126 loss: 2.824573 (2.672975) time: 0.943007 data: 0.000203 max mem: 18817 Epoch: [255/300] [ 300/1251] eta: 0:15:03 lr: 0.000126 loss: 2.644580 (2.654756) time: 0.949795 data: 0.000171 max mem: 18817 Epoch: [255/300] [ 350/1251] eta: 0:14:14 lr: 0.000126 loss: 2.676162 (2.635613) time: 0.968033 data: 0.000162 max mem: 18817 Epoch: [255/300] [ 400/1251] eta: 0:13:25 lr: 0.000125 loss: 2.805309 (2.629218) time: 0.913560 data: 0.000181 max mem: 18817 Epoch: [255/300] [ 450/1251] eta: 0:12:38 lr: 0.000125 loss: 2.578851 (2.628588) time: 0.916787 data: 0.000189 max mem: 18817 Epoch: [255/300] [ 500/1251] eta: 0:11:52 lr: 0.000125 loss: 2.823810 (2.636358) time: 0.964711 data: 0.000177 max mem: 18817 Epoch: [255/300] [ 550/1251] eta: 0:11:05 lr: 0.000125 loss: 2.554698 (2.641296) time: 1.001355 data: 0.000177 max mem: 18817 Epoch: [255/300] [ 600/1251] eta: 0:10:18 lr: 0.000125 loss: 2.690349 (2.641077) time: 0.962387 data: 0.000172 max mem: 18817 Epoch: [255/300] [ 650/1251] eta: 0:09:30 lr: 0.000125 loss: 2.775765 (2.637178) time: 0.913864 data: 0.000172 max mem: 18817 Epoch: [255/300] [ 700/1251] eta: 0:08:42 lr: 0.000124 loss: 2.605159 (2.638239) time: 0.912568 data: 0.000186 max mem: 18817 Epoch: [255/300] [ 750/1251] eta: 0:07:55 lr: 0.000124 loss: 2.660126 (2.639016) time: 0.933638 data: 0.000172 max mem: 18817 Epoch: [255/300] [ 800/1251] eta: 0:07:08 lr: 0.000124 loss: 2.633504 (2.641646) time: 0.958028 data: 0.000174 max mem: 18817 Epoch: [255/300] [ 850/1251] eta: 0:06:21 lr: 0.000124 loss: 2.588408 (2.634995) time: 0.985622 data: 0.000186 max mem: 18817 Epoch: [255/300] [ 900/1251] eta: 0:05:33 lr: 0.000124 loss: 2.920843 (2.637022) time: 0.972926 data: 0.000172 max mem: 18817 Epoch: [255/300] [ 950/1251] eta: 0:04:45 lr: 0.000123 loss: 2.617238 (2.637480) time: 0.917375 data: 0.000183 max mem: 18817 Epoch: [255/300] [1000/1251] eta: 0:03:58 lr: 0.000123 loss: 2.612791 (2.634098) time: 0.919050 data: 0.000166 max mem: 18817 Epoch: [255/300] [1050/1251] eta: 0:03:10 lr: 0.000123 loss: 2.760062 (2.631367) time: 0.971786 data: 0.000178 max mem: 18817 Epoch: [255/300] [1100/1251] eta: 0:02:23 lr: 0.000123 loss: 2.482049 (2.626466) time: 1.014801 data: 0.000183 max mem: 18817 Epoch: [255/300] [1150/1251] eta: 0:01:35 lr: 0.000123 loss: 2.858820 (2.628682) time: 0.966645 data: 0.000167 max mem: 18817 Epoch: [255/300] [1200/1251] eta: 0:00:48 lr: 0.000122 loss: 2.626710 (2.627905) time: 0.913486 data: 0.000181 max mem: 18817 Epoch: [255/300] [1250/1251] eta: 0:00:00 lr: 0.000122 loss: 2.740508 (2.632431) time: 0.928114 data: 0.000755 max mem: 18817 Epoch: [255/300] Total time: 0:19:48 (0.950042 s / it) Averaged stats: lr: 0.000122 loss: 2.740508 (2.632292) Test: [ 0/49] eta: 0:01:16 loss: 0.501521 (0.501521) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.560467 data: 1.109582 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.579588 (0.672096) acc1: 84.375000 (84.659091) acc5: 98.437500 (96.732955) time: 0.468526 data: 0.101018 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.727841 (0.705121) acc1: 82.812500 (83.705357) acc5: 96.875000 (96.577381) time: 0.364584 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.711607 (0.703544) acc1: 81.250000 (83.770161) acc5: 96.875000 (96.774194) time: 0.360901 data: 0.000135 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.714402 (0.714384) acc1: 82.812500 (83.498476) acc5: 96.875000 (96.875000) time: 0.349623 data: 0.000149 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.730435 (0.710590) acc1: 81.250000 (83.552000) acc5: 96.875000 (96.960000) time: 0.344608 data: 0.000123 max mem: 18817 Test: Total time: 0:00:18 (0.381345 s / it) * Acc@1 83.646 Acc@5 96.740 loss 0.726 Max accuracy: 83.69% Epoch: [256/300] [ 0/1251] eta: 0:44:22 lr: 0.000122 loss: 3.048639 (3.048639) time: 2.128205 data: 1.177222 max mem: 18817 Epoch: [256/300] [ 50/1251] eta: 0:19:30 lr: 0.000122 loss: 2.612976 (2.570435) time: 0.980821 data: 0.000176 max mem: 18817 Epoch: [256/300] [ 100/1251] eta: 0:18:27 lr: 0.000122 loss: 2.626035 (2.602027) time: 0.930166 data: 0.000175 max mem: 18817 Epoch: [256/300] [ 150/1251] eta: 0:17:37 lr: 0.000122 loss: 2.557188 (2.589481) time: 0.926905 data: 0.000165 max mem: 18817 Epoch: [256/300] [ 200/1251] eta: 0:16:48 lr: 0.000122 loss: 2.713378 (2.621671) time: 0.909481 data: 0.000190 max mem: 18817 Epoch: [256/300] [ 250/1251] eta: 0:16:01 lr: 0.000121 loss: 2.781833 (2.624460) time: 0.959091 data: 0.000173 max mem: 18817 Epoch: [256/300] [ 300/1251] eta: 0:15:10 lr: 0.000121 loss: 2.706693 (2.627419) time: 0.948211 data: 0.000186 max mem: 18817 Epoch: [256/300] [ 350/1251] eta: 0:14:19 lr: 0.000121 loss: 2.524987 (2.628880) time: 0.923158 data: 0.000179 max mem: 18817 Epoch: [256/300] [ 400/1251] eta: 0:13:32 lr: 0.000121 loss: 2.702519 (2.628675) time: 0.938268 data: 0.000178 max mem: 18817 Epoch: [256/300] [ 450/1251] eta: 0:12:45 lr: 0.000121 loss: 2.622994 (2.636377) time: 0.965588 data: 0.000179 max mem: 18817 Epoch: [256/300] [ 500/1251] eta: 0:11:58 lr: 0.000120 loss: 2.655771 (2.628374) time: 1.025683 data: 0.000186 max mem: 18817 Epoch: [256/300] [ 550/1251] eta: 0:11:09 lr: 0.000120 loss: 2.862061 (2.633452) time: 0.958070 data: 0.000179 max mem: 18817 Epoch: [256/300] [ 600/1251] eta: 0:10:20 lr: 0.000120 loss: 2.785153 (2.640416) time: 0.928017 data: 0.000186 max mem: 18817 Epoch: [256/300] [ 650/1251] eta: 0:09:33 lr: 0.000120 loss: 2.627247 (2.641908) time: 0.939200 data: 0.000176 max mem: 18817 Epoch: [256/300] [ 700/1251] eta: 0:08:46 lr: 0.000120 loss: 2.775856 (2.635077) time: 0.974099 data: 0.000184 max mem: 18817 Epoch: [256/300] [ 750/1251] eta: 0:07:58 lr: 0.000120 loss: 2.636030 (2.638758) time: 1.025485 data: 0.000175 max mem: 18817 Epoch: [256/300] [ 800/1251] eta: 0:07:10 lr: 0.000119 loss: 2.545678 (2.642438) time: 0.956372 data: 0.000181 max mem: 18817 Epoch: [256/300] [ 850/1251] eta: 0:06:22 lr: 0.000119 loss: 2.653035 (2.641754) time: 0.924087 data: 0.000174 max mem: 18817 Epoch: [256/300] [ 900/1251] eta: 0:05:34 lr: 0.000119 loss: 2.340520 (2.632131) time: 0.913311 data: 0.000172 max mem: 18817 Epoch: [256/300] [ 950/1251] eta: 0:04:47 lr: 0.000119 loss: 2.686007 (2.630747) time: 0.983443 data: 0.000176 max mem: 18817 Epoch: [256/300] [1000/1251] eta: 0:03:59 lr: 0.000119 loss: 2.673457 (2.628577) time: 1.015747 data: 0.000173 max mem: 18817 Epoch: [256/300] [1050/1251] eta: 0:03:11 lr: 0.000118 loss: 2.513930 (2.630356) time: 0.958165 data: 0.000176 max mem: 18817 Epoch: [256/300] [1100/1251] eta: 0:02:23 lr: 0.000118 loss: 2.756665 (2.631814) time: 0.921242 data: 0.000180 max mem: 18817 Epoch: [256/300] [1150/1251] eta: 0:01:36 lr: 0.000118 loss: 2.676671 (2.630660) time: 0.924184 data: 0.000186 max mem: 18817 Epoch: [256/300] [1200/1251] eta: 0:00:48 lr: 0.000118 loss: 2.897311 (2.636346) time: 0.976050 data: 0.000186 max mem: 18817 Epoch: [256/300] [1250/1251] eta: 0:00:00 lr: 0.000118 loss: 2.822656 (2.637658) time: 1.020269 data: 0.000774 max mem: 18817 Epoch: [256/300] Total time: 0:19:53 (0.954197 s / it) Averaged stats: lr: 0.000118 loss: 2.822656 (2.634861) Test: [ 0/49] eta: 0:01:21 loss: 0.474376 (0.474376) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.657999 data: 1.223987 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.589420 (0.660486) acc1: 84.375000 (83.948864) acc5: 98.437500 (97.017045) time: 0.477394 data: 0.111431 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.728146 (0.691278) acc1: 82.812500 (83.258929) acc5: 96.875000 (97.023810) time: 0.355173 data: 0.000158 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.686524 (0.690678) acc1: 81.250000 (83.114919) acc5: 96.875000 (97.127016) time: 0.353691 data: 0.000147 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.680371 (0.702995) acc1: 82.812500 (83.231707) acc5: 96.875000 (97.141768) time: 0.356969 data: 0.000154 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.700301 (0.701240) acc1: 82.812500 (83.392000) acc5: 96.875000 (97.184000) time: 0.351555 data: 0.000125 max mem: 18817 Test: Total time: 0:00:18 (0.382682 s / it) * Acc@1 83.672 Acc@5 96.764 loss 0.716 Max accuracy: 83.69% Epoch: [257/300] [ 0/1251] eta: 0:40:09 lr: 0.000118 loss: 2.730286 (2.730286) time: 1.926416 data: 1.057927 max mem: 18817 Epoch: [257/300] [ 50/1251] eta: 0:19:52 lr: 0.000118 loss: 2.703375 (2.710659) time: 0.951430 data: 0.000181 max mem: 18817 Epoch: [257/300] [ 100/1251] eta: 0:18:38 lr: 0.000117 loss: 2.630621 (2.684233) time: 1.015062 data: 0.000195 max mem: 18817 Epoch: [257/300] [ 150/1251] eta: 0:17:37 lr: 0.000117 loss: 2.641110 (2.655618) time: 0.968457 data: 0.000195 max mem: 18817 Epoch: [257/300] [ 200/1251] eta: 0:16:42 lr: 0.000117 loss: 2.727252 (2.668521) time: 0.917139 data: 0.000183 max mem: 18817 Epoch: [257/300] [ 250/1251] eta: 0:15:56 lr: 0.000117 loss: 2.664681 (2.654389) time: 0.926367 data: 0.000172 max mem: 18817 Epoch: [257/300] [ 300/1251] eta: 0:15:10 lr: 0.000117 loss: 2.713147 (2.646756) time: 0.974154 data: 0.000169 max mem: 18817 Epoch: [257/300] [ 350/1251] eta: 0:14:20 lr: 0.000117 loss: 2.719576 (2.643881) time: 0.973926 data: 0.000176 max mem: 18817 Epoch: [257/300] [ 400/1251] eta: 0:13:32 lr: 0.000116 loss: 2.804282 (2.645954) time: 0.974244 data: 0.000169 max mem: 18817 Epoch: [257/300] [ 450/1251] eta: 0:12:42 lr: 0.000116 loss: 2.589615 (2.640570) time: 0.918678 data: 0.000181 max mem: 18817 Epoch: [257/300] [ 500/1251] eta: 0:11:56 lr: 0.000116 loss: 2.818152 (2.647301) time: 0.918559 data: 0.000179 max mem: 18817 Epoch: [257/300] [ 550/1251] eta: 0:11:09 lr: 0.000116 loss: 2.797648 (2.648514) time: 0.969030 data: 0.000164 max mem: 18817 Epoch: [257/300] [ 600/1251] eta: 0:10:20 lr: 0.000116 loss: 2.771393 (2.649951) time: 0.960738 data: 0.000151 max mem: 18817 Epoch: [257/300] [ 650/1251] eta: 0:09:33 lr: 0.000115 loss: 2.788619 (2.652031) time: 0.988508 data: 0.000160 max mem: 18817 Epoch: [257/300] [ 700/1251] eta: 0:08:44 lr: 0.000115 loss: 2.900655 (2.652758) time: 0.909026 data: 0.000172 max mem: 18817 Epoch: [257/300] [ 750/1251] eta: 0:07:57 lr: 0.000115 loss: 2.946261 (2.652640) time: 0.905456 data: 0.000171 max mem: 18817 Epoch: [257/300] [ 800/1251] eta: 0:07:10 lr: 0.000115 loss: 2.758801 (2.658685) time: 0.982251 data: 0.000173 max mem: 18817 Epoch: [257/300] [ 850/1251] eta: 0:06:22 lr: 0.000115 loss: 2.616524 (2.652414) time: 0.970865 data: 0.000178 max mem: 18817 Epoch: [257/300] [ 900/1251] eta: 0:05:34 lr: 0.000115 loss: 2.440840 (2.649997) time: 0.907942 data: 0.000179 max mem: 18817 Epoch: [257/300] [ 950/1251] eta: 0:04:46 lr: 0.000114 loss: 2.726721 (2.651709) time: 0.934722 data: 0.000181 max mem: 18817 Epoch: [257/300] [1000/1251] eta: 0:03:59 lr: 0.000114 loss: 2.734135 (2.649996) time: 0.942103 data: 0.000170 max mem: 18817 Epoch: [257/300] [1050/1251] eta: 0:03:11 lr: 0.000114 loss: 2.481180 (2.647387) time: 0.979243 data: 0.000166 max mem: 18817 Epoch: [257/300] [1100/1251] eta: 0:02:24 lr: 0.000114 loss: 2.673544 (2.648719) time: 0.976483 data: 0.000169 max mem: 18817 Epoch: [257/300] [1150/1251] eta: 0:01:36 lr: 0.000114 loss: 2.492260 (2.642766) time: 0.916068 data: 0.000171 max mem: 18817 Epoch: [257/300] [1200/1251] eta: 0:00:48 lr: 0.000113 loss: 2.712430 (2.644682) time: 0.907388 data: 0.000175 max mem: 18817 Epoch: [257/300] [1250/1251] eta: 0:00:00 lr: 0.000113 loss: 2.400465 (2.643914) time: 1.009232 data: 0.000814 max mem: 18817 Epoch: [257/300] Total time: 0:19:54 (0.954614 s / it) Averaged stats: lr: 0.000113 loss: 2.400465 (2.640062) Test: [ 0/49] eta: 0:01:27 loss: 0.524431 (0.524431) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.794267 data: 1.386707 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.595158 (0.658210) acc1: 84.375000 (84.659091) acc5: 98.437500 (97.017045) time: 0.486894 data: 0.126217 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.713230 (0.689023) acc1: 84.375000 (83.854167) acc5: 96.875000 (96.800595) time: 0.362963 data: 0.000155 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.693176 (0.682209) acc1: 82.812500 (83.770161) acc5: 96.875000 (97.076613) time: 0.374462 data: 0.000143 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.671604 (0.697793) acc1: 82.812500 (83.612805) acc5: 96.875000 (96.989329) time: 0.362796 data: 0.000133 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.673015 (0.691866) acc1: 84.375000 (83.680000) acc5: 96.875000 (97.120000) time: 0.345151 data: 0.000105 max mem: 18817 Test: Total time: 0:00:19 (0.391217 s / it) * Acc@1 83.728 Acc@5 96.720 loss 0.710 Max accuracy: 83.73% Epoch: [258/300] [ 0/1251] eta: 0:56:56 lr: 0.000113 loss: 2.689866 (2.689866) time: 2.730673 data: 1.064121 max mem: 18817 Epoch: [258/300] [ 50/1251] eta: 0:19:17 lr: 0.000113 loss: 2.643966 (2.656444) time: 0.904590 data: 0.000163 max mem: 18817 Epoch: [258/300] [ 100/1251] eta: 0:18:19 lr: 0.000113 loss: 2.533164 (2.592401) time: 0.910299 data: 0.000173 max mem: 18817 Epoch: [258/300] [ 150/1251] eta: 0:17:32 lr: 0.000113 loss: 2.555923 (2.599501) time: 0.977718 data: 0.000171 max mem: 18817 Epoch: [258/300] [ 200/1251] eta: 0:16:42 lr: 0.000113 loss: 2.601311 (2.590511) time: 0.973945 data: 0.000176 max mem: 18817 Epoch: [258/300] [ 250/1251] eta: 0:15:52 lr: 0.000112 loss: 2.690013 (2.589998) time: 0.912044 data: 0.000175 max mem: 18817 Epoch: [258/300] [ 300/1251] eta: 0:15:05 lr: 0.000112 loss: 2.626189 (2.588676) time: 0.909959 data: 0.000165 max mem: 18817 Epoch: [258/300] [ 350/1251] eta: 0:14:18 lr: 0.000112 loss: 2.447796 (2.590483) time: 0.911584 data: 0.000165 max mem: 18817 Epoch: [258/300] [ 400/1251] eta: 0:13:31 lr: 0.000112 loss: 2.811716 (2.605142) time: 1.018777 data: 0.000187 max mem: 18817 Epoch: [258/300] [ 450/1251] eta: 0:12:42 lr: 0.000112 loss: 2.739220 (2.614887) time: 0.961389 data: 0.000175 max mem: 18817 Epoch: [258/300] [ 500/1251] eta: 0:11:54 lr: 0.000112 loss: 2.582641 (2.608124) time: 0.912374 data: 0.000185 max mem: 18817 Epoch: [258/300] [ 550/1251] eta: 0:11:07 lr: 0.000111 loss: 2.649645 (2.603827) time: 0.929313 data: 0.000187 max mem: 18817 Epoch: [258/300] [ 600/1251] eta: 0:10:19 lr: 0.000111 loss: 2.727363 (2.606031) time: 0.977563 data: 0.000191 max mem: 18817 Epoch: [258/300] [ 650/1251] eta: 0:09:32 lr: 0.000111 loss: 2.858183 (2.616965) time: 1.021821 data: 0.000169 max mem: 18817 Epoch: [258/300] [ 700/1251] eta: 0:08:44 lr: 0.000111 loss: 2.815312 (2.616185) time: 0.970149 data: 0.000179 max mem: 18817 Epoch: [258/300] [ 750/1251] eta: 0:07:56 lr: 0.000111 loss: 2.831196 (2.619850) time: 0.923118 data: 0.000174 max mem: 18817 Epoch: [258/300] [ 800/1251] eta: 0:07:08 lr: 0.000111 loss: 2.594031 (2.614335) time: 0.921311 data: 0.000164 max mem: 18817 Epoch: [258/300] [ 850/1251] eta: 0:06:21 lr: 0.000110 loss: 2.687958 (2.610455) time: 0.958189 data: 0.000167 max mem: 18817 Epoch: [258/300] [ 900/1251] eta: 0:05:34 lr: 0.000110 loss: 2.626697 (2.614202) time: 1.014542 data: 0.000197 max mem: 18817 Epoch: [258/300] [ 950/1251] eta: 0:04:46 lr: 0.000110 loss: 2.593100 (2.613825) time: 0.965005 data: 0.000177 max mem: 18817 Epoch: [258/300] [1000/1251] eta: 0:03:58 lr: 0.000110 loss: 2.818908 (2.611403) time: 0.921703 data: 0.000164 max mem: 18817 Epoch: [258/300] [1050/1251] eta: 0:03:11 lr: 0.000110 loss: 2.798956 (2.613312) time: 0.932168 data: 0.000185 max mem: 18817 Epoch: [258/300] [1100/1251] eta: 0:02:23 lr: 0.000109 loss: 2.689656 (2.612588) time: 0.951679 data: 0.000169 max mem: 18817 Epoch: [258/300] [1150/1251] eta: 0:01:36 lr: 0.000109 loss: 2.761028 (2.613914) time: 0.970195 data: 0.000179 max mem: 18817 Epoch: [258/300] [1200/1251] eta: 0:00:48 lr: 0.000109 loss: 2.646314 (2.612959) time: 0.937666 data: 0.000174 max mem: 18817 Epoch: [258/300] [1250/1251] eta: 0:00:00 lr: 0.000109 loss: 2.856102 (2.614725) time: 0.910368 data: 0.000762 max mem: 18817 Epoch: [258/300] Total time: 0:19:49 (0.950445 s / it) Averaged stats: lr: 0.000109 loss: 2.856102 (2.614488) Test: [ 0/49] eta: 0:01:18 loss: 0.519143 (0.519143) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.591928 data: 1.138033 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.559754 (0.656090) acc1: 85.937500 (85.085227) acc5: 98.437500 (96.590909) time: 0.473121 data: 0.103711 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.729366 (0.684040) acc1: 82.812500 (84.077381) acc5: 96.875000 (96.875000) time: 0.356758 data: 0.000238 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.701942 (0.677438) acc1: 82.812500 (83.770161) acc5: 98.437500 (97.227823) time: 0.352976 data: 0.000212 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.698361 (0.690379) acc1: 82.812500 (83.498476) acc5: 96.875000 (97.103659) time: 0.445573 data: 0.000238 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.699102 (0.687938) acc1: 82.812500 (83.584000) acc5: 96.875000 (97.120000) time: 0.440223 data: 0.000181 max mem: 18817 Test: Total time: 0:00:20 (0.418729 s / it) * Acc@1 83.766 Acc@5 96.788 loss 0.707 Max accuracy: 83.77% Epoch: [259/300] [ 0/1251] eta: 0:41:52 lr: 0.000109 loss: 2.494881 (2.494881) time: 2.008043 data: 1.139314 max mem: 18817 Epoch: [259/300] [ 50/1251] eta: 0:19:27 lr: 0.000109 loss: 2.523661 (2.560335) time: 1.011452 data: 0.000172 max mem: 18817 Epoch: [259/300] [ 100/1251] eta: 0:18:26 lr: 0.000109 loss: 2.661125 (2.588826) time: 0.976117 data: 0.000163 max mem: 18817 Epoch: [259/300] [ 150/1251] eta: 0:17:26 lr: 0.000108 loss: 2.706744 (2.623961) time: 0.915371 data: 0.000181 max mem: 18817 Epoch: [259/300] [ 200/1251] eta: 0:16:42 lr: 0.000108 loss: 2.600726 (2.620067) time: 0.920536 data: 0.000171 max mem: 18817 Epoch: [259/300] [ 250/1251] eta: 0:15:55 lr: 0.000108 loss: 2.697424 (2.616630) time: 0.966925 data: 0.000184 max mem: 18817 Epoch: [259/300] [ 300/1251] eta: 0:15:04 lr: 0.000108 loss: 2.478282 (2.610437) time: 0.962483 data: 0.000170 max mem: 18817 Epoch: [259/300] [ 350/1251] eta: 0:14:17 lr: 0.000108 loss: 2.694633 (2.632723) time: 0.973366 data: 0.000163 max mem: 18817 Epoch: [259/300] [ 400/1251] eta: 0:13:29 lr: 0.000108 loss: 2.655322 (2.635772) time: 0.917436 data: 0.000178 max mem: 18817 Epoch: [259/300] [ 450/1251] eta: 0:12:40 lr: 0.000107 loss: 2.671792 (2.641109) time: 0.925083 data: 0.000179 max mem: 18817 Epoch: [259/300] [ 500/1251] eta: 0:11:53 lr: 0.000107 loss: 2.524612 (2.640506) time: 0.969252 data: 0.000184 max mem: 18817 Epoch: [259/300] [ 550/1251] eta: 0:11:05 lr: 0.000107 loss: 2.838096 (2.645578) time: 0.964410 data: 0.000447 max mem: 18817 Epoch: [259/300] [ 600/1251] eta: 0:10:17 lr: 0.000107 loss: 2.435392 (2.643161) time: 0.921532 data: 0.000174 max mem: 18817 Epoch: [259/300] [ 650/1251] eta: 0:09:31 lr: 0.000107 loss: 2.620845 (2.639778) time: 0.925242 data: 0.000166 max mem: 18817 Epoch: [259/300] [ 700/1251] eta: 0:08:43 lr: 0.000107 loss: 2.758207 (2.639534) time: 0.968544 data: 0.000160 max mem: 18817 Epoch: [259/300] [ 750/1251] eta: 0:07:57 lr: 0.000106 loss: 2.539922 (2.634149) time: 0.967696 data: 0.000173 max mem: 18817 Epoch: [259/300] [ 800/1251] eta: 0:07:08 lr: 0.000106 loss: 2.680353 (2.638911) time: 0.960692 data: 0.000174 max mem: 18817 Epoch: [259/300] [ 850/1251] eta: 0:06:21 lr: 0.000106 loss: 2.820600 (2.640645) time: 0.917855 data: 0.000177 max mem: 18817 Epoch: [259/300] [ 900/1251] eta: 0:05:33 lr: 0.000106 loss: 2.771353 (2.642745) time: 0.901132 data: 0.000170 max mem: 18817 Epoch: [259/300] [ 950/1251] eta: 0:04:46 lr: 0.000106 loss: 2.653351 (2.639069) time: 0.981303 data: 0.000186 max mem: 18817 Epoch: [259/300] [1000/1251] eta: 0:03:58 lr: 0.000106 loss: 2.679447 (2.637712) time: 1.026186 data: 0.000177 max mem: 18817 Epoch: [259/300] [1050/1251] eta: 0:03:11 lr: 0.000105 loss: 2.534999 (2.637119) time: 0.964987 data: 0.000174 max mem: 18817 Epoch: [259/300] [1100/1251] eta: 0:02:23 lr: 0.000105 loss: 2.692705 (2.636075) time: 0.912663 data: 0.000173 max mem: 18817 Epoch: [259/300] [1150/1251] eta: 0:01:35 lr: 0.000105 loss: 2.700257 (2.638619) time: 0.923149 data: 0.000172 max mem: 18817 Epoch: [259/300] [1200/1251] eta: 0:00:48 lr: 0.000105 loss: 2.732907 (2.638337) time: 0.982220 data: 0.000165 max mem: 18817 Epoch: [259/300] [1250/1251] eta: 0:00:00 lr: 0.000105 loss: 2.842100 (2.640217) time: 1.001032 data: 0.000768 max mem: 18817 Epoch: [259/300] Total time: 0:19:50 (0.951266 s / it) Averaged stats: lr: 0.000105 loss: 2.842100 (2.643127) Test: [ 0/49] eta: 0:01:27 loss: 0.488740 (0.488740) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.795597 data: 1.402903 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.580585 (0.652644) acc1: 85.937500 (85.227273) acc5: 98.437500 (96.590909) time: 0.487910 data: 0.127669 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.719797 (0.682049) acc1: 82.812500 (83.928571) acc5: 96.875000 (96.875000) time: 0.355711 data: 0.000156 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.712595 (0.681411) acc1: 82.812500 (83.719758) acc5: 96.875000 (97.026210) time: 0.352932 data: 0.000159 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.680453 (0.696426) acc1: 82.812500 (83.612805) acc5: 96.875000 (96.989329) time: 0.349539 data: 0.000150 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.680453 (0.692661) acc1: 82.812500 (83.712000) acc5: 96.875000 (97.088000) time: 0.344280 data: 0.000127 max mem: 18817 Test: Total time: 0:00:18 (0.383127 s / it) * Acc@1 83.824 Acc@5 96.716 loss 0.712 Max accuracy: 83.82% Epoch: [260/300] [ 0/1251] eta: 0:39:53 lr: 0.000105 loss: 1.816187 (1.816187) time: 1.912982 data: 1.047241 max mem: 18817 Epoch: [260/300] [ 50/1251] eta: 0:19:35 lr: 0.000105 loss: 2.644982 (2.620394) time: 0.916701 data: 0.000180 max mem: 18817 Epoch: [260/300] [ 100/1251] eta: 0:18:30 lr: 0.000104 loss: 2.676558 (2.664812) time: 0.955995 data: 0.000180 max mem: 18817 Epoch: [260/300] [ 150/1251] eta: 0:17:33 lr: 0.000104 loss: 2.399304 (2.643123) time: 0.966447 data: 0.000175 max mem: 18817 Epoch: [260/300] [ 200/1251] eta: 0:16:42 lr: 0.000104 loss: 2.527578 (2.642049) time: 0.929010 data: 0.000178 max mem: 18817 Epoch: [260/300] [ 250/1251] eta: 0:15:54 lr: 0.000104 loss: 2.660659 (2.640840) time: 0.912408 data: 0.000179 max mem: 18817 Epoch: [260/300] [ 300/1251] eta: 0:15:08 lr: 0.000104 loss: 2.798014 (2.647410) time: 0.923821 data: 0.000168 max mem: 18817 Epoch: [260/300] [ 350/1251] eta: 0:14:22 lr: 0.000104 loss: 2.603834 (2.628307) time: 0.965845 data: 0.000170 max mem: 18817 Epoch: [260/300] [ 400/1251] eta: 0:13:33 lr: 0.000103 loss: 2.455439 (2.618345) time: 0.978558 data: 0.000174 max mem: 18817 Epoch: [260/300] [ 450/1251] eta: 0:12:44 lr: 0.000103 loss: 2.785305 (2.624858) time: 0.918530 data: 0.000180 max mem: 18817 Epoch: [260/300] [ 500/1251] eta: 0:11:56 lr: 0.000103 loss: 2.788991 (2.625500) time: 0.930401 data: 0.000170 max mem: 18817 Epoch: [260/300] [ 550/1251] eta: 0:11:10 lr: 0.000103 loss: 2.552478 (2.628461) time: 0.915884 data: 0.000183 max mem: 18817 Epoch: [260/300] [ 600/1251] eta: 0:10:22 lr: 0.000103 loss: 2.521607 (2.627190) time: 0.971204 data: 0.000177 max mem: 18817 Epoch: [260/300] [ 650/1251] eta: 0:09:33 lr: 0.000103 loss: 2.629872 (2.623984) time: 0.968906 data: 0.000159 max mem: 18817 Epoch: [260/300] [ 700/1251] eta: 0:08:45 lr: 0.000102 loss: 2.566845 (2.623139) time: 0.920699 data: 0.000157 max mem: 18817 Epoch: [260/300] [ 750/1251] eta: 0:07:57 lr: 0.000102 loss: 2.639309 (2.619260) time: 0.915757 data: 0.000171 max mem: 18817 Epoch: [260/300] [ 800/1251] eta: 0:07:10 lr: 0.000102 loss: 2.703780 (2.624429) time: 0.976429 data: 0.000170 max mem: 18817 Epoch: [260/300] [ 850/1251] eta: 0:06:22 lr: 0.000102 loss: 2.779672 (2.627186) time: 0.999877 data: 0.000194 max mem: 18817 Epoch: [260/300] [ 900/1251] eta: 0:05:34 lr: 0.000102 loss: 2.696721 (2.622402) time: 0.975133 data: 0.000177 max mem: 18817 Epoch: [260/300] [ 950/1251] eta: 0:04:47 lr: 0.000102 loss: 2.680071 (2.623281) time: 0.926355 data: 0.000166 max mem: 18817 Epoch: [260/300] [1000/1251] eta: 0:03:59 lr: 0.000101 loss: 2.463042 (2.618918) time: 0.928905 data: 0.000176 max mem: 18817 Epoch: [260/300] [1050/1251] eta: 0:03:11 lr: 0.000101 loss: 2.568942 (2.616881) time: 0.975276 data: 0.000204 max mem: 18817 Epoch: [260/300] [1100/1251] eta: 0:02:24 lr: 0.000101 loss: 2.767169 (2.620334) time: 1.020330 data: 0.000171 max mem: 18817 Epoch: [260/300] [1150/1251] eta: 0:01:36 lr: 0.000101 loss: 2.696304 (2.619966) time: 0.961765 data: 0.000174 max mem: 18817 Epoch: [260/300] [1200/1251] eta: 0:00:48 lr: 0.000101 loss: 2.543142 (2.618812) time: 0.924433 data: 0.000165 max mem: 18817 Epoch: [260/300] [1250/1251] eta: 0:00:00 lr: 0.000101 loss: 2.727621 (2.617430) time: 0.922081 data: 0.000756 max mem: 18817 Epoch: [260/300] Total time: 0:19:52 (0.952999 s / it) Averaged stats: lr: 0.000101 loss: 2.727621 (2.614571) Test: [ 0/49] eta: 0:01:19 loss: 0.499324 (0.499324) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.619464 data: 1.214338 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.576688 (0.650633) acc1: 84.375000 (85.511364) acc5: 98.437500 (96.875000) time: 0.475702 data: 0.110562 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.681351 (0.668314) acc1: 84.375000 (84.895833) acc5: 96.875000 (96.800595) time: 0.357287 data: 0.000162 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.651869 (0.666508) acc1: 84.375000 (84.375000) acc5: 96.875000 (97.177419) time: 0.351822 data: 0.000157 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.672738 (0.680039) acc1: 82.812500 (84.336890) acc5: 96.875000 (97.065549) time: 0.348892 data: 0.000148 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.676027 (0.676503) acc1: 82.812500 (84.416000) acc5: 96.875000 (97.088000) time: 0.344172 data: 0.000111 max mem: 18817 Test: Total time: 0:00:18 (0.380214 s / it) * Acc@1 83.882 Acc@5 96.788 loss 0.700 Max accuracy: 83.88% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0260.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0260.pth Epoch: [261/300] [ 0/1251] eta: 0:44:44 lr: 0.000101 loss: 2.602170 (2.602170) time: 2.146013 data: 1.270664 max mem: 18817 Epoch: [261/300] [ 50/1251] eta: 0:19:15 lr: 0.000100 loss: 2.714335 (2.630456) time: 0.971383 data: 0.000167 max mem: 18817 Epoch: [261/300] [ 100/1251] eta: 0:18:14 lr: 0.000100 loss: 2.588872 (2.598734) time: 0.921675 data: 0.000165 max mem: 18817 Epoch: [261/300] [ 150/1251] eta: 0:17:28 lr: 0.000100 loss: 2.726568 (2.658512) time: 0.931853 data: 0.000199 max mem: 18817 Epoch: [261/300] [ 200/1251] eta: 0:16:42 lr: 0.000100 loss: 2.532614 (2.642842) time: 0.973908 data: 0.000181 max mem: 18817 Epoch: [261/300] [ 250/1251] eta: 0:15:49 lr: 0.000100 loss: 2.539573 (2.635181) time: 0.952871 data: 0.000178 max mem: 18817 Epoch: [261/300] [ 300/1251] eta: 0:15:02 lr: 0.000100 loss: 2.688011 (2.624683) time: 0.937077 data: 0.000188 max mem: 18817 Epoch: [261/300] [ 350/1251] eta: 0:14:14 lr: 0.000099 loss: 2.633539 (2.622368) time: 0.930338 data: 0.000181 max mem: 18817 Epoch: [261/300] [ 400/1251] eta: 0:13:28 lr: 0.000099 loss: 2.759216 (2.637393) time: 0.918531 data: 0.000170 max mem: 18817 Epoch: [261/300] [ 450/1251] eta: 0:12:39 lr: 0.000099 loss: 2.838426 (2.643266) time: 0.944954 data: 0.000168 max mem: 18817 Epoch: [261/300] [ 500/1251] eta: 0:11:51 lr: 0.000099 loss: 2.872031 (2.656956) time: 0.934759 data: 0.000163 max mem: 18817 Epoch: [261/300] [ 550/1251] eta: 0:11:05 lr: 0.000099 loss: 2.513087 (2.647220) time: 0.916096 data: 0.000176 max mem: 18817 Epoch: [261/300] [ 600/1251] eta: 0:10:18 lr: 0.000099 loss: 2.509580 (2.642680) time: 0.964554 data: 0.000183 max mem: 18817 Epoch: [261/300] [ 650/1251] eta: 0:09:31 lr: 0.000098 loss: 2.658612 (2.638002) time: 1.003779 data: 0.000195 max mem: 18817 Epoch: [261/300] [ 700/1251] eta: 0:08:43 lr: 0.000098 loss: 2.488189 (2.629300) time: 0.975418 data: 0.000178 max mem: 18817 Epoch: [261/300] [ 750/1251] eta: 0:07:55 lr: 0.000098 loss: 2.832831 (2.631230) time: 0.913180 data: 0.000183 max mem: 18817 Epoch: [261/300] [ 800/1251] eta: 0:07:08 lr: 0.000098 loss: 2.698537 (2.621765) time: 0.905382 data: 0.000164 max mem: 18817 Epoch: [261/300] [ 850/1251] eta: 0:06:20 lr: 0.000098 loss: 2.666651 (2.624015) time: 0.973062 data: 0.000185 max mem: 18817 Epoch: [261/300] [ 900/1251] eta: 0:05:33 lr: 0.000098 loss: 2.829239 (2.627989) time: 0.972931 data: 0.000184 max mem: 18817 Epoch: [261/300] [ 950/1251] eta: 0:04:45 lr: 0.000097 loss: 2.874460 (2.626681) time: 0.979436 data: 0.000178 max mem: 18817 Epoch: [261/300] [1000/1251] eta: 0:03:58 lr: 0.000097 loss: 2.682101 (2.626512) time: 0.909061 data: 0.000161 max mem: 18817 Epoch: [261/300] [1050/1251] eta: 0:03:10 lr: 0.000097 loss: 2.727371 (2.626939) time: 0.914893 data: 0.000173 max mem: 18817 Epoch: [261/300] [1100/1251] eta: 0:02:23 lr: 0.000097 loss: 2.598480 (2.629306) time: 0.966687 data: 0.000177 max mem: 18817 Epoch: [261/300] [1150/1251] eta: 0:01:35 lr: 0.000097 loss: 2.713570 (2.629541) time: 0.965222 data: 0.000173 max mem: 18817 Epoch: [261/300] [1200/1251] eta: 0:00:48 lr: 0.000097 loss: 2.710922 (2.629322) time: 0.981379 data: 0.000166 max mem: 18817 Epoch: [261/300] [1250/1251] eta: 0:00:00 lr: 0.000097 loss: 2.672863 (2.628967) time: 0.917435 data: 0.000776 max mem: 18817 Epoch: [261/300] Total time: 0:19:48 (0.950045 s / it) Averaged stats: lr: 0.000097 loss: 2.672863 (2.635502) Test: [ 0/49] eta: 0:01:15 loss: 0.492473 (0.492473) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.532794 data: 1.091663 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.574810 (0.649545) acc1: 84.375000 (84.232955) acc5: 96.875000 (96.448864) time: 0.480937 data: 0.100211 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.699769 (0.679091) acc1: 82.812500 (84.002976) acc5: 96.875000 (96.726190) time: 0.372023 data: 0.000592 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.681810 (0.683948) acc1: 82.812500 (83.467742) acc5: 96.875000 (96.925403) time: 0.360204 data: 0.000119 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.675560 (0.696075) acc1: 81.250000 (83.384146) acc5: 98.437500 (96.989329) time: 0.349531 data: 0.000115 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.676787 (0.692081) acc1: 84.375000 (83.584000) acc5: 98.437500 (97.056000) time: 0.344088 data: 0.000097 max mem: 18817 Test: Total time: 0:00:18 (0.383675 s / it) * Acc@1 83.778 Acc@5 96.744 loss 0.710 Max accuracy: 83.88% Epoch: [262/300] [ 0/1251] eta: 0:39:22 lr: 0.000097 loss: 2.831488 (2.831488) time: 1.888305 data: 1.027599 max mem: 18817 Epoch: [262/300] [ 50/1251] eta: 0:19:28 lr: 0.000096 loss: 2.620390 (2.541137) time: 0.965858 data: 0.000187 max mem: 18817 Epoch: [262/300] [ 100/1251] eta: 0:18:14 lr: 0.000096 loss: 2.700230 (2.589559) time: 0.950032 data: 0.000173 max mem: 18817 Epoch: [262/300] [ 150/1251] eta: 0:17:29 lr: 0.000096 loss: 2.373976 (2.579542) time: 0.969084 data: 0.000179 max mem: 18817 Epoch: [262/300] [ 200/1251] eta: 0:16:38 lr: 0.000096 loss: 2.686759 (2.586385) time: 0.916978 data: 0.000184 max mem: 18817 Epoch: [262/300] [ 250/1251] eta: 0:15:51 lr: 0.000096 loss: 2.465294 (2.591207) time: 0.940166 data: 0.000171 max mem: 18817 Epoch: [262/300] [ 300/1251] eta: 0:15:06 lr: 0.000096 loss: 2.627393 (2.602036) time: 0.988918 data: 0.000158 max mem: 18817 Epoch: [262/300] [ 350/1251] eta: 0:14:17 lr: 0.000095 loss: 2.681080 (2.595304) time: 0.972416 data: 0.000163 max mem: 18817 Epoch: [262/300] [ 400/1251] eta: 0:13:28 lr: 0.000095 loss: 2.811548 (2.605303) time: 0.910809 data: 0.000182 max mem: 18817 Epoch: [262/300] [ 450/1251] eta: 0:12:41 lr: 0.000095 loss: 2.691679 (2.613136) time: 0.911812 data: 0.000172 max mem: 18817 Epoch: [262/300] [ 500/1251] eta: 0:11:54 lr: 0.000095 loss: 2.734812 (2.617799) time: 0.970612 data: 0.000187 max mem: 18817 Epoch: [262/300] [ 550/1251] eta: 0:11:07 lr: 0.000095 loss: 2.822825 (2.627336) time: 0.960195 data: 0.000187 max mem: 18817 Epoch: [262/300] [ 600/1251] eta: 0:10:19 lr: 0.000095 loss: 2.689114 (2.621340) time: 0.980956 data: 0.000177 max mem: 18817 Epoch: [262/300] [ 650/1251] eta: 0:09:31 lr: 0.000094 loss: 2.631049 (2.619729) time: 0.922994 data: 0.000167 max mem: 18817 Epoch: [262/300] [ 700/1251] eta: 0:08:44 lr: 0.000094 loss: 2.663524 (2.618824) time: 0.928519 data: 0.000163 max mem: 18817 Epoch: [262/300] [ 750/1251] eta: 0:07:57 lr: 0.000094 loss: 2.698510 (2.614579) time: 0.974895 data: 0.000173 max mem: 18817 Epoch: [262/300] [ 800/1251] eta: 0:07:09 lr: 0.000094 loss: 2.488096 (2.611378) time: 0.966894 data: 0.000164 max mem: 18817 Epoch: [262/300] [ 850/1251] eta: 0:06:21 lr: 0.000094 loss: 2.488035 (2.607498) time: 0.974968 data: 0.000171 max mem: 18817 Epoch: [262/300] [ 900/1251] eta: 0:05:34 lr: 0.000094 loss: 2.575824 (2.609697) time: 0.927540 data: 0.000171 max mem: 18817 Epoch: [262/300] [ 950/1251] eta: 0:04:46 lr: 0.000094 loss: 2.464424 (2.605497) time: 0.918927 data: 0.000178 max mem: 18817 Epoch: [262/300] [1000/1251] eta: 0:03:59 lr: 0.000093 loss: 2.740595 (2.605022) time: 0.980722 data: 0.000176 max mem: 18817 Epoch: [262/300] [1050/1251] eta: 0:03:11 lr: 0.000093 loss: 2.726352 (2.603742) time: 0.968080 data: 0.000180 max mem: 18817 Epoch: [262/300] [1100/1251] eta: 0:02:23 lr: 0.000093 loss: 2.569937 (2.603484) time: 0.947168 data: 0.000176 max mem: 18817 Epoch: [262/300] [1150/1251] eta: 0:01:36 lr: 0.000093 loss: 2.486854 (2.602953) time: 0.912013 data: 0.000168 max mem: 18817 Epoch: [262/300] [1200/1251] eta: 0:00:48 lr: 0.000093 loss: 2.761652 (2.603774) time: 0.917208 data: 0.000176 max mem: 18817 Epoch: [262/300] [1250/1251] eta: 0:00:00 lr: 0.000093 loss: 2.561157 (2.599802) time: 0.962492 data: 0.000765 max mem: 18817 Epoch: [262/300] Total time: 0:19:51 (0.952122 s / it) Averaged stats: lr: 0.000093 loss: 2.561157 (2.600459) Test: [ 0/49] eta: 0:01:17 loss: 0.520348 (0.520348) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.578363 data: 1.126255 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.578814 (0.650317) acc1: 85.937500 (85.795455) acc5: 98.437500 (96.875000) time: 0.473852 data: 0.102535 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.689514 (0.676905) acc1: 84.375000 (85.267857) acc5: 96.875000 (96.949405) time: 0.357316 data: 0.000140 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.684647 (0.675168) acc1: 84.375000 (84.627016) acc5: 96.875000 (97.177419) time: 0.351386 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.709149 (0.694956) acc1: 82.812500 (84.184451) acc5: 96.875000 (97.027439) time: 0.360717 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.709149 (0.692520) acc1: 82.812500 (84.064000) acc5: 96.875000 (97.088000) time: 0.368642 data: 0.000102 max mem: 18817 Test: Total time: 0:00:19 (0.389604 s / it) * Acc@1 83.808 Acc@5 96.726 loss 0.710 Max accuracy: 83.88% Epoch: [263/300] [ 0/1251] eta: 0:41:58 lr: 0.000093 loss: 1.844059 (1.844059) time: 2.013265 data: 1.146936 max mem: 18817 Epoch: [263/300] [ 50/1251] eta: 0:19:13 lr: 0.000092 loss: 2.650768 (2.658129) time: 0.925101 data: 0.000161 max mem: 18817 Epoch: [263/300] [ 100/1251] eta: 0:18:23 lr: 0.000092 loss: 2.834808 (2.642657) time: 0.915288 data: 0.000165 max mem: 18817 Epoch: [263/300] [ 150/1251] eta: 0:17:33 lr: 0.000092 loss: 2.705041 (2.668288) time: 0.957791 data: 0.000184 max mem: 18817 Epoch: [263/300] [ 200/1251] eta: 0:16:46 lr: 0.000092 loss: 2.743132 (2.647191) time: 0.999448 data: 0.000181 max mem: 18817 Epoch: [263/300] [ 250/1251] eta: 0:15:54 lr: 0.000092 loss: 2.834395 (2.639152) time: 0.957436 data: 0.000173 max mem: 18817 Epoch: [263/300] [ 300/1251] eta: 0:15:03 lr: 0.000092 loss: 2.385713 (2.635571) time: 0.915262 data: 0.000171 max mem: 18817 Epoch: [263/300] [ 350/1251] eta: 0:14:15 lr: 0.000091 loss: 2.617476 (2.630920) time: 0.917938 data: 0.000167 max mem: 18817 Epoch: [263/300] [ 400/1251] eta: 0:13:29 lr: 0.000091 loss: 2.579024 (2.621094) time: 0.977049 data: 0.000184 max mem: 18817 Epoch: [263/300] [ 450/1251] eta: 0:12:42 lr: 0.000091 loss: 2.568671 (2.613395) time: 1.025314 data: 0.000173 max mem: 18817 Epoch: [263/300] [ 500/1251] eta: 0:11:54 lr: 0.000091 loss: 2.502624 (2.606336) time: 0.969847 data: 0.000191 max mem: 18817 Epoch: [263/300] [ 550/1251] eta: 0:11:06 lr: 0.000091 loss: 2.828181 (2.612020) time: 0.912023 data: 0.000193 max mem: 18817 Epoch: [263/300] [ 600/1251] eta: 0:10:18 lr: 0.000091 loss: 2.588294 (2.606847) time: 0.910681 data: 0.000181 max mem: 18817 Epoch: [263/300] [ 650/1251] eta: 0:09:31 lr: 0.000091 loss: 2.729783 (2.610118) time: 0.952826 data: 0.000168 max mem: 18817 Epoch: [263/300] [ 700/1251] eta: 0:08:43 lr: 0.000090 loss: 2.697309 (2.614428) time: 0.965298 data: 0.000157 max mem: 18817 Epoch: [263/300] [ 750/1251] eta: 0:07:56 lr: 0.000090 loss: 2.737228 (2.614543) time: 0.959249 data: 0.000161 max mem: 18817 Epoch: [263/300] [ 800/1251] eta: 0:07:08 lr: 0.000090 loss: 2.690628 (2.617831) time: 0.920349 data: 0.000195 max mem: 18817 Epoch: [263/300] [ 850/1251] eta: 0:06:21 lr: 0.000090 loss: 2.788835 (2.624094) time: 0.941954 data: 0.000171 max mem: 18817 Epoch: [263/300] [ 900/1251] eta: 0:05:34 lr: 0.000090 loss: 2.772902 (2.628446) time: 0.972281 data: 0.000171 max mem: 18817 Epoch: [263/300] [ 950/1251] eta: 0:04:46 lr: 0.000090 loss: 2.706780 (2.626787) time: 0.963344 data: 0.000177 max mem: 18817 Epoch: [263/300] [1000/1251] eta: 0:03:58 lr: 0.000089 loss: 2.755658 (2.629673) time: 0.974874 data: 0.000184 max mem: 18817 Epoch: [263/300] [1050/1251] eta: 0:03:11 lr: 0.000089 loss: 2.672172 (2.629104) time: 0.913460 data: 0.000185 max mem: 18817 Epoch: [263/300] [1100/1251] eta: 0:02:23 lr: 0.000089 loss: 2.730805 (2.629858) time: 0.975223 data: 0.000162 max mem: 18817 Epoch: [263/300] [1150/1251] eta: 0:01:36 lr: 0.000089 loss: 2.615283 (2.626801) time: 0.963540 data: 0.000175 max mem: 18817 Epoch: [263/300] [1200/1251] eta: 0:00:48 lr: 0.000089 loss: 2.701566 (2.630496) time: 0.926347 data: 0.000179 max mem: 18817 Epoch: [263/300] [1250/1251] eta: 0:00:00 lr: 0.000089 loss: 2.673291 (2.625001) time: 0.934556 data: 0.000758 max mem: 18817 Epoch: [263/300] Total time: 0:19:50 (0.951717 s / it) Averaged stats: lr: 0.000089 loss: 2.673291 (2.625729) Test: [ 0/49] eta: 0:01:17 loss: 0.481503 (0.481503) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.581303 data: 1.150538 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.595886 (0.645076) acc1: 84.375000 (85.227273) acc5: 98.437500 (97.017045) time: 0.479923 data: 0.104762 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.682109 (0.668761) acc1: 82.812500 (84.449405) acc5: 96.875000 (97.023810) time: 0.361340 data: 0.000149 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.677333 (0.669807) acc1: 82.812500 (84.122984) acc5: 96.875000 (97.227823) time: 0.353062 data: 0.000119 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.678474 (0.686899) acc1: 82.812500 (83.803354) acc5: 96.875000 (97.217988) time: 0.349915 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.678474 (0.685209) acc1: 82.812500 (83.936000) acc5: 98.437500 (97.216000) time: 0.439061 data: 0.000106 max mem: 18817 Test: Total time: 0:00:20 (0.419892 s / it) * Acc@1 83.902 Acc@5 96.758 loss 0.703 Max accuracy: 83.90% Epoch: [264/300] [ 0/1251] eta: 0:41:50 lr: 0.000089 loss: 2.650251 (2.650251) time: 2.007105 data: 1.043382 max mem: 18817 Epoch: [264/300] [ 50/1251] eta: 0:19:34 lr: 0.000089 loss: 2.711964 (2.637709) time: 1.020705 data: 0.000167 max mem: 18817 Epoch: [264/300] [ 100/1251] eta: 0:18:25 lr: 0.000088 loss: 2.663935 (2.591463) time: 0.973606 data: 0.000191 max mem: 18817 Epoch: [264/300] [ 150/1251] eta: 0:17:28 lr: 0.000088 loss: 2.743791 (2.619644) time: 0.914966 data: 0.000186 max mem: 18817 Epoch: [264/300] [ 200/1251] eta: 0:16:44 lr: 0.000088 loss: 2.688596 (2.632187) time: 0.913125 data: 0.000175 max mem: 18817 Epoch: [264/300] [ 250/1251] eta: 0:15:57 lr: 0.000088 loss: 2.739709 (2.630189) time: 0.978450 data: 0.000170 max mem: 18817 Epoch: [264/300] [ 300/1251] eta: 0:15:07 lr: 0.000088 loss: 2.482979 (2.608820) time: 0.977146 data: 0.000179 max mem: 18817 Epoch: [264/300] [ 350/1251] eta: 0:14:18 lr: 0.000088 loss: 2.546816 (2.608601) time: 0.923231 data: 0.000176 max mem: 18817 Epoch: [264/300] [ 400/1251] eta: 0:13:31 lr: 0.000088 loss: 2.675374 (2.607299) time: 0.942769 data: 0.000173 max mem: 18817 Epoch: [264/300] [ 450/1251] eta: 0:12:45 lr: 0.000087 loss: 2.404164 (2.603995) time: 0.975775 data: 0.000174 max mem: 18817 Epoch: [264/300] [ 500/1251] eta: 0:11:55 lr: 0.000087 loss: 2.728510 (2.606487) time: 0.968072 data: 0.000189 max mem: 18817 Epoch: [264/300] [ 550/1251] eta: 0:11:07 lr: 0.000087 loss: 2.605570 (2.611245) time: 0.942757 data: 0.000183 max mem: 18817 Epoch: [264/300] [ 600/1251] eta: 0:10:20 lr: 0.000087 loss: 2.436732 (2.608214) time: 0.912269 data: 0.000171 max mem: 18817 Epoch: [264/300] [ 650/1251] eta: 0:09:32 lr: 0.000087 loss: 2.687645 (2.615035) time: 0.970049 data: 0.000165 max mem: 18817 Epoch: [264/300] [ 700/1251] eta: 0:08:45 lr: 0.000087 loss: 2.588631 (2.610841) time: 0.964844 data: 0.000169 max mem: 18817 Epoch: [264/300] [ 750/1251] eta: 0:07:56 lr: 0.000086 loss: 2.676519 (2.608543) time: 0.959267 data: 0.000185 max mem: 18817 Epoch: [264/300] [ 800/1251] eta: 0:07:08 lr: 0.000086 loss: 2.536062 (2.601937) time: 0.914251 data: 0.000169 max mem: 18817 Epoch: [264/300] [ 850/1251] eta: 0:06:21 lr: 0.000086 loss: 2.533233 (2.599499) time: 0.922208 data: 0.000174 max mem: 18817 Epoch: [264/300] [ 900/1251] eta: 0:05:34 lr: 0.000086 loss: 2.646957 (2.601072) time: 0.981968 data: 0.000181 max mem: 18817 Epoch: [264/300] [ 950/1251] eta: 0:04:46 lr: 0.000086 loss: 2.826103 (2.605853) time: 1.014368 data: 0.000181 max mem: 18817 Epoch: [264/300] [1000/1251] eta: 0:03:59 lr: 0.000086 loss: 2.737025 (2.609269) time: 0.985373 data: 0.000157 max mem: 18817 Epoch: [264/300] [1050/1251] eta: 0:03:11 lr: 0.000086 loss: 2.656727 (2.608471) time: 0.918588 data: 0.000171 max mem: 18817 Epoch: [264/300] [1100/1251] eta: 0:02:23 lr: 0.000085 loss: 2.593586 (2.604514) time: 0.915048 data: 0.000181 max mem: 18817 Epoch: [264/300] [1150/1251] eta: 0:01:36 lr: 0.000085 loss: 2.642377 (2.607541) time: 0.987598 data: 0.000174 max mem: 18817 Epoch: [264/300] [1200/1251] eta: 0:00:48 lr: 0.000085 loss: 2.373155 (2.601861) time: 1.027137 data: 0.000179 max mem: 18817 Epoch: [264/300] [1250/1251] eta: 0:00:00 lr: 0.000085 loss: 2.632950 (2.600139) time: 0.955409 data: 0.000822 max mem: 18817 Epoch: [264/300] Total time: 0:19:52 (0.952862 s / it) Averaged stats: lr: 0.000085 loss: 2.632950 (2.596481) Test: [ 0/49] eta: 0:01:16 loss: 0.495749 (0.495749) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.555312 data: 1.125674 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.596825 (0.654033) acc1: 84.375000 (84.943182) acc5: 98.437500 (96.732955) time: 0.469285 data: 0.102469 max mem: 18817 Test: [20/49] eta: 0:00:11 loss: 0.701609 (0.676559) acc1: 82.812500 (84.523810) acc5: 96.875000 (96.577381) time: 0.356143 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.689744 (0.672441) acc1: 82.812500 (84.375000) acc5: 96.875000 (96.925403) time: 0.458373 data: 0.000134 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.671511 (0.684961) acc1: 82.812500 (84.108232) acc5: 96.875000 (96.913110) time: 0.455961 data: 0.000136 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.677153 (0.683402) acc1: 84.375000 (84.288000) acc5: 96.875000 (97.024000) time: 0.344257 data: 0.000109 max mem: 18817 Test: Total time: 0:00:20 (0.420726 s / it) * Acc@1 83.792 Acc@5 96.790 loss 0.705 Max accuracy: 83.90% Epoch: [265/300] [ 0/1251] eta: 0:41:13 lr: 0.000085 loss: 2.450415 (2.450415) time: 1.976838 data: 1.101036 max mem: 18817 Epoch: [265/300] [ 50/1251] eta: 0:19:44 lr: 0.000085 loss: 2.505031 (2.546081) time: 0.969217 data: 0.000180 max mem: 18817 Epoch: [265/300] [ 100/1251] eta: 0:18:37 lr: 0.000085 loss: 2.658592 (2.568291) time: 1.005393 data: 0.000169 max mem: 18817 Epoch: [265/300] [ 150/1251] eta: 0:17:40 lr: 0.000085 loss: 2.801310 (2.590287) time: 0.956823 data: 0.000162 max mem: 18817 Epoch: [265/300] [ 200/1251] eta: 0:16:50 lr: 0.000084 loss: 2.519470 (2.568403) time: 0.934539 data: 0.000172 max mem: 18817 Epoch: [265/300] [ 250/1251] eta: 0:16:01 lr: 0.000084 loss: 2.708512 (2.582669) time: 0.907423 data: 0.000185 max mem: 18817 Epoch: [265/300] [ 300/1251] eta: 0:15:12 lr: 0.000084 loss: 2.850743 (2.598003) time: 0.973873 data: 0.000175 max mem: 18817 Epoch: [265/300] [ 350/1251] eta: 0:14:23 lr: 0.000084 loss: 2.742576 (2.611870) time: 0.978040 data: 0.000177 max mem: 18817 Epoch: [265/300] [ 400/1251] eta: 0:13:37 lr: 0.000084 loss: 2.660745 (2.616824) time: 0.961645 data: 0.000173 max mem: 18817 Epoch: [265/300] [ 450/1251] eta: 0:12:47 lr: 0.000084 loss: 2.572165 (2.610825) time: 0.912858 data: 0.000188 max mem: 18817 Epoch: [265/300] [ 500/1251] eta: 0:11:59 lr: 0.000084 loss: 2.765672 (2.611677) time: 0.911651 data: 0.000185 max mem: 18817 Epoch: [265/300] [ 550/1251] eta: 0:11:11 lr: 0.000083 loss: 2.605077 (2.611163) time: 0.953579 data: 0.000175 max mem: 18817 Epoch: [265/300] [ 600/1251] eta: 0:10:22 lr: 0.000083 loss: 2.614155 (2.610993) time: 0.950578 data: 0.000160 max mem: 18817 Epoch: [265/300] [ 650/1251] eta: 0:09:33 lr: 0.000083 loss: 2.843604 (2.619948) time: 0.925139 data: 0.000174 max mem: 18817 Epoch: [265/300] [ 700/1251] eta: 0:08:46 lr: 0.000083 loss: 2.407042 (2.617613) time: 0.927030 data: 0.000183 max mem: 18817 Epoch: [265/300] [ 750/1251] eta: 0:07:59 lr: 0.000083 loss: 2.218127 (2.608992) time: 0.901006 data: 0.000184 max mem: 18817 Epoch: [265/300] [ 800/1251] eta: 0:07:11 lr: 0.000083 loss: 2.709957 (2.606704) time: 0.992666 data: 0.000178 max mem: 18817 Epoch: [265/300] [ 850/1251] eta: 0:06:23 lr: 0.000082 loss: 2.864897 (2.607837) time: 0.977497 data: 0.000165 max mem: 18817 Epoch: [265/300] [ 900/1251] eta: 0:05:35 lr: 0.000082 loss: 2.853416 (2.611108) time: 0.925663 data: 0.000165 max mem: 18817 Epoch: [265/300] [ 950/1251] eta: 0:04:47 lr: 0.000082 loss: 2.552744 (2.610450) time: 0.911444 data: 0.000184 max mem: 18817 Epoch: [265/300] [1000/1251] eta: 0:03:59 lr: 0.000082 loss: 2.844353 (2.616652) time: 0.980382 data: 0.000171 max mem: 18817 Epoch: [265/300] [1050/1251] eta: 0:03:12 lr: 0.000082 loss: 2.436115 (2.607951) time: 0.978380 data: 0.000183 max mem: 18817 Epoch: [265/300] [1100/1251] eta: 0:02:24 lr: 0.000082 loss: 2.655042 (2.609876) time: 0.985563 data: 0.000203 max mem: 18817 Epoch: [265/300] [1150/1251] eta: 0:01:36 lr: 0.000082 loss: 2.429174 (2.607424) time: 0.925660 data: 0.000170 max mem: 18817 Epoch: [265/300] [1200/1251] eta: 0:00:48 lr: 0.000081 loss: 2.787489 (2.609207) time: 0.913083 data: 0.000179 max mem: 18817 Epoch: [265/300] [1250/1251] eta: 0:00:00 lr: 0.000081 loss: 2.724077 (2.608457) time: 0.955429 data: 0.000759 max mem: 18817 Epoch: [265/300] Total time: 0:19:56 (0.956173 s / it) Averaged stats: lr: 0.000081 loss: 2.724077 (2.609006) Test: [ 0/49] eta: 0:01:16 loss: 0.539174 (0.539174) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.560587 data: 1.136189 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.597207 (0.658573) acc1: 85.937500 (85.227273) acc5: 98.437500 (96.875000) time: 0.468197 data: 0.103424 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.736744 (0.687359) acc1: 82.812500 (84.077381) acc5: 96.875000 (96.875000) time: 0.359541 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.702298 (0.687140) acc1: 81.250000 (83.770161) acc5: 96.875000 (97.026210) time: 0.359642 data: 0.000148 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.680040 (0.698354) acc1: 81.250000 (83.498476) acc5: 96.875000 (96.989329) time: 0.357370 data: 0.000142 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.694947 (0.693524) acc1: 81.250000 (83.680000) acc5: 96.875000 (97.024000) time: 0.348441 data: 0.000101 max mem: 18817 Test: Total time: 0:00:18 (0.382045 s / it) * Acc@1 83.714 Acc@5 96.768 loss 0.710 Max accuracy: 83.90% Epoch: [266/300] [ 0/1251] eta: 0:43:45 lr: 0.000081 loss: 1.713511 (1.713511) time: 2.098684 data: 1.224905 max mem: 18817 Epoch: [266/300] [ 50/1251] eta: 0:19:20 lr: 0.000081 loss: 2.540166 (2.447487) time: 0.983541 data: 0.000172 max mem: 18817 Epoch: [266/300] [ 100/1251] eta: 0:18:17 lr: 0.000081 loss: 2.625390 (2.497056) time: 0.931299 data: 0.000174 max mem: 18817 Epoch: [266/300] [ 150/1251] eta: 0:17:33 lr: 0.000081 loss: 2.802807 (2.560351) time: 0.935738 data: 0.000179 max mem: 18817 Epoch: [266/300] [ 200/1251] eta: 0:16:41 lr: 0.000081 loss: 2.697073 (2.574985) time: 0.951786 data: 0.000179 max mem: 18817 Epoch: [266/300] [ 250/1251] eta: 0:15:54 lr: 0.000081 loss: 2.486057 (2.557605) time: 1.019521 data: 0.000190 max mem: 18817 Epoch: [266/300] [ 300/1251] eta: 0:15:05 lr: 0.000080 loss: 2.611369 (2.559575) time: 0.977893 data: 0.000176 max mem: 18817 Epoch: [266/300] [ 350/1251] eta: 0:14:17 lr: 0.000080 loss: 2.523515 (2.561038) time: 0.915465 data: 0.000166 max mem: 18817 Epoch: [266/300] [ 400/1251] eta: 0:13:30 lr: 0.000080 loss: 2.827107 (2.578646) time: 0.927317 data: 0.000187 max mem: 18817 Epoch: [266/300] [ 450/1251] eta: 0:12:43 lr: 0.000080 loss: 2.785925 (2.581239) time: 0.960358 data: 0.000168 max mem: 18817 Epoch: [266/300] [ 500/1251] eta: 0:11:56 lr: 0.000080 loss: 2.520663 (2.577004) time: 1.031738 data: 0.000177 max mem: 18817 Epoch: [266/300] [ 550/1251] eta: 0:11:07 lr: 0.000080 loss: 2.703075 (2.581239) time: 0.947098 data: 0.000265 max mem: 18817 Epoch: [266/300] [ 600/1251] eta: 0:10:18 lr: 0.000080 loss: 2.549154 (2.575191) time: 0.913051 data: 0.000159 max mem: 18817 Epoch: [266/300] [ 650/1251] eta: 0:09:31 lr: 0.000079 loss: 2.714839 (2.572334) time: 0.924999 data: 0.000176 max mem: 18817 Epoch: [266/300] [ 700/1251] eta: 0:08:44 lr: 0.000079 loss: 2.616873 (2.581540) time: 0.971078 data: 0.000178 max mem: 18817 Epoch: [266/300] [ 750/1251] eta: 0:07:57 lr: 0.000079 loss: 2.575009 (2.583137) time: 0.993307 data: 0.000178 max mem: 18817 Epoch: [266/300] [ 800/1251] eta: 0:07:08 lr: 0.000079 loss: 2.667452 (2.588832) time: 0.902889 data: 0.000174 max mem: 18817 Epoch: [266/300] [ 850/1251] eta: 0:06:21 lr: 0.000079 loss: 2.541075 (2.588480) time: 0.917710 data: 0.000183 max mem: 18817 Epoch: [266/300] [ 900/1251] eta: 0:05:34 lr: 0.000079 loss: 2.617321 (2.586382) time: 0.923893 data: 0.000195 max mem: 18817 Epoch: [266/300] [ 950/1251] eta: 0:04:46 lr: 0.000079 loss: 2.767447 (2.589994) time: 0.966694 data: 0.000172 max mem: 18817 Epoch: [266/300] [1000/1251] eta: 0:03:58 lr: 0.000078 loss: 2.680538 (2.593487) time: 0.958829 data: 0.000157 max mem: 18817 Epoch: [266/300] [1050/1251] eta: 0:03:11 lr: 0.000078 loss: 2.458225 (2.591085) time: 0.977614 data: 0.000169 max mem: 18817 Epoch: [266/300] [1100/1251] eta: 0:02:23 lr: 0.000078 loss: 2.624621 (2.589123) time: 0.961009 data: 0.000178 max mem: 18817 Epoch: [266/300] [1150/1251] eta: 0:01:35 lr: 0.000078 loss: 2.794593 (2.586401) time: 0.924088 data: 0.000185 max mem: 18817 Epoch: [266/300] [1200/1251] eta: 0:00:48 lr: 0.000078 loss: 2.575840 (2.584978) time: 0.966900 data: 0.000171 max mem: 18817 Epoch: [266/300] [1250/1251] eta: 0:00:00 lr: 0.000078 loss: 2.516385 (2.578681) time: 0.974573 data: 0.000751 max mem: 18817 Epoch: [266/300] Total time: 0:19:49 (0.951220 s / it) Averaged stats: lr: 0.000078 loss: 2.516385 (2.584877) Test: [ 0/49] eta: 0:01:17 loss: 0.493130 (0.493130) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.579002 data: 1.155620 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.623291 (0.662415) acc1: 85.937500 (85.227273) acc5: 96.875000 (96.732955) time: 0.473071 data: 0.105244 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.706130 (0.688086) acc1: 82.812500 (84.077381) acc5: 96.875000 (96.577381) time: 0.356944 data: 0.000177 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.680130 (0.686297) acc1: 82.812500 (83.770161) acc5: 96.875000 (96.723790) time: 0.353312 data: 0.000155 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.672968 (0.698337) acc1: 82.812500 (83.612805) acc5: 96.875000 (96.684451) time: 0.351300 data: 0.000153 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.683306 (0.694590) acc1: 82.812500 (83.808000) acc5: 96.875000 (96.736000) time: 0.355509 data: 0.000116 max mem: 18817 Test: Total time: 0:00:18 (0.385328 s / it) * Acc@1 83.818 Acc@5 96.780 loss 0.711 Max accuracy: 83.90% Epoch: [267/300] [ 0/1251] eta: 0:39:18 lr: 0.000078 loss: 2.827804 (2.827804) time: 1.885582 data: 1.026064 max mem: 18817 Epoch: [267/300] [ 50/1251] eta: 0:19:11 lr: 0.000078 loss: 2.672771 (2.481040) time: 0.921167 data: 0.000183 max mem: 18817 Epoch: [267/300] [ 100/1251] eta: 0:18:32 lr: 0.000078 loss: 2.747766 (2.562834) time: 0.921814 data: 0.000188 max mem: 18817 Epoch: [267/300] [ 150/1251] eta: 0:17:37 lr: 0.000077 loss: 2.647432 (2.566429) time: 0.957739 data: 0.000180 max mem: 18817 Epoch: [267/300] [ 200/1251] eta: 0:16:45 lr: 0.000077 loss: 2.565561 (2.584402) time: 0.989398 data: 0.000186 max mem: 18817 Epoch: [267/300] [ 250/1251] eta: 0:15:59 lr: 0.000077 loss: 2.642821 (2.579976) time: 1.040098 data: 0.000173 max mem: 18817 Epoch: [267/300] [ 300/1251] eta: 0:15:08 lr: 0.000077 loss: 2.599191 (2.591201) time: 0.964068 data: 0.000179 max mem: 18817 Epoch: [267/300] [ 350/1251] eta: 0:14:18 lr: 0.000077 loss: 2.668351 (2.597162) time: 0.932001 data: 0.000185 max mem: 18817 Epoch: [267/300] [ 400/1251] eta: 0:13:30 lr: 0.000077 loss: 2.763656 (2.599157) time: 0.915484 data: 0.000183 max mem: 18817 Epoch: [267/300] [ 450/1251] eta: 0:12:43 lr: 0.000077 loss: 2.693160 (2.608950) time: 0.964244 data: 0.000186 max mem: 18817 Epoch: [267/300] [ 500/1251] eta: 0:11:56 lr: 0.000076 loss: 2.424640 (2.604875) time: 1.032292 data: 0.000184 max mem: 18817 Epoch: [267/300] [ 550/1251] eta: 0:11:07 lr: 0.000076 loss: 2.747456 (2.614353) time: 0.973747 data: 0.000164 max mem: 18817 Epoch: [267/300] [ 600/1251] eta: 0:10:19 lr: 0.000076 loss: 2.662040 (2.620427) time: 0.916931 data: 0.000176 max mem: 18817 Epoch: [267/300] [ 650/1251] eta: 0:09:32 lr: 0.000076 loss: 2.857010 (2.626323) time: 0.910991 data: 0.000164 max mem: 18817 Epoch: [267/300] [ 700/1251] eta: 0:08:45 lr: 0.000076 loss: 2.637472 (2.622644) time: 0.980389 data: 0.000155 max mem: 18817 Epoch: [267/300] [ 750/1251] eta: 0:07:57 lr: 0.000076 loss: 2.639854 (2.623821) time: 1.029838 data: 0.000164 max mem: 18817 Epoch: [267/300] [ 800/1251] eta: 0:07:09 lr: 0.000076 loss: 2.639816 (2.621048) time: 0.961611 data: 0.000175 max mem: 18817 Epoch: [267/300] [ 850/1251] eta: 0:06:21 lr: 0.000075 loss: 2.693661 (2.622627) time: 0.918060 data: 0.000178 max mem: 18817 Epoch: [267/300] [ 900/1251] eta: 0:05:34 lr: 0.000075 loss: 2.734456 (2.622380) time: 0.915127 data: 0.000177 max mem: 18817 Epoch: [267/300] [ 950/1251] eta: 0:04:46 lr: 0.000075 loss: 2.618092 (2.620386) time: 0.970663 data: 0.000177 max mem: 18817 Epoch: [267/300] [1000/1251] eta: 0:03:59 lr: 0.000075 loss: 2.409608 (2.617169) time: 0.978435 data: 0.000163 max mem: 18817 Epoch: [267/300] [1050/1251] eta: 0:03:11 lr: 0.000075 loss: 2.672496 (2.619273) time: 0.953113 data: 0.000174 max mem: 18817 Epoch: [267/300] [1100/1251] eta: 0:02:23 lr: 0.000075 loss: 2.499040 (2.619082) time: 0.914560 data: 0.000179 max mem: 18817 Epoch: [267/300] [1150/1251] eta: 0:01:36 lr: 0.000075 loss: 2.548662 (2.614618) time: 0.907965 data: 0.000174 max mem: 18817 Epoch: [267/300] [1200/1251] eta: 0:00:48 lr: 0.000074 loss: 2.737653 (2.613843) time: 0.956434 data: 0.000185 max mem: 18817 Epoch: [267/300] [1250/1251] eta: 0:00:00 lr: 0.000074 loss: 2.657803 (2.613191) time: 0.970764 data: 0.000757 max mem: 18817 Epoch: [267/300] Total time: 0:19:51 (0.952758 s / it) Averaged stats: lr: 0.000074 loss: 2.657803 (2.602759) Test: [ 0/49] eta: 0:01:19 loss: 0.536361 (0.536361) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.627429 data: 1.171894 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.605328 (0.668788) acc1: 82.812500 (84.232955) acc5: 96.875000 (96.306818) time: 0.475229 data: 0.106679 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.708965 (0.698091) acc1: 82.812500 (83.333333) acc5: 96.875000 (96.354167) time: 0.356690 data: 0.000150 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.688221 (0.690842) acc1: 82.812500 (83.518145) acc5: 96.875000 (96.673387) time: 0.352689 data: 0.000148 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.662153 (0.701737) acc1: 82.812500 (83.346037) acc5: 96.875000 (96.684451) time: 0.354702 data: 0.000149 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.663889 (0.697456) acc1: 82.812500 (83.552000) acc5: 96.875000 (96.736000) time: 0.364329 data: 0.000118 max mem: 18817 Test: Total time: 0:00:19 (0.388212 s / it) * Acc@1 83.954 Acc@5 96.790 loss 0.708 Max accuracy: 83.95% Epoch: [268/300] [ 0/1251] eta: 0:40:31 lr: 0.000074 loss: 2.532616 (2.532616) time: 1.943529 data: 1.058229 max mem: 18817 Epoch: [268/300] [ 50/1251] eta: 0:19:05 lr: 0.000074 loss: 2.616739 (2.606162) time: 0.912232 data: 0.000172 max mem: 18817 Epoch: [268/300] [ 100/1251] eta: 0:18:19 lr: 0.000074 loss: 2.669859 (2.581948) time: 0.909912 data: 0.000161 max mem: 18817 Epoch: [268/300] [ 150/1251] eta: 0:17:34 lr: 0.000074 loss: 2.590524 (2.559043) time: 0.979284 data: 0.000180 max mem: 18817 Epoch: [268/300] [ 200/1251] eta: 0:16:47 lr: 0.000074 loss: 2.789699 (2.585119) time: 1.026400 data: 0.000180 max mem: 18817 Epoch: [268/300] [ 250/1251] eta: 0:15:56 lr: 0.000074 loss: 2.595917 (2.585771) time: 0.977091 data: 0.000173 max mem: 18817 Epoch: [268/300] [ 300/1251] eta: 0:15:04 lr: 0.000074 loss: 2.781541 (2.594403) time: 0.906108 data: 0.000166 max mem: 18817 Epoch: [268/300] [ 350/1251] eta: 0:14:16 lr: 0.000073 loss: 2.597786 (2.603994) time: 0.907680 data: 0.000161 max mem: 18817 Epoch: [268/300] [ 400/1251] eta: 0:13:29 lr: 0.000073 loss: 2.640113 (2.597275) time: 0.951081 data: 0.000214 max mem: 18817 Epoch: [268/300] [ 450/1251] eta: 0:12:41 lr: 0.000073 loss: 2.628806 (2.585530) time: 0.977903 data: 0.000176 max mem: 18817 Epoch: [268/300] [ 500/1251] eta: 0:11:52 lr: 0.000073 loss: 2.795961 (2.592185) time: 0.905818 data: 0.000181 max mem: 18817 Epoch: [268/300] [ 550/1251] eta: 0:11:06 lr: 0.000073 loss: 2.685312 (2.603300) time: 0.919669 data: 0.000164 max mem: 18817 Epoch: [268/300] [ 600/1251] eta: 0:10:18 lr: 0.000073 loss: 2.539740 (2.601542) time: 0.912467 data: 0.000185 max mem: 18817 Epoch: [268/300] [ 650/1251] eta: 0:09:32 lr: 0.000073 loss: 2.657728 (2.596688) time: 0.995843 data: 0.000160 max mem: 18817 Epoch: [268/300] [ 700/1251] eta: 0:08:44 lr: 0.000072 loss: 2.550329 (2.600418) time: 0.982342 data: 0.000171 max mem: 18817 Epoch: [268/300] [ 750/1251] eta: 0:07:56 lr: 0.000072 loss: 2.416807 (2.593675) time: 0.911845 data: 0.000181 max mem: 18817 Epoch: [268/300] [ 800/1251] eta: 0:07:09 lr: 0.000072 loss: 2.705442 (2.597064) time: 0.918292 data: 0.000164 max mem: 18817 Epoch: [268/300] [ 850/1251] eta: 0:06:21 lr: 0.000072 loss: 2.740879 (2.597375) time: 0.962672 data: 0.000168 max mem: 18817 Epoch: [268/300] [ 900/1251] eta: 0:05:34 lr: 0.000072 loss: 2.487225 (2.596951) time: 0.996844 data: 0.000237 max mem: 18817 Epoch: [268/300] [ 950/1251] eta: 0:04:46 lr: 0.000072 loss: 2.749210 (2.602616) time: 0.980624 data: 0.000186 max mem: 18817 Epoch: [268/300] [1000/1251] eta: 0:03:58 lr: 0.000072 loss: 2.755354 (2.601387) time: 0.928288 data: 0.000161 max mem: 18817 Epoch: [268/300] [1050/1251] eta: 0:03:11 lr: 0.000072 loss: 2.846488 (2.601267) time: 0.932162 data: 0.000182 max mem: 18817 Epoch: [268/300] [1100/1251] eta: 0:02:23 lr: 0.000071 loss: 2.672635 (2.602902) time: 0.956980 data: 0.000179 max mem: 18817 Epoch: [268/300] [1150/1251] eta: 0:01:36 lr: 0.000071 loss: 2.655341 (2.602405) time: 1.039883 data: 0.000173 max mem: 18817 Epoch: [268/300] [1200/1251] eta: 0:00:48 lr: 0.000071 loss: 2.472457 (2.597328) time: 0.967386 data: 0.000175 max mem: 18817 Epoch: [268/300] [1250/1251] eta: 0:00:00 lr: 0.000071 loss: 2.710314 (2.597033) time: 0.910973 data: 0.000767 max mem: 18817 Epoch: [268/300] Total time: 0:19:51 (0.952628 s / it) Averaged stats: lr: 0.000071 loss: 2.710314 (2.604235) Test: [ 0/49] eta: 0:01:15 loss: 0.554021 (0.554021) acc1: 82.812500 (82.812500) acc5: 98.437500 (98.437500) time: 1.546407 data: 1.101943 max mem: 18817 Test: [10/49] eta: 0:00:21 loss: 0.595015 (0.656541) acc1: 85.937500 (85.369318) acc5: 96.875000 (96.448864) time: 0.552953 data: 0.100328 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.733865 (0.686656) acc1: 84.375000 (84.449405) acc5: 96.875000 (96.651786) time: 0.456291 data: 0.000139 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.689389 (0.683681) acc1: 82.812500 (84.223790) acc5: 96.875000 (96.824597) time: 0.405835 data: 0.000123 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.659445 (0.696573) acc1: 82.812500 (83.879573) acc5: 96.875000 (96.798780) time: 0.350231 data: 0.000155 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.673836 (0.694791) acc1: 82.812500 (83.936000) acc5: 96.875000 (96.896000) time: 0.344991 data: 0.000133 max mem: 18817 Test: Total time: 0:00:20 (0.419823 s / it) * Acc@1 83.808 Acc@5 96.754 loss 0.709 Max accuracy: 83.95% Epoch: [269/300] [ 0/1251] eta: 0:41:05 lr: 0.000071 loss: 2.477818 (2.477818) time: 1.970863 data: 1.109752 max mem: 18817 Epoch: [269/300] [ 50/1251] eta: 0:19:35 lr: 0.000071 loss: 2.589501 (2.578075) time: 0.969693 data: 0.000164 max mem: 18817 Epoch: [269/300] [ 100/1251] eta: 0:18:43 lr: 0.000071 loss: 2.718409 (2.568383) time: 1.055661 data: 0.000164 max mem: 18817 Epoch: [269/300] [ 150/1251] eta: 0:17:41 lr: 0.000071 loss: 2.619592 (2.589404) time: 0.964134 data: 0.000175 max mem: 18817 Epoch: [269/300] [ 200/1251] eta: 0:16:43 lr: 0.000070 loss: 2.589669 (2.596576) time: 0.912297 data: 0.000169 max mem: 18817 Epoch: [269/300] [ 250/1251] eta: 0:15:58 lr: 0.000070 loss: 2.740228 (2.600774) time: 0.934353 data: 0.000175 max mem: 18817 Epoch: [269/300] [ 300/1251] eta: 0:15:11 lr: 0.000070 loss: 2.477088 (2.594952) time: 0.973017 data: 0.000170 max mem: 18817 Epoch: [269/300] [ 350/1251] eta: 0:14:26 lr: 0.000070 loss: 2.655834 (2.592153) time: 1.054801 data: 0.000180 max mem: 18817 Epoch: [269/300] [ 400/1251] eta: 0:13:35 lr: 0.000070 loss: 2.745016 (2.598337) time: 0.959123 data: 0.000172 max mem: 18817 Epoch: [269/300] [ 450/1251] eta: 0:12:45 lr: 0.000070 loss: 2.703946 (2.596974) time: 0.915064 data: 0.000189 max mem: 18817 Epoch: [269/300] [ 500/1251] eta: 0:11:58 lr: 0.000070 loss: 2.571672 (2.594646) time: 0.919685 data: 0.000180 max mem: 18817 Epoch: [269/300] [ 550/1251] eta: 0:11:09 lr: 0.000070 loss: 2.825162 (2.605136) time: 0.961183 data: 0.000180 max mem: 18817 Epoch: [269/300] [ 600/1251] eta: 0:10:22 lr: 0.000069 loss: 2.635645 (2.607933) time: 1.020508 data: 0.000187 max mem: 18817 Epoch: [269/300] [ 650/1251] eta: 0:09:33 lr: 0.000069 loss: 2.682377 (2.612580) time: 0.955163 data: 0.000166 max mem: 18817 Epoch: [269/300] [ 700/1251] eta: 0:08:45 lr: 0.000069 loss: 2.652737 (2.613152) time: 0.923212 data: 0.000160 max mem: 18817 Epoch: [269/300] [ 750/1251] eta: 0:07:57 lr: 0.000069 loss: 2.626853 (2.610828) time: 0.913286 data: 0.000163 max mem: 18817 Epoch: [269/300] [ 800/1251] eta: 0:07:09 lr: 0.000069 loss: 2.711692 (2.607712) time: 0.977789 data: 0.000169 max mem: 18817 Epoch: [269/300] [ 850/1251] eta: 0:06:22 lr: 0.000069 loss: 2.536332 (2.602302) time: 0.996032 data: 0.000179 max mem: 18817 Epoch: [269/300] [ 900/1251] eta: 0:05:34 lr: 0.000069 loss: 2.652126 (2.600604) time: 0.971939 data: 0.000175 max mem: 18817 Epoch: [269/300] [ 950/1251] eta: 0:04:46 lr: 0.000069 loss: 2.514657 (2.595845) time: 0.962454 data: 0.000177 max mem: 18817 Epoch: [269/300] [1000/1251] eta: 0:03:59 lr: 0.000068 loss: 2.563338 (2.593654) time: 0.934902 data: 0.000169 max mem: 18817 Epoch: [269/300] [1050/1251] eta: 0:03:11 lr: 0.000068 loss: 2.674916 (2.598623) time: 0.910487 data: 0.000169 max mem: 18817 Epoch: [269/300] [1100/1251] eta: 0:02:24 lr: 0.000068 loss: 2.475439 (2.599155) time: 0.977420 data: 0.000175 max mem: 18817 Epoch: [269/300] [1150/1251] eta: 0:01:36 lr: 0.000068 loss: 2.618396 (2.596899) time: 0.991650 data: 0.000188 max mem: 18817 Epoch: [269/300] [1200/1251] eta: 0:00:48 lr: 0.000068 loss: 2.652752 (2.598866) time: 0.970388 data: 0.000173 max mem: 18817 Epoch: [269/300] [1250/1251] eta: 0:00:00 lr: 0.000068 loss: 2.660906 (2.598651) time: 0.921151 data: 0.000759 max mem: 18817 Epoch: [269/300] Total time: 0:19:53 (0.954245 s / it) Averaged stats: lr: 0.000068 loss: 2.660906 (2.598730) Test: [ 0/49] eta: 0:01:15 loss: 0.513601 (0.513601) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.531747 data: 1.113108 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.616451 (0.664689) acc1: 84.375000 (84.801136) acc5: 95.312500 (96.448864) time: 0.487413 data: 0.101356 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.695894 (0.686650) acc1: 82.812500 (84.375000) acc5: 95.312500 (96.502976) time: 0.367160 data: 0.000165 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.672374 (0.683019) acc1: 84.375000 (84.274194) acc5: 96.875000 (96.723790) time: 0.352295 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.711314 (0.696859) acc1: 84.375000 (84.184451) acc5: 96.875000 (96.646341) time: 0.350418 data: 0.000122 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.711314 (0.694797) acc1: 84.375000 (84.320000) acc5: 96.875000 (96.704000) time: 0.344365 data: 0.000102 max mem: 18817 Test: Total time: 0:00:18 (0.381871 s / it) * Acc@1 83.864 Acc@5 96.684 loss 0.712 Max accuracy: 83.95% Epoch: [270/300] [ 0/1251] eta: 0:44:32 lr: 0.000068 loss: 2.999193 (2.999193) time: 2.135955 data: 1.255621 max mem: 18817 Epoch: [270/300] [ 50/1251] eta: 0:20:00 lr: 0.000068 loss: 2.552919 (2.556977) time: 0.994335 data: 0.000173 max mem: 18817 Epoch: [270/300] [ 100/1251] eta: 0:18:36 lr: 0.000068 loss: 2.775669 (2.557854) time: 0.963080 data: 0.000186 max mem: 18817 Epoch: [270/300] [ 150/1251] eta: 0:17:50 lr: 0.000067 loss: 2.677458 (2.569913) time: 0.991574 data: 0.000433 max mem: 18817 Epoch: [270/300] [ 200/1251] eta: 0:16:51 lr: 0.000067 loss: 2.663894 (2.573393) time: 0.904712 data: 0.000161 max mem: 18817 Epoch: [270/300] [ 250/1251] eta: 0:16:00 lr: 0.000067 loss: 2.660760 (2.583773) time: 0.909092 data: 0.000187 max mem: 18817 Epoch: [270/300] [ 300/1251] eta: 0:15:12 lr: 0.000067 loss: 2.563750 (2.583584) time: 0.980593 data: 0.000175 max mem: 18817 Epoch: [270/300] [ 350/1251] eta: 0:14:20 lr: 0.000067 loss: 2.513976 (2.577546) time: 0.947118 data: 0.000163 max mem: 18817 Epoch: [270/300] [ 400/1251] eta: 0:13:32 lr: 0.000067 loss: 2.741320 (2.584784) time: 0.950987 data: 0.000168 max mem: 18817 Epoch: [270/300] [ 450/1251] eta: 0:12:43 lr: 0.000067 loss: 2.596107 (2.582172) time: 0.913197 data: 0.000181 max mem: 18817 Epoch: [270/300] [ 500/1251] eta: 0:11:55 lr: 0.000067 loss: 2.759577 (2.582139) time: 0.954666 data: 0.000160 max mem: 18817 Epoch: [270/300] [ 550/1251] eta: 0:11:08 lr: 0.000066 loss: 2.585943 (2.590316) time: 0.980974 data: 0.000171 max mem: 18817 Epoch: [270/300] [ 600/1251] eta: 0:10:20 lr: 0.000066 loss: 2.603714 (2.594671) time: 0.973778 data: 0.000190 max mem: 18817 Epoch: [270/300] [ 650/1251] eta: 0:09:32 lr: 0.000066 loss: 2.516376 (2.590965) time: 0.925671 data: 0.000162 max mem: 18817 Epoch: [270/300] [ 700/1251] eta: 0:08:44 lr: 0.000066 loss: 2.635068 (2.587786) time: 0.912952 data: 0.000168 max mem: 18817 Epoch: [270/300] [ 750/1251] eta: 0:07:57 lr: 0.000066 loss: 2.708907 (2.588428) time: 0.975415 data: 0.000181 max mem: 18817 Epoch: [270/300] [ 800/1251] eta: 0:07:09 lr: 0.000066 loss: 2.808623 (2.595087) time: 0.967391 data: 0.000172 max mem: 18817 Epoch: [270/300] [ 850/1251] eta: 0:06:21 lr: 0.000066 loss: 2.750226 (2.597188) time: 0.959809 data: 0.000169 max mem: 18817 Epoch: [270/300] [ 900/1251] eta: 0:05:34 lr: 0.000066 loss: 2.587754 (2.602361) time: 0.917735 data: 0.000174 max mem: 18817 Epoch: [270/300] [ 950/1251] eta: 0:04:46 lr: 0.000065 loss: 2.740275 (2.603370) time: 0.909050 data: 0.000165 max mem: 18817 Epoch: [270/300] [1000/1251] eta: 0:03:59 lr: 0.000065 loss: 2.621920 (2.600664) time: 0.968257 data: 0.000179 max mem: 18817 Epoch: [270/300] [1050/1251] eta: 0:03:11 lr: 0.000065 loss: 2.718895 (2.602802) time: 1.016325 data: 0.000169 max mem: 18817 Epoch: [270/300] [1100/1251] eta: 0:02:23 lr: 0.000065 loss: 2.663501 (2.604697) time: 0.984264 data: 0.000187 max mem: 18817 Epoch: [270/300] [1150/1251] eta: 0:01:36 lr: 0.000065 loss: 2.649096 (2.601767) time: 0.911774 data: 0.000175 max mem: 18817 Epoch: [270/300] [1200/1251] eta: 0:00:48 lr: 0.000065 loss: 2.542857 (2.600836) time: 0.932798 data: 0.000180 max mem: 18817 Epoch: [270/300] [1250/1251] eta: 0:00:00 lr: 0.000065 loss: 2.576985 (2.598168) time: 0.973565 data: 0.000812 max mem: 18817 Epoch: [270/300] Total time: 0:19:51 (0.952813 s / it) Averaged stats: lr: 0.000065 loss: 2.576985 (2.597343) Test: [ 0/49] eta: 0:01:25 loss: 0.522182 (0.522182) acc1: 87.500000 (87.500000) acc5: 98.437500 (98.437500) time: 1.743796 data: 1.354682 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.609969 (0.657699) acc1: 84.375000 (85.085227) acc5: 98.437500 (96.732955) time: 0.487232 data: 0.123277 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.667756 (0.682752) acc1: 82.812500 (84.523810) acc5: 96.875000 (96.651786) time: 0.357475 data: 0.000135 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.665522 (0.683735) acc1: 82.812500 (84.173387) acc5: 96.875000 (96.925403) time: 0.360391 data: 0.000128 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.697742 (0.696464) acc1: 82.812500 (84.070122) acc5: 96.875000 (96.913110) time: 0.363893 data: 0.000118 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.697742 (0.694922) acc1: 84.375000 (84.320000) acc5: 96.875000 (96.864000) time: 0.357399 data: 0.000099 max mem: 18817 Test: Total time: 0:00:19 (0.389452 s / it) * Acc@1 83.858 Acc@5 96.774 loss 0.714 Max accuracy: 83.95% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0270.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0270.pth Epoch: [271/300] [ 0/1251] eta: 0:39:50 lr: 0.000065 loss: 2.819255 (2.819255) time: 1.910608 data: 1.038167 max mem: 18817 Epoch: [271/300] [ 50/1251] eta: 0:19:43 lr: 0.000065 loss: 2.631400 (2.627956) time: 0.965593 data: 0.000164 max mem: 18817 Epoch: [271/300] [ 100/1251] eta: 0:18:24 lr: 0.000064 loss: 2.679306 (2.585134) time: 0.956579 data: 0.000164 max mem: 18817 Epoch: [271/300] [ 150/1251] eta: 0:17:32 lr: 0.000064 loss: 2.574510 (2.559252) time: 0.926183 data: 0.000169 max mem: 18817 Epoch: [271/300] [ 200/1251] eta: 0:16:45 lr: 0.000064 loss: 2.735154 (2.545492) time: 0.917672 data: 0.000171 max mem: 18817 Epoch: [271/300] [ 250/1251] eta: 0:15:57 lr: 0.000064 loss: 2.682781 (2.559499) time: 0.975609 data: 0.000180 max mem: 18817 Epoch: [271/300] [ 300/1251] eta: 0:15:11 lr: 0.000064 loss: 2.735286 (2.559951) time: 0.994099 data: 0.000160 max mem: 18817 Epoch: [271/300] [ 350/1251] eta: 0:14:21 lr: 0.000064 loss: 2.551291 (2.561937) time: 0.986807 data: 0.000176 max mem: 18817 Epoch: [271/300] [ 400/1251] eta: 0:13:32 lr: 0.000064 loss: 2.622828 (2.567617) time: 0.922176 data: 0.000163 max mem: 18817 Epoch: [271/300] [ 450/1251] eta: 0:12:44 lr: 0.000064 loss: 2.237024 (2.559009) time: 0.922936 data: 0.000188 max mem: 18817 Epoch: [271/300] [ 500/1251] eta: 0:11:57 lr: 0.000063 loss: 2.765815 (2.566500) time: 0.978350 data: 0.000177 max mem: 18817 Epoch: [271/300] [ 550/1251] eta: 0:11:10 lr: 0.000063 loss: 2.727774 (2.572394) time: 1.005878 data: 0.000180 max mem: 18817 Epoch: [271/300] [ 600/1251] eta: 0:10:21 lr: 0.000063 loss: 2.704795 (2.576480) time: 0.955766 data: 0.000173 max mem: 18817 Epoch: [271/300] [ 650/1251] eta: 0:09:32 lr: 0.000063 loss: 2.396200 (2.575591) time: 0.919923 data: 0.000169 max mem: 18817 Epoch: [271/300] [ 700/1251] eta: 0:08:45 lr: 0.000063 loss: 2.587528 (2.580012) time: 0.911873 data: 0.000155 max mem: 18817 Epoch: [271/300] [ 750/1251] eta: 0:07:56 lr: 0.000063 loss: 2.569848 (2.578534) time: 0.906441 data: 0.000183 max mem: 18817 Epoch: [271/300] [ 800/1251] eta: 0:07:09 lr: 0.000063 loss: 2.439490 (2.580238) time: 0.984317 data: 0.000181 max mem: 18817 Epoch: [271/300] [ 850/1251] eta: 0:06:22 lr: 0.000063 loss: 2.556557 (2.580895) time: 1.027982 data: 0.000184 max mem: 18817 Epoch: [271/300] [ 900/1251] eta: 0:05:34 lr: 0.000062 loss: 2.547083 (2.574809) time: 0.980111 data: 0.000174 max mem: 18817 Epoch: [271/300] [ 950/1251] eta: 0:04:46 lr: 0.000062 loss: 2.692019 (2.576575) time: 0.930715 data: 0.000167 max mem: 18817 Epoch: [271/300] [1000/1251] eta: 0:03:59 lr: 0.000062 loss: 2.482489 (2.574710) time: 0.930806 data: 0.000167 max mem: 18817 Epoch: [271/300] [1050/1251] eta: 0:03:11 lr: 0.000062 loss: 2.740077 (2.576592) time: 0.968689 data: 0.000166 max mem: 18817 Epoch: [271/300] [1100/1251] eta: 0:02:24 lr: 0.000062 loss: 2.640422 (2.577968) time: 1.022403 data: 0.000174 max mem: 18817 Epoch: [271/300] [1150/1251] eta: 0:01:36 lr: 0.000062 loss: 2.596119 (2.577454) time: 0.960405 data: 0.000160 max mem: 18817 Epoch: [271/300] [1200/1251] eta: 0:00:48 lr: 0.000062 loss: 2.515081 (2.577288) time: 0.918230 data: 0.000164 max mem: 18817 Epoch: [271/300] [1250/1251] eta: 0:00:00 lr: 0.000062 loss: 2.584030 (2.573210) time: 0.922845 data: 0.000771 max mem: 18817 Epoch: [271/300] Total time: 0:19:52 (0.953477 s / it) Averaged stats: lr: 0.000062 loss: 2.584030 (2.574419) Test: [ 0/49] eta: 0:01:14 loss: 0.511499 (0.511499) acc1: 84.375000 (84.375000) acc5: 98.437500 (98.437500) time: 1.517317 data: 1.116902 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.574031 (0.637176) acc1: 84.375000 (85.227273) acc5: 96.875000 (96.306818) time: 0.466856 data: 0.101697 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.700187 (0.672144) acc1: 82.812500 (84.151786) acc5: 96.875000 (96.726190) time: 0.359641 data: 0.000152 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.666772 (0.671448) acc1: 82.812500 (84.022177) acc5: 96.875000 (96.925403) time: 0.354618 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.686758 (0.685418) acc1: 82.812500 (83.879573) acc5: 98.437500 (97.065549) time: 0.349042 data: 0.000136 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.686758 (0.682927) acc1: 82.812500 (84.032000) acc5: 98.437500 (97.024000) time: 0.343588 data: 0.000110 max mem: 18817 Test: Total time: 0:00:18 (0.378426 s / it) * Acc@1 83.818 Acc@5 96.792 loss 0.705 Max accuracy: 83.95% Epoch: [272/300] [ 0/1251] eta: 0:41:32 lr: 0.000062 loss: 2.918221 (2.918221) time: 1.992619 data: 1.129409 max mem: 18817 Epoch: [272/300] [ 50/1251] eta: 0:19:01 lr: 0.000061 loss: 2.691783 (2.595314) time: 0.953178 data: 0.000173 max mem: 18817 Epoch: [272/300] [ 100/1251] eta: 0:18:03 lr: 0.000061 loss: 2.747631 (2.580782) time: 0.922440 data: 0.000176 max mem: 18817 Epoch: [272/300] [ 150/1251] eta: 0:17:23 lr: 0.000061 loss: 2.674468 (2.580546) time: 0.928444 data: 0.000176 max mem: 18817 Epoch: [272/300] [ 200/1251] eta: 0:16:38 lr: 0.000061 loss: 2.558846 (2.554096) time: 0.960261 data: 0.000174 max mem: 18817 Epoch: [272/300] [ 250/1251] eta: 0:15:52 lr: 0.000061 loss: 2.566839 (2.567063) time: 1.022282 data: 0.000166 max mem: 18817 Epoch: [272/300] [ 300/1251] eta: 0:15:01 lr: 0.000061 loss: 2.554946 (2.560470) time: 0.948405 data: 0.000186 max mem: 18817 Epoch: [272/300] [ 350/1251] eta: 0:14:12 lr: 0.000061 loss: 2.741692 (2.566868) time: 0.916265 data: 0.000170 max mem: 18817 Epoch: [272/300] [ 400/1251] eta: 0:13:26 lr: 0.000061 loss: 2.683878 (2.572487) time: 0.931745 data: 0.000161 max mem: 18817 Epoch: [272/300] [ 450/1251] eta: 0:12:40 lr: 0.000061 loss: 2.741186 (2.585505) time: 0.970685 data: 0.000148 max mem: 18817 Epoch: [272/300] [ 500/1251] eta: 0:11:51 lr: 0.000060 loss: 2.688547 (2.585083) time: 0.965181 data: 0.000187 max mem: 18817 Epoch: [272/300] [ 550/1251] eta: 0:11:02 lr: 0.000060 loss: 2.876826 (2.586463) time: 0.904434 data: 0.000172 max mem: 18817 Epoch: [272/300] [ 600/1251] eta: 0:10:15 lr: 0.000060 loss: 2.398219 (2.579664) time: 0.918618 data: 0.000174 max mem: 18817 Epoch: [272/300] [ 650/1251] eta: 0:09:29 lr: 0.000060 loss: 2.680256 (2.592132) time: 0.923185 data: 0.000176 max mem: 18817 Epoch: [272/300] [ 700/1251] eta: 0:08:42 lr: 0.000060 loss: 2.792139 (2.592767) time: 0.955669 data: 0.000179 max mem: 18817 Epoch: [272/300] [ 750/1251] eta: 0:07:54 lr: 0.000060 loss: 2.549902 (2.587514) time: 0.977599 data: 0.000180 max mem: 18817 Epoch: [272/300] [ 800/1251] eta: 0:07:06 lr: 0.000060 loss: 2.671710 (2.581628) time: 0.959672 data: 0.000164 max mem: 18817 Epoch: [272/300] [ 850/1251] eta: 0:06:19 lr: 0.000060 loss: 2.529953 (2.578702) time: 0.959397 data: 0.000176 max mem: 18817 Epoch: [272/300] [ 900/1251] eta: 0:05:32 lr: 0.000059 loss: 2.411376 (2.576204) time: 0.909398 data: 0.000175 max mem: 18817 Epoch: [272/300] [ 950/1251] eta: 0:04:45 lr: 0.000059 loss: 2.605173 (2.576667) time: 0.968527 data: 0.000170 max mem: 18817 Epoch: [272/300] [1000/1251] eta: 0:03:57 lr: 0.000059 loss: 2.913020 (2.579612) time: 0.977794 data: 0.000158 max mem: 18817 Epoch: [272/300] [1050/1251] eta: 0:03:10 lr: 0.000059 loss: 2.508137 (2.576237) time: 0.957464 data: 0.000194 max mem: 18817 Epoch: [272/300] [1100/1251] eta: 0:02:22 lr: 0.000059 loss: 2.556314 (2.575732) time: 0.920497 data: 0.000169 max mem: 18817 Epoch: [272/300] [1150/1251] eta: 0:01:35 lr: 0.000059 loss: 2.714624 (2.578181) time: 0.920982 data: 0.000224 max mem: 18817 Epoch: [272/300] [1200/1251] eta: 0:00:48 lr: 0.000059 loss: 2.626983 (2.579189) time: 0.953789 data: 0.000206 max mem: 18817 Epoch: [272/300] [1250/1251] eta: 0:00:00 lr: 0.000059 loss: 2.560505 (2.578007) time: 0.900539 data: 0.000775 max mem: 18817 Epoch: [272/300] Total time: 0:19:45 (0.947979 s / it) Averaged stats: lr: 0.000059 loss: 2.560505 (2.579767) Test: [ 0/49] eta: 0:01:16 loss: 0.492659 (0.492659) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.564162 data: 1.148778 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.548364 (0.647869) acc1: 85.937500 (85.085227) acc5: 96.875000 (96.590909) time: 0.468476 data: 0.104573 max mem: 18817 Test: [20/49] eta: 0:00:11 loss: 0.687045 (0.677753) acc1: 82.812500 (84.151786) acc5: 96.875000 (96.800595) time: 0.355486 data: 0.000140 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.676562 (0.676933) acc1: 82.812500 (84.173387) acc5: 98.437500 (97.026210) time: 0.351849 data: 0.000129 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.685963 (0.691071) acc1: 84.375000 (84.108232) acc5: 96.875000 (96.951220) time: 0.348987 data: 0.000136 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.685963 (0.687079) acc1: 84.375000 (84.384000) acc5: 96.875000 (97.024000) time: 0.343678 data: 0.000118 max mem: 18817 Test: Total time: 0:00:18 (0.376883 s / it) * Acc@1 84.000 Acc@5 96.774 loss 0.708 Max accuracy: 84.00% Epoch: [273/300] [ 0/1251] eta: 0:42:58 lr: 0.000059 loss: 2.776528 (2.776528) time: 2.061117 data: 1.178115 max mem: 18817 Epoch: [273/300] [ 50/1251] eta: 0:19:12 lr: 0.000059 loss: 2.444547 (2.566870) time: 0.949649 data: 0.000170 max mem: 18817 Epoch: [273/300] [ 100/1251] eta: 0:18:08 lr: 0.000058 loss: 2.600487 (2.578344) time: 0.912871 data: 0.000176 max mem: 18817 Epoch: [273/300] [ 150/1251] eta: 0:17:20 lr: 0.000058 loss: 2.682054 (2.553250) time: 0.906626 data: 0.000187 max mem: 18817 Epoch: [273/300] [ 200/1251] eta: 0:16:37 lr: 0.000058 loss: 2.581528 (2.578505) time: 0.963841 data: 0.000179 max mem: 18817 Epoch: [273/300] [ 250/1251] eta: 0:15:51 lr: 0.000058 loss: 2.783539 (2.585557) time: 1.005667 data: 0.000187 max mem: 18817 Epoch: [273/300] [ 300/1251] eta: 0:15:03 lr: 0.000058 loss: 2.406946 (2.579622) time: 0.958991 data: 0.000174 max mem: 18817 Epoch: [273/300] [ 350/1251] eta: 0:14:15 lr: 0.000058 loss: 2.603293 (2.579512) time: 0.921539 data: 0.000170 max mem: 18817 Epoch: [273/300] [ 400/1251] eta: 0:13:29 lr: 0.000058 loss: 2.589353 (2.567363) time: 0.905260 data: 0.000170 max mem: 18817 Epoch: [273/300] [ 450/1251] eta: 0:12:42 lr: 0.000058 loss: 2.778002 (2.563962) time: 0.956733 data: 0.000167 max mem: 18817 Epoch: [273/300] [ 500/1251] eta: 0:11:52 lr: 0.000058 loss: 2.712960 (2.566347) time: 0.948526 data: 0.000176 max mem: 18817 Epoch: [273/300] [ 550/1251] eta: 0:11:05 lr: 0.000057 loss: 2.725753 (2.570726) time: 0.923192 data: 0.000170 max mem: 18817 Epoch: [273/300] [ 600/1251] eta: 0:10:18 lr: 0.000057 loss: 2.582245 (2.570770) time: 0.913580 data: 0.000179 max mem: 18817 Epoch: [273/300] [ 650/1251] eta: 0:09:31 lr: 0.000057 loss: 2.312101 (2.563992) time: 0.920345 data: 0.000166 max mem: 18817 Epoch: [273/300] [ 700/1251] eta: 0:08:44 lr: 0.000057 loss: 2.636002 (2.564915) time: 0.970551 data: 0.000161 max mem: 18817 Epoch: [273/300] [ 750/1251] eta: 0:07:56 lr: 0.000057 loss: 2.729170 (2.566442) time: 0.988190 data: 0.000172 max mem: 18817 Epoch: [273/300] [ 800/1251] eta: 0:07:09 lr: 0.000057 loss: 2.777070 (2.570811) time: 0.925130 data: 0.000186 max mem: 18817 Epoch: [273/300] [ 850/1251] eta: 0:06:21 lr: 0.000057 loss: 2.681090 (2.574938) time: 0.934343 data: 0.000164 max mem: 18817 Epoch: [273/300] [ 900/1251] eta: 0:05:34 lr: 0.000057 loss: 2.739262 (2.580885) time: 0.934839 data: 0.000169 max mem: 18817 Epoch: [273/300] [ 950/1251] eta: 0:04:46 lr: 0.000057 loss: 2.723442 (2.588236) time: 0.971740 data: 0.000172 max mem: 18817 Epoch: [273/300] [1000/1251] eta: 0:03:58 lr: 0.000056 loss: 2.481954 (2.584980) time: 0.967320 data: 0.000170 max mem: 18817 Epoch: [273/300] [1050/1251] eta: 0:03:11 lr: 0.000056 loss: 2.761210 (2.584442) time: 0.912363 data: 0.000169 max mem: 18817 Epoch: [273/300] [1100/1251] eta: 0:02:23 lr: 0.000056 loss: 2.740906 (2.585887) time: 0.915903 data: 0.000182 max mem: 18817 Epoch: [273/300] [1150/1251] eta: 0:01:36 lr: 0.000056 loss: 2.472906 (2.584526) time: 0.968130 data: 0.000168 max mem: 18817 Epoch: [273/300] [1200/1251] eta: 0:00:48 lr: 0.000056 loss: 2.697532 (2.583261) time: 1.032558 data: 0.000179 max mem: 18817 Epoch: [273/300] [1250/1251] eta: 0:00:00 lr: 0.000056 loss: 2.398499 (2.581287) time: 0.971880 data: 0.000756 max mem: 18817 Epoch: [273/300] Total time: 0:19:50 (0.951984 s / it) Averaged stats: lr: 0.000056 loss: 2.398499 (2.579210) Test: [ 0/49] eta: 0:01:17 loss: 0.459505 (0.459505) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.585021 data: 1.152267 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.541321 (0.642929) acc1: 85.937500 (85.653409) acc5: 96.875000 (97.017045) time: 0.474611 data: 0.104903 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.705482 (0.673096) acc1: 82.812500 (84.226190) acc5: 96.875000 (97.023810) time: 0.358280 data: 0.000145 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.685247 (0.674664) acc1: 82.812500 (83.971774) acc5: 98.437500 (97.177419) time: 0.353774 data: 0.000135 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.658063 (0.687019) acc1: 84.375000 (83.917683) acc5: 98.437500 (97.141768) time: 0.351409 data: 0.000148 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.669600 (0.685597) acc1: 84.375000 (84.064000) acc5: 98.437500 (97.152000) time: 0.346049 data: 0.000124 max mem: 18817 Test: Total time: 0:00:18 (0.379862 s / it) * Acc@1 83.878 Acc@5 96.770 loss 0.708 Max accuracy: 84.00% Epoch: [274/300] [ 0/1251] eta: 0:41:30 lr: 0.000056 loss: 2.282331 (2.282331) time: 1.990843 data: 1.129620 max mem: 18817 Epoch: [274/300] [ 50/1251] eta: 0:19:06 lr: 0.000056 loss: 2.627042 (2.539298) time: 0.914813 data: 0.000191 max mem: 18817 Epoch: [274/300] [ 100/1251] eta: 0:18:28 lr: 0.000056 loss: 2.478950 (2.493271) time: 0.921108 data: 0.000189 max mem: 18817 Epoch: [274/300] [ 150/1251] eta: 0:17:44 lr: 0.000056 loss: 2.781843 (2.543732) time: 0.987278 data: 0.000185 max mem: 18817 Epoch: [274/300] [ 200/1251] eta: 0:16:51 lr: 0.000055 loss: 2.527981 (2.532747) time: 1.012044 data: 0.000178 max mem: 18817 Epoch: [274/300] [ 250/1251] eta: 0:15:59 lr: 0.000055 loss: 2.629505 (2.536078) time: 0.979339 data: 0.000193 max mem: 18817 Epoch: [274/300] [ 300/1251] eta: 0:15:08 lr: 0.000055 loss: 2.683339 (2.539977) time: 0.919054 data: 0.000172 max mem: 18817 Epoch: [274/300] [ 350/1251] eta: 0:14:20 lr: 0.000055 loss: 2.651266 (2.552115) time: 0.905466 data: 0.000159 max mem: 18817 Epoch: [274/300] [ 400/1251] eta: 0:13:34 lr: 0.000055 loss: 2.581968 (2.560787) time: 0.976119 data: 0.000176 max mem: 18817 Epoch: [274/300] [ 450/1251] eta: 0:12:47 lr: 0.000055 loss: 2.552092 (2.564639) time: 1.030895 data: 0.000175 max mem: 18817 Epoch: [274/300] [ 500/1251] eta: 0:11:58 lr: 0.000055 loss: 2.695843 (2.572358) time: 0.963162 data: 0.000172 max mem: 18817 Epoch: [274/300] [ 550/1251] eta: 0:11:09 lr: 0.000055 loss: 2.712161 (2.578638) time: 0.909974 data: 0.000179 max mem: 18817 Epoch: [274/300] [ 600/1251] eta: 0:10:22 lr: 0.000055 loss: 2.768644 (2.578337) time: 0.913510 data: 0.000168 max mem: 18817 Epoch: [274/300] [ 650/1251] eta: 0:09:34 lr: 0.000054 loss: 2.620128 (2.577870) time: 0.965454 data: 0.000179 max mem: 18817 Epoch: [274/300] [ 700/1251] eta: 0:08:46 lr: 0.000054 loss: 2.636386 (2.583056) time: 0.987374 data: 0.000170 max mem: 18817 Epoch: [274/300] [ 750/1251] eta: 0:07:59 lr: 0.000054 loss: 2.769892 (2.590760) time: 0.987089 data: 0.000165 max mem: 18817 Epoch: [274/300] [ 800/1251] eta: 0:07:10 lr: 0.000054 loss: 2.645650 (2.595096) time: 0.916800 data: 0.000186 max mem: 18817 Epoch: [274/300] [ 850/1251] eta: 0:06:23 lr: 0.000054 loss: 2.345066 (2.592351) time: 0.918221 data: 0.000181 max mem: 18817 Epoch: [274/300] [ 900/1251] eta: 0:05:35 lr: 0.000054 loss: 2.563818 (2.591259) time: 0.965740 data: 0.000192 max mem: 18817 Epoch: [274/300] [ 950/1251] eta: 0:04:47 lr: 0.000054 loss: 2.569171 (2.588469) time: 0.998415 data: 0.000184 max mem: 18817 Epoch: [274/300] [1000/1251] eta: 0:04:00 lr: 0.000054 loss: 2.709179 (2.586791) time: 0.985327 data: 0.000167 max mem: 18817 Epoch: [274/300] [1050/1251] eta: 0:03:12 lr: 0.000054 loss: 2.679732 (2.585697) time: 0.913636 data: 0.000184 max mem: 18817 Epoch: [274/300] [1100/1251] eta: 0:02:24 lr: 0.000053 loss: 2.729363 (2.585121) time: 0.908000 data: 0.000166 max mem: 18817 Epoch: [274/300] [1150/1251] eta: 0:01:36 lr: 0.000053 loss: 2.749120 (2.586264) time: 0.960078 data: 0.000190 max mem: 18817 Epoch: [274/300] [1200/1251] eta: 0:00:48 lr: 0.000053 loss: 2.844939 (2.588345) time: 0.958511 data: 0.000179 max mem: 18817 Epoch: [274/300] [1250/1251] eta: 0:00:00 lr: 0.000053 loss: 2.715559 (2.589605) time: 0.927784 data: 0.000761 max mem: 18817 Epoch: [274/300] Total time: 0:19:54 (0.954708 s / it) Averaged stats: lr: 0.000053 loss: 2.715559 (2.585513) Test: [ 0/49] eta: 0:01:30 loss: 0.477780 (0.477780) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.845396 data: 1.460342 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.556103 (0.642524) acc1: 85.937500 (84.943182) acc5: 98.437500 (97.017045) time: 0.499093 data: 0.132875 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.715397 (0.676498) acc1: 84.375000 (84.523810) acc5: 96.875000 (97.098214) time: 0.358666 data: 0.000128 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.683128 (0.676180) acc1: 82.812500 (84.223790) acc5: 96.875000 (97.227823) time: 0.351872 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.688426 (0.692821) acc1: 82.812500 (83.993902) acc5: 98.437500 (97.179878) time: 0.350494 data: 0.000134 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.688426 (0.689497) acc1: 82.812500 (84.224000) acc5: 98.437500 (97.152000) time: 0.448332 data: 0.000113 max mem: 18817 Test: Total time: 0:00:20 (0.427433 s / it) * Acc@1 83.900 Acc@5 96.740 loss 0.710 Max accuracy: 84.00% Epoch: [275/300] [ 0/1251] eta: 0:45:50 lr: 0.000053 loss: 2.123623 (2.123623) time: 2.198819 data: 1.152848 max mem: 18817 Epoch: [275/300] [ 50/1251] eta: 0:19:39 lr: 0.000053 loss: 2.759545 (2.662097) time: 0.982341 data: 0.000189 max mem: 18817 Epoch: [275/300] [ 100/1251] eta: 0:18:39 lr: 0.000053 loss: 2.510045 (2.576910) time: 0.964414 data: 0.000174 max mem: 18817 Epoch: [275/300] [ 150/1251] eta: 0:17:37 lr: 0.000053 loss: 2.719450 (2.595905) time: 0.959871 data: 0.000178 max mem: 18817 Epoch: [275/300] [ 200/1251] eta: 0:16:43 lr: 0.000053 loss: 2.772951 (2.598763) time: 0.923915 data: 0.000168 max mem: 18817 Epoch: [275/300] [ 250/1251] eta: 0:15:51 lr: 0.000053 loss: 2.666660 (2.585555) time: 0.916299 data: 0.000170 max mem: 18817 Epoch: [275/300] [ 300/1251] eta: 0:15:07 lr: 0.000053 loss: 2.739369 (2.593251) time: 0.919237 data: 0.000165 max mem: 18817 Epoch: [275/300] [ 350/1251] eta: 0:14:19 lr: 0.000052 loss: 2.458384 (2.587366) time: 0.967114 data: 0.000183 max mem: 18817 Epoch: [275/300] [ 400/1251] eta: 0:13:34 lr: 0.000052 loss: 2.514225 (2.591022) time: 1.006394 data: 0.000175 max mem: 18817 Epoch: [275/300] [ 450/1251] eta: 0:12:45 lr: 0.000052 loss: 2.637324 (2.585635) time: 0.970571 data: 0.000173 max mem: 18817 Epoch: [275/300] [ 500/1251] eta: 0:11:55 lr: 0.000052 loss: 2.567640 (2.572597) time: 0.906988 data: 0.000176 max mem: 18817 Epoch: [275/300] [ 550/1251] eta: 0:11:08 lr: 0.000052 loss: 2.640515 (2.580225) time: 0.922126 data: 0.000162 max mem: 18817 Epoch: [275/300] [ 600/1251] eta: 0:10:20 lr: 0.000052 loss: 2.581620 (2.583267) time: 0.963226 data: 0.000171 max mem: 18817 Epoch: [275/300] [ 650/1251] eta: 0:09:32 lr: 0.000052 loss: 2.799144 (2.588544) time: 0.965518 data: 0.000173 max mem: 18817 Epoch: [275/300] [ 700/1251] eta: 0:08:44 lr: 0.000052 loss: 2.787169 (2.588038) time: 0.969615 data: 0.000175 max mem: 18817 Epoch: [275/300] [ 750/1251] eta: 0:07:56 lr: 0.000052 loss: 2.694920 (2.590201) time: 0.957810 data: 0.000180 max mem: 18817 Epoch: [275/300] [ 800/1251] eta: 0:07:09 lr: 0.000051 loss: 2.501943 (2.584798) time: 0.940115 data: 0.000186 max mem: 18817 Epoch: [275/300] [ 850/1251] eta: 0:06:22 lr: 0.000051 loss: 2.599259 (2.585791) time: 0.922146 data: 0.000182 max mem: 18817 Epoch: [275/300] [ 900/1251] eta: 0:05:34 lr: 0.000051 loss: 2.870341 (2.585888) time: 0.975951 data: 0.000182 max mem: 18817 Epoch: [275/300] [ 950/1251] eta: 0:04:46 lr: 0.000051 loss: 2.566388 (2.586373) time: 0.947141 data: 0.000164 max mem: 18817 Epoch: [275/300] [1000/1251] eta: 0:03:59 lr: 0.000051 loss: 2.623325 (2.587083) time: 0.956368 data: 0.000164 max mem: 18817 Epoch: [275/300] [1050/1251] eta: 0:03:11 lr: 0.000051 loss: 2.760605 (2.589951) time: 0.911449 data: 0.000175 max mem: 18817 Epoch: [275/300] [1100/1251] eta: 0:02:23 lr: 0.000051 loss: 2.751121 (2.590248) time: 0.911722 data: 0.000181 max mem: 18817 Epoch: [275/300] [1150/1251] eta: 0:01:36 lr: 0.000051 loss: 2.468490 (2.592956) time: 0.903634 data: 0.000168 max mem: 18817 Epoch: [275/300] [1200/1251] eta: 0:00:48 lr: 0.000051 loss: 2.790573 (2.592728) time: 0.963285 data: 0.000171 max mem: 18817 Epoch: [275/300] [1250/1251] eta: 0:00:00 lr: 0.000051 loss: 2.774858 (2.595190) time: 1.021081 data: 0.000754 max mem: 18817 Epoch: [275/300] Total time: 0:19:52 (0.953207 s / it) Averaged stats: lr: 0.000051 loss: 2.774858 (2.588466) Test: [ 0/49] eta: 0:01:26 loss: 0.518852 (0.518852) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.760558 data: 1.331576 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.551115 (0.647263) acc1: 85.937500 (85.653409) acc5: 96.875000 (96.732955) time: 0.520397 data: 0.121204 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.734729 (0.679793) acc1: 82.812500 (84.672619) acc5: 96.875000 (96.726190) time: 0.399000 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.657873 (0.680572) acc1: 82.812500 (84.375000) acc5: 96.875000 (96.975806) time: 0.376958 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.680606 (0.693237) acc1: 82.812500 (84.222561) acc5: 96.875000 (96.951220) time: 0.349391 data: 0.000122 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.680606 (0.688668) acc1: 84.375000 (84.416000) acc5: 96.875000 (96.992000) time: 0.344278 data: 0.000100 max mem: 18817 Test: Total time: 0:00:19 (0.399165 s / it) * Acc@1 84.048 Acc@5 96.782 loss 0.710 Max accuracy: 84.05% Epoch: [276/300] [ 0/1251] eta: 0:42:46 lr: 0.000051 loss: 2.769105 (2.769105) time: 2.051514 data: 1.191356 max mem: 18817 Epoch: [276/300] [ 50/1251] eta: 0:19:07 lr: 0.000050 loss: 2.482169 (2.533489) time: 0.937039 data: 0.000184 max mem: 18817 Epoch: [276/300] [ 100/1251] eta: 0:18:16 lr: 0.000050 loss: 2.368911 (2.525521) time: 0.919588 data: 0.000199 max mem: 18817 Epoch: [276/300] [ 150/1251] eta: 0:17:32 lr: 0.000050 loss: 2.527339 (2.528152) time: 0.912984 data: 0.000178 max mem: 18817 Epoch: [276/300] [ 200/1251] eta: 0:16:36 lr: 0.000050 loss: 2.729623 (2.549942) time: 0.901789 data: 0.000174 max mem: 18817 Epoch: [276/300] [ 250/1251] eta: 0:15:51 lr: 0.000050 loss: 2.471672 (2.550505) time: 0.963947 data: 0.000175 max mem: 18817 Epoch: [276/300] [ 300/1251] eta: 0:15:00 lr: 0.000050 loss: 2.636049 (2.561526) time: 0.955382 data: 0.000188 max mem: 18817 Epoch: [276/300] [ 350/1251] eta: 0:14:16 lr: 0.000050 loss: 2.623208 (2.569412) time: 0.988922 data: 0.000170 max mem: 18817 Epoch: [276/300] [ 400/1251] eta: 0:13:27 lr: 0.000050 loss: 2.511986 (2.564276) time: 0.930943 data: 0.000162 max mem: 18817 Epoch: [276/300] [ 450/1251] eta: 0:12:40 lr: 0.000050 loss: 2.685917 (2.569098) time: 0.934670 data: 0.000183 max mem: 18817 Epoch: [276/300] [ 500/1251] eta: 0:11:53 lr: 0.000050 loss: 2.584895 (2.572509) time: 0.970647 data: 0.000178 max mem: 18817 Epoch: [276/300] [ 550/1251] eta: 0:11:05 lr: 0.000049 loss: 2.688714 (2.579485) time: 0.966469 data: 0.000192 max mem: 18817 Epoch: [276/300] [ 600/1251] eta: 0:10:17 lr: 0.000049 loss: 2.741732 (2.579561) time: 0.914248 data: 0.000160 max mem: 18817 Epoch: [276/300] [ 650/1251] eta: 0:09:30 lr: 0.000049 loss: 2.614849 (2.576322) time: 0.911502 data: 0.000165 max mem: 18817 Epoch: [276/300] [ 700/1251] eta: 0:08:42 lr: 0.000049 loss: 2.523220 (2.576154) time: 0.932379 data: 0.000190 max mem: 18817 Epoch: [276/300] [ 750/1251] eta: 0:07:54 lr: 0.000049 loss: 2.614919 (2.573828) time: 0.923719 data: 0.000174 max mem: 18817 Epoch: [276/300] [ 800/1251] eta: 0:07:07 lr: 0.000049 loss: 2.685321 (2.577098) time: 0.947220 data: 0.000177 max mem: 18817 Epoch: [276/300] [ 850/1251] eta: 0:06:19 lr: 0.000049 loss: 2.648093 (2.580783) time: 0.919537 data: 0.000166 max mem: 18817 Epoch: [276/300] [ 900/1251] eta: 0:05:32 lr: 0.000049 loss: 2.572070 (2.587052) time: 0.923795 data: 0.000187 max mem: 18817 Epoch: [276/300] [ 950/1251] eta: 0:04:45 lr: 0.000049 loss: 2.554646 (2.584633) time: 0.966948 data: 0.000182 max mem: 18817 Epoch: [276/300] [1000/1251] eta: 0:03:58 lr: 0.000049 loss: 2.795633 (2.584576) time: 0.983511 data: 0.000169 max mem: 18817 Epoch: [276/300] [1050/1251] eta: 0:03:10 lr: 0.000048 loss: 2.515234 (2.587108) time: 0.973882 data: 0.000170 max mem: 18817 Epoch: [276/300] [1100/1251] eta: 0:02:23 lr: 0.000048 loss: 2.656931 (2.587424) time: 0.925892 data: 0.000174 max mem: 18817 Epoch: [276/300] [1150/1251] eta: 0:01:35 lr: 0.000048 loss: 2.817752 (2.590928) time: 0.910524 data: 0.000179 max mem: 18817 Epoch: [276/300] [1200/1251] eta: 0:00:48 lr: 0.000048 loss: 2.762090 (2.591006) time: 0.968671 data: 0.000165 max mem: 18817 Epoch: [276/300] [1250/1251] eta: 0:00:00 lr: 0.000048 loss: 2.476460 (2.587291) time: 0.983624 data: 0.000772 max mem: 18817 Epoch: [276/300] Total time: 0:19:47 (0.949548 s / it) Averaged stats: lr: 0.000048 loss: 2.476460 (2.585139) Test: [ 0/49] eta: 0:01:29 loss: 0.514596 (0.514596) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.821075 data: 1.440600 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.543620 (0.649410) acc1: 84.375000 (85.511364) acc5: 96.875000 (96.875000) time: 0.490917 data: 0.131100 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.719586 (0.675987) acc1: 82.812500 (84.375000) acc5: 96.875000 (96.949405) time: 0.354700 data: 0.000138 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.656403 (0.675286) acc1: 82.812500 (84.324597) acc5: 96.875000 (97.076613) time: 0.351770 data: 0.000132 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.676056 (0.689033) acc1: 84.375000 (84.146341) acc5: 96.875000 (97.065549) time: 0.349517 data: 0.000140 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.678898 (0.686558) acc1: 84.375000 (84.160000) acc5: 96.875000 (97.120000) time: 0.344148 data: 0.000116 max mem: 18817 Test: Total time: 0:00:18 (0.382457 s / it) * Acc@1 84.026 Acc@5 96.802 loss 0.706 Max accuracy: 84.05% Epoch: [277/300] [ 0/1251] eta: 0:43:00 lr: 0.000048 loss: 2.962480 (2.962480) time: 2.062372 data: 1.186363 max mem: 18817 Epoch: [277/300] [ 50/1251] eta: 0:19:55 lr: 0.000048 loss: 2.686615 (2.606418) time: 0.918691 data: 0.000177 max mem: 18817 Epoch: [277/300] [ 100/1251] eta: 0:18:48 lr: 0.000048 loss: 2.687907 (2.607796) time: 0.967155 data: 0.000184 max mem: 18817 Epoch: [277/300] [ 150/1251] eta: 0:17:50 lr: 0.000048 loss: 2.694436 (2.584528) time: 0.972183 data: 0.000173 max mem: 18817 Epoch: [277/300] [ 200/1251] eta: 0:16:53 lr: 0.000048 loss: 2.447423 (2.572696) time: 0.985802 data: 0.000178 max mem: 18817 Epoch: [277/300] [ 250/1251] eta: 0:15:58 lr: 0.000048 loss: 2.366513 (2.561692) time: 0.909593 data: 0.000188 max mem: 18817 Epoch: [277/300] [ 300/1251] eta: 0:15:12 lr: 0.000047 loss: 2.760815 (2.575045) time: 0.920957 data: 0.000177 max mem: 18817 Epoch: [277/300] [ 350/1251] eta: 0:14:24 lr: 0.000047 loss: 2.626060 (2.567555) time: 0.980546 data: 0.000178 max mem: 18817 Epoch: [277/300] [ 400/1251] eta: 0:13:36 lr: 0.000047 loss: 2.625754 (2.575119) time: 1.020857 data: 0.000188 max mem: 18817 Epoch: [277/300] [ 450/1251] eta: 0:12:46 lr: 0.000047 loss: 2.427367 (2.565049) time: 0.977131 data: 0.000184 max mem: 18817 Epoch: [277/300] [ 500/1251] eta: 0:11:57 lr: 0.000047 loss: 2.743592 (2.574165) time: 0.910010 data: 0.000174 max mem: 18817 Epoch: [277/300] [ 550/1251] eta: 0:11:08 lr: 0.000047 loss: 2.662763 (2.577299) time: 0.913394 data: 0.000160 max mem: 18817 Epoch: [277/300] [ 600/1251] eta: 0:10:21 lr: 0.000047 loss: 2.663231 (2.576643) time: 0.926156 data: 0.000184 max mem: 18817 Epoch: [277/300] [ 650/1251] eta: 0:09:33 lr: 0.000047 loss: 2.759180 (2.580441) time: 0.960977 data: 0.000169 max mem: 18817 Epoch: [277/300] [ 700/1251] eta: 0:08:46 lr: 0.000047 loss: 2.751138 (2.584464) time: 1.041786 data: 0.000171 max mem: 18817 Epoch: [277/300] [ 750/1251] eta: 0:07:58 lr: 0.000047 loss: 2.581567 (2.580124) time: 0.986773 data: 0.000168 max mem: 18817 Epoch: [277/300] [ 800/1251] eta: 0:07:10 lr: 0.000046 loss: 2.751976 (2.583334) time: 0.919208 data: 0.000172 max mem: 18817 Epoch: [277/300] [ 850/1251] eta: 0:06:22 lr: 0.000046 loss: 2.596706 (2.578763) time: 0.922438 data: 0.000185 max mem: 18817 Epoch: [277/300] [ 900/1251] eta: 0:05:35 lr: 0.000046 loss: 2.498893 (2.576800) time: 0.966495 data: 0.000174 max mem: 18817 Epoch: [277/300] [ 950/1251] eta: 0:04:47 lr: 0.000046 loss: 2.707075 (2.579269) time: 1.007105 data: 0.000178 max mem: 18817 Epoch: [277/300] [1000/1251] eta: 0:03:59 lr: 0.000046 loss: 2.669655 (2.572771) time: 0.977674 data: 0.000158 max mem: 18817 Epoch: [277/300] [1050/1251] eta: 0:03:11 lr: 0.000046 loss: 2.579553 (2.572264) time: 0.919103 data: 0.000186 max mem: 18817 Epoch: [277/300] [1100/1251] eta: 0:02:24 lr: 0.000046 loss: 2.453584 (2.567346) time: 0.911219 data: 0.000204 max mem: 18817 Epoch: [277/300] [1150/1251] eta: 0:01:36 lr: 0.000046 loss: 2.779896 (2.567476) time: 0.968829 data: 0.000178 max mem: 18817 Epoch: [277/300] [1200/1251] eta: 0:00:48 lr: 0.000046 loss: 2.645143 (2.565340) time: 0.953447 data: 0.000165 max mem: 18817 Epoch: [277/300] [1250/1251] eta: 0:00:00 lr: 0.000046 loss: 2.775993 (2.565688) time: 0.905969 data: 0.000748 max mem: 18817 Epoch: [277/300] Total time: 0:19:51 (0.952274 s / it) Averaged stats: lr: 0.000046 loss: 2.775993 (2.564461) Test: [ 0/49] eta: 0:02:03 loss: 0.517091 (0.517091) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 2.521204 data: 1.499643 max mem: 18817 Test: [10/49] eta: 0:00:23 loss: 0.584946 (0.657102) acc1: 84.375000 (85.369318) acc5: 98.437500 (97.017045) time: 0.614984 data: 0.136460 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.722503 (0.688812) acc1: 84.375000 (84.449405) acc5: 96.875000 (97.023810) time: 0.389064 data: 0.000150 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.683571 (0.687800) acc1: 82.812500 (84.324597) acc5: 96.875000 (97.177419) time: 0.352384 data: 0.000160 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.683571 (0.700364) acc1: 82.812500 (84.108232) acc5: 98.437500 (97.179878) time: 0.349055 data: 0.000151 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.686457 (0.697891) acc1: 82.812500 (84.192000) acc5: 98.437500 (97.216000) time: 0.343960 data: 0.000113 max mem: 18817 Test: Total time: 0:00:20 (0.411171 s / it) * Acc@1 83.964 Acc@5 96.772 loss 0.717 Max accuracy: 84.05% Epoch: [278/300] [ 0/1251] eta: 0:41:20 lr: 0.000046 loss: 2.588370 (2.588370) time: 1.982992 data: 1.119774 max mem: 18817 Epoch: [278/300] [ 50/1251] eta: 0:19:39 lr: 0.000046 loss: 2.704292 (2.613656) time: 0.926522 data: 0.000172 max mem: 18817 Epoch: [278/300] [ 100/1251] eta: 0:18:29 lr: 0.000045 loss: 2.602111 (2.594522) time: 0.941462 data: 0.000183 max mem: 18817 Epoch: [278/300] [ 150/1251] eta: 0:17:43 lr: 0.000045 loss: 2.541895 (2.596449) time: 1.045641 data: 0.000186 max mem: 18817 Epoch: [278/300] [ 200/1251] eta: 0:16:47 lr: 0.000045 loss: 2.688078 (2.557384) time: 0.955957 data: 0.000173 max mem: 18817 Epoch: [278/300] [ 250/1251] eta: 0:15:55 lr: 0.000045 loss: 2.445596 (2.549475) time: 0.920792 data: 0.000180 max mem: 18817 Epoch: [278/300] [ 300/1251] eta: 0:15:08 lr: 0.000045 loss: 2.642457 (2.554551) time: 0.912363 data: 0.000170 max mem: 18817 Epoch: [278/300] [ 350/1251] eta: 0:14:22 lr: 0.000045 loss: 2.580149 (2.552643) time: 0.963205 data: 0.000160 max mem: 18817 Epoch: [278/300] [ 400/1251] eta: 0:13:35 lr: 0.000045 loss: 2.740813 (2.548294) time: 1.025769 data: 0.000163 max mem: 18817 Epoch: [278/300] [ 450/1251] eta: 0:12:44 lr: 0.000045 loss: 2.524449 (2.545493) time: 0.929667 data: 0.000164 max mem: 18817 Epoch: [278/300] [ 500/1251] eta: 0:11:57 lr: 0.000045 loss: 2.575604 (2.541153) time: 0.917202 data: 0.000169 max mem: 18817 Epoch: [278/300] [ 550/1251] eta: 0:11:08 lr: 0.000045 loss: 2.567970 (2.545721) time: 0.930729 data: 0.000174 max mem: 18817 Epoch: [278/300] [ 600/1251] eta: 0:10:22 lr: 0.000045 loss: 2.703653 (2.556126) time: 0.918613 data: 0.000166 max mem: 18817 Epoch: [278/300] [ 650/1251] eta: 0:09:34 lr: 0.000044 loss: 2.343899 (2.552829) time: 0.966326 data: 0.000171 max mem: 18817 Epoch: [278/300] [ 700/1251] eta: 0:08:46 lr: 0.000044 loss: 2.327587 (2.552812) time: 0.982013 data: 0.000175 max mem: 18817 Epoch: [278/300] [ 750/1251] eta: 0:07:58 lr: 0.000044 loss: 2.581013 (2.553552) time: 0.966931 data: 0.000179 max mem: 18817 Epoch: [278/300] [ 800/1251] eta: 0:07:10 lr: 0.000044 loss: 2.796672 (2.554022) time: 0.911451 data: 0.000168 max mem: 18817 Epoch: [278/300] [ 850/1251] eta: 0:06:22 lr: 0.000044 loss: 2.330415 (2.548472) time: 0.917179 data: 0.000165 max mem: 18817 Epoch: [278/300] [ 900/1251] eta: 0:05:35 lr: 0.000044 loss: 2.418911 (2.552529) time: 0.999945 data: 0.000175 max mem: 18817 Epoch: [278/300] [ 950/1251] eta: 0:04:47 lr: 0.000044 loss: 2.773116 (2.559939) time: 0.957990 data: 0.000170 max mem: 18817 Epoch: [278/300] [1000/1251] eta: 0:03:59 lr: 0.000044 loss: 2.656516 (2.559761) time: 0.972370 data: 0.000159 max mem: 18817 Epoch: [278/300] [1050/1251] eta: 0:03:12 lr: 0.000044 loss: 2.712220 (2.564351) time: 0.981264 data: 0.000173 max mem: 18817 Epoch: [278/300] [1100/1251] eta: 0:02:24 lr: 0.000044 loss: 2.580647 (2.563178) time: 0.929942 data: 0.000178 max mem: 18817 Epoch: [278/300] [1150/1251] eta: 0:01:36 lr: 0.000044 loss: 2.522713 (2.562999) time: 0.919978 data: 0.000184 max mem: 18817 Epoch: [278/300] [1200/1251] eta: 0:00:48 lr: 0.000043 loss: 2.148106 (2.561265) time: 0.974290 data: 0.000171 max mem: 18817 Epoch: [278/300] [1250/1251] eta: 0:00:00 lr: 0.000043 loss: 2.696171 (2.563843) time: 1.034999 data: 0.000757 max mem: 18817 Epoch: [278/300] Total time: 0:19:56 (0.956688 s / it) Averaged stats: lr: 0.000043 loss: 2.696171 (2.561479) Test: [ 0/49] eta: 0:01:27 loss: 0.499214 (0.499214) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.792592 data: 1.383004 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.579135 (0.654000) acc1: 84.375000 (84.517045) acc5: 96.875000 (96.590909) time: 0.486361 data: 0.125855 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.742824 (0.687829) acc1: 82.812500 (83.482143) acc5: 96.875000 (96.800595) time: 0.364731 data: 0.000136 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.685697 (0.684106) acc1: 82.812500 (83.669355) acc5: 98.437500 (97.127016) time: 0.380466 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.680821 (0.698189) acc1: 84.375000 (83.574695) acc5: 96.875000 (96.951220) time: 0.367142 data: 0.000120 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.680821 (0.693000) acc1: 84.375000 (83.808000) acc5: 96.875000 (97.024000) time: 0.351598 data: 0.000102 max mem: 18817 Test: Total time: 0:00:19 (0.393974 s / it) * Acc@1 83.992 Acc@5 96.774 loss 0.710 Max accuracy: 84.05% Epoch: [279/300] [ 0/1251] eta: 0:42:36 lr: 0.000043 loss: 2.683682 (2.683682) time: 2.043260 data: 1.188067 max mem: 18817 Epoch: [279/300] [ 50/1251] eta: 0:19:01 lr: 0.000043 loss: 2.666008 (2.529634) time: 0.914220 data: 0.000178 max mem: 18817 Epoch: [279/300] [ 100/1251] eta: 0:18:14 lr: 0.000043 loss: 2.577423 (2.554004) time: 0.918101 data: 0.000176 max mem: 18817 Epoch: [279/300] [ 150/1251] eta: 0:17:31 lr: 0.000043 loss: 2.644373 (2.547953) time: 0.966091 data: 0.000182 max mem: 18817 Epoch: [279/300] [ 200/1251] eta: 0:16:41 lr: 0.000043 loss: 2.644401 (2.553721) time: 0.992795 data: 0.000169 max mem: 18817 Epoch: [279/300] [ 250/1251] eta: 0:15:49 lr: 0.000043 loss: 2.552141 (2.553464) time: 0.931879 data: 0.000181 max mem: 18817 Epoch: [279/300] [ 300/1251] eta: 0:15:00 lr: 0.000043 loss: 2.633133 (2.548162) time: 0.914339 data: 0.000174 max mem: 18817 Epoch: [279/300] [ 350/1251] eta: 0:14:15 lr: 0.000043 loss: 2.429058 (2.546718) time: 0.916796 data: 0.000170 max mem: 18817 Epoch: [279/300] [ 400/1251] eta: 0:13:28 lr: 0.000043 loss: 2.727714 (2.554472) time: 0.952923 data: 0.000186 max mem: 18817 Epoch: [279/300] [ 450/1251] eta: 0:12:40 lr: 0.000043 loss: 2.449831 (2.558739) time: 0.965686 data: 0.000190 max mem: 18817 Epoch: [279/300] [ 500/1251] eta: 0:11:51 lr: 0.000042 loss: 2.608377 (2.558409) time: 0.909718 data: 0.000171 max mem: 18817 Epoch: [279/300] [ 550/1251] eta: 0:11:04 lr: 0.000042 loss: 2.632380 (2.551848) time: 0.908221 data: 0.000175 max mem: 18817 Epoch: [279/300] [ 600/1251] eta: 0:10:17 lr: 0.000042 loss: 2.594505 (2.550443) time: 0.946649 data: 0.000184 max mem: 18817 Epoch: [279/300] [ 650/1251] eta: 0:09:30 lr: 0.000042 loss: 2.596906 (2.552093) time: 1.027859 data: 0.000167 max mem: 18817 Epoch: [279/300] [ 700/1251] eta: 0:08:42 lr: 0.000042 loss: 2.724854 (2.548225) time: 0.955479 data: 0.000162 max mem: 18817 Epoch: [279/300] [ 750/1251] eta: 0:07:54 lr: 0.000042 loss: 2.615795 (2.549085) time: 0.915596 data: 0.000181 max mem: 18817 Epoch: [279/300] [ 800/1251] eta: 0:07:07 lr: 0.000042 loss: 2.664227 (2.550590) time: 0.923021 data: 0.000164 max mem: 18817 Epoch: [279/300] [ 850/1251] eta: 0:06:20 lr: 0.000042 loss: 2.636714 (2.554224) time: 0.957238 data: 0.000189 max mem: 18817 Epoch: [279/300] [ 900/1251] eta: 0:05:33 lr: 0.000042 loss: 2.650377 (2.556070) time: 1.027025 data: 0.000187 max mem: 18817 Epoch: [279/300] [ 950/1251] eta: 0:04:45 lr: 0.000042 loss: 2.587162 (2.556236) time: 0.970106 data: 0.000185 max mem: 18817 Epoch: [279/300] [1000/1251] eta: 0:03:57 lr: 0.000042 loss: 2.525931 (2.554546) time: 0.906388 data: 0.000162 max mem: 18817 Epoch: [279/300] [1050/1251] eta: 0:03:10 lr: 0.000041 loss: 2.669513 (2.560028) time: 0.915649 data: 0.000172 max mem: 18817 Epoch: [279/300] [1100/1251] eta: 0:02:23 lr: 0.000041 loss: 2.618109 (2.563412) time: 0.966400 data: 0.000178 max mem: 18817 Epoch: [279/300] [1150/1251] eta: 0:01:35 lr: 0.000041 loss: 2.663989 (2.564770) time: 1.028626 data: 0.000182 max mem: 18817 Epoch: [279/300] [1200/1251] eta: 0:00:48 lr: 0.000041 loss: 2.503566 (2.562584) time: 0.960185 data: 0.000168 max mem: 18817 Epoch: [279/300] [1250/1251] eta: 0:00:00 lr: 0.000041 loss: 2.569933 (2.561355) time: 0.918365 data: 0.000756 max mem: 18817 Epoch: [279/300] Total time: 0:19:47 (0.949452 s / it) Averaged stats: lr: 0.000041 loss: 2.569933 (2.565812) Test: [ 0/49] eta: 0:01:16 loss: 0.506663 (0.506663) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.562373 data: 1.176173 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.601429 (0.647535) acc1: 84.375000 (85.511364) acc5: 96.875000 (97.017045) time: 0.467268 data: 0.107063 max mem: 18817 Test: [20/49] eta: 0:00:11 loss: 0.721326 (0.676810) acc1: 84.375000 (84.449405) acc5: 96.875000 (97.023810) time: 0.355275 data: 0.000135 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.675573 (0.674980) acc1: 82.812500 (84.375000) acc5: 96.875000 (97.177419) time: 0.446320 data: 0.000130 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.675573 (0.691388) acc1: 82.812500 (84.070122) acc5: 96.875000 (97.217988) time: 0.443291 data: 0.000132 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.675573 (0.690333) acc1: 82.812500 (84.192000) acc5: 96.875000 (97.216000) time: 0.344411 data: 0.000108 max mem: 18817 Test: Total time: 0:00:20 (0.415413 s / it) * Acc@1 83.978 Acc@5 96.792 loss 0.711 Max accuracy: 84.05% Epoch: [280/300] [ 0/1251] eta: 0:40:18 lr: 0.000041 loss: 2.259890 (2.259890) time: 1.933486 data: 1.068596 max mem: 18817 Epoch: [280/300] [ 50/1251] eta: 0:19:00 lr: 0.000041 loss: 2.652825 (2.579485) time: 0.965701 data: 0.000169 max mem: 18817 Epoch: [280/300] [ 100/1251] eta: 0:18:19 lr: 0.000041 loss: 2.577003 (2.564970) time: 0.946995 data: 0.000176 max mem: 18817 Epoch: [280/300] [ 150/1251] eta: 0:17:26 lr: 0.000041 loss: 2.661468 (2.557112) time: 0.913514 data: 0.000183 max mem: 18817 Epoch: [280/300] [ 200/1251] eta: 0:16:41 lr: 0.000041 loss: 2.443582 (2.538044) time: 0.942657 data: 0.000632 max mem: 18817 Epoch: [280/300] [ 250/1251] eta: 0:15:55 lr: 0.000041 loss: 2.531641 (2.528127) time: 0.962066 data: 0.000169 max mem: 18817 Epoch: [280/300] [ 300/1251] eta: 0:15:06 lr: 0.000041 loss: 2.676548 (2.527745) time: 0.984498 data: 0.000177 max mem: 18817 Epoch: [280/300] [ 350/1251] eta: 0:14:17 lr: 0.000041 loss: 2.457546 (2.525564) time: 0.919986 data: 0.000167 max mem: 18817 Epoch: [280/300] [ 400/1251] eta: 0:13:30 lr: 0.000040 loss: 2.394168 (2.522133) time: 0.911165 data: 0.000176 max mem: 18817 Epoch: [280/300] [ 450/1251] eta: 0:12:42 lr: 0.000040 loss: 2.661770 (2.515321) time: 0.961981 data: 0.000195 max mem: 18817 Epoch: [280/300] [ 500/1251] eta: 0:11:56 lr: 0.000040 loss: 2.606375 (2.526077) time: 0.977406 data: 0.000178 max mem: 18817 Epoch: [280/300] [ 550/1251] eta: 0:11:07 lr: 0.000040 loss: 2.541954 (2.528351) time: 0.968184 data: 0.000171 max mem: 18817 Epoch: [280/300] [ 600/1251] eta: 0:10:19 lr: 0.000040 loss: 2.610651 (2.526131) time: 0.919799 data: 0.000177 max mem: 18817 Epoch: [280/300] [ 650/1251] eta: 0:09:31 lr: 0.000040 loss: 2.601879 (2.534911) time: 0.905347 data: 0.000176 max mem: 18817 Epoch: [280/300] [ 700/1251] eta: 0:08:44 lr: 0.000040 loss: 2.697353 (2.540701) time: 0.976900 data: 0.000168 max mem: 18817 Epoch: [280/300] [ 750/1251] eta: 0:07:56 lr: 0.000040 loss: 2.544240 (2.537043) time: 0.995130 data: 0.000176 max mem: 18817 Epoch: [280/300] [ 800/1251] eta: 0:07:08 lr: 0.000040 loss: 2.754752 (2.547038) time: 0.970438 data: 0.000170 max mem: 18817 Epoch: [280/300] [ 850/1251] eta: 0:06:21 lr: 0.000040 loss: 2.537393 (2.548144) time: 0.920853 data: 0.000170 max mem: 18817 Epoch: [280/300] [ 900/1251] eta: 0:05:33 lr: 0.000040 loss: 2.585796 (2.545262) time: 0.929352 data: 0.000177 max mem: 18817 Epoch: [280/300] [ 950/1251] eta: 0:04:46 lr: 0.000040 loss: 2.699272 (2.547425) time: 0.972741 data: 0.000188 max mem: 18817 Epoch: [280/300] [1000/1251] eta: 0:03:58 lr: 0.000039 loss: 2.655098 (2.550940) time: 0.967176 data: 0.000162 max mem: 18817 Epoch: [280/300] [1050/1251] eta: 0:03:11 lr: 0.000039 loss: 2.390816 (2.550126) time: 0.969993 data: 0.000184 max mem: 18817 Epoch: [280/300] [1100/1251] eta: 0:02:23 lr: 0.000039 loss: 2.704460 (2.550176) time: 0.917632 data: 0.000170 max mem: 18817 Epoch: [280/300] [1150/1251] eta: 0:01:36 lr: 0.000039 loss: 2.673616 (2.551325) time: 0.924375 data: 0.000176 max mem: 18817 Epoch: [280/300] [1200/1251] eta: 0:00:48 lr: 0.000039 loss: 2.622617 (2.547532) time: 0.977482 data: 0.000174 max mem: 18817 Epoch: [280/300] [1250/1251] eta: 0:00:00 lr: 0.000039 loss: 2.479057 (2.545563) time: 0.952274 data: 0.000816 max mem: 18817 Epoch: [280/300] Total time: 0:19:50 (0.951767 s / it) Averaged stats: lr: 0.000039 loss: 2.479057 (2.550360) Test: [ 0/49] eta: 0:01:14 loss: 0.468561 (0.468561) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.524208 data: 1.094517 max mem: 18817 Test: [10/49] eta: 0:00:24 loss: 0.573402 (0.639698) acc1: 84.375000 (85.653409) acc5: 96.875000 (96.590909) time: 0.621563 data: 0.099641 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.724879 (0.674533) acc1: 82.812500 (84.375000) acc5: 96.875000 (96.651786) time: 0.442405 data: 0.000146 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.658982 (0.671404) acc1: 84.375000 (84.324597) acc5: 98.437500 (97.076613) time: 0.351937 data: 0.000135 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.675034 (0.688030) acc1: 84.375000 (84.108232) acc5: 96.875000 (97.027439) time: 0.348016 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.675034 (0.685838) acc1: 84.375000 (84.320000) acc5: 96.875000 (97.024000) time: 0.343382 data: 0.000102 max mem: 18817 Test: Total time: 0:00:20 (0.411293 s / it) * Acc@1 84.034 Acc@5 96.770 loss 0.703 Max accuracy: 84.05% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0280.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0280.pth Epoch: [281/300] [ 0/1251] eta: 0:43:40 lr: 0.000039 loss: 2.481091 (2.481091) time: 2.094612 data: 1.221899 max mem: 18817 Epoch: [281/300] [ 50/1251] eta: 0:19:29 lr: 0.000039 loss: 2.653161 (2.597027) time: 0.902322 data: 0.000175 max mem: 18817 Epoch: [281/300] [ 100/1251] eta: 0:18:30 lr: 0.000039 loss: 2.480471 (2.543392) time: 0.971810 data: 0.000174 max mem: 18817 Epoch: [281/300] [ 150/1251] eta: 0:17:31 lr: 0.000039 loss: 2.448177 (2.541719) time: 0.976685 data: 0.000170 max mem: 18817 Epoch: [281/300] [ 200/1251] eta: 0:16:43 lr: 0.000039 loss: 2.642490 (2.553361) time: 0.981547 data: 0.000180 max mem: 18817 Epoch: [281/300] [ 250/1251] eta: 0:15:53 lr: 0.000039 loss: 2.645639 (2.544736) time: 0.919109 data: 0.000175 max mem: 18817 Epoch: [281/300] [ 300/1251] eta: 0:15:08 lr: 0.000039 loss: 2.608827 (2.539200) time: 0.920934 data: 0.000171 max mem: 18817 Epoch: [281/300] [ 350/1251] eta: 0:14:25 lr: 0.000039 loss: 2.746839 (2.549350) time: 0.987934 data: 0.000164 max mem: 18817 Epoch: [281/300] [ 400/1251] eta: 0:13:32 lr: 0.000038 loss: 2.558412 (2.552000) time: 0.920317 data: 0.000177 max mem: 18817 Epoch: [281/300] [ 450/1251] eta: 0:12:46 lr: 0.000038 loss: 2.442117 (2.547145) time: 0.918647 data: 0.000176 max mem: 18817 Epoch: [281/300] [ 500/1251] eta: 0:11:59 lr: 0.000038 loss: 2.255292 (2.559596) time: 0.936351 data: 0.000170 max mem: 18817 Epoch: [281/300] [ 550/1251] eta: 0:11:11 lr: 0.000038 loss: 2.484095 (2.553086) time: 0.963380 data: 0.000187 max mem: 18817 Epoch: [281/300] [ 600/1251] eta: 0:10:22 lr: 0.000038 loss: 2.458404 (2.547828) time: 0.960617 data: 0.000184 max mem: 18817 Epoch: [281/300] [ 650/1251] eta: 0:09:34 lr: 0.000038 loss: 2.819384 (2.559693) time: 0.911596 data: 0.000168 max mem: 18817 Epoch: [281/300] [ 700/1251] eta: 0:08:45 lr: 0.000038 loss: 2.625427 (2.556367) time: 0.905970 data: 0.000174 max mem: 18817 Epoch: [281/300] [ 750/1251] eta: 0:07:58 lr: 0.000038 loss: 2.618186 (2.559273) time: 0.919099 data: 0.000188 max mem: 18817 Epoch: [281/300] [ 800/1251] eta: 0:07:10 lr: 0.000038 loss: 2.635115 (2.562048) time: 1.029384 data: 0.000174 max mem: 18817 Epoch: [281/300] [ 850/1251] eta: 0:06:21 lr: 0.000038 loss: 2.654677 (2.563182) time: 0.911590 data: 0.000167 max mem: 18817 Epoch: [281/300] [ 900/1251] eta: 0:05:34 lr: 0.000038 loss: 2.690205 (2.565106) time: 0.926921 data: 0.000182 max mem: 18817 Epoch: [281/300] [ 950/1251] eta: 0:04:46 lr: 0.000038 loss: 2.448566 (2.559611) time: 0.989735 data: 0.000194 max mem: 18817 Epoch: [281/300] [1000/1251] eta: 0:03:59 lr: 0.000037 loss: 2.714170 (2.561883) time: 0.976622 data: 0.000163 max mem: 18817 Epoch: [281/300] [1050/1251] eta: 0:03:11 lr: 0.000037 loss: 2.484715 (2.559592) time: 0.971804 data: 0.000164 max mem: 18817 Epoch: [281/300] [1100/1251] eta: 0:02:23 lr: 0.000037 loss: 2.810559 (2.567731) time: 0.917974 data: 0.000179 max mem: 18817 Epoch: [281/300] [1150/1251] eta: 0:01:36 lr: 0.000037 loss: 2.296273 (2.561885) time: 0.913647 data: 0.000177 max mem: 18817 Epoch: [281/300] [1200/1251] eta: 0:00:48 lr: 0.000037 loss: 2.788147 (2.562791) time: 0.963798 data: 0.000174 max mem: 18817 Epoch: [281/300] [1250/1251] eta: 0:00:00 lr: 0.000037 loss: 2.754279 (2.566185) time: 0.973442 data: 0.000779 max mem: 18817 Epoch: [281/300] Total time: 0:19:51 (0.952647 s / it) Averaged stats: lr: 0.000037 loss: 2.754279 (2.557267) Test: [ 0/49] eta: 0:01:56 loss: 0.502210 (0.502210) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 2.387075 data: 1.363760 max mem: 18817 Test: [10/49] eta: 0:00:25 loss: 0.580789 (0.645252) acc1: 84.375000 (85.511364) acc5: 96.875000 (96.732955) time: 0.648439 data: 0.124108 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.718901 (0.675082) acc1: 82.812500 (84.375000) acc5: 96.875000 (96.949405) time: 0.413295 data: 0.000146 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.677489 (0.676391) acc1: 82.812500 (84.576613) acc5: 96.875000 (97.076613) time: 0.351638 data: 0.000136 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.680698 (0.690759) acc1: 84.375000 (84.375000) acc5: 96.875000 (97.027439) time: 0.348807 data: 0.000122 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.680698 (0.688786) acc1: 84.375000 (84.352000) acc5: 96.875000 (97.024000) time: 0.343388 data: 0.000104 max mem: 18817 Test: Total time: 0:00:20 (0.417553 s / it) * Acc@1 84.016 Acc@5 96.766 loss 0.711 Max accuracy: 84.05% Epoch: [282/300] [ 0/1251] eta: 0:40:32 lr: 0.000037 loss: 2.778963 (2.778963) time: 1.944260 data: 1.080027 max mem: 18817 Epoch: [282/300] [ 50/1251] eta: 0:19:10 lr: 0.000037 loss: 2.563177 (2.404829) time: 0.911445 data: 0.000183 max mem: 18817 Epoch: [282/300] [ 100/1251] eta: 0:18:27 lr: 0.000037 loss: 2.470949 (2.458728) time: 0.998474 data: 0.000179 max mem: 18817 Epoch: [282/300] [ 150/1251] eta: 0:17:41 lr: 0.000037 loss: 2.702025 (2.494522) time: 0.961870 data: 0.000192 max mem: 18817 Epoch: [282/300] [ 200/1251] eta: 0:16:47 lr: 0.000037 loss: 2.592728 (2.488281) time: 0.979658 data: 0.000186 max mem: 18817 Epoch: [282/300] [ 250/1251] eta: 0:15:57 lr: 0.000037 loss: 2.705775 (2.517399) time: 0.920387 data: 0.000190 max mem: 18817 Epoch: [282/300] [ 300/1251] eta: 0:15:08 lr: 0.000037 loss: 2.408258 (2.514965) time: 0.918336 data: 0.000197 max mem: 18817 Epoch: [282/300] [ 350/1251] eta: 0:14:22 lr: 0.000037 loss: 2.579884 (2.517292) time: 0.965105 data: 0.000165 max mem: 18817 Epoch: [282/300] [ 400/1251] eta: 0:13:34 lr: 0.000036 loss: 2.575734 (2.520798) time: 0.953910 data: 0.000169 max mem: 18817 Epoch: [282/300] [ 450/1251] eta: 0:12:45 lr: 0.000036 loss: 2.474312 (2.513399) time: 0.975954 data: 0.000178 max mem: 18817 Epoch: [282/300] [ 500/1251] eta: 0:11:56 lr: 0.000036 loss: 2.637511 (2.511086) time: 0.920189 data: 0.000178 max mem: 18817 Epoch: [282/300] [ 550/1251] eta: 0:11:09 lr: 0.000036 loss: 2.474395 (2.503216) time: 0.916590 data: 0.000164 max mem: 18817 Epoch: [282/300] [ 600/1251] eta: 0:10:21 lr: 0.000036 loss: 2.362636 (2.503857) time: 0.972486 data: 0.000168 max mem: 18817 Epoch: [282/300] [ 650/1251] eta: 0:09:34 lr: 0.000036 loss: 2.592014 (2.501823) time: 1.003016 data: 0.000180 max mem: 18817 Epoch: [282/300] [ 700/1251] eta: 0:08:45 lr: 0.000036 loss: 2.704210 (2.508725) time: 0.959918 data: 0.000169 max mem: 18817 Epoch: [282/300] [ 750/1251] eta: 0:07:57 lr: 0.000036 loss: 2.570206 (2.506624) time: 0.918487 data: 0.000179 max mem: 18817 Epoch: [282/300] [ 800/1251] eta: 0:07:10 lr: 0.000036 loss: 2.534792 (2.508390) time: 0.905979 data: 0.000185 max mem: 18817 Epoch: [282/300] [ 850/1251] eta: 0:06:22 lr: 0.000036 loss: 2.784232 (2.513393) time: 0.975174 data: 0.000188 max mem: 18817 Epoch: [282/300] [ 900/1251] eta: 0:05:34 lr: 0.000036 loss: 2.604128 (2.517440) time: 1.005321 data: 0.000183 max mem: 18817 Epoch: [282/300] [ 950/1251] eta: 0:04:47 lr: 0.000036 loss: 2.632104 (2.521464) time: 0.973947 data: 0.000180 max mem: 18817 Epoch: [282/300] [1000/1251] eta: 0:03:59 lr: 0.000036 loss: 2.710850 (2.520384) time: 0.915433 data: 0.000165 max mem: 18817 Epoch: [282/300] [1050/1251] eta: 0:03:11 lr: 0.000036 loss: 2.487058 (2.518931) time: 0.913958 data: 0.000178 max mem: 18817 Epoch: [282/300] [1100/1251] eta: 0:02:24 lr: 0.000035 loss: 2.382387 (2.517196) time: 0.997519 data: 0.000284 max mem: 18817 Epoch: [282/300] [1150/1251] eta: 0:01:36 lr: 0.000035 loss: 2.708936 (2.518437) time: 0.962681 data: 0.000191 max mem: 18817 Epoch: [282/300] [1200/1251] eta: 0:00:48 lr: 0.000035 loss: 2.596868 (2.515682) time: 0.982322 data: 0.000167 max mem: 18817 Epoch: [282/300] [1250/1251] eta: 0:00:00 lr: 0.000035 loss: 2.457732 (2.516246) time: 0.938144 data: 0.000760 max mem: 18817 Epoch: [282/300] Total time: 0:19:54 (0.955024 s / it) Averaged stats: lr: 0.000035 loss: 2.457732 (2.528598) Test: [ 0/49] eta: 0:01:26 loss: 0.558570 (0.558570) acc1: 82.812500 (82.812500) acc5: 100.000000 (100.000000) time: 1.765754 data: 1.377126 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.567190 (0.658384) acc1: 82.812500 (85.085227) acc5: 96.875000 (96.732955) time: 0.484841 data: 0.125328 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.733080 (0.690068) acc1: 82.812500 (84.002976) acc5: 96.875000 (96.875000) time: 0.358176 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.685696 (0.684323) acc1: 82.812500 (84.122984) acc5: 96.875000 (97.026210) time: 0.360505 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.681570 (0.699681) acc1: 82.812500 (83.917683) acc5: 96.875000 (97.065549) time: 0.354406 data: 0.000123 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.683135 (0.696488) acc1: 82.812500 (84.096000) acc5: 96.875000 (97.088000) time: 0.344060 data: 0.000103 max mem: 18817 Test: Total time: 0:00:18 (0.384228 s / it) * Acc@1 84.028 Acc@5 96.734 loss 0.714 Max accuracy: 84.05% Epoch: [283/300] [ 0/1251] eta: 0:40:42 lr: 0.000035 loss: 2.293211 (2.293211) time: 1.952635 data: 1.096681 max mem: 18817 Epoch: [283/300] [ 50/1251] eta: 0:19:05 lr: 0.000035 loss: 2.550163 (2.481121) time: 0.954135 data: 0.000172 max mem: 18817 Epoch: [283/300] [ 100/1251] eta: 0:18:12 lr: 0.000035 loss: 2.710470 (2.557833) time: 0.925101 data: 0.000189 max mem: 18817 Epoch: [283/300] [ 150/1251] eta: 0:17:29 lr: 0.000035 loss: 2.701640 (2.570068) time: 0.921496 data: 0.000179 max mem: 18817 Epoch: [283/300] [ 200/1251] eta: 0:16:44 lr: 0.000035 loss: 2.645807 (2.577087) time: 0.977934 data: 0.000158 max mem: 18817 Epoch: [283/300] [ 250/1251] eta: 0:15:56 lr: 0.000035 loss: 2.413618 (2.543560) time: 1.017377 data: 0.000183 max mem: 18817 Epoch: [283/300] [ 300/1251] eta: 0:15:04 lr: 0.000035 loss: 2.569165 (2.540316) time: 0.943983 data: 0.000167 max mem: 18817 Epoch: [283/300] [ 350/1251] eta: 0:14:18 lr: 0.000035 loss: 2.559669 (2.543019) time: 0.930989 data: 0.000162 max mem: 18817 Epoch: [283/300] [ 400/1251] eta: 0:13:29 lr: 0.000035 loss: 2.720511 (2.560175) time: 0.903364 data: 0.000167 max mem: 18817 Epoch: [283/300] [ 450/1251] eta: 0:12:43 lr: 0.000035 loss: 2.819297 (2.567513) time: 0.964875 data: 0.000164 max mem: 18817 Epoch: [283/300] [ 500/1251] eta: 0:11:54 lr: 0.000035 loss: 2.349249 (2.556215) time: 0.976948 data: 0.000177 max mem: 18817 Epoch: [283/300] [ 550/1251] eta: 0:11:06 lr: 0.000034 loss: 2.739222 (2.563822) time: 0.922714 data: 0.000185 max mem: 18817 Epoch: [283/300] [ 600/1251] eta: 0:10:19 lr: 0.000034 loss: 2.496134 (2.561567) time: 0.922798 data: 0.000170 max mem: 18817 Epoch: [283/300] [ 650/1251] eta: 0:09:31 lr: 0.000034 loss: 2.353145 (2.551894) time: 0.907663 data: 0.000168 max mem: 18817 Epoch: [283/300] [ 700/1251] eta: 0:08:44 lr: 0.000034 loss: 2.749621 (2.550114) time: 0.975821 data: 0.000169 max mem: 18817 Epoch: [283/300] [ 750/1251] eta: 0:07:56 lr: 0.000034 loss: 2.560242 (2.546416) time: 0.960280 data: 0.000195 max mem: 18817 Epoch: [283/300] [ 800/1251] eta: 0:07:08 lr: 0.000034 loss: 2.660779 (2.548649) time: 0.913052 data: 0.000187 max mem: 18817 Epoch: [283/300] [ 850/1251] eta: 0:06:21 lr: 0.000034 loss: 2.492276 (2.544080) time: 0.934187 data: 0.000161 max mem: 18817 Epoch: [283/300] [ 900/1251] eta: 0:05:33 lr: 0.000034 loss: 2.634121 (2.550492) time: 0.959855 data: 0.000168 max mem: 18817 Epoch: [283/300] [ 950/1251] eta: 0:04:46 lr: 0.000034 loss: 2.567995 (2.552837) time: 1.013653 data: 0.000189 max mem: 18817 Epoch: [283/300] [1000/1251] eta: 0:03:58 lr: 0.000034 loss: 2.516120 (2.552762) time: 0.966737 data: 0.000173 max mem: 18817 Epoch: [283/300] [1050/1251] eta: 0:03:10 lr: 0.000034 loss: 2.579884 (2.551668) time: 0.920751 data: 0.000176 max mem: 18817 Epoch: [283/300] [1100/1251] eta: 0:02:23 lr: 0.000034 loss: 2.692539 (2.556779) time: 0.920661 data: 0.000167 max mem: 18817 Epoch: [283/300] [1150/1251] eta: 0:01:35 lr: 0.000034 loss: 2.626931 (2.557881) time: 0.955410 data: 0.000178 max mem: 18817 Epoch: [283/300] [1200/1251] eta: 0:00:48 lr: 0.000034 loss: 2.401929 (2.553177) time: 1.014282 data: 0.000180 max mem: 18817 Epoch: [283/300] [1250/1251] eta: 0:00:00 lr: 0.000033 loss: 2.744346 (2.559363) time: 0.941982 data: 0.000759 max mem: 18817 Epoch: [283/300] Total time: 0:19:48 (0.949640 s / it) Averaged stats: lr: 0.000033 loss: 2.744346 (2.553111) Test: [ 0/49] eta: 0:01:27 loss: 0.528257 (0.528257) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.794642 data: 1.383366 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.549370 (0.652513) acc1: 84.375000 (85.085227) acc5: 96.875000 (96.732955) time: 0.488357 data: 0.125913 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.734722 (0.683428) acc1: 82.812500 (84.226190) acc5: 96.875000 (96.875000) time: 0.354946 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.676805 (0.680843) acc1: 82.812500 (84.173387) acc5: 96.875000 (96.875000) time: 0.351900 data: 0.000121 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.700258 (0.698171) acc1: 84.375000 (83.917683) acc5: 96.875000 (96.875000) time: 0.348806 data: 0.000122 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.700258 (0.694990) acc1: 84.375000 (84.128000) acc5: 96.875000 (96.960000) time: 0.344675 data: 0.000099 max mem: 18817 Test: Total time: 0:00:18 (0.382592 s / it) * Acc@1 84.034 Acc@5 96.750 loss 0.716 Max accuracy: 84.05% Epoch: [284/300] [ 0/1251] eta: 0:55:24 lr: 0.000033 loss: 2.975925 (2.975925) time: 2.657218 data: 1.102273 max mem: 18817 Epoch: [284/300] [ 50/1251] eta: 0:19:40 lr: 0.000033 loss: 2.759911 (2.611338) time: 0.916164 data: 0.000162 max mem: 18817 Epoch: [284/300] [ 100/1251] eta: 0:18:36 lr: 0.000033 loss: 2.658312 (2.541791) time: 0.909598 data: 0.000168 max mem: 18817 Epoch: [284/300] [ 150/1251] eta: 0:17:45 lr: 0.000033 loss: 2.690735 (2.565873) time: 0.965699 data: 0.000180 max mem: 18817 Epoch: [284/300] [ 200/1251] eta: 0:16:51 lr: 0.000033 loss: 2.693248 (2.573718) time: 0.983789 data: 0.000180 max mem: 18817 Epoch: [284/300] [ 250/1251] eta: 0:15:59 lr: 0.000033 loss: 2.571689 (2.567042) time: 0.910248 data: 0.000178 max mem: 18817 Epoch: [284/300] [ 300/1251] eta: 0:15:12 lr: 0.000033 loss: 2.783914 (2.577763) time: 0.987916 data: 0.000161 max mem: 18817 Epoch: [284/300] [ 350/1251] eta: 0:14:21 lr: 0.000033 loss: 2.671945 (2.573264) time: 0.911813 data: 0.000159 max mem: 18817 Epoch: [284/300] [ 400/1251] eta: 0:13:35 lr: 0.000033 loss: 2.689554 (2.573589) time: 0.920117 data: 0.000172 max mem: 18817 Epoch: [284/300] [ 450/1251] eta: 0:12:47 lr: 0.000033 loss: 2.589835 (2.568485) time: 0.970370 data: 0.000160 max mem: 18817 Epoch: [284/300] [ 500/1251] eta: 0:11:57 lr: 0.000033 loss: 2.596618 (2.563551) time: 0.956694 data: 0.000166 max mem: 18817 Epoch: [284/300] [ 550/1251] eta: 0:11:08 lr: 0.000033 loss: 2.591417 (2.562419) time: 0.921323 data: 0.000175 max mem: 18817 Epoch: [284/300] [ 600/1251] eta: 0:10:20 lr: 0.000033 loss: 2.649397 (2.569225) time: 0.906830 data: 0.000171 max mem: 18817 Epoch: [284/300] [ 650/1251] eta: 0:09:32 lr: 0.000033 loss: 2.633985 (2.574073) time: 0.941623 data: 0.000177 max mem: 18817 Epoch: [284/300] [ 700/1251] eta: 0:08:45 lr: 0.000033 loss: 2.667485 (2.575148) time: 0.979048 data: 0.000161 max mem: 18817 Epoch: [284/300] [ 750/1251] eta: 0:07:57 lr: 0.000032 loss: 2.418942 (2.566979) time: 0.960992 data: 0.000167 max mem: 18817 Epoch: [284/300] [ 800/1251] eta: 0:07:08 lr: 0.000032 loss: 2.730623 (2.571952) time: 0.979642 data: 0.000175 max mem: 18817 Epoch: [284/300] [ 850/1251] eta: 0:06:21 lr: 0.000032 loss: 2.619730 (2.570524) time: 0.915463 data: 0.000187 max mem: 18817 Epoch: [284/300] [ 900/1251] eta: 0:05:33 lr: 0.000032 loss: 2.599829 (2.570489) time: 0.911949 data: 0.000176 max mem: 18817 Epoch: [284/300] [ 950/1251] eta: 0:04:46 lr: 0.000032 loss: 2.590719 (2.569257) time: 0.935214 data: 0.000170 max mem: 18817 Epoch: [284/300] [1000/1251] eta: 0:03:59 lr: 0.000032 loss: 2.749357 (2.570656) time: 0.996007 data: 0.000171 max mem: 18817 Epoch: [284/300] [1050/1251] eta: 0:03:11 lr: 0.000032 loss: 2.605237 (2.574892) time: 0.946259 data: 0.000208 max mem: 18817 Epoch: [284/300] [1100/1251] eta: 0:02:23 lr: 0.000032 loss: 2.620826 (2.570806) time: 0.918480 data: 0.000175 max mem: 18817 Epoch: [284/300] [1150/1251] eta: 0:01:36 lr: 0.000032 loss: 2.491513 (2.569326) time: 0.929264 data: 0.000177 max mem: 18817 Epoch: [284/300] [1200/1251] eta: 0:00:48 lr: 0.000032 loss: 2.578205 (2.568968) time: 0.982062 data: 0.000172 max mem: 18817 Epoch: [284/300] [1250/1251] eta: 0:00:00 lr: 0.000032 loss: 2.629150 (2.565174) time: 1.020391 data: 0.000752 max mem: 18817 Epoch: [284/300] Total time: 0:19:50 (0.952011 s / it) Averaged stats: lr: 0.000032 loss: 2.629150 (2.567243) Test: [ 0/49] eta: 0:01:27 loss: 0.516437 (0.516437) acc1: 82.812500 (82.812500) acc5: 100.000000 (100.000000) time: 1.783666 data: 1.385644 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.572056 (0.656361) acc1: 84.375000 (84.801136) acc5: 96.875000 (97.017045) time: 0.499275 data: 0.126091 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.735722 (0.687199) acc1: 82.812500 (84.002976) acc5: 96.875000 (97.023810) time: 0.365177 data: 0.000136 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.690527 (0.685222) acc1: 82.812500 (83.971774) acc5: 96.875000 (97.127016) time: 0.364151 data: 0.000129 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.690527 (0.698237) acc1: 84.375000 (83.727134) acc5: 96.875000 (97.141768) time: 0.357951 data: 0.000124 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.690527 (0.694748) acc1: 84.375000 (83.904000) acc5: 96.875000 (97.152000) time: 0.348876 data: 0.000107 max mem: 18817 Test: Total time: 0:00:19 (0.389310 s / it) * Acc@1 84.010 Acc@5 96.782 loss 0.713 Max accuracy: 84.05% Epoch: [285/300] [ 0/1251] eta: 0:40:26 lr: 0.000032 loss: 2.947770 (2.947770) time: 1.939888 data: 1.063031 max mem: 18817 Epoch: [285/300] [ 50/1251] eta: 0:19:08 lr: 0.000032 loss: 2.648079 (2.627279) time: 0.907195 data: 0.000180 max mem: 18817 Epoch: [285/300] [ 100/1251] eta: 0:18:21 lr: 0.000032 loss: 2.811435 (2.626494) time: 0.919604 data: 0.000182 max mem: 18817 Epoch: [285/300] [ 150/1251] eta: 0:17:34 lr: 0.000032 loss: 2.445194 (2.611187) time: 0.961952 data: 0.000173 max mem: 18817 Epoch: [285/300] [ 200/1251] eta: 0:16:42 lr: 0.000032 loss: 2.736270 (2.595068) time: 0.965032 data: 0.000186 max mem: 18817 Epoch: [285/300] [ 250/1251] eta: 0:15:52 lr: 0.000032 loss: 2.567250 (2.569443) time: 0.925408 data: 0.000177 max mem: 18817 Epoch: [285/300] [ 300/1251] eta: 0:15:06 lr: 0.000031 loss: 2.618844 (2.561712) time: 0.923784 data: 0.000163 max mem: 18817 Epoch: [285/300] [ 350/1251] eta: 0:14:19 lr: 0.000031 loss: 2.507534 (2.565745) time: 0.906015 data: 0.000181 max mem: 18817 Epoch: [285/300] [ 400/1251] eta: 0:13:31 lr: 0.000031 loss: 2.500291 (2.551747) time: 0.973112 data: 0.000168 max mem: 18817 Epoch: [285/300] [ 450/1251] eta: 0:12:42 lr: 0.000031 loss: 2.480222 (2.549453) time: 0.967829 data: 0.000166 max mem: 18817 Epoch: [285/300] [ 500/1251] eta: 0:11:53 lr: 0.000031 loss: 2.517432 (2.543323) time: 0.917853 data: 0.000171 max mem: 18817 Epoch: [285/300] [ 550/1251] eta: 0:11:06 lr: 0.000031 loss: 2.786985 (2.545141) time: 0.925994 data: 0.000183 max mem: 18817 Epoch: [285/300] [ 600/1251] eta: 0:10:18 lr: 0.000031 loss: 2.577010 (2.540917) time: 0.941988 data: 0.000157 max mem: 18817 Epoch: [285/300] [ 650/1251] eta: 0:09:31 lr: 0.000031 loss: 2.619723 (2.547066) time: 1.022976 data: 0.000178 max mem: 18817 Epoch: [285/300] [ 700/1251] eta: 0:08:43 lr: 0.000031 loss: 2.656779 (2.553759) time: 0.962297 data: 0.000180 max mem: 18817 Epoch: [285/300] [ 750/1251] eta: 0:07:55 lr: 0.000031 loss: 2.561453 (2.557146) time: 0.914648 data: 0.000173 max mem: 18817 Epoch: [285/300] [ 800/1251] eta: 0:07:07 lr: 0.000031 loss: 2.591820 (2.556144) time: 0.917795 data: 0.000172 max mem: 18817 Epoch: [285/300] [ 850/1251] eta: 0:06:21 lr: 0.000031 loss: 2.674673 (2.560417) time: 0.990113 data: 0.000181 max mem: 18817 Epoch: [285/300] [ 900/1251] eta: 0:05:33 lr: 0.000031 loss: 2.679090 (2.559194) time: 1.042769 data: 0.000179 max mem: 18817 Epoch: [285/300] [ 950/1251] eta: 0:04:45 lr: 0.000031 loss: 2.549396 (2.552366) time: 0.947717 data: 0.000179 max mem: 18817 Epoch: [285/300] [1000/1251] eta: 0:03:58 lr: 0.000031 loss: 2.472461 (2.553741) time: 0.918907 data: 0.000165 max mem: 18817 Epoch: [285/300] [1050/1251] eta: 0:03:10 lr: 0.000031 loss: 2.741923 (2.555171) time: 0.928058 data: 0.000192 max mem: 18817 Epoch: [285/300] [1100/1251] eta: 0:02:23 lr: 0.000030 loss: 2.539452 (2.556261) time: 0.991817 data: 0.000168 max mem: 18817 Epoch: [285/300] [1150/1251] eta: 0:01:36 lr: 0.000030 loss: 2.474758 (2.551488) time: 0.994558 data: 0.000187 max mem: 18817 Epoch: [285/300] [1200/1251] eta: 0:00:48 lr: 0.000030 loss: 2.571485 (2.550933) time: 0.935356 data: 0.000190 max mem: 18817 Epoch: [285/300] [1250/1251] eta: 0:00:00 lr: 0.000030 loss: 2.483904 (2.548256) time: 0.924401 data: 0.000764 max mem: 18817 Epoch: [285/300] Total time: 0:19:49 (0.950627 s / it) Averaged stats: lr: 0.000030 loss: 2.483904 (2.545239) Test: [ 0/49] eta: 0:01:25 loss: 0.496655 (0.496655) acc1: 85.937500 (85.937500) acc5: 98.437500 (98.437500) time: 1.744011 data: 1.351034 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.557255 (0.648768) acc1: 84.375000 (84.801136) acc5: 96.875000 (96.732955) time: 0.481500 data: 0.122960 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.715764 (0.679994) acc1: 82.812500 (84.151786) acc5: 96.875000 (96.726190) time: 0.353826 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.681796 (0.678806) acc1: 82.812500 (84.122984) acc5: 96.875000 (96.925403) time: 0.451879 data: 0.000138 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.665277 (0.691465) acc1: 82.812500 (84.032012) acc5: 98.437500 (97.027439) time: 0.448830 data: 0.000150 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.665277 (0.689965) acc1: 82.812500 (84.096000) acc5: 98.437500 (97.056000) time: 0.343424 data: 0.000125 max mem: 18817 Test: Total time: 0:00:20 (0.421321 s / it) * Acc@1 83.938 Acc@5 96.754 loss 0.715 Max accuracy: 84.05% Epoch: [286/300] [ 0/1251] eta: 0:43:40 lr: 0.000030 loss: 1.831191 (1.831191) time: 2.094361 data: 1.209274 max mem: 18817 Epoch: [286/300] [ 50/1251] eta: 0:19:01 lr: 0.000030 loss: 2.663235 (2.524271) time: 0.956293 data: 0.000169 max mem: 18817 Epoch: [286/300] [ 100/1251] eta: 0:18:07 lr: 0.000030 loss: 2.764493 (2.582719) time: 0.923730 data: 0.000164 max mem: 18817 Epoch: [286/300] [ 150/1251] eta: 0:17:23 lr: 0.000030 loss: 2.558941 (2.579430) time: 0.915260 data: 0.000173 max mem: 18817 Epoch: [286/300] [ 200/1251] eta: 0:16:40 lr: 0.000030 loss: 2.686594 (2.578788) time: 0.972007 data: 0.000188 max mem: 18817 Epoch: [286/300] [ 250/1251] eta: 0:15:48 lr: 0.000030 loss: 2.556621 (2.588559) time: 0.963750 data: 0.000179 max mem: 18817 Epoch: [286/300] [ 300/1251] eta: 0:15:03 lr: 0.000030 loss: 2.529948 (2.591303) time: 0.981049 data: 0.000179 max mem: 18817 Epoch: [286/300] [ 350/1251] eta: 0:14:14 lr: 0.000030 loss: 2.759303 (2.591871) time: 0.907403 data: 0.000159 max mem: 18817 Epoch: [286/300] [ 400/1251] eta: 0:13:27 lr: 0.000030 loss: 2.568853 (2.600024) time: 0.903708 data: 0.000172 max mem: 18817 Epoch: [286/300] [ 450/1251] eta: 0:12:41 lr: 0.000030 loss: 2.646047 (2.600074) time: 0.974290 data: 0.000190 max mem: 18817 Epoch: [286/300] [ 500/1251] eta: 0:11:52 lr: 0.000030 loss: 2.772420 (2.597965) time: 0.956108 data: 0.000167 max mem: 18817 Epoch: [286/300] [ 550/1251] eta: 0:11:05 lr: 0.000030 loss: 2.666301 (2.590377) time: 0.955235 data: 0.000173 max mem: 18817 Epoch: [286/300] [ 600/1251] eta: 0:10:18 lr: 0.000030 loss: 2.645717 (2.589742) time: 0.927856 data: 0.000178 max mem: 18817 Epoch: [286/300] [ 650/1251] eta: 0:09:31 lr: 0.000030 loss: 2.497742 (2.585007) time: 0.961412 data: 0.000171 max mem: 18817 Epoch: [286/300] [ 700/1251] eta: 0:08:44 lr: 0.000029 loss: 2.821129 (2.587504) time: 0.979030 data: 0.000164 max mem: 18817 Epoch: [286/300] [ 750/1251] eta: 0:07:56 lr: 0.000029 loss: 2.622152 (2.585729) time: 0.969866 data: 0.000177 max mem: 18817 Epoch: [286/300] [ 800/1251] eta: 0:07:08 lr: 0.000029 loss: 2.653884 (2.586445) time: 0.947211 data: 0.000169 max mem: 18817 Epoch: [286/300] [ 850/1251] eta: 0:06:21 lr: 0.000029 loss: 2.537746 (2.584653) time: 0.917612 data: 0.000165 max mem: 18817 Epoch: [286/300] [ 900/1251] eta: 0:05:34 lr: 0.000029 loss: 2.621832 (2.582840) time: 0.978081 data: 0.000164 max mem: 18817 Epoch: [286/300] [ 950/1251] eta: 0:04:46 lr: 0.000029 loss: 2.582335 (2.579227) time: 0.984900 data: 0.000171 max mem: 18817 Epoch: [286/300] [1000/1251] eta: 0:03:59 lr: 0.000029 loss: 2.414976 (2.576730) time: 0.968365 data: 0.000183 max mem: 18817 Epoch: [286/300] [1050/1251] eta: 0:03:11 lr: 0.000029 loss: 2.580747 (2.579723) time: 0.927594 data: 0.000175 max mem: 18817 Epoch: [286/300] [1100/1251] eta: 0:02:23 lr: 0.000029 loss: 2.514367 (2.576328) time: 0.920087 data: 0.000164 max mem: 18817 Epoch: [286/300] [1150/1251] eta: 0:01:36 lr: 0.000029 loss: 2.414873 (2.572819) time: 0.958900 data: 0.000168 max mem: 18817 Epoch: [286/300] [1200/1251] eta: 0:00:48 lr: 0.000029 loss: 2.459351 (2.565620) time: 0.982718 data: 0.000178 max mem: 18817 Epoch: [286/300] [1250/1251] eta: 0:00:00 lr: 0.000029 loss: 2.668446 (2.567077) time: 0.983501 data: 0.000835 max mem: 18817 Epoch: [286/300] Total time: 0:19:52 (0.953570 s / it) Averaged stats: lr: 0.000029 loss: 2.668446 (2.567603) Test: [ 0/49] eta: 0:01:21 loss: 0.518883 (0.518883) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.665191 data: 1.224260 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.574498 (0.653736) acc1: 84.375000 (84.943182) acc5: 96.875000 (96.732955) time: 0.477970 data: 0.111441 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 0.702417 (0.678256) acc1: 82.812500 (84.375000) acc5: 96.875000 (96.726190) time: 0.462805 data: 0.000139 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.673161 (0.677106) acc1: 84.375000 (84.274194) acc5: 96.875000 (96.925403) time: 0.458886 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.673161 (0.692993) acc1: 84.375000 (84.032012) acc5: 96.875000 (97.027439) time: 0.349494 data: 0.000135 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.673161 (0.689847) acc1: 84.375000 (84.192000) acc5: 96.875000 (97.088000) time: 0.344479 data: 0.000113 max mem: 18817 Test: Total time: 0:00:20 (0.423388 s / it) * Acc@1 84.036 Acc@5 96.780 loss 0.713 Max accuracy: 84.05% Epoch: [287/300] [ 0/1251] eta: 0:43:17 lr: 0.000029 loss: 2.982194 (2.982194) time: 2.076625 data: 1.204875 max mem: 18817 Epoch: [287/300] [ 50/1251] eta: 0:19:09 lr: 0.000029 loss: 2.752724 (2.661291) time: 0.983727 data: 0.000173 max mem: 18817 Epoch: [287/300] [ 100/1251] eta: 0:18:12 lr: 0.000029 loss: 2.742723 (2.597091) time: 0.953232 data: 0.000176 max mem: 18817 Epoch: [287/300] [ 150/1251] eta: 0:17:25 lr: 0.000029 loss: 2.570072 (2.570932) time: 0.919736 data: 0.000168 max mem: 18817 Epoch: [287/300] [ 200/1251] eta: 0:16:41 lr: 0.000029 loss: 2.183303 (2.537494) time: 0.972413 data: 0.000176 max mem: 18817 Epoch: [287/300] [ 250/1251] eta: 0:15:51 lr: 0.000029 loss: 2.479915 (2.524387) time: 0.971390 data: 0.000186 max mem: 18817 Epoch: [287/300] [ 300/1251] eta: 0:15:01 lr: 0.000029 loss: 2.720307 (2.527586) time: 0.904960 data: 0.000175 max mem: 18817 Epoch: [287/300] [ 350/1251] eta: 0:14:15 lr: 0.000028 loss: 2.615050 (2.517736) time: 0.928888 data: 0.000179 max mem: 18817 Epoch: [287/300] [ 400/1251] eta: 0:13:30 lr: 0.000028 loss: 2.580681 (2.529916) time: 0.920957 data: 0.000185 max mem: 18817 Epoch: [287/300] [ 450/1251] eta: 0:12:42 lr: 0.000028 loss: 2.613144 (2.535817) time: 0.957247 data: 0.000201 max mem: 18817 Epoch: [287/300] [ 500/1251] eta: 0:11:53 lr: 0.000028 loss: 2.590269 (2.537359) time: 0.952010 data: 0.000179 max mem: 18817 Epoch: [287/300] [ 550/1251] eta: 0:11:06 lr: 0.000028 loss: 2.468722 (2.531651) time: 0.925393 data: 0.000180 max mem: 18817 Epoch: [287/300] [ 600/1251] eta: 0:10:18 lr: 0.000028 loss: 2.648339 (2.526120) time: 0.921071 data: 0.000170 max mem: 18817 Epoch: [287/300] [ 650/1251] eta: 0:09:31 lr: 0.000028 loss: 2.718302 (2.524503) time: 1.015101 data: 0.000186 max mem: 18817 Epoch: [287/300] [ 700/1251] eta: 0:08:43 lr: 0.000028 loss: 2.687831 (2.524002) time: 0.960655 data: 0.000182 max mem: 18817 Epoch: [287/300] [ 750/1251] eta: 0:07:56 lr: 0.000028 loss: 2.643229 (2.525635) time: 0.939561 data: 0.000180 max mem: 18817 Epoch: [287/300] [ 800/1251] eta: 0:07:09 lr: 0.000028 loss: 2.726776 (2.531211) time: 0.924947 data: 0.000174 max mem: 18817 Epoch: [287/300] [ 850/1251] eta: 0:06:21 lr: 0.000028 loss: 2.434833 (2.528816) time: 0.968578 data: 0.000173 max mem: 18817 Epoch: [287/300] [ 900/1251] eta: 0:05:33 lr: 0.000028 loss: 2.683485 (2.534445) time: 0.958759 data: 0.000173 max mem: 18817 Epoch: [287/300] [ 950/1251] eta: 0:04:46 lr: 0.000028 loss: 2.635171 (2.538128) time: 0.974952 data: 0.000168 max mem: 18817 Epoch: [287/300] [1000/1251] eta: 0:03:58 lr: 0.000028 loss: 2.742171 (2.543382) time: 0.917404 data: 0.000164 max mem: 18817 Epoch: [287/300] [1050/1251] eta: 0:03:11 lr: 0.000028 loss: 2.700558 (2.545532) time: 0.908216 data: 0.000176 max mem: 18817 Epoch: [287/300] [1100/1251] eta: 0:02:23 lr: 0.000028 loss: 2.432539 (2.544410) time: 0.966394 data: 0.000175 max mem: 18817 Epoch: [287/300] [1150/1251] eta: 0:01:36 lr: 0.000028 loss: 2.419131 (2.545138) time: 0.971172 data: 0.000188 max mem: 18817 Epoch: [287/300] [1200/1251] eta: 0:00:48 lr: 0.000028 loss: 2.627517 (2.546358) time: 0.979275 data: 0.000187 max mem: 18817 Epoch: [287/300] [1250/1251] eta: 0:00:00 lr: 0.000028 loss: 2.552247 (2.545838) time: 0.913822 data: 0.000758 max mem: 18817 Epoch: [287/300] Total time: 0:19:50 (0.951558 s / it) Averaged stats: lr: 0.000028 loss: 2.552247 (2.551064) Test: [ 0/49] eta: 0:01:16 loss: 0.481446 (0.481446) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.561446 data: 1.155886 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.564433 (0.652322) acc1: 84.375000 (85.511364) acc5: 96.875000 (96.875000) time: 0.468516 data: 0.105223 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.719935 (0.680778) acc1: 82.812500 (84.375000) acc5: 96.875000 (96.875000) time: 0.358392 data: 0.000135 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.678227 (0.680377) acc1: 82.812500 (84.324597) acc5: 96.875000 (96.925403) time: 0.354627 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.678227 (0.693727) acc1: 82.812500 (84.108232) acc5: 96.875000 (96.989329) time: 0.356932 data: 0.000127 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.678227 (0.690066) acc1: 82.812500 (84.384000) acc5: 96.875000 (97.024000) time: 0.351555 data: 0.000100 max mem: 18817 Test: Total time: 0:00:18 (0.382324 s / it) * Acc@1 83.994 Acc@5 96.754 loss 0.712 Max accuracy: 84.05% Epoch: [288/300] [ 0/1251] eta: 0:39:40 lr: 0.000028 loss: 2.315171 (2.315171) time: 1.902853 data: 1.030143 max mem: 18817 Epoch: [288/300] [ 50/1251] eta: 0:19:50 lr: 0.000027 loss: 2.687982 (2.554001) time: 0.973114 data: 0.000162 max mem: 18817 Epoch: [288/300] [ 100/1251] eta: 0:18:29 lr: 0.000027 loss: 2.569251 (2.506262) time: 0.975426 data: 0.000181 max mem: 18817 Epoch: [288/300] [ 150/1251] eta: 0:17:35 lr: 0.000027 loss: 2.638081 (2.532768) time: 0.927746 data: 0.000177 max mem: 18817 Epoch: [288/300] [ 200/1251] eta: 0:16:49 lr: 0.000027 loss: 2.593826 (2.541273) time: 0.928705 data: 0.000164 max mem: 18817 Epoch: [288/300] [ 250/1251] eta: 0:16:02 lr: 0.000027 loss: 2.497815 (2.530721) time: 0.986574 data: 0.000178 max mem: 18817 Epoch: [288/300] [ 300/1251] eta: 0:15:14 lr: 0.000027 loss: 2.605937 (2.534387) time: 0.994775 data: 0.000160 max mem: 18817 Epoch: [288/300] [ 350/1251] eta: 0:14:20 lr: 0.000027 loss: 2.479656 (2.533575) time: 0.909480 data: 0.000164 max mem: 18817 Epoch: [288/300] [ 400/1251] eta: 0:13:32 lr: 0.000027 loss: 2.407190 (2.527972) time: 0.915109 data: 0.000162 max mem: 18817 Epoch: [288/300] [ 450/1251] eta: 0:12:45 lr: 0.000027 loss: 2.754743 (2.543925) time: 0.957365 data: 0.000173 max mem: 18817 Epoch: [288/300] [ 500/1251] eta: 0:11:55 lr: 0.000027 loss: 2.457477 (2.547078) time: 0.958623 data: 0.000173 max mem: 18817 Epoch: [288/300] [ 550/1251] eta: 0:11:07 lr: 0.000027 loss: 2.768798 (2.547853) time: 0.910899 data: 0.000192 max mem: 18817 Epoch: [288/300] [ 600/1251] eta: 0:10:20 lr: 0.000027 loss: 2.796140 (2.558364) time: 0.919977 data: 0.000192 max mem: 18817 Epoch: [288/300] [ 650/1251] eta: 0:09:32 lr: 0.000027 loss: 2.412556 (2.563653) time: 0.903599 data: 0.000163 max mem: 18817 Epoch: [288/300] [ 700/1251] eta: 0:08:45 lr: 0.000027 loss: 2.475914 (2.564876) time: 0.985064 data: 0.000171 max mem: 18817 Epoch: [288/300] [ 750/1251] eta: 0:07:57 lr: 0.000027 loss: 2.622581 (2.562394) time: 0.968363 data: 0.000174 max mem: 18817 Epoch: [288/300] [ 800/1251] eta: 0:07:09 lr: 0.000027 loss: 2.466744 (2.557472) time: 0.913060 data: 0.000173 max mem: 18817 Epoch: [288/300] [ 850/1251] eta: 0:06:22 lr: 0.000027 loss: 2.708116 (2.561935) time: 0.925561 data: 0.000170 max mem: 18817 Epoch: [288/300] [ 900/1251] eta: 0:05:34 lr: 0.000027 loss: 2.597125 (2.557485) time: 0.958012 data: 0.000184 max mem: 18817 Epoch: [288/300] [ 950/1251] eta: 0:04:46 lr: 0.000027 loss: 2.599258 (2.553913) time: 1.001114 data: 0.000188 max mem: 18817 Epoch: [288/300] [1000/1251] eta: 0:03:58 lr: 0.000027 loss: 2.685317 (2.554238) time: 0.947515 data: 0.000159 max mem: 18817 Epoch: [288/300] [1050/1251] eta: 0:03:11 lr: 0.000026 loss: 2.618366 (2.555340) time: 0.926159 data: 0.000183 max mem: 18817 Epoch: [288/300] [1100/1251] eta: 0:02:23 lr: 0.000026 loss: 2.587280 (2.556903) time: 0.931922 data: 0.000177 max mem: 18817 Epoch: [288/300] [1150/1251] eta: 0:01:36 lr: 0.000026 loss: 2.770977 (2.558299) time: 0.966046 data: 0.000192 max mem: 18817 Epoch: [288/300] [1200/1251] eta: 0:00:48 lr: 0.000026 loss: 2.431859 (2.556644) time: 1.024536 data: 0.000169 max mem: 18817 Epoch: [288/300] [1250/1251] eta: 0:00:00 lr: 0.000026 loss: 2.737923 (2.555068) time: 0.997888 data: 0.000774 max mem: 18817 Epoch: [288/300] Total time: 0:19:51 (0.952807 s / it) Averaged stats: lr: 0.000026 loss: 2.737923 (2.552027) Test: [ 0/49] eta: 0:01:19 loss: 0.507807 (0.507807) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.620378 data: 1.197683 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.571498 (0.647211) acc1: 84.375000 (85.227273) acc5: 96.875000 (96.732955) time: 0.472470 data: 0.109010 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.703991 (0.675781) acc1: 84.375000 (84.747024) acc5: 96.875000 (96.726190) time: 0.354881 data: 0.000131 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.678758 (0.673810) acc1: 84.375000 (84.677419) acc5: 96.875000 (96.975806) time: 0.352091 data: 0.000128 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.678758 (0.688254) acc1: 84.375000 (84.336890) acc5: 98.437500 (97.065549) time: 0.349652 data: 0.000126 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.678758 (0.684646) acc1: 84.375000 (84.480000) acc5: 96.875000 (97.088000) time: 0.344134 data: 0.000101 max mem: 18817 Test: Total time: 0:00:18 (0.378188 s / it) * Acc@1 83.980 Acc@5 96.758 loss 0.707 Max accuracy: 84.05% Epoch: [289/300] [ 0/1251] eta: 0:42:37 lr: 0.000026 loss: 1.905300 (1.905300) time: 2.044215 data: 1.181471 max mem: 18817 Epoch: [289/300] [ 50/1251] eta: 0:19:04 lr: 0.000026 loss: 2.458199 (2.445572) time: 0.906372 data: 0.000148 max mem: 18817 Epoch: [289/300] [ 100/1251] eta: 0:18:26 lr: 0.000026 loss: 2.769440 (2.539664) time: 0.915062 data: 0.000166 max mem: 18817 Epoch: [289/300] [ 150/1251] eta: 0:17:43 lr: 0.000026 loss: 2.462921 (2.543264) time: 0.983912 data: 0.000171 max mem: 18817 Epoch: [289/300] [ 200/1251] eta: 0:16:40 lr: 0.000026 loss: 2.624102 (2.557423) time: 0.903409 data: 0.000184 max mem: 18817 Epoch: [289/300] [ 250/1251] eta: 0:15:55 lr: 0.000026 loss: 2.782293 (2.565299) time: 0.915888 data: 0.000169 max mem: 18817 Epoch: [289/300] [ 300/1251] eta: 0:15:06 lr: 0.000026 loss: 2.484441 (2.560610) time: 0.906800 data: 0.000168 max mem: 18817 Epoch: [289/300] [ 350/1251] eta: 0:14:18 lr: 0.000026 loss: 2.468331 (2.556963) time: 0.943035 data: 0.000152 max mem: 18817 Epoch: [289/300] [ 400/1251] eta: 0:13:28 lr: 0.000026 loss: 2.338712 (2.548122) time: 0.950317 data: 0.000196 max mem: 18817 Epoch: [289/300] [ 450/1251] eta: 0:12:41 lr: 0.000026 loss: 2.654407 (2.547262) time: 0.927122 data: 0.000184 max mem: 18817 Epoch: [289/300] [ 500/1251] eta: 0:11:53 lr: 0.000026 loss: 2.703135 (2.544621) time: 0.922692 data: 0.000186 max mem: 18817 Epoch: [289/300] [ 550/1251] eta: 0:11:06 lr: 0.000026 loss: 2.369603 (2.538259) time: 0.976070 data: 0.000197 max mem: 18817 Epoch: [289/300] [ 600/1251] eta: 0:10:19 lr: 0.000026 loss: 2.526995 (2.537458) time: 1.000748 data: 0.000182 max mem: 18817 Epoch: [289/300] [ 650/1251] eta: 0:09:32 lr: 0.000026 loss: 2.608130 (2.538664) time: 0.982039 data: 0.000175 max mem: 18817 Epoch: [289/300] [ 700/1251] eta: 0:08:44 lr: 0.000026 loss: 2.618513 (2.538039) time: 0.925958 data: 0.000164 max mem: 18817 Epoch: [289/300] [ 750/1251] eta: 0:07:56 lr: 0.000026 loss: 2.634253 (2.539968) time: 0.913837 data: 0.000171 max mem: 18817 Epoch: [289/300] [ 800/1251] eta: 0:07:09 lr: 0.000026 loss: 2.629038 (2.544082) time: 0.981015 data: 0.000179 max mem: 18817 Epoch: [289/300] [ 850/1251] eta: 0:06:22 lr: 0.000026 loss: 2.517110 (2.537248) time: 1.036663 data: 0.000182 max mem: 18817 Epoch: [289/300] [ 900/1251] eta: 0:05:34 lr: 0.000025 loss: 2.639283 (2.534583) time: 0.957944 data: 0.000199 max mem: 18817 Epoch: [289/300] [ 950/1251] eta: 0:04:46 lr: 0.000025 loss: 2.519383 (2.536985) time: 0.918600 data: 0.000170 max mem: 18817 Epoch: [289/300] [1000/1251] eta: 0:03:58 lr: 0.000025 loss: 2.621442 (2.539255) time: 0.907763 data: 0.000158 max mem: 18817 Epoch: [289/300] [1050/1251] eta: 0:03:11 lr: 0.000025 loss: 2.541501 (2.535878) time: 0.974318 data: 0.000171 max mem: 18817 Epoch: [289/300] [1100/1251] eta: 0:02:23 lr: 0.000025 loss: 2.629297 (2.539061) time: 1.028831 data: 0.000171 max mem: 18817 Epoch: [289/300] [1150/1251] eta: 0:01:36 lr: 0.000025 loss: 2.578573 (2.533483) time: 0.958735 data: 0.000168 max mem: 18817 Epoch: [289/300] [1200/1251] eta: 0:00:48 lr: 0.000025 loss: 2.576120 (2.536296) time: 0.924073 data: 0.000175 max mem: 18817 Epoch: [289/300] [1250/1251] eta: 0:00:00 lr: 0.000025 loss: 2.908802 (2.542869) time: 0.913350 data: 0.000776 max mem: 18817 Epoch: [289/300] Total time: 0:19:50 (0.951927 s / it) Averaged stats: lr: 0.000025 loss: 2.908802 (2.541565) Test: [ 0/49] eta: 0:01:29 loss: 0.502986 (0.502986) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.833022 data: 1.456453 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.599733 (0.653719) acc1: 84.375000 (84.801136) acc5: 96.875000 (96.875000) time: 0.492650 data: 0.132535 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.705308 (0.679859) acc1: 84.375000 (84.449405) acc5: 96.875000 (97.098214) time: 0.355693 data: 0.000137 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.684877 (0.681556) acc1: 84.375000 (84.375000) acc5: 96.875000 (97.076613) time: 0.352282 data: 0.000127 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.688111 (0.696935) acc1: 84.375000 (84.184451) acc5: 96.875000 (96.989329) time: 0.349579 data: 0.000118 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.688111 (0.693225) acc1: 84.375000 (84.480000) acc5: 96.875000 (97.024000) time: 0.344436 data: 0.000099 max mem: 18817 Test: Total time: 0:00:18 (0.383072 s / it) * Acc@1 84.030 Acc@5 96.736 loss 0.718 Max accuracy: 84.05% Epoch: [290/300] [ 0/1251] eta: 0:41:38 lr: 0.000025 loss: 2.120734 (2.120734) time: 1.997341 data: 1.129091 max mem: 18817 Epoch: [290/300] [ 50/1251] eta: 0:19:10 lr: 0.000025 loss: 2.563554 (2.480803) time: 0.965210 data: 0.000189 max mem: 18817 Epoch: [290/300] [ 100/1251] eta: 0:18:09 lr: 0.000025 loss: 2.597099 (2.536570) time: 0.967462 data: 0.000180 max mem: 18817 Epoch: [290/300] [ 150/1251] eta: 0:17:31 lr: 0.000025 loss: 2.752008 (2.530802) time: 0.958279 data: 0.000170 max mem: 18817 Epoch: [290/300] [ 200/1251] eta: 0:16:39 lr: 0.000025 loss: 2.565452 (2.534173) time: 0.918593 data: 0.000183 max mem: 18817 Epoch: [290/300] [ 250/1251] eta: 0:15:53 lr: 0.000025 loss: 2.458489 (2.529910) time: 0.917986 data: 0.000166 max mem: 18817 Epoch: [290/300] [ 300/1251] eta: 0:15:07 lr: 0.000025 loss: 2.619976 (2.526869) time: 0.976169 data: 0.000175 max mem: 18817 Epoch: [290/300] [ 350/1251] eta: 0:14:18 lr: 0.000025 loss: 2.565542 (2.520306) time: 0.978958 data: 0.000163 max mem: 18817 Epoch: [290/300] [ 400/1251] eta: 0:13:33 lr: 0.000025 loss: 2.858535 (2.536049) time: 1.000865 data: 0.000176 max mem: 18817 Epoch: [290/300] [ 450/1251] eta: 0:12:44 lr: 0.000025 loss: 2.669331 (2.534545) time: 0.924656 data: 0.000177 max mem: 18817 Epoch: [290/300] [ 500/1251] eta: 0:11:58 lr: 0.000025 loss: 2.690413 (2.543131) time: 0.914995 data: 0.000166 max mem: 18817 Epoch: [290/300] [ 550/1251] eta: 0:11:09 lr: 0.000025 loss: 2.764042 (2.540736) time: 0.908573 data: 0.000175 max mem: 18817 Epoch: [290/300] [ 600/1251] eta: 0:10:21 lr: 0.000025 loss: 2.611542 (2.550698) time: 0.953566 data: 0.000188 max mem: 18817 Epoch: [290/300] [ 650/1251] eta: 0:09:33 lr: 0.000025 loss: 2.597805 (2.558563) time: 1.011844 data: 0.000182 max mem: 18817 Epoch: [290/300] [ 700/1251] eta: 0:08:45 lr: 0.000025 loss: 2.535917 (2.558374) time: 0.986662 data: 0.000178 max mem: 18817 Epoch: [290/300] [ 750/1251] eta: 0:07:57 lr: 0.000025 loss: 2.439780 (2.552251) time: 0.916123 data: 0.000180 max mem: 18817 Epoch: [290/300] [ 800/1251] eta: 0:07:09 lr: 0.000025 loss: 2.619318 (2.546839) time: 0.908213 data: 0.000181 max mem: 18817 Epoch: [290/300] [ 850/1251] eta: 0:06:22 lr: 0.000024 loss: 2.733512 (2.553772) time: 0.958152 data: 0.000189 max mem: 18817 Epoch: [290/300] [ 900/1251] eta: 0:05:34 lr: 0.000024 loss: 2.680746 (2.551968) time: 0.972519 data: 0.000198 max mem: 18817 Epoch: [290/300] [ 950/1251] eta: 0:04:46 lr: 0.000024 loss: 2.667930 (2.548785) time: 0.916078 data: 0.000185 max mem: 18817 Epoch: [290/300] [1000/1251] eta: 0:03:58 lr: 0.000024 loss: 2.536421 (2.548274) time: 0.911629 data: 0.000172 max mem: 18817 Epoch: [290/300] [1050/1251] eta: 0:03:11 lr: 0.000024 loss: 2.588494 (2.548078) time: 0.902531 data: 0.000229 max mem: 18817 Epoch: [290/300] [1100/1251] eta: 0:02:23 lr: 0.000024 loss: 2.449333 (2.544187) time: 0.975249 data: 0.000177 max mem: 18817 Epoch: [290/300] [1150/1251] eta: 0:01:36 lr: 0.000024 loss: 2.556126 (2.541657) time: 0.964628 data: 0.000169 max mem: 18817 Epoch: [290/300] [1200/1251] eta: 0:00:48 lr: 0.000024 loss: 2.523475 (2.540617) time: 0.922437 data: 0.000185 max mem: 18817 Epoch: [290/300] [1250/1251] eta: 0:00:00 lr: 0.000024 loss: 2.628885 (2.542706) time: 0.919843 data: 0.000754 max mem: 18817 Epoch: [290/300] Total time: 0:19:50 (0.951512 s / it) Averaged stats: lr: 0.000024 loss: 2.628885 (2.542789) Test: [ 0/49] eta: 0:01:18 loss: 0.515335 (0.515335) acc1: 87.500000 (87.500000) acc5: 100.000000 (100.000000) time: 1.611356 data: 1.191903 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.553338 (0.653046) acc1: 84.375000 (84.801136) acc5: 96.875000 (97.017045) time: 0.483327 data: 0.108504 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.704227 (0.683189) acc1: 84.375000 (84.449405) acc5: 96.875000 (96.949405) time: 0.361013 data: 0.000143 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.691423 (0.683226) acc1: 84.375000 (84.375000) acc5: 96.875000 (97.026210) time: 0.352192 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.687870 (0.697689) acc1: 84.375000 (84.298780) acc5: 96.875000 (96.951220) time: 0.434650 data: 0.000132 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.687870 (0.694268) acc1: 84.375000 (84.480000) acc5: 96.875000 (96.992000) time: 0.443950 data: 0.000109 max mem: 18817 Test: Total time: 0:00:20 (0.421539 s / it) * Acc@1 84.054 Acc@5 96.724 loss 0.717 Max accuracy: 84.05% uploading checkpoint experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0290.pth to hdfs://haruna/home/byte_arnold_hl_vc/user/guoyuanfan/HCSC/experiments/classification/imagenet1k/eurnet_base_300eps_reproduce_2nodes_fixed_re5/checkpoint_0290.pth Epoch: [291/300] [ 0/1251] eta: 0:40:02 lr: 0.000024 loss: 1.708308 (1.708308) time: 1.920446 data: 1.050038 max mem: 18817 Epoch: [291/300] [ 50/1251] eta: 0:19:05 lr: 0.000024 loss: 2.747865 (2.598007) time: 0.914940 data: 0.000170 max mem: 18817 Epoch: [291/300] [ 100/1251] eta: 0:18:28 lr: 0.000024 loss: 2.578940 (2.539870) time: 0.935137 data: 0.000184 max mem: 18817 Epoch: [291/300] [ 150/1251] eta: 0:17:39 lr: 0.000024 loss: 2.630492 (2.553100) time: 0.970400 data: 0.000183 max mem: 18817 Epoch: [291/300] [ 200/1251] eta: 0:16:51 lr: 0.000024 loss: 2.608272 (2.550039) time: 1.035159 data: 0.000175 max mem: 18817 Epoch: [291/300] [ 250/1251] eta: 0:15:58 lr: 0.000024 loss: 2.592994 (2.549450) time: 0.969504 data: 0.000182 max mem: 18817 Epoch: [291/300] [ 300/1251] eta: 0:15:06 lr: 0.000024 loss: 2.649000 (2.557397) time: 0.916396 data: 0.000172 max mem: 18817 Epoch: [291/300] [ 350/1251] eta: 0:14:21 lr: 0.000024 loss: 2.625198 (2.556380) time: 0.928545 data: 0.000165 max mem: 18817 Epoch: [291/300] [ 400/1251] eta: 0:13:33 lr: 0.000024 loss: 2.640184 (2.557558) time: 0.964584 data: 0.000177 max mem: 18817 Epoch: [291/300] [ 450/1251] eta: 0:12:45 lr: 0.000024 loss: 2.673920 (2.558089) time: 1.024138 data: 0.000168 max mem: 18817 Epoch: [291/300] [ 500/1251] eta: 0:11:55 lr: 0.000024 loss: 2.387088 (2.547623) time: 0.960003 data: 0.000177 max mem: 18817 Epoch: [291/300] [ 550/1251] eta: 0:11:07 lr: 0.000024 loss: 2.729262 (2.550596) time: 0.923612 data: 0.000175 max mem: 18817 Epoch: [291/300] [ 600/1251] eta: 0:10:20 lr: 0.000024 loss: 2.510304 (2.550233) time: 0.919618 data: 0.000170 max mem: 18817 Epoch: [291/300] [ 650/1251] eta: 0:09:32 lr: 0.000024 loss: 2.538368 (2.555073) time: 0.950635 data: 0.000173 max mem: 18817 Epoch: [291/300] [ 700/1251] eta: 0:08:44 lr: 0.000024 loss: 2.610569 (2.555664) time: 0.983387 data: 0.000172 max mem: 18817 Epoch: [291/300] [ 750/1251] eta: 0:07:57 lr: 0.000024 loss: 2.686816 (2.553624) time: 0.961466 data: 0.000168 max mem: 18817 Epoch: [291/300] [ 800/1251] eta: 0:07:09 lr: 0.000024 loss: 2.762731 (2.553184) time: 0.917562 data: 0.000168 max mem: 18817 Epoch: [291/300] [ 850/1251] eta: 0:06:21 lr: 0.000024 loss: 2.714670 (2.558574) time: 0.916560 data: 0.000171 max mem: 18817 Epoch: [291/300] [ 900/1251] eta: 0:05:34 lr: 0.000024 loss: 2.528327 (2.551452) time: 0.968772 data: 0.000168 max mem: 18817 Epoch: [291/300] [ 950/1251] eta: 0:04:46 lr: 0.000023 loss: 2.643339 (2.552761) time: 0.966337 data: 0.000182 max mem: 18817 Epoch: [291/300] [1000/1251] eta: 0:03:58 lr: 0.000023 loss: 2.615057 (2.554462) time: 0.923257 data: 0.000166 max mem: 18817 Epoch: [291/300] [1050/1251] eta: 0:03:11 lr: 0.000023 loss: 2.479721 (2.549918) time: 0.923517 data: 0.000167 max mem: 18817 Epoch: [291/300] [1100/1251] eta: 0:02:23 lr: 0.000023 loss: 2.645116 (2.551587) time: 0.918672 data: 0.000179 max mem: 18817 Epoch: [291/300] [1150/1251] eta: 0:01:36 lr: 0.000023 loss: 2.492274 (2.549157) time: 0.955113 data: 0.000183 max mem: 18817 Epoch: [291/300] [1200/1251] eta: 0:00:48 lr: 0.000023 loss: 2.498898 (2.545029) time: 0.971632 data: 0.000179 max mem: 18817 Epoch: [291/300] [1250/1251] eta: 0:00:00 lr: 0.000023 loss: 2.655797 (2.544848) time: 0.921016 data: 0.000760 max mem: 18817 Epoch: [291/300] Total time: 0:19:51 (0.952360 s / it) Averaged stats: lr: 0.000023 loss: 2.655797 (2.545840) Test: [ 0/49] eta: 0:01:14 loss: 0.513151 (0.513151) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.511564 data: 1.106168 max mem: 18817 Test: [10/49] eta: 0:00:25 loss: 0.579103 (0.649158) acc1: 84.375000 (84.801136) acc5: 96.875000 (97.017045) time: 0.656090 data: 0.100704 max mem: 18817 Test: [20/49] eta: 0:00:14 loss: 0.715268 (0.680740) acc1: 84.375000 (84.300595) acc5: 96.875000 (96.949405) time: 0.461066 data: 0.000140 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.680107 (0.680331) acc1: 84.375000 (84.223790) acc5: 96.875000 (97.026210) time: 0.352663 data: 0.000136 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.685322 (0.695049) acc1: 84.375000 (84.032012) acc5: 96.875000 (97.065549) time: 0.350680 data: 0.000132 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.685322 (0.691339) acc1: 84.375000 (84.192000) acc5: 96.875000 (97.120000) time: 0.344992 data: 0.000103 max mem: 18817 Test: Total time: 0:00:20 (0.420381 s / it) * Acc@1 84.012 Acc@5 96.760 loss 0.713 Max accuracy: 84.05% Epoch: [292/300] [ 0/1251] eta: 0:45:36 lr: 0.000023 loss: 2.333431 (2.333431) time: 2.187301 data: 1.140071 max mem: 18817 Epoch: [292/300] [ 50/1251] eta: 0:19:22 lr: 0.000023 loss: 2.636673 (2.523826) time: 0.912408 data: 0.000179 max mem: 18817 Epoch: [292/300] [ 100/1251] eta: 0:18:34 lr: 0.000023 loss: 2.726517 (2.596133) time: 1.001036 data: 0.000167 max mem: 18817 Epoch: [292/300] [ 150/1251] eta: 0:17:40 lr: 0.000023 loss: 2.607733 (2.592037) time: 1.015841 data: 0.000181 max mem: 18817 Epoch: [292/300] [ 200/1251] eta: 0:16:43 lr: 0.000023 loss: 2.555315 (2.571153) time: 0.955577 data: 0.000164 max mem: 18817 Epoch: [292/300] [ 250/1251] eta: 0:15:51 lr: 0.000023 loss: 2.512241 (2.569012) time: 0.910624 data: 0.000154 max mem: 18817 Epoch: [292/300] [ 300/1251] eta: 0:15:05 lr: 0.000023 loss: 2.673368 (2.560223) time: 0.912308 data: 0.000181 max mem: 18817 Epoch: [292/300] [ 350/1251] eta: 0:14:17 lr: 0.000023 loss: 2.639493 (2.551529) time: 0.962541 data: 0.000165 max mem: 18817 Epoch: [292/300] [ 400/1251] eta: 0:13:27 lr: 0.000023 loss: 2.499501 (2.546192) time: 0.934001 data: 0.000183 max mem: 18817 Epoch: [292/300] [ 450/1251] eta: 0:12:41 lr: 0.000023 loss: 2.552265 (2.541890) time: 0.987577 data: 0.000181 max mem: 18817 Epoch: [292/300] [ 500/1251] eta: 0:11:52 lr: 0.000023 loss: 2.698184 (2.539859) time: 0.914988 data: 0.000169 max mem: 18817 Epoch: [292/300] [ 550/1251] eta: 0:11:06 lr: 0.000023 loss: 2.653897 (2.539815) time: 0.915380 data: 0.000179 max mem: 18817 Epoch: [292/300] [ 600/1251] eta: 0:10:19 lr: 0.000023 loss: 2.506216 (2.534992) time: 0.981487 data: 0.000179 max mem: 18817 Epoch: [292/300] [ 650/1251] eta: 0:09:31 lr: 0.000023 loss: 2.644556 (2.539244) time: 0.971018 data: 0.000160 max mem: 18817 Epoch: [292/300] [ 700/1251] eta: 0:08:43 lr: 0.000023 loss: 2.652503 (2.537096) time: 0.957850 data: 0.000173 max mem: 18817 Epoch: [292/300] [ 750/1251] eta: 0:07:55 lr: 0.000023 loss: 2.565374 (2.537839) time: 0.911393 data: 0.000168 max mem: 18817 Epoch: [292/300] [ 800/1251] eta: 0:07:08 lr: 0.000023 loss: 2.554370 (2.534528) time: 0.937233 data: 0.000179 max mem: 18817 Epoch: [292/300] [ 850/1251] eta: 0:06:21 lr: 0.000023 loss: 2.547172 (2.531298) time: 0.975865 data: 0.000176 max mem: 18817 Epoch: [292/300] [ 900/1251] eta: 0:05:33 lr: 0.000023 loss: 2.583740 (2.535052) time: 0.968130 data: 0.000167 max mem: 18817 Epoch: [292/300] [ 950/1251] eta: 0:04:45 lr: 0.000023 loss: 2.654636 (2.537570) time: 0.928797 data: 0.000175 max mem: 18817 Epoch: [292/300] [1000/1251] eta: 0:03:58 lr: 0.000023 loss: 2.479279 (2.536986) time: 0.909290 data: 0.000161 max mem: 18817 Epoch: [292/300] [1050/1251] eta: 0:03:10 lr: 0.000023 loss: 2.504482 (2.528412) time: 0.970507 data: 0.000182 max mem: 18817 Epoch: [292/300] [1100/1251] eta: 0:02:23 lr: 0.000023 loss: 2.587729 (2.530934) time: 0.964118 data: 0.000172 max mem: 18817 Epoch: [292/300] [1150/1251] eta: 0:01:35 lr: 0.000023 loss: 2.689169 (2.531159) time: 0.957612 data: 0.000184 max mem: 18817 Epoch: [292/300] [1200/1251] eta: 0:00:48 lr: 0.000023 loss: 2.439528 (2.531846) time: 0.932626 data: 0.000179 max mem: 18817 Epoch: [292/300] [1250/1251] eta: 0:00:00 lr: 0.000022 loss: 2.452801 (2.531802) time: 0.957880 data: 0.000727 max mem: 18817 Epoch: [292/300] Total time: 0:19:47 (0.949236 s / it) Averaged stats: lr: 0.000022 loss: 2.452801 (2.532588) Test: [ 0/49] eta: 0:01:31 loss: 0.489525 (0.489525) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.874517 data: 1.428655 max mem: 18817 Test: [10/49] eta: 0:00:20 loss: 0.547270 (0.651139) acc1: 85.937500 (85.937500) acc5: 96.875000 (96.732955) time: 0.523698 data: 0.130038 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.726977 (0.682723) acc1: 82.812500 (84.523810) acc5: 96.875000 (96.800595) time: 0.371281 data: 0.000150 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.683724 (0.680910) acc1: 82.812500 (84.022177) acc5: 96.875000 (96.875000) time: 0.353479 data: 0.000126 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.683724 (0.696334) acc1: 82.812500 (83.803354) acc5: 96.875000 (96.913110) time: 0.360063 data: 0.000128 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.683724 (0.692175) acc1: 82.812500 (84.000000) acc5: 96.875000 (96.960000) time: 0.354183 data: 0.000106 max mem: 18817 Test: Total time: 0:00:19 (0.395741 s / it) * Acc@1 84.024 Acc@5 96.756 loss 0.713 Max accuracy: 84.05% Epoch: [293/300] [ 0/1251] eta: 0:42:43 lr: 0.000022 loss: 2.193038 (2.193038) time: 2.048786 data: 1.169200 max mem: 18817 Epoch: [293/300] [ 50/1251] eta: 0:19:20 lr: 0.000022 loss: 2.621168 (2.529658) time: 0.985982 data: 0.000182 max mem: 18817 Epoch: [293/300] [ 100/1251] eta: 0:18:23 lr: 0.000022 loss: 2.552977 (2.548795) time: 0.962450 data: 0.000182 max mem: 18817 Epoch: [293/300] [ 150/1251] eta: 0:17:32 lr: 0.000022 loss: 2.536007 (2.554473) time: 0.923361 data: 0.000177 max mem: 18817 Epoch: [293/300] [ 200/1251] eta: 0:16:46 lr: 0.000022 loss: 2.638252 (2.537656) time: 0.943524 data: 0.000178 max mem: 18817 Epoch: [293/300] [ 250/1251] eta: 0:15:58 lr: 0.000022 loss: 2.496332 (2.514332) time: 0.965589 data: 0.000175 max mem: 18817 Epoch: [293/300] [ 300/1251] eta: 0:15:08 lr: 0.000022 loss: 2.501614 (2.513919) time: 0.979615 data: 0.000170 max mem: 18817 Epoch: [293/300] [ 350/1251] eta: 0:14:18 lr: 0.000022 loss: 2.662329 (2.526919) time: 0.916600 data: 0.000173 max mem: 18817 Epoch: [293/300] [ 400/1251] eta: 0:13:28 lr: 0.000022 loss: 2.702675 (2.532457) time: 0.912139 data: 0.000169 max mem: 18817 Epoch: [293/300] [ 450/1251] eta: 0:12:41 lr: 0.000022 loss: 2.582416 (2.532768) time: 0.918849 data: 0.000172 max mem: 18817 Epoch: [293/300] [ 500/1251] eta: 0:11:54 lr: 0.000022 loss: 2.548905 (2.542432) time: 0.911105 data: 0.000177 max mem: 18817 Epoch: [293/300] [ 550/1251] eta: 0:11:07 lr: 0.000022 loss: 2.480946 (2.539866) time: 0.955455 data: 0.000181 max mem: 18817 Epoch: [293/300] [ 600/1251] eta: 0:10:18 lr: 0.000022 loss: 2.651980 (2.538665) time: 0.960610 data: 0.000183 max mem: 18817 Epoch: [293/300] [ 650/1251] eta: 0:09:30 lr: 0.000022 loss: 2.582800 (2.539610) time: 0.932325 data: 0.000158 max mem: 18817 Epoch: [293/300] [ 700/1251] eta: 0:08:43 lr: 0.000022 loss: 2.652705 (2.546876) time: 0.934327 data: 0.000163 max mem: 18817 Epoch: [293/300] [ 750/1251] eta: 0:07:56 lr: 0.000022 loss: 2.663323 (2.548078) time: 0.959581 data: 0.000178 max mem: 18817 Epoch: [293/300] [ 800/1251] eta: 0:07:08 lr: 0.000022 loss: 2.611148 (2.546941) time: 1.018145 data: 0.000176 max mem: 18817 Epoch: [293/300] [ 850/1251] eta: 0:06:20 lr: 0.000022 loss: 2.685964 (2.550182) time: 0.958960 data: 0.000180 max mem: 18817 Epoch: [293/300] [ 900/1251] eta: 0:05:32 lr: 0.000022 loss: 2.521786 (2.549355) time: 0.911579 data: 0.000184 max mem: 18817 Epoch: [293/300] [ 950/1251] eta: 0:04:45 lr: 0.000022 loss: 2.676685 (2.550169) time: 0.924197 data: 0.000171 max mem: 18817 Epoch: [293/300] [1000/1251] eta: 0:03:58 lr: 0.000022 loss: 2.715085 (2.550258) time: 0.973050 data: 0.000186 max mem: 18817 Epoch: [293/300] [1050/1251] eta: 0:03:10 lr: 0.000022 loss: 2.527791 (2.549122) time: 1.000000 data: 0.000200 max mem: 18817 Epoch: [293/300] [1100/1251] eta: 0:02:23 lr: 0.000022 loss: 2.612844 (2.547731) time: 0.957496 data: 0.000164 max mem: 18817 Epoch: [293/300] [1150/1251] eta: 0:01:35 lr: 0.000022 loss: 2.587850 (2.547032) time: 0.924538 data: 0.000168 max mem: 18817 Epoch: [293/300] [1200/1251] eta: 0:00:48 lr: 0.000022 loss: 2.191883 (2.547854) time: 0.913998 data: 0.000177 max mem: 18817 Epoch: [293/300] [1250/1251] eta: 0:00:00 lr: 0.000022 loss: 2.707250 (2.545836) time: 0.953920 data: 0.000782 max mem: 18817 Epoch: [293/300] Total time: 0:19:48 (0.950077 s / it) Averaged stats: lr: 0.000022 loss: 2.707250 (2.544617) Test: [ 0/49] eta: 0:01:28 loss: 0.502875 (0.502875) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.804846 data: 1.420806 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.555047 (0.652216) acc1: 84.375000 (85.369318) acc5: 96.875000 (96.875000) time: 0.499784 data: 0.129299 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.725890 (0.681164) acc1: 84.375000 (84.151786) acc5: 96.875000 (96.949405) time: 0.360172 data: 0.000135 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.672237 (0.680771) acc1: 82.812500 (84.122984) acc5: 96.875000 (96.975806) time: 0.356262 data: 0.000133 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.669762 (0.694771) acc1: 82.812500 (83.993902) acc5: 96.875000 (96.875000) time: 0.354835 data: 0.000148 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.669762 (0.690808) acc1: 84.375000 (84.224000) acc5: 96.875000 (96.992000) time: 0.348015 data: 0.000119 max mem: 18817 Test: Total time: 0:00:19 (0.388783 s / it) * Acc@1 84.024 Acc@5 96.734 loss 0.714 Max accuracy: 84.05% Epoch: [294/300] [ 0/1251] eta: 0:42:40 lr: 0.000022 loss: 1.485988 (1.485988) time: 2.047058 data: 1.162310 max mem: 18817 Epoch: [294/300] [ 50/1251] eta: 0:19:09 lr: 0.000022 loss: 2.617322 (2.569682) time: 0.962855 data: 0.000167 max mem: 18817 Epoch: [294/300] [ 100/1251] eta: 0:18:06 lr: 0.000022 loss: 2.539345 (2.556686) time: 0.909775 data: 0.000172 max mem: 18817 Epoch: [294/300] [ 150/1251] eta: 0:17:25 lr: 0.000022 loss: 2.304492 (2.536627) time: 0.926797 data: 0.000180 max mem: 18817 Epoch: [294/300] [ 200/1251] eta: 0:16:39 lr: 0.000022 loss: 2.597909 (2.535524) time: 0.973463 data: 0.000176 max mem: 18817 Epoch: [294/300] [ 250/1251] eta: 0:15:53 lr: 0.000022 loss: 2.611331 (2.520782) time: 1.003686 data: 0.000178 max mem: 18817 Epoch: [294/300] [ 300/1251] eta: 0:15:05 lr: 0.000022 loss: 2.690086 (2.539277) time: 0.973927 data: 0.000176 max mem: 18817 Epoch: [294/300] [ 350/1251] eta: 0:14:14 lr: 0.000022 loss: 2.611895 (2.540754) time: 0.913098 data: 0.000171 max mem: 18817 Epoch: [294/300] [ 400/1251] eta: 0:13:27 lr: 0.000022 loss: 2.531133 (2.537789) time: 0.912606 data: 0.000178 max mem: 18817 Epoch: [294/300] [ 450/1251] eta: 0:12:39 lr: 0.000022 loss: 2.581188 (2.539500) time: 0.958408 data: 0.000190 max mem: 18817 Epoch: [294/300] [ 500/1251] eta: 0:11:51 lr: 0.000022 loss: 2.647336 (2.535769) time: 0.953624 data: 0.000183 max mem: 18817 Epoch: [294/300] [ 550/1251] eta: 0:11:03 lr: 0.000022 loss: 2.626484 (2.530192) time: 0.908376 data: 0.000185 max mem: 18817 Epoch: [294/300] [ 600/1251] eta: 0:10:17 lr: 0.000022 loss: 2.350403 (2.533545) time: 0.937700 data: 0.000179 max mem: 18817 Epoch: [294/300] [ 650/1251] eta: 0:09:31 lr: 0.000021 loss: 2.647156 (2.537861) time: 0.930737 data: 0.000170 max mem: 18817 Epoch: [294/300] [ 700/1251] eta: 0:08:43 lr: 0.000021 loss: 2.624816 (2.537057) time: 0.957610 data: 0.000172 max mem: 18817 Epoch: [294/300] [ 750/1251] eta: 0:07:56 lr: 0.000021 loss: 2.711512 (2.536249) time: 0.988369 data: 0.000179 max mem: 18817 Epoch: [294/300] [ 800/1251] eta: 0:07:08 lr: 0.000021 loss: 2.637777 (2.533520) time: 0.906216 data: 0.000184 max mem: 18817 Epoch: [294/300] [ 850/1251] eta: 0:06:21 lr: 0.000021 loss: 2.438841 (2.527805) time: 0.917756 data: 0.000166 max mem: 18817 Epoch: [294/300] [ 900/1251] eta: 0:05:33 lr: 0.000021 loss: 2.661798 (2.526143) time: 0.910310 data: 0.000170 max mem: 18817 Epoch: [294/300] [ 950/1251] eta: 0:04:46 lr: 0.000021 loss: 2.443241 (2.528194) time: 1.015122 data: 0.000178 max mem: 18817 Epoch: [294/300] [1000/1251] eta: 0:03:58 lr: 0.000021 loss: 2.568375 (2.528253) time: 0.958839 data: 0.000168 max mem: 18817 Epoch: [294/300] [1050/1251] eta: 0:03:10 lr: 0.000021 loss: 2.716040 (2.528871) time: 0.926013 data: 0.000190 max mem: 18817 Epoch: [294/300] [1100/1251] eta: 0:02:23 lr: 0.000021 loss: 2.528882 (2.526511) time: 0.937283 data: 0.000171 max mem: 18817 Epoch: [294/300] [1150/1251] eta: 0:01:36 lr: 0.000021 loss: 2.764258 (2.531783) time: 0.971387 data: 0.000181 max mem: 18817 Epoch: [294/300] [1200/1251] eta: 0:00:48 lr: 0.000021 loss: 2.617405 (2.528980) time: 1.034526 data: 0.000177 max mem: 18817 Epoch: [294/300] [1250/1251] eta: 0:00:00 lr: 0.000021 loss: 2.646620 (2.530333) time: 0.949225 data: 0.000764 max mem: 18817 Epoch: [294/300] Total time: 0:19:49 (0.950937 s / it) Averaged stats: lr: 0.000021 loss: 2.646620 (2.533774) Test: [ 0/49] eta: 0:01:30 loss: 0.510868 (0.510868) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.849478 data: 1.451419 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.551627 (0.649481) acc1: 84.375000 (85.227273) acc5: 98.437500 (96.732955) time: 0.493192 data: 0.132087 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.711731 (0.677554) acc1: 82.812500 (84.300595) acc5: 96.875000 (96.875000) time: 0.354290 data: 0.000146 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.671832 (0.675384) acc1: 82.812500 (84.122984) acc5: 96.875000 (96.975806) time: 0.351490 data: 0.000143 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.671832 (0.690545) acc1: 82.812500 (83.917683) acc5: 96.875000 (97.027439) time: 0.349428 data: 0.000150 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.671832 (0.686387) acc1: 82.812500 (84.128000) acc5: 96.875000 (97.088000) time: 0.350354 data: 0.000127 max mem: 18817 Test: Total time: 0:00:18 (0.385634 s / it) * Acc@1 84.018 Acc@5 96.784 loss 0.709 Max accuracy: 84.05% Epoch: [295/300] [ 0/1251] eta: 0:43:43 lr: 0.000021 loss: 2.817868 (2.817868) time: 2.097166 data: 1.227504 max mem: 18817 Epoch: [295/300] [ 50/1251] eta: 0:20:01 lr: 0.000021 loss: 2.574768 (2.591412) time: 0.933204 data: 0.000179 max mem: 18817 Epoch: [295/300] [ 100/1251] eta: 0:18:56 lr: 0.000021 loss: 2.666567 (2.588863) time: 0.982050 data: 0.000168 max mem: 18817 Epoch: [295/300] [ 150/1251] eta: 0:17:54 lr: 0.000021 loss: 2.525714 (2.568580) time: 1.012034 data: 0.000166 max mem: 18817 Epoch: [295/300] [ 200/1251] eta: 0:16:55 lr: 0.000021 loss: 2.554875 (2.579495) time: 0.972189 data: 0.000165 max mem: 18817 Epoch: [295/300] [ 250/1251] eta: 0:16:01 lr: 0.000021 loss: 2.769140 (2.598302) time: 0.914688 data: 0.000181 max mem: 18817 Epoch: [295/300] [ 300/1251] eta: 0:15:13 lr: 0.000021 loss: 2.667515 (2.607370) time: 0.915360 data: 0.000161 max mem: 18817 Epoch: [295/300] [ 350/1251] eta: 0:14:25 lr: 0.000021 loss: 2.639464 (2.597545) time: 0.974715 data: 0.000183 max mem: 18817 Epoch: [295/300] [ 400/1251] eta: 0:13:37 lr: 0.000021 loss: 2.685831 (2.593573) time: 1.024094 data: 0.000174 max mem: 18817 Epoch: [295/300] [ 450/1251] eta: 0:12:46 lr: 0.000021 loss: 2.573724 (2.585286) time: 0.957413 data: 0.000181 max mem: 18817 Epoch: [295/300] [ 500/1251] eta: 0:11:57 lr: 0.000021 loss: 2.509308 (2.575663) time: 0.924403 data: 0.000176 max mem: 18817 Epoch: [295/300] [ 550/1251] eta: 0:11:09 lr: 0.000021 loss: 2.616879 (2.571284) time: 0.914527 data: 0.000174 max mem: 18817 Epoch: [295/300] [ 600/1251] eta: 0:10:21 lr: 0.000021 loss: 2.387235 (2.566513) time: 0.957305 data: 0.000184 max mem: 18817 Epoch: [295/300] [ 650/1251] eta: 0:09:33 lr: 0.000021 loss: 2.775561 (2.575720) time: 0.952791 data: 0.000194 max mem: 18817 Epoch: [295/300] [ 700/1251] eta: 0:08:44 lr: 0.000021 loss: 2.465027 (2.569619) time: 0.907475 data: 0.000173 max mem: 18817 Epoch: [295/300] [ 750/1251] eta: 0:07:58 lr: 0.000021 loss: 2.540901 (2.568182) time: 0.947999 data: 0.000171 max mem: 18817 Epoch: [295/300] [ 800/1251] eta: 0:07:10 lr: 0.000021 loss: 2.831017 (2.567433) time: 0.910832 data: 0.000184 max mem: 18817 Epoch: [295/300] [ 850/1251] eta: 0:06:22 lr: 0.000021 loss: 2.652504 (2.568473) time: 0.957401 data: 0.000187 max mem: 18817 Epoch: [295/300] [ 900/1251] eta: 0:05:34 lr: 0.000021 loss: 2.311704 (2.564015) time: 0.954856 data: 0.000169 max mem: 18817 Epoch: [295/300] [ 950/1251] eta: 0:04:46 lr: 0.000021 loss: 2.671603 (2.563725) time: 0.924449 data: 0.000166 max mem: 18817 Epoch: [295/300] [1000/1251] eta: 0:03:59 lr: 0.000021 loss: 2.120353 (2.556364) time: 0.918004 data: 0.000170 max mem: 18817 Epoch: [295/300] [1050/1251] eta: 0:03:11 lr: 0.000021 loss: 2.554431 (2.552795) time: 0.967714 data: 0.000212 max mem: 18817 Epoch: [295/300] [1100/1251] eta: 0:02:23 lr: 0.000021 loss: 2.626597 (2.555368) time: 0.949911 data: 0.000173 max mem: 18817 Epoch: [295/300] [1150/1251] eta: 0:01:36 lr: 0.000021 loss: 2.736054 (2.558490) time: 0.976519 data: 0.000199 max mem: 18817 Epoch: [295/300] [1200/1251] eta: 0:00:48 lr: 0.000021 loss: 2.532521 (2.552028) time: 0.991546 data: 0.000178 max mem: 18817 Epoch: [295/300] [1250/1251] eta: 0:00:00 lr: 0.000021 loss: 2.629218 (2.553626) time: 0.917807 data: 0.000748 max mem: 18817 Epoch: [295/300] Total time: 0:19:51 (0.952529 s / it) Averaged stats: lr: 0.000021 loss: 2.629218 (2.547787) Test: [ 0/49] eta: 0:01:28 loss: 0.508994 (0.508994) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.806106 data: 1.420615 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.575334 (0.658034) acc1: 84.375000 (85.795455) acc5: 96.875000 (96.875000) time: 0.495212 data: 0.129301 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.710597 (0.680652) acc1: 84.375000 (84.598214) acc5: 96.875000 (97.023810) time: 0.358652 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.693172 (0.682428) acc1: 84.375000 (84.375000) acc5: 96.875000 (97.127016) time: 0.353350 data: 0.000123 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.684731 (0.695816) acc1: 84.375000 (84.298780) acc5: 96.875000 (97.179878) time: 0.350293 data: 0.000121 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.684731 (0.692699) acc1: 84.375000 (84.448000) acc5: 96.875000 (97.216000) time: 0.441015 data: 0.000101 max mem: 18817 Test: Total time: 0:00:20 (0.424451 s / it) * Acc@1 84.048 Acc@5 96.752 loss 0.717 Max accuracy: 84.05% Epoch: [296/300] [ 0/1251] eta: 0:41:10 lr: 0.000021 loss: 2.597857 (2.597857) time: 1.974976 data: 1.105124 max mem: 18817 Epoch: [296/300] [ 50/1251] eta: 0:19:20 lr: 0.000021 loss: 2.525061 (2.509581) time: 0.927123 data: 0.000175 max mem: 18817 Epoch: [296/300] [ 100/1251] eta: 0:18:26 lr: 0.000021 loss: 2.600804 (2.536366) time: 0.911624 data: 0.000198 max mem: 18817 Epoch: [296/300] [ 150/1251] eta: 0:17:33 lr: 0.000021 loss: 2.355967 (2.492201) time: 0.948186 data: 0.000186 max mem: 18817 Epoch: [296/300] [ 200/1251] eta: 0:16:38 lr: 0.000021 loss: 2.398622 (2.482607) time: 0.959570 data: 0.000168 max mem: 18817 Epoch: [296/300] [ 250/1251] eta: 0:15:56 lr: 0.000021 loss: 2.631626 (2.508300) time: 0.971860 data: 0.000190 max mem: 18817 Epoch: [296/300] [ 300/1251] eta: 0:15:05 lr: 0.000021 loss: 2.697594 (2.523748) time: 0.974353 data: 0.000184 max mem: 18817 Epoch: [296/300] [ 350/1251] eta: 0:14:17 lr: 0.000021 loss: 2.515250 (2.511171) time: 0.919053 data: 0.000175 max mem: 18817 Epoch: [296/300] [ 400/1251] eta: 0:13:31 lr: 0.000021 loss: 2.489033 (2.518108) time: 0.928511 data: 0.000168 max mem: 18817 Epoch: [296/300] [ 450/1251] eta: 0:12:45 lr: 0.000021 loss: 2.573640 (2.508458) time: 0.972171 data: 0.000189 max mem: 18817 Epoch: [296/300] [ 500/1251] eta: 0:11:57 lr: 0.000021 loss: 2.697572 (2.520338) time: 0.967413 data: 0.000175 max mem: 18817 Epoch: [296/300] [ 550/1251] eta: 0:11:08 lr: 0.000021 loss: 2.524424 (2.519608) time: 0.983362 data: 0.000161 max mem: 18817 Epoch: [296/300] [ 600/1251] eta: 0:10:20 lr: 0.000021 loss: 2.548017 (2.528033) time: 0.925433 data: 0.000171 max mem: 18817 Epoch: [296/300] [ 650/1251] eta: 0:09:33 lr: 0.000021 loss: 2.695731 (2.531651) time: 0.932588 data: 0.000171 max mem: 18817 Epoch: [296/300] [ 700/1251] eta: 0:08:46 lr: 0.000021 loss: 2.642000 (2.526445) time: 0.998272 data: 0.000175 max mem: 18817 Epoch: [296/300] [ 750/1251] eta: 0:07:59 lr: 0.000021 loss: 2.488480 (2.529637) time: 1.022039 data: 0.000174 max mem: 18817 Epoch: [296/300] [ 800/1251] eta: 0:07:10 lr: 0.000021 loss: 2.492194 (2.530279) time: 0.948669 data: 0.000168 max mem: 18817 Epoch: [296/300] [ 850/1251] eta: 0:06:22 lr: 0.000021 loss: 2.592481 (2.523781) time: 0.919622 data: 0.000162 max mem: 18817 Epoch: [296/300] [ 900/1251] eta: 0:05:34 lr: 0.000021 loss: 2.557550 (2.524543) time: 0.928898 data: 0.000172 max mem: 18817 Epoch: [296/300] [ 950/1251] eta: 0:04:47 lr: 0.000020 loss: 2.794927 (2.527798) time: 0.968833 data: 0.000179 max mem: 18817 Epoch: [296/300] [1000/1251] eta: 0:03:59 lr: 0.000020 loss: 2.548788 (2.526755) time: 1.025472 data: 0.000176 max mem: 18817 Epoch: [296/300] [1050/1251] eta: 0:03:11 lr: 0.000020 loss: 2.668304 (2.529524) time: 0.991315 data: 0.000169 max mem: 18817 Epoch: [296/300] [1100/1251] eta: 0:02:23 lr: 0.000020 loss: 2.700951 (2.532338) time: 0.914114 data: 0.000181 max mem: 18817 Epoch: [296/300] [1150/1251] eta: 0:01:36 lr: 0.000020 loss: 2.480110 (2.532526) time: 0.921783 data: 0.000173 max mem: 18817 Epoch: [296/300] [1200/1251] eta: 0:00:48 lr: 0.000020 loss: 2.624914 (2.531718) time: 0.971100 data: 0.000165 max mem: 18817 Epoch: [296/300] [1250/1251] eta: 0:00:00 lr: 0.000020 loss: 2.699359 (2.531503) time: 0.947372 data: 0.000779 max mem: 18817 Epoch: [296/300] Total time: 0:19:51 (0.952289 s / it) Averaged stats: lr: 0.000020 loss: 2.699359 (2.529860) Test: [ 0/49] eta: 0:01:16 loss: 0.477117 (0.477117) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.557295 data: 1.125978 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.570391 (0.651064) acc1: 84.375000 (85.085227) acc5: 96.875000 (96.875000) time: 0.469558 data: 0.102520 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.703948 (0.677638) acc1: 82.812500 (84.077381) acc5: 96.875000 (96.875000) time: 0.356843 data: 0.000147 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.679499 (0.677294) acc1: 82.812500 (84.223790) acc5: 96.875000 (96.975806) time: 0.356041 data: 0.000120 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.675414 (0.692037) acc1: 84.375000 (84.070122) acc5: 96.875000 (97.065549) time: 0.367972 data: 0.000119 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.675414 (0.688682) acc1: 84.375000 (84.256000) acc5: 96.875000 (97.184000) time: 0.368416 data: 0.000102 max mem: 18817 Test: Total time: 0:00:19 (0.389071 s / it) * Acc@1 84.028 Acc@5 96.768 loss 0.714 Max accuracy: 84.05% Epoch: [297/300] [ 0/1251] eta: 0:40:55 lr: 0.000020 loss: 3.014343 (3.014343) time: 1.962773 data: 1.097189 max mem: 18817 Epoch: [297/300] [ 50/1251] eta: 0:19:14 lr: 0.000020 loss: 2.842628 (2.551966) time: 0.921253 data: 0.000180 max mem: 18817 Epoch: [297/300] [ 100/1251] eta: 0:18:30 lr: 0.000020 loss: 2.628029 (2.519965) time: 0.924961 data: 0.000184 max mem: 18817 Epoch: [297/300] [ 150/1251] eta: 0:17:42 lr: 0.000020 loss: 2.719895 (2.539342) time: 0.980588 data: 0.000198 max mem: 18817 Epoch: [297/300] [ 200/1251] eta: 0:16:52 lr: 0.000020 loss: 2.757025 (2.530077) time: 1.017001 data: 0.000182 max mem: 18817 Epoch: [297/300] [ 250/1251] eta: 0:15:58 lr: 0.000020 loss: 2.724627 (2.538870) time: 0.972847 data: 0.000181 max mem: 18817 Epoch: [297/300] [ 300/1251] eta: 0:15:06 lr: 0.000020 loss: 2.358592 (2.533448) time: 0.921785 data: 0.000179 max mem: 18817 Epoch: [297/300] [ 350/1251] eta: 0:14:18 lr: 0.000020 loss: 2.596035 (2.517860) time: 0.914092 data: 0.000175 max mem: 18817 Epoch: [297/300] [ 400/1251] eta: 0:13:31 lr: 0.000020 loss: 2.463633 (2.516751) time: 0.953291 data: 0.000172 max mem: 18817 Epoch: [297/300] [ 450/1251] eta: 0:12:43 lr: 0.000020 loss: 2.766949 (2.528695) time: 0.996991 data: 0.000178 max mem: 18817 Epoch: [297/300] [ 500/1251] eta: 0:11:54 lr: 0.000020 loss: 2.488136 (2.526676) time: 0.914096 data: 0.000190 max mem: 18817 Epoch: [297/300] [ 550/1251] eta: 0:11:06 lr: 0.000020 loss: 2.649134 (2.520529) time: 0.909368 data: 0.000167 max mem: 18817 Epoch: [297/300] [ 600/1251] eta: 0:10:18 lr: 0.000020 loss: 2.677994 (2.523491) time: 0.909415 data: 0.000169 max mem: 18817 Epoch: [297/300] [ 650/1251] eta: 0:09:31 lr: 0.000020 loss: 2.430743 (2.517969) time: 0.968135 data: 0.000173 max mem: 18817 Epoch: [297/300] [ 700/1251] eta: 0:08:43 lr: 0.000020 loss: 2.812157 (2.524160) time: 0.966191 data: 0.000168 max mem: 18817 Epoch: [297/300] [ 750/1251] eta: 0:07:55 lr: 0.000020 loss: 2.430551 (2.517531) time: 0.916371 data: 0.000174 max mem: 18817 Epoch: [297/300] [ 800/1251] eta: 0:07:08 lr: 0.000020 loss: 2.607390 (2.523984) time: 0.919746 data: 0.000181 max mem: 18817 Epoch: [297/300] [ 850/1251] eta: 0:06:21 lr: 0.000020 loss: 2.630203 (2.525568) time: 0.944630 data: 0.000171 max mem: 18817 Epoch: [297/300] [ 900/1251] eta: 0:05:33 lr: 0.000020 loss: 2.593291 (2.529474) time: 1.006847 data: 0.000176 max mem: 18817 Epoch: [297/300] [ 950/1251] eta: 0:04:46 lr: 0.000020 loss: 2.679974 (2.533346) time: 0.964109 data: 0.000173 max mem: 18817 Epoch: [297/300] [1000/1251] eta: 0:03:58 lr: 0.000020 loss: 2.680206 (2.537239) time: 0.921981 data: 0.000167 max mem: 18817 Epoch: [297/300] [1050/1251] eta: 0:03:11 lr: 0.000020 loss: 2.572407 (2.539004) time: 0.938521 data: 0.000172 max mem: 18817 Epoch: [297/300] [1100/1251] eta: 0:02:23 lr: 0.000020 loss: 2.501003 (2.541730) time: 0.983614 data: 0.000170 max mem: 18817 Epoch: [297/300] [1150/1251] eta: 0:01:36 lr: 0.000020 loss: 2.734002 (2.541325) time: 1.023370 data: 0.000158 max mem: 18817 Epoch: [297/300] [1200/1251] eta: 0:00:48 lr: 0.000020 loss: 2.524188 (2.540073) time: 0.953263 data: 0.000162 max mem: 18817 Epoch: [297/300] [1250/1251] eta: 0:00:00 lr: 0.000020 loss: 2.562386 (2.537061) time: 0.918915 data: 0.000749 max mem: 18817 Epoch: [297/300] Total time: 0:19:50 (0.951629 s / it) Averaged stats: lr: 0.000020 loss: 2.562386 (2.538423) Test: [ 0/49] eta: 0:01:19 loss: 0.502539 (0.502539) acc1: 85.937500 (85.937500) acc5: 100.000000 (100.000000) time: 1.613754 data: 1.193955 max mem: 18817 Test: [10/49] eta: 0:00:19 loss: 0.551611 (0.654788) acc1: 84.375000 (85.085227) acc5: 96.875000 (96.875000) time: 0.492095 data: 0.108685 max mem: 18817 Test: [20/49] eta: 0:00:15 loss: 0.692736 (0.678859) acc1: 84.375000 (84.002976) acc5: 96.875000 (96.949405) time: 0.480711 data: 0.000144 max mem: 18817 Test: [30/49] eta: 0:00:09 loss: 0.677481 (0.680026) acc1: 84.375000 (84.122984) acc5: 96.875000 (97.076613) time: 0.466264 data: 0.000131 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.673351 (0.691902) acc1: 84.375000 (83.955793) acc5: 96.875000 (97.141768) time: 0.348561 data: 0.000132 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.673351 (0.688184) acc1: 84.375000 (84.128000) acc5: 98.437500 (97.248000) time: 0.343504 data: 0.000111 max mem: 18817 Test: Total time: 0:00:21 (0.429529 s / it) * Acc@1 83.956 Acc@5 96.752 loss 0.711 Max accuracy: 84.05% Epoch: [298/300] [ 0/1251] eta: 0:42:18 lr: 0.000020 loss: 2.907438 (2.907438) time: 2.028899 data: 1.155877 max mem: 18817 Epoch: [298/300] [ 50/1251] eta: 0:19:18 lr: 0.000020 loss: 2.726252 (2.506757) time: 0.993646 data: 0.000175 max mem: 18817 Epoch: [298/300] [ 100/1251] eta: 0:18:16 lr: 0.000020 loss: 2.524704 (2.458230) time: 0.958443 data: 0.000166 max mem: 18817 Epoch: [298/300] [ 150/1251] eta: 0:17:26 lr: 0.000020 loss: 2.571759 (2.472650) time: 0.918327 data: 0.000179 max mem: 18817 Epoch: [298/300] [ 200/1251] eta: 0:16:40 lr: 0.000020 loss: 2.575493 (2.496686) time: 0.912369 data: 0.000153 max mem: 18817 Epoch: [298/300] [ 250/1251] eta: 0:15:53 lr: 0.000020 loss: 2.723122 (2.517576) time: 0.971618 data: 0.000209 max mem: 18817 Epoch: [298/300] [ 300/1251] eta: 0:15:04 lr: 0.000020 loss: 2.672806 (2.526013) time: 0.979150 data: 0.000172 max mem: 18817 Epoch: [298/300] [ 350/1251] eta: 0:14:18 lr: 0.000020 loss: 2.547956 (2.529860) time: 0.970798 data: 0.000156 max mem: 18817 Epoch: [298/300] [ 400/1251] eta: 0:13:28 lr: 0.000020 loss: 2.381080 (2.520494) time: 0.919090 data: 0.000170 max mem: 18817 Epoch: [298/300] [ 450/1251] eta: 0:12:41 lr: 0.000020 loss: 2.381401 (2.516204) time: 1.021580 data: 0.000164 max mem: 18817 Epoch: [298/300] [ 500/1251] eta: 0:11:52 lr: 0.000020 loss: 2.711711 (2.518009) time: 0.958907 data: 0.000172 max mem: 18817 Epoch: [298/300] [ 550/1251] eta: 0:11:04 lr: 0.000020 loss: 2.569244 (2.513439) time: 0.915743 data: 0.000173 max mem: 18817 Epoch: [298/300] [ 600/1251] eta: 0:10:17 lr: 0.000020 loss: 2.685632 (2.518610) time: 0.928197 data: 0.000173 max mem: 18817 Epoch: [298/300] [ 650/1251] eta: 0:09:31 lr: 0.000020 loss: 2.522240 (2.515137) time: 0.966545 data: 0.000167 max mem: 18817 Epoch: [298/300] [ 700/1251] eta: 0:08:43 lr: 0.000020 loss: 2.655056 (2.524685) time: 1.027382 data: 0.000169 max mem: 18817 Epoch: [298/300] [ 750/1251] eta: 0:07:55 lr: 0.000020 loss: 2.250645 (2.521373) time: 0.972687 data: 0.000170 max mem: 18817 Epoch: [298/300] [ 800/1251] eta: 0:07:07 lr: 0.000020 loss: 2.501643 (2.525396) time: 0.916341 data: 0.000175 max mem: 18817 Epoch: [298/300] [ 850/1251] eta: 0:06:20 lr: 0.000020 loss: 2.561688 (2.527344) time: 0.919475 data: 0.000167 max mem: 18817 Epoch: [298/300] [ 900/1251] eta: 0:05:33 lr: 0.000020 loss: 2.628008 (2.530071) time: 0.988036 data: 0.000182 max mem: 18817 Epoch: [298/300] [ 950/1251] eta: 0:04:46 lr: 0.000020 loss: 2.811110 (2.535971) time: 0.975734 data: 0.000174 max mem: 18817 Epoch: [298/300] [1000/1251] eta: 0:03:58 lr: 0.000020 loss: 2.500634 (2.531835) time: 1.031524 data: 0.000184 max mem: 18817 Epoch: [298/300] [1050/1251] eta: 0:03:10 lr: 0.000020 loss: 2.690139 (2.532713) time: 0.957943 data: 0.000171 max mem: 18817 Epoch: [298/300] [1100/1251] eta: 0:02:23 lr: 0.000020 loss: 2.716259 (2.535283) time: 0.922243 data: 0.000163 max mem: 18817 Epoch: [298/300] [1150/1251] eta: 0:01:35 lr: 0.000020 loss: 2.296158 (2.532273) time: 0.913445 data: 0.000177 max mem: 18817 Epoch: [298/300] [1200/1251] eta: 0:00:48 lr: 0.000020 loss: 2.333878 (2.529029) time: 0.964466 data: 0.000174 max mem: 18817 Epoch: [298/300] [1250/1251] eta: 0:00:00 lr: 0.000020 loss: 2.596204 (2.527413) time: 0.965576 data: 0.000774 max mem: 18817 Epoch: [298/300] Total time: 0:19:48 (0.949810 s / it) Averaged stats: lr: 0.000020 loss: 2.596204 (2.530203) Test: [ 0/49] eta: 0:01:51 loss: 0.488030 (0.488030) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 2.285413 data: 1.226561 max mem: 18817 Test: [10/49] eta: 0:00:21 loss: 0.592779 (0.649937) acc1: 84.375000 (85.085227) acc5: 96.875000 (96.875000) time: 0.563151 data: 0.111637 max mem: 18817 Test: [20/49] eta: 0:00:13 loss: 0.705432 (0.675753) acc1: 84.375000 (84.523810) acc5: 96.875000 (96.800595) time: 0.372681 data: 0.000141 max mem: 18817 Test: [30/49] eta: 0:00:08 loss: 0.672344 (0.674753) acc1: 84.375000 (84.324597) acc5: 96.875000 (96.925403) time: 0.352724 data: 0.000130 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.672344 (0.687487) acc1: 82.812500 (84.146341) acc5: 96.875000 (96.989329) time: 0.349625 data: 0.000125 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.672344 (0.685403) acc1: 84.375000 (84.352000) acc5: 96.875000 (97.056000) time: 0.344623 data: 0.000108 max mem: 18817 Test: Total time: 0:00:19 (0.399640 s / it) * Acc@1 84.040 Acc@5 96.752 loss 0.708 Max accuracy: 84.05% Epoch: [299/300] [ 0/1251] eta: 0:43:27 lr: 0.000020 loss: 2.738857 (2.738857) time: 2.084216 data: 1.225948 max mem: 18817 Epoch: [299/300] [ 50/1251] eta: 0:19:02 lr: 0.000020 loss: 2.569541 (2.611187) time: 0.906583 data: 0.000165 max mem: 18817 Epoch: [299/300] [ 100/1251] eta: 0:18:22 lr: 0.000020 loss: 2.700476 (2.584339) time: 0.942237 data: 0.000191 max mem: 18817 Epoch: [299/300] [ 150/1251] eta: 0:17:33 lr: 0.000020 loss: 2.323021 (2.532385) time: 0.977685 data: 0.000173 max mem: 18817 Epoch: [299/300] [ 200/1251] eta: 0:16:38 lr: 0.000020 loss: 2.473592 (2.515831) time: 0.954799 data: 0.000173 max mem: 18817 Epoch: [299/300] [ 250/1251] eta: 0:15:53 lr: 0.000020 loss: 2.579552 (2.524439) time: 0.966859 data: 0.000180 max mem: 18817 Epoch: [299/300] [ 300/1251] eta: 0:15:02 lr: 0.000020 loss: 2.772277 (2.530536) time: 0.913807 data: 0.000170 max mem: 18817 Epoch: [299/300] [ 350/1251] eta: 0:14:14 lr: 0.000020 loss: 2.627277 (2.535319) time: 0.959669 data: 0.000189 max mem: 18817 Epoch: [299/300] [ 400/1251] eta: 0:13:29 lr: 0.000020 loss: 2.710734 (2.539862) time: 0.988299 data: 0.000204 max mem: 18817 Epoch: [299/300] [ 450/1251] eta: 0:12:40 lr: 0.000020 loss: 2.662604 (2.550067) time: 0.975591 data: 0.000179 max mem: 18817 Epoch: [299/300] [ 500/1251] eta: 0:11:52 lr: 0.000020 loss: 2.389590 (2.542673) time: 0.921367 data: 0.000167 max mem: 18817 Epoch: [299/300] [ 550/1251] eta: 0:11:05 lr: 0.000020 loss: 2.461165 (2.537621) time: 0.929655 data: 0.000165 max mem: 18817 Epoch: [299/300] [ 600/1251] eta: 0:10:17 lr: 0.000020 loss: 2.763205 (2.539808) time: 0.962310 data: 0.000176 max mem: 18817 Epoch: [299/300] [ 650/1251] eta: 0:09:30 lr: 0.000020 loss: 2.608177 (2.543126) time: 0.970164 data: 0.000163 max mem: 18817 Epoch: [299/300] [ 700/1251] eta: 0:08:42 lr: 0.000020 loss: 2.589488 (2.543091) time: 0.966347 data: 0.000169 max mem: 18817 Epoch: [299/300] [ 750/1251] eta: 0:07:54 lr: 0.000020 loss: 2.616502 (2.547462) time: 0.909345 data: 0.000168 max mem: 18817 Epoch: [299/300] [ 800/1251] eta: 0:07:07 lr: 0.000020 loss: 2.520483 (2.541855) time: 0.918060 data: 0.000166 max mem: 18817 Epoch: [299/300] [ 850/1251] eta: 0:06:20 lr: 0.000020 loss: 2.517772 (2.542741) time: 0.956536 data: 0.000166 max mem: 18817 Epoch: [299/300] [ 900/1251] eta: 0:05:33 lr: 0.000020 loss: 2.627553 (2.541328) time: 1.031144 data: 0.000173 max mem: 18817 Epoch: [299/300] [ 950/1251] eta: 0:04:45 lr: 0.000020 loss: 2.554633 (2.541406) time: 0.972390 data: 0.000177 max mem: 18817 Epoch: [299/300] [1000/1251] eta: 0:03:57 lr: 0.000020 loss: 2.539256 (2.536718) time: 0.914657 data: 0.000170 max mem: 18817 Epoch: [299/300] [1050/1251] eta: 0:03:10 lr: 0.000020 loss: 2.646870 (2.534770) time: 0.911588 data: 0.000173 max mem: 18817 Epoch: [299/300] [1100/1251] eta: 0:02:23 lr: 0.000020 loss: 2.467576 (2.534454) time: 0.970392 data: 0.000170 max mem: 18817 Epoch: [299/300] [1150/1251] eta: 0:01:35 lr: 0.000020 loss: 2.600613 (2.529975) time: 0.979343 data: 0.000168 max mem: 18817 Epoch: [299/300] [1200/1251] eta: 0:00:48 lr: 0.000020 loss: 2.654320 (2.528174) time: 0.957389 data: 0.000168 max mem: 18817 Epoch: [299/300] [1250/1251] eta: 0:00:00 lr: 0.000020 loss: 2.748573 (2.529704) time: 0.920752 data: 0.000754 max mem: 18817 Epoch: [299/300] Total time: 0:19:47 (0.948910 s / it) Averaged stats: lr: 0.000020 loss: 2.748573 (2.527472) Test: [ 0/49] eta: 0:01:19 loss: 0.504885 (0.504885) acc1: 84.375000 (84.375000) acc5: 100.000000 (100.000000) time: 1.621912 data: 1.211075 max mem: 18817 Test: [10/49] eta: 0:00:18 loss: 0.563772 (0.651941) acc1: 84.375000 (85.227273) acc5: 96.875000 (96.875000) time: 0.473138 data: 0.110250 max mem: 18817 Test: [20/49] eta: 0:00:12 loss: 0.702658 (0.675987) acc1: 84.375000 (84.672619) acc5: 96.875000 (96.800595) time: 0.359210 data: 0.000142 max mem: 18817 Test: [30/49] eta: 0:00:07 loss: 0.673885 (0.677499) acc1: 82.812500 (84.375000) acc5: 96.875000 (96.875000) time: 0.356521 data: 0.000123 max mem: 18817 Test: [40/49] eta: 0:00:03 loss: 0.673885 (0.690329) acc1: 84.375000 (84.298780) acc5: 96.875000 (96.913110) time: 0.355259 data: 0.000121 max mem: 18817 Test: [48/49] eta: 0:00:00 loss: 0.673885 (0.686689) acc1: 84.375000 (84.384000) acc5: 96.875000 (96.992000) time: 0.350268 data: 0.000100 max mem: 18817 Test: Total time: 0:00:18 (0.383339 s / it) * Acc@1 84.040 Acc@5 96.730 loss 0.710 Max accuracy: 84.05%