forked from lessw2020/transformer_framework
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathrun_300.txt
2197 lines (1923 loc) · 120 KB
/
run_300.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
******* loading model args.model='vit'
******* loading model args.model='vit'
******* loading model args.model='vit'
******* loading model args.model='vit'
--> World Size = 4
--> Device_count = 4
--> running with these defaults train_config(seed=2022, verbose=True, total_steps_to_run=None, print_memory_summary=False, num_epochs=300, model_weights_bf16=False, use_mixed_precision=False, use_low_precision_gradient_policy=False, use_tf32=False, optimizer='AnyPrecision', ap_use_kahan_summation=False, sharding_strategy=<ShardingStrategy.FULL_SHARD: 1>, print_sharding_plan=False, run_profiler=False, profile_folder='fsdp/profile_tracing', log_every=1, num_workers_dataloader=2, batch_size_training=68, fsdp_activation_checkpointing=False, run_validation=True, memory_report=True, nccl_debug_handler=True, distributed_debug=True, use_non_recursive_wrapping=False, use_tp=False, image_size=224, use_synthetic_data=False, use_pokemon_dataset=False, use_beans_dataset=True, save_model_checkpoint=False, load_model_checkpoint=False, checkpoint_max_save_count=2, save_optimizer=False, load_optimizer=False, optimizer_checkpoint_file='Adam-vit--1.pt', checkpoint_model_filename='vit--1.pt')
clearing gpu cache for all ranks
--> running with torch dist debug set to detail
--> total memory per gpu (GB) = 22.0626
policy is None
--> Prepping vit_relpos_base_patch16_rpn_224 model ...
stats is ready....? _stats=defaultdict(<class 'list'>, {}), local_rank=0, rank=0
--> vit_relpos_base_patch16_rpn_224 built.
built model with 151.942851M params
--> Warning - bf16 support not available. Using fp32
backward prefetch set to None
sharding set to ShardingStrategy.FULL_SHARD
--> Batch Size = 68
local rank 0 init time = 0.7194142759999522
memory stats reset, ready to track
step: 1: time taken for the last 1 steps is 2.7425989050000226, loss is 1.3618253469467163
step: 2: time taken for the last 1 steps is 0.5492016389998753, loss is 1.2400237321853638
Finish warm up
Running with AnyPrecision Optimizer, momo=torch.float32, var = torch.float32, kahan summation = False
Epoch: 1 starting...
step: 1: time taken for the last 1 steps is 1.2059477050001988, loss is 1.3502328395843506
step: 2: time taken for the last 1 steps is 0.558105623000074, loss is 2.772268056869507
step: 3: time taken for the last 1 steps is 0.5638014480000493, loss is 2.188901901245117
step: 4: time taken for the last 1 steps is 0.531010574999982, loss is 1.8474090099334717
val_loss : 1.4679 : val_acc: 0.4521
updating stats...
Epoch: 2 starting...
step: 1: time taken for the last 1 steps is 0.8264739339999778, loss is 1.8955564498901367
step: 2: time taken for the last 1 steps is 0.5498019899998781, loss is 1.1917791366577148
step: 3: time taken for the last 1 steps is 0.5327653079998527, loss is 1.2654627561569214
step: 4: time taken for the last 1 steps is 0.5191987880000397, loss is 0.8961225152015686
val_loss : 1.0444 : val_acc: 0.5250
updating stats...
Epoch: 3 starting...
step: 1: time taken for the last 1 steps is 0.8135913480000454, loss is 1.1972880363464355
step: 2: time taken for the last 1 steps is 0.5436835079999582, loss is 1.137427568435669
step: 3: time taken for the last 1 steps is 0.5465052800000194, loss is 1.0648016929626465
step: 4: time taken for the last 1 steps is 0.49816815200006204, loss is 0.9127635359764099
val_loss : 0.9245 : val_acc: 0.5302
updating stats...
Epoch: 4 starting...
step: 1: time taken for the last 1 steps is 0.8449840939999831, loss is 1.0200624465942383
step: 2: time taken for the last 1 steps is 0.548742152000159, loss is 0.9055872559547424
step: 3: time taken for the last 1 steps is 0.5473216350001167, loss is 0.973391592502594
step: 4: time taken for the last 1 steps is 0.520292869000059, loss is 0.9933704137802124
val_loss : 0.9433 : val_acc: 0.5177
updating stats...
Epoch: 5 starting...
step: 1: time taken for the last 1 steps is 0.8542066929999237, loss is 1.0468885898590088
step: 2: time taken for the last 1 steps is 0.5338339780000751, loss is 1.0379694700241089
step: 3: time taken for the last 1 steps is 0.5462448159998985, loss is 0.9869194030761719
step: 4: time taken for the last 1 steps is 0.5177978330000315, loss is 0.9640048146247864
val_loss : 0.9358 : val_acc: 0.5156
updating stats...
Epoch: 6 starting...
step: 1: time taken for the last 1 steps is 0.8200303359999452, loss is 0.9707807898521423
step: 2: time taken for the last 1 steps is 0.5328905009998834, loss is 1.0078496932983398
step: 3: time taken for the last 1 steps is 0.5508367090001229, loss is 0.9222326278686523
step: 4: time taken for the last 1 steps is 0.5185197949999747, loss is 0.9319679141044617
val_loss : 0.8963 : val_acc: 0.5656
updating stats...
Epoch: 7 starting...
step: 1: time taken for the last 1 steps is 0.8411699939999835, loss is 0.9370549321174622
step: 2: time taken for the last 1 steps is 0.5481458610001937, loss is 0.9615454077720642
step: 3: time taken for the last 1 steps is 0.5439990239999588, loss is 0.9261724948883057
step: 4: time taken for the last 1 steps is 0.5197624390000328, loss is 0.8765459656715393
val_loss : 0.8534 : val_acc: 0.6000
updating stats...
Epoch: 8 starting...
step: 1: time taken for the last 1 steps is 0.8368948150000506, loss is 0.9940316677093506
step: 2: time taken for the last 1 steps is 0.5447389679998196, loss is 0.995409369468689
step: 3: time taken for the last 1 steps is 0.5487850519998574, loss is 0.923244833946228
step: 4: time taken for the last 1 steps is 0.5231416009999066, loss is 0.9233924150466919
val_loss : 0.8564 : val_acc: 0.5813
updating stats...
Epoch: 9 starting...
step: 1: time taken for the last 1 steps is 0.8603359060000457, loss is 0.9624865055084229
step: 2: time taken for the last 1 steps is 0.5478670140000759, loss is 0.9661927223205566
step: 3: time taken for the last 1 steps is 0.5472375640001701, loss is 0.9105219841003418
step: 4: time taken for the last 1 steps is 0.5170502290000059, loss is 0.851849377155304
val_loss : 0.8624 : val_acc: 0.6198
updating stats...
Epoch: 10 starting...
step: 1: time taken for the last 1 steps is 0.8296574720000081, loss is 0.9686362147331238
step: 2: time taken for the last 1 steps is 0.5427856020000945, loss is 0.9338812828063965
step: 3: time taken for the last 1 steps is 0.5417603930000041, loss is 0.9548109769821167
step: 4: time taken for the last 1 steps is 0.516019690000121, loss is 0.8184503316879272
val_loss : 0.8334 : val_acc: 0.6250
updating stats...
Epoch: 11 starting...
step: 1: time taken for the last 1 steps is 0.8410934920000273, loss is 0.9091380834579468
step: 2: time taken for the last 1 steps is 0.5402776660000654, loss is 0.940418004989624
step: 3: time taken for the last 1 steps is 0.5349400569998579, loss is 0.9102849960327148
step: 4: time taken for the last 1 steps is 0.5205788169998868, loss is 0.8478797078132629
val_loss : 0.8206 : val_acc: 0.5844
updating stats...
Epoch: 12 starting...
step: 1: time taken for the last 1 steps is 0.8505435850001959, loss is 0.9116680026054382
step: 2: time taken for the last 1 steps is 0.5399741429998812, loss is 0.8749575614929199
step: 3: time taken for the last 1 steps is 0.5493994070000099, loss is 0.9562798738479614
step: 4: time taken for the last 1 steps is 0.5150371259999247, loss is 0.8270223736763
val_loss : 0.8233 : val_acc: 0.6281
updating stats...
Epoch: 13 starting...
step: 1: time taken for the last 1 steps is 0.8452627579999898, loss is 0.8943233489990234
step: 2: time taken for the last 1 steps is 0.5343376299999818, loss is 0.8784380555152893
step: 3: time taken for the last 1 steps is 0.5497094720001314, loss is 0.8737931251525879
step: 4: time taken for the last 1 steps is 0.5093470009999237, loss is 0.813238263130188
val_loss : 0.7962 : val_acc: 0.6302
updating stats...
Epoch: 14 starting...
step: 1: time taken for the last 1 steps is 0.8305475590000242, loss is 0.8451847434043884
step: 2: time taken for the last 1 steps is 0.5445413079999071, loss is 0.807092547416687
step: 3: time taken for the last 1 steps is 0.5390552670000943, loss is 0.8758931159973145
step: 4: time taken for the last 1 steps is 0.5196595600000364, loss is 0.7266584038734436
val_loss : 0.7612 : val_acc: 0.6823
updating stats...
Epoch: 15 starting...
step: 1: time taken for the last 1 steps is 0.8316008379999857, loss is 0.8146539926528931
step: 2: time taken for the last 1 steps is 0.5445479479999449, loss is 0.820121705532074
step: 3: time taken for the last 1 steps is 0.5425406500000918, loss is 0.904537558555603
step: 4: time taken for the last 1 steps is 0.5228884800001197, loss is 0.6548585295677185
val_loss : 0.7176 : val_acc: 0.6719
updating stats...
Epoch: 16 starting...
step: 1: time taken for the last 1 steps is 0.823359986000014, loss is 0.8075004816055298
step: 2: time taken for the last 1 steps is 0.5409990619998553, loss is 0.812919020652771
step: 3: time taken for the last 1 steps is 0.5505797979999443, loss is 0.828467607498169
step: 4: time taken for the last 1 steps is 0.5209054039999046, loss is 0.6972617506980896
val_loss : 0.6934 : val_acc: 0.6948
updating stats...
Epoch: 17 starting...
step: 1: time taken for the last 1 steps is 0.8451690969998253, loss is 0.8070997595787048
step: 2: time taken for the last 1 steps is 0.5381818199998634, loss is 0.7828678488731384
step: 3: time taken for the last 1 steps is 0.5363925880001261, loss is 0.805567741394043
step: 4: time taken for the last 1 steps is 0.5145254470000964, loss is 0.6182730197906494
val_loss : 0.6740 : val_acc: 0.7156
updating stats...
Epoch: 18 starting...
step: 1: time taken for the last 1 steps is 0.8435487469998861, loss is 0.7872871160507202
step: 2: time taken for the last 1 steps is 0.5391830889998346, loss is 0.8073642253875732
step: 3: time taken for the last 1 steps is 0.5447258409999449, loss is 0.755902886390686
step: 4: time taken for the last 1 steps is 0.5167048569999224, loss is 0.5741916298866272
val_loss : 0.6374 : val_acc: 0.7458
updating stats...
Epoch: 19 starting...
step: 1: time taken for the last 1 steps is 0.8201395880000746, loss is 0.7035937309265137
step: 2: time taken for the last 1 steps is 0.5415559329999269, loss is 0.7020411491394043
step: 3: time taken for the last 1 steps is 0.5529583820000425, loss is 0.6524328589439392
step: 4: time taken for the last 1 steps is 0.516060934999814, loss is 0.5203772187232971
val_loss : 0.6201 : val_acc: 0.7198
updating stats...
Epoch: 20 starting...
step: 1: time taken for the last 1 steps is 0.8128173129998686, loss is 0.741275429725647
step: 2: time taken for the last 1 steps is 0.540281240000013, loss is 0.7578439116477966
step: 3: time taken for the last 1 steps is 0.5390044559999296, loss is 0.7470228672027588
step: 4: time taken for the last 1 steps is 0.5127534240000386, loss is 0.502064049243927
val_loss : 0.6014 : val_acc: 0.7635
updating stats...
Epoch: 21 starting...
step: 1: time taken for the last 1 steps is 0.8213273700000627, loss is 0.6423299908638
step: 2: time taken for the last 1 steps is 0.5373384249999162, loss is 0.6209176778793335
step: 3: time taken for the last 1 steps is 0.5479359000000841, loss is 0.6622822880744934
step: 4: time taken for the last 1 steps is 0.5202187509999021, loss is 0.6154690980911255
val_loss : 0.5907 : val_acc: 0.7865
updating stats...
Epoch: 22 starting...
step: 1: time taken for the last 1 steps is 0.8286023829998612, loss is 0.6644025444984436
step: 2: time taken for the last 1 steps is 0.539339062000181, loss is 0.63299560546875
step: 3: time taken for the last 1 steps is 0.549679592000075, loss is 0.7135334014892578
step: 4: time taken for the last 1 steps is 0.5189134369998101, loss is 0.4888369143009186
val_loss : 0.5481 : val_acc: 0.7958
updating stats...
Epoch: 23 starting...
step: 1: time taken for the last 1 steps is 0.817078550999895, loss is 0.5884806513786316
step: 2: time taken for the last 1 steps is 0.5524680639998678, loss is 0.6131036281585693
step: 3: time taken for the last 1 steps is 0.5349797119999948, loss is 0.7055802941322327
step: 4: time taken for the last 1 steps is 0.5141654599999583, loss is 0.48324206471443176
val_loss : 0.5360 : val_acc: 0.7812
updating stats...
Epoch: 24 starting...
step: 1: time taken for the last 1 steps is 0.845998181999903, loss is 0.6357324719429016
step: 2: time taken for the last 1 steps is 0.551383153000188, loss is 0.6854644417762756
step: 3: time taken for the last 1 steps is 0.5418210679999902, loss is 0.653465986251831
step: 4: time taken for the last 1 steps is 0.5210236359998817, loss is 0.42653608322143555
val_loss : 0.5140 : val_acc: 0.7917
updating stats...
Epoch: 25 starting...
step: 1: time taken for the last 1 steps is 0.8431856199999856, loss is 0.6675330400466919
step: 2: time taken for the last 1 steps is 0.5453269219999584, loss is 0.6548640727996826
step: 3: time taken for the last 1 steps is 0.5423217469999599, loss is 0.5874381065368652
step: 4: time taken for the last 1 steps is 0.5117928060001304, loss is 0.4780173897743225
val_loss : 0.5461 : val_acc: 0.7479
updating stats...
Epoch: 26 starting...
step: 1: time taken for the last 1 steps is 0.8556660589999865, loss is 0.5913741588592529
step: 2: time taken for the last 1 steps is 0.5490591610000592, loss is 0.7334398627281189
step: 3: time taken for the last 1 steps is 0.5472029669999756, loss is 0.6245533227920532
step: 4: time taken for the last 1 steps is 0.5180356809999012, loss is 0.5323359966278076
val_loss : 0.4688 : val_acc: 0.7885
updating stats...
Epoch: 27 starting...
step: 1: time taken for the last 1 steps is 0.8381884880000143, loss is 0.5139099359512329
step: 2: time taken for the last 1 steps is 0.5411214450000443, loss is 0.693558931350708
step: 3: time taken for the last 1 steps is 0.5449236340000425, loss is 0.7740514278411865
step: 4: time taken for the last 1 steps is 0.510619153999869, loss is 0.47446271777153015
val_loss : 0.5068 : val_acc: 0.8010
updating stats...
Epoch: 28 starting...
step: 1: time taken for the last 1 steps is 0.8444867850000719, loss is 0.7158756256103516
step: 2: time taken for the last 1 steps is 0.5393332710000323, loss is 0.5535522699356079
step: 3: time taken for the last 1 steps is 0.5361991239999497, loss is 0.4926387071609497
step: 4: time taken for the last 1 steps is 0.5207769010000902, loss is 0.4277729392051697
val_loss : 0.5148 : val_acc: 0.8167
updating stats...
Epoch: 29 starting...
step: 1: time taken for the last 1 steps is 0.8343857690001641, loss is 0.7410185933113098
step: 2: time taken for the last 1 steps is 0.5437762440001279, loss is 0.4869172275066376
step: 3: time taken for the last 1 steps is 0.5517492899998615, loss is 0.6866953372955322
step: 4: time taken for the last 1 steps is 0.5145939980000094, loss is 0.3805162310600281
val_loss : 0.4761 : val_acc: 0.8188
updating stats...
Epoch: 30 starting...
step: 1: time taken for the last 1 steps is 0.8440308360000017, loss is 0.6032447814941406
step: 2: time taken for the last 1 steps is 0.543419096999969, loss is 0.600347638130188
step: 3: time taken for the last 1 steps is 0.5466232549999859, loss is 0.5694103240966797
step: 4: time taken for the last 1 steps is 0.5138032330000897, loss is 0.4858241379261017
val_loss : 0.5104 : val_acc: 0.7865
updating stats...
Epoch: 31 starting...
step: 1: time taken for the last 1 steps is 0.8438744630000201, loss is 0.6173492074012756
step: 2: time taken for the last 1 steps is 0.5304690500001925, loss is 0.5149618983268738
step: 3: time taken for the last 1 steps is 0.5430723810000018, loss is 0.5191231966018677
step: 4: time taken for the last 1 steps is 0.5110833629998979, loss is 0.35234522819519043
val_loss : 0.4901 : val_acc: 0.7812
updating stats...
Epoch: 32 starting...
step: 1: time taken for the last 1 steps is 0.8310574679999263, loss is 0.5714020133018494
step: 2: time taken for the last 1 steps is 0.544334913000057, loss is 0.5203770399093628
step: 3: time taken for the last 1 steps is 0.5487915250000697, loss is 0.5801374316215515
step: 4: time taken for the last 1 steps is 0.5201246789999914, loss is 0.424825519323349
val_loss : 0.4423 : val_acc: 0.8385
updating stats...
Epoch: 33 starting...
step: 1: time taken for the last 1 steps is 0.8399538709998069, loss is 0.6046094298362732
step: 2: time taken for the last 1 steps is 0.5468507399998543, loss is 0.5661307573318481
step: 3: time taken for the last 1 steps is 0.5371328119999816, loss is 0.47572943568229675
step: 4: time taken for the last 1 steps is 0.5206928190000326, loss is 0.3309897780418396
val_loss : 0.4648 : val_acc: 0.8031
updating stats...
Epoch: 34 starting...
step: 1: time taken for the last 1 steps is 0.8525505220000014, loss is 0.6189627051353455
step: 2: time taken for the last 1 steps is 0.5422852259998763, loss is 0.4379640817642212
step: 3: time taken for the last 1 steps is 0.5370040200000403, loss is 0.5288975834846497
step: 4: time taken for the last 1 steps is 0.5144844150001973, loss is 0.40403687953948975
val_loss : 0.4466 : val_acc: 0.8188
updating stats...
Epoch: 35 starting...
step: 1: time taken for the last 1 steps is 0.824652770000057, loss is 0.5330159068107605
step: 2: time taken for the last 1 steps is 0.5421480729999075, loss is 0.6689003705978394
step: 3: time taken for the last 1 steps is 0.5423894380001002, loss is 0.5194405913352966
step: 4: time taken for the last 1 steps is 0.5097219580000001, loss is 0.5041148066520691
val_loss : 0.3817 : val_acc: 0.8573
updating stats...
Epoch: 36 starting...
step: 1: time taken for the last 1 steps is 0.8299120480000965, loss is 0.5545355677604675
step: 2: time taken for the last 1 steps is 0.538976195000032, loss is 0.6755070686340332
step: 3: time taken for the last 1 steps is 0.5444121150001138, loss is 0.5365249514579773
step: 4: time taken for the last 1 steps is 0.5116072440000607, loss is 0.3271138370037079
val_loss : 0.4391 : val_acc: 0.8344
updating stats...
Epoch: 37 starting...
step: 1: time taken for the last 1 steps is 0.8488872850000462, loss is 0.5250250697135925
step: 2: time taken for the last 1 steps is 0.5463359200000468, loss is 0.4831342101097107
step: 3: time taken for the last 1 steps is 0.5439744970001357, loss is 0.4384065270423889
step: 4: time taken for the last 1 steps is 0.5170706730000347, loss is 0.31635811924934387
val_loss : 0.3799 : val_acc: 0.8271
updating stats...
Epoch: 38 starting...
step: 1: time taken for the last 1 steps is 0.827283578999868, loss is 0.4261717200279236
step: 2: time taken for the last 1 steps is 0.5493589659999998, loss is 0.6602377891540527
step: 3: time taken for the last 1 steps is 0.5416404340001009, loss is 0.3891771733760834
step: 4: time taken for the last 1 steps is 0.5168963799999347, loss is 0.27832353115081787
val_loss : 0.4075 : val_acc: 0.8323
updating stats...
Epoch: 39 starting...
step: 1: time taken for the last 1 steps is 0.8207659590000276, loss is 0.5376316905021667
step: 2: time taken for the last 1 steps is 0.5521770079999442, loss is 0.5012232661247253
step: 3: time taken for the last 1 steps is 0.5509890160001305, loss is 0.4218485653400421
step: 4: time taken for the last 1 steps is 0.5131002099999478, loss is 0.27662724256515503
val_loss : 0.4427 : val_acc: 0.8094
updating stats...
Epoch: 40 starting...
step: 1: time taken for the last 1 steps is 0.8376527479999822, loss is 0.4440532624721527
step: 2: time taken for the last 1 steps is 0.5510345570000936, loss is 0.6418067812919617
step: 3: time taken for the last 1 steps is 0.5421794839999166, loss is 0.46846866607666016
step: 4: time taken for the last 1 steps is 0.5162232870000025, loss is 0.2992960810661316
val_loss : 0.3319 : val_acc: 0.8750
updating stats...
Epoch: 41 starting...
step: 1: time taken for the last 1 steps is 0.844955083000059, loss is 0.4810296297073364
step: 2: time taken for the last 1 steps is 0.5364089280001281, loss is 0.6463090181350708
step: 3: time taken for the last 1 steps is 0.5384968659998322, loss is 0.37559396028518677
step: 4: time taken for the last 1 steps is 0.5182124150001073, loss is 0.29842817783355713
val_loss : 0.3802 : val_acc: 0.8344
updating stats...
Epoch: 42 starting...
step: 1: time taken for the last 1 steps is 0.8427844630000436, loss is 0.7462881207466125
step: 2: time taken for the last 1 steps is 0.5459101120000014, loss is 0.4627962112426758
step: 3: time taken for the last 1 steps is 0.5483047469999747, loss is 0.39968928694725037
step: 4: time taken for the last 1 steps is 0.5133590350001214, loss is 0.30692461133003235
val_loss : 0.3518 : val_acc: 0.8396
updating stats...
Epoch: 43 starting...
step: 1: time taken for the last 1 steps is 0.8322284299999865, loss is 0.453580379486084
step: 2: time taken for the last 1 steps is 0.5522283590000825, loss is 0.652605414390564
step: 3: time taken for the last 1 steps is 0.5406667170000219, loss is 0.5937989950180054
step: 4: time taken for the last 1 steps is 0.5092612500000087, loss is 0.3244580030441284
val_loss : 0.3701 : val_acc: 0.8771
updating stats...
Epoch: 44 starting...
step: 1: time taken for the last 1 steps is 0.8511122150000574, loss is 0.37136971950531006
step: 2: time taken for the last 1 steps is 0.5458711920000496, loss is 0.5553340315818787
step: 3: time taken for the last 1 steps is 0.5386855099998229, loss is 0.440286248922348
step: 4: time taken for the last 1 steps is 0.5118726779999179, loss is 0.38807573914527893
val_loss : 0.3996 : val_acc: 0.8396
updating stats...
Epoch: 45 starting...
step: 1: time taken for the last 1 steps is 0.8380329860001439, loss is 0.36037102341651917
step: 2: time taken for the last 1 steps is 0.5441072589999294, loss is 0.5116207599639893
step: 3: time taken for the last 1 steps is 0.5444170150001355, loss is 0.39522743225097656
step: 4: time taken for the last 1 steps is 0.512953798000126, loss is 0.28663620352745056
val_loss : 0.3575 : val_acc: 0.8677
updating stats...
Epoch: 46 starting...
step: 1: time taken for the last 1 steps is 0.8480264199999965, loss is 0.43381190299987793
step: 2: time taken for the last 1 steps is 0.5492876749999596, loss is 0.49097365140914917
step: 3: time taken for the last 1 steps is 0.5440264880000996, loss is 0.37951770424842834
step: 4: time taken for the last 1 steps is 0.5095594559998062, loss is 0.4139736294746399
val_loss : 0.3501 : val_acc: 0.8896
updating stats...
Epoch: 47 starting...
step: 1: time taken for the last 1 steps is 0.836202123000021, loss is 0.6317886710166931
step: 2: time taken for the last 1 steps is 0.5462563290000162, loss is 0.5238603949546814
step: 3: time taken for the last 1 steps is 0.5461061660000723, loss is 0.3395364582538605
step: 4: time taken for the last 1 steps is 0.5167777680001109, loss is 0.2931707501411438
val_loss : 0.3809 : val_acc: 0.8240
updating stats...
Epoch: 48 starting...
step: 1: time taken for the last 1 steps is 0.8398710999999821, loss is 0.5362442135810852
step: 2: time taken for the last 1 steps is 0.5510759789999611, loss is 0.5752094388008118
step: 3: time taken for the last 1 steps is 0.5389262559999679, loss is 0.3688044548034668
step: 4: time taken for the last 1 steps is 0.5081076100000246, loss is 0.4317777156829834
val_loss : 0.4348 : val_acc: 0.8010
updating stats...
Epoch: 49 starting...
step: 1: time taken for the last 1 steps is 0.8448712700001124, loss is 0.5630983114242554
step: 2: time taken for the last 1 steps is 0.5412297880000096, loss is 0.46068090200424194
step: 3: time taken for the last 1 steps is 0.5505470200000673, loss is 0.37136125564575195
step: 4: time taken for the last 1 steps is 0.516530904000092, loss is 0.3317767381668091
val_loss : 0.4261 : val_acc: 0.8010
updating stats...
Epoch: 50 starting...
step: 1: time taken for the last 1 steps is 0.8202758079999057, loss is 0.5593106150627136
step: 2: time taken for the last 1 steps is 0.5408029400000487, loss is 0.5105289816856384
step: 3: time taken for the last 1 steps is 0.5410066849999566, loss is 0.335543155670166
step: 4: time taken for the last 1 steps is 0.5107147069998064, loss is 0.2307049036026001
val_loss : 0.3462 : val_acc: 0.8698
updating stats...
Epoch: 51 starting...
step: 1: time taken for the last 1 steps is 0.8397289550000551, loss is 0.35260993242263794
step: 2: time taken for the last 1 steps is 0.5474934940000367, loss is 0.5112385749816895
step: 3: time taken for the last 1 steps is 0.550537919999897, loss is 0.46086472272872925
step: 4: time taken for the last 1 steps is 0.5098960029999944, loss is 0.21640625596046448
val_loss : 0.4139 : val_acc: 0.8344
updating stats...
Epoch: 52 starting...
step: 1: time taken for the last 1 steps is 0.8392931869998392, loss is 0.38207030296325684
step: 2: time taken for the last 1 steps is 0.5498887179999201, loss is 0.42851948738098145
step: 3: time taken for the last 1 steps is 0.5461494379999294, loss is 0.39450252056121826
step: 4: time taken for the last 1 steps is 0.5205169079999905, loss is 0.2882072925567627
val_loss : 0.3639 : val_acc: 0.8396
updating stats...
Epoch: 53 starting...
step: 1: time taken for the last 1 steps is 0.8299657860000025, loss is 0.46444156765937805
step: 2: time taken for the last 1 steps is 0.5380599709999387, loss is 0.45376309752464294
step: 3: time taken for the last 1 steps is 0.5410077950000414, loss is 0.3763642907142639
step: 4: time taken for the last 1 steps is 0.5171320459999151, loss is 0.22324706614017487
val_loss : 0.3446 : val_acc: 0.8573
updating stats...
Epoch: 54 starting...
step: 1: time taken for the last 1 steps is 0.8293039040001986, loss is 0.4434482455253601
step: 2: time taken for the last 1 steps is 0.5375601210000696, loss is 0.42038384079933167
step: 3: time taken for the last 1 steps is 0.5522127099998215, loss is 0.29192525148391724
step: 4: time taken for the last 1 steps is 0.5097539899998083, loss is 0.23594322800636292
val_loss : 0.3457 : val_acc: 0.8802
updating stats...
Epoch: 55 starting...
step: 1: time taken for the last 1 steps is 0.8469693990000451, loss is 0.5281389355659485
step: 2: time taken for the last 1 steps is 0.5464324149997992, loss is 0.29966476559638977
step: 3: time taken for the last 1 steps is 0.5468831629998476, loss is 0.3275134861469269
step: 4: time taken for the last 1 steps is 0.5148749950001275, loss is 0.3303925693035126
val_loss : 0.3158 : val_acc: 0.8729
updating stats...
Epoch: 56 starting...
step: 1: time taken for the last 1 steps is 0.8449668020000445, loss is 0.2864932417869568
step: 2: time taken for the last 1 steps is 0.5465323860000808, loss is 0.23786938190460205
step: 3: time taken for the last 1 steps is 0.5375252200001341, loss is 0.3883000314235687
step: 4: time taken for the last 1 steps is 0.5169753229999969, loss is 0.20424135029315948
val_loss : 0.4455 : val_acc: 0.8146
updating stats...
Epoch: 57 starting...
step: 1: time taken for the last 1 steps is 0.849637378000125, loss is 0.4567107856273651
step: 2: time taken for the last 1 steps is 0.5514313959999981, loss is 0.4020608067512512
step: 3: time taken for the last 1 steps is 0.5413075999999819, loss is 0.3310708999633789
step: 4: time taken for the last 1 steps is 0.5128373370000645, loss is 0.34602242708206177
val_loss : 0.3613 : val_acc: 0.8927
updating stats...
Epoch: 58 starting...
step: 1: time taken for the last 1 steps is 0.8438951729999644, loss is 0.5850158929824829
step: 2: time taken for the last 1 steps is 0.5430887830000302, loss is 0.2720194160938263
step: 3: time taken for the last 1 steps is 0.5457129410001471, loss is 0.32866501808166504
step: 4: time taken for the last 1 steps is 0.5113696589999108, loss is 0.2173653393983841
val_loss : 0.3391 : val_acc: 0.8573
updating stats...
Epoch: 59 starting...
step: 1: time taken for the last 1 steps is 0.8378961410001011, loss is 0.4399292767047882
step: 2: time taken for the last 1 steps is 0.54691432300001, loss is 0.4121926724910736
step: 3: time taken for the last 1 steps is 0.5394432160001088, loss is 0.4596976041793823
step: 4: time taken for the last 1 steps is 0.502688439999929, loss is 0.2787383198738098
val_loss : 0.4702 : val_acc: 0.8271
updating stats...
Epoch: 60 starting...
step: 1: time taken for the last 1 steps is 0.8405262799999491, loss is 0.4451938569545746
step: 2: time taken for the last 1 steps is 0.5427675870000712, loss is 0.5084255933761597
step: 3: time taken for the last 1 steps is 0.5422000660000776, loss is 0.3848440647125244
step: 4: time taken for the last 1 steps is 0.5146825799999988, loss is 0.28282225131988525
val_loss : 0.3265 : val_acc: 0.8802
updating stats...
Epoch: 61 starting...
step: 1: time taken for the last 1 steps is 0.8466105620000235, loss is 0.568873405456543
step: 2: time taken for the last 1 steps is 0.5408888520000801, loss is 0.5093417167663574
step: 3: time taken for the last 1 steps is 0.5481075150000834, loss is 0.44990864396095276
step: 4: time taken for the last 1 steps is 0.5142949930000213, loss is 0.37456196546554565
val_loss : 0.3508 : val_acc: 0.8479
updating stats...
Epoch: 62 starting...
step: 1: time taken for the last 1 steps is 0.8397469760000149, loss is 0.3454976975917816
step: 2: time taken for the last 1 steps is 0.5421872859999439, loss is 0.36569735407829285
step: 3: time taken for the last 1 steps is 0.5454590470001222, loss is 0.349128395318985
step: 4: time taken for the last 1 steps is 0.521013357000129, loss is 0.3244234025478363
val_loss : 0.3229 : val_acc: 0.8677
updating stats...
Epoch: 63 starting...
step: 1: time taken for the last 1 steps is 0.8384043819999079, loss is 0.2720220685005188
step: 2: time taken for the last 1 steps is 0.5417602179998084, loss is 0.5650380849838257
step: 3: time taken for the last 1 steps is 0.5458686639999542, loss is 0.40326714515686035
step: 4: time taken for the last 1 steps is 0.5084576670001297, loss is 0.2615705132484436
val_loss : 0.3903 : val_acc: 0.8448
updating stats...
Epoch: 64 starting...
step: 1: time taken for the last 1 steps is 0.8334515599999577, loss is 0.3811153769493103
step: 2: time taken for the last 1 steps is 0.5352437489998465, loss is 0.5216858983039856
step: 3: time taken for the last 1 steps is 0.5414920640000673, loss is 0.3633744716644287
step: 4: time taken for the last 1 steps is 0.5168853909999598, loss is 0.25555747747421265
val_loss : 0.3102 : val_acc: 0.8646
updating stats...
Epoch: 65 starting...
step: 1: time taken for the last 1 steps is 0.8375014040000224, loss is 0.33768174052238464
step: 2: time taken for the last 1 steps is 0.5470058949999839, loss is 0.4009026288986206
step: 3: time taken for the last 1 steps is 0.5428169679998973, loss is 0.31035277247428894
step: 4: time taken for the last 1 steps is 0.5117809579999175, loss is 0.32748943567276
val_loss : 0.3153 : val_acc: 0.8771
updating stats...
Epoch: 66 starting...
step: 1: time taken for the last 1 steps is 0.8408254260000376, loss is 0.24922168254852295
step: 2: time taken for the last 1 steps is 0.5391657110001233, loss is 0.38323304057121277
step: 3: time taken for the last 1 steps is 0.5439622890000919, loss is 0.32413971424102783
step: 4: time taken for the last 1 steps is 0.5183009869999751, loss is 0.20844615995883942
val_loss : 0.3758 : val_acc: 0.8552
updating stats...
Epoch: 67 starting...
step: 1: time taken for the last 1 steps is 0.852057713000022, loss is 0.41490158438682556
step: 2: time taken for the last 1 steps is 0.5440358390001165, loss is 0.43497130274772644
step: 3: time taken for the last 1 steps is 0.5487611369999286, loss is 0.39321666955947876
step: 4: time taken for the last 1 steps is 0.5184408790000816, loss is 0.21731966733932495
val_loss : 0.3267 : val_acc: 0.8677
updating stats...
Epoch: 68 starting...
step: 1: time taken for the last 1 steps is 0.8554738349998843, loss is 0.3155420422554016
step: 2: time taken for the last 1 steps is 0.5323744859999806, loss is 0.27047860622406006
step: 3: time taken for the last 1 steps is 0.5466998900001272, loss is 0.4239215552806854
step: 4: time taken for the last 1 steps is 0.5187169150001409, loss is 0.2991621494293213
val_loss : 0.4051 : val_acc: 0.8646
updating stats...
Epoch: 69 starting...
step: 1: time taken for the last 1 steps is 0.8485200370000712, loss is 0.5106467604637146
step: 2: time taken for the last 1 steps is 0.5404764950001208, loss is 0.4554342031478882
step: 3: time taken for the last 1 steps is 0.5405606459999035, loss is 0.3424634337425232
step: 4: time taken for the last 1 steps is 0.5136511620000874, loss is 0.5016503930091858
val_loss : 0.4224 : val_acc: 0.8500
updating stats...
Epoch: 70 starting...
step: 1: time taken for the last 1 steps is 0.8528716670000449, loss is 0.38263043761253357
step: 2: time taken for the last 1 steps is 0.5334491760002038, loss is 0.5316821932792664
step: 3: time taken for the last 1 steps is 0.5462976019998678, loss is 0.3921135663986206
step: 4: time taken for the last 1 steps is 0.5201237600001605, loss is 0.19029353559017181
val_loss : 0.3654 : val_acc: 0.8573
updating stats...
Epoch: 71 starting...
step: 1: time taken for the last 1 steps is 0.8307401800000207, loss is 0.33706891536712646
step: 2: time taken for the last 1 steps is 0.5415321939999558, loss is 0.34573251008987427
step: 3: time taken for the last 1 steps is 0.5472453889999542, loss is 0.3926294147968292
step: 4: time taken for the last 1 steps is 0.5240712319998693, loss is 0.3372357189655304
val_loss : 0.3655 : val_acc: 0.8750
updating stats...
Epoch: 72 starting...
step: 1: time taken for the last 1 steps is 0.8419285659999787, loss is 0.3913383483886719
step: 2: time taken for the last 1 steps is 0.547681257000022, loss is 0.3989218771457672
step: 3: time taken for the last 1 steps is 0.5463574130001234, loss is 0.29651570320129395
step: 4: time taken for the last 1 steps is 0.5173357090000081, loss is 0.1850368082523346
val_loss : 0.3600 : val_acc: 0.8573
updating stats...
Epoch: 73 starting...
step: 1: time taken for the last 1 steps is 0.8459032690000186, loss is 0.23566776514053345
step: 2: time taken for the last 1 steps is 0.5466515789998994, loss is 0.34271299839019775
step: 3: time taken for the last 1 steps is 0.5523532130000604, loss is 0.3732047379016876
step: 4: time taken for the last 1 steps is 0.5127695359999507, loss is 0.25547879934310913
val_loss : 0.3481 : val_acc: 0.8625
updating stats...
Epoch: 74 starting...
step: 1: time taken for the last 1 steps is 0.8197748190000311, loss is 0.33922043442726135
step: 2: time taken for the last 1 steps is 0.5446478920000573, loss is 0.42848658561706543
step: 3: time taken for the last 1 steps is 0.5361137450001934, loss is 0.27290964126586914
step: 4: time taken for the last 1 steps is 0.5089796459999434, loss is 0.27954408526420593
val_loss : 0.3030 : val_acc: 0.8750
updating stats...
Epoch: 75 starting...
step: 1: time taken for the last 1 steps is 0.8467514740000297, loss is 0.331319659948349
step: 2: time taken for the last 1 steps is 0.5418602899999314, loss is 0.4288283884525299
step: 3: time taken for the last 1 steps is 0.5481950270000198, loss is 0.22228719294071198
step: 4: time taken for the last 1 steps is 0.5060845130001326, loss is 0.29571616649627686
val_loss : 0.3303 : val_acc: 0.8427
updating stats...
Epoch: 76 starting...
step: 1: time taken for the last 1 steps is 0.8305346869999539, loss is 0.35639694333076477
step: 2: time taken for the last 1 steps is 0.5408402820000902, loss is 0.35386237502098083
step: 3: time taken for the last 1 steps is 0.5403758930001459, loss is 0.26564282178878784
step: 4: time taken for the last 1 steps is 0.52477868699998, loss is 0.12686072289943695
val_loss : 0.3505 : val_acc: 0.8729
updating stats...
Epoch: 77 starting...
step: 1: time taken for the last 1 steps is 0.8568825810000362, loss is 0.27486446499824524
step: 2: time taken for the last 1 steps is 0.5416424559998632, loss is 0.41230034828186035
step: 3: time taken for the last 1 steps is 0.5394882769999185, loss is 0.2750483751296997
step: 4: time taken for the last 1 steps is 0.5128158669999721, loss is 0.15927013754844666
val_loss : 0.3085 : val_acc: 0.8802
updating stats...
Epoch: 78 starting...
step: 1: time taken for the last 1 steps is 0.8594516079999721, loss is 0.5484926700592041
step: 2: time taken for the last 1 steps is 0.5450754480000342, loss is 0.22854410111904144
step: 3: time taken for the last 1 steps is 0.5403769429999556, loss is 0.33515334129333496
step: 4: time taken for the last 1 steps is 0.5095610470000338, loss is 0.13389679789543152
val_loss : 0.3982 : val_acc: 0.8750
updating stats...
Epoch: 79 starting...
step: 1: time taken for the last 1 steps is 0.8169919470001332, loss is 0.5357200503349304
step: 2: time taken for the last 1 steps is 0.541370790999963, loss is 0.28352922201156616
step: 3: time taken for the last 1 steps is 0.5333003729999746, loss is 0.3210703432559967
step: 4: time taken for the last 1 steps is 0.5238907899999958, loss is 0.17085234820842743
val_loss : 0.2579 : val_acc: 0.9115
updating stats...
Epoch: 80 starting...
step: 1: time taken for the last 1 steps is 0.8423897749999014, loss is 0.2799394428730011
step: 2: time taken for the last 1 steps is 0.5454400060000353, loss is 0.2804766893386841
step: 3: time taken for the last 1 steps is 0.5470311550000133, loss is 0.23797374963760376
step: 4: time taken for the last 1 steps is 0.5037543699997968, loss is 0.09351199865341187
val_loss : 0.3462 : val_acc: 0.8646
updating stats...
Epoch: 81 starting...
step: 1: time taken for the last 1 steps is 0.8285302890001276, loss is 0.3739168643951416
step: 2: time taken for the last 1 steps is 0.5337869119998686, loss is 0.28762388229370117
step: 3: time taken for the last 1 steps is 0.5460798369999793, loss is 0.2056814283132553
step: 4: time taken for the last 1 steps is 0.5178612699999121, loss is 0.2639504373073578
val_loss : 0.2875 : val_acc: 0.9062
updating stats...
Epoch: 82 starting...
step: 1: time taken for the last 1 steps is 0.8433162610001546, loss is 0.3289852440357208
step: 2: time taken for the last 1 steps is 0.5380572910000865, loss is 0.31694847345352173
step: 3: time taken for the last 1 steps is 0.540467045000014, loss is 0.19613215327262878
step: 4: time taken for the last 1 steps is 0.5058267980000437, loss is 0.19082005321979523
val_loss : 0.4089 : val_acc: 0.8073
updating stats...
Epoch: 83 starting...
step: 1: time taken for the last 1 steps is 0.8528771170001619, loss is 0.2890875041484833
step: 2: time taken for the last 1 steps is 0.5317863540001326, loss is 0.25270548462867737
step: 3: time taken for the last 1 steps is 0.5389012760001606, loss is 0.31929105520248413
step: 4: time taken for the last 1 steps is 0.5169785929999762, loss is 0.19591136276721954
val_loss : 0.3671 : val_acc: 0.8583
updating stats...
Epoch: 84 starting...
step: 1: time taken for the last 1 steps is 0.8625459649999812, loss is 0.3308655023574829
step: 2: time taken for the last 1 steps is 0.5426433449999877, loss is 0.2794710099697113
step: 3: time taken for the last 1 steps is 0.5479242720000457, loss is 0.2557043433189392
step: 4: time taken for the last 1 steps is 0.5099157330000708, loss is 0.22920313477516174
val_loss : 0.2672 : val_acc: 0.9052
updating stats...
Epoch: 85 starting...
step: 1: time taken for the last 1 steps is 0.8443998509999346, loss is 0.31487056612968445
step: 2: time taken for the last 1 steps is 0.5379034229999888, loss is 0.2556721270084381
step: 3: time taken for the last 1 steps is 0.5424145960000715, loss is 0.2705182135105133
step: 4: time taken for the last 1 steps is 0.5129072239999459, loss is 0.19406041502952576
val_loss : 0.2811 : val_acc: 0.8958
updating stats...
Epoch: 86 starting...
step: 1: time taken for the last 1 steps is 0.846795345999908, loss is 0.24836136400699615
step: 2: time taken for the last 1 steps is 0.538678038000171, loss is 0.3314928710460663
step: 3: time taken for the last 1 steps is 0.5435622569998486, loss is 0.24494993686676025
step: 4: time taken for the last 1 steps is 0.5042944159999934, loss is 0.23353390395641327
val_loss : 0.3508 : val_acc: 0.8958
updating stats...
Epoch: 87 starting...
step: 1: time taken for the last 1 steps is 0.8483938850001778, loss is 0.21342074871063232
step: 2: time taken for the last 1 steps is 0.5361883020000278, loss is 0.2538529932498932
step: 3: time taken for the last 1 steps is 0.5459579719999965, loss is 0.215516597032547
step: 4: time taken for the last 1 steps is 0.5138188609998906, loss is 0.08187326043844223
val_loss : 0.2932 : val_acc: 0.8708
updating stats...
Epoch: 88 starting...
step: 1: time taken for the last 1 steps is 0.8284909889998744, loss is 0.32094860076904297
step: 2: time taken for the last 1 steps is 0.546609202999889, loss is 0.3729039132595062
step: 3: time taken for the last 1 steps is 0.5420560200000182, loss is 0.2696378827095032
step: 4: time taken for the last 1 steps is 0.5149517509998986, loss is 0.233515202999115
val_loss : 0.2802 : val_acc: 0.8479
updating stats...
Epoch: 89 starting...
step: 1: time taken for the last 1 steps is 0.8260963650000122, loss is 0.1977672129869461
step: 2: time taken for the last 1 steps is 0.5481952229999933, loss is 0.3694140911102295
step: 3: time taken for the last 1 steps is 0.5431735600000138, loss is 0.312104731798172
step: 4: time taken for the last 1 steps is 0.5116949620000923, loss is 0.3138490617275238
val_loss : 0.4317 : val_acc: 0.8198
updating stats...
Epoch: 90 starting...
step: 1: time taken for the last 1 steps is 0.8226159700000153, loss is 0.3480829894542694
step: 2: time taken for the last 1 steps is 0.5399158699999589, loss is 0.35792312026023865
step: 3: time taken for the last 1 steps is 0.5459028509999371, loss is 0.3412383198738098
step: 4: time taken for the last 1 steps is 0.5141728070000227, loss is 0.2192990630865097
val_loss : 0.3090 : val_acc: 0.8708
updating stats...
Epoch: 91 starting...
step: 1: time taken for the last 1 steps is 0.8144271090000075, loss is 0.5013067126274109
step: 2: time taken for the last 1 steps is 0.5394525320000412, loss is 0.32031697034835815
step: 3: time taken for the last 1 steps is 0.5470062710001002, loss is 0.534544825553894
step: 4: time taken for the last 1 steps is 0.5077366279999751, loss is 0.21631096303462982
val_loss : 0.3675 : val_acc: 0.8521
updating stats...
Epoch: 92 starting...
step: 1: time taken for the last 1 steps is 0.8319976440000119, loss is 0.38171812891960144
step: 2: time taken for the last 1 steps is 0.5485370289998173, loss is 0.37269327044487
step: 3: time taken for the last 1 steps is 0.5507007799999428, loss is 0.2330285757780075
step: 4: time taken for the last 1 steps is 0.522854617999883, loss is 0.34924086928367615
val_loss : 0.4231 : val_acc: 0.8427
updating stats...
Epoch: 93 starting...
step: 1: time taken for the last 1 steps is 0.8457249660000343, loss is 0.40017637610435486
step: 2: time taken for the last 1 steps is 0.5381915989999015, loss is 0.46867069602012634
step: 3: time taken for the last 1 steps is 0.5425971100000879, loss is 0.30734196305274963
step: 4: time taken for the last 1 steps is 0.5154852010000468, loss is 0.15804480016231537
val_loss : 0.3035 : val_acc: 0.8906
updating stats...
Epoch: 94 starting...
step: 1: time taken for the last 1 steps is 0.8326172049999059, loss is 0.381089985370636
step: 2: time taken for the last 1 steps is 0.546245196999962, loss is 0.32271283864974976
step: 3: time taken for the last 1 steps is 0.545861830000149, loss is 0.34432435035705566
step: 4: time taken for the last 1 steps is 0.5125337469999067, loss is 0.20784370601177216
val_loss : 0.3086 : val_acc: 0.8885
updating stats...
Epoch: 95 starting...
step: 1: time taken for the last 1 steps is 0.8247717899998861, loss is 0.2757062613964081
step: 2: time taken for the last 1 steps is 0.5348315070000353, loss is 0.1893559992313385
step: 3: time taken for the last 1 steps is 0.5348839390001103, loss is 0.3900822103023529
step: 4: time taken for the last 1 steps is 0.5175287689999095, loss is 0.18327276408672333
val_loss : 0.3094 : val_acc: 0.8958
updating stats...
Epoch: 96 starting...
step: 1: time taken for the last 1 steps is 0.8573793699999896, loss is 0.2578771114349365
step: 2: time taken for the last 1 steps is 0.544312102000049, loss is 0.17483782768249512
step: 3: time taken for the last 1 steps is 0.542864654999903, loss is 0.21440821886062622
step: 4: time taken for the last 1 steps is 0.519148838999854, loss is 0.20658259093761444
val_loss : 0.2842 : val_acc: 0.9240
updating stats...
Epoch: 97 starting...
step: 1: time taken for the last 1 steps is 0.8408754869999484, loss is 0.48936226963996887
step: 2: time taken for the last 1 steps is 0.5353012559999115, loss is 0.21120376884937286
step: 3: time taken for the last 1 steps is 0.5482557739999265, loss is 0.2297537475824356
step: 4: time taken for the last 1 steps is 0.5141605169999366, loss is 0.1071009635925293
val_loss : 0.3520 : val_acc: 0.8604
updating stats...
Epoch: 98 starting...
step: 1: time taken for the last 1 steps is 0.832407380999939, loss is 0.19704879820346832
step: 2: time taken for the last 1 steps is 0.5466210540000702, loss is 0.2286141812801361
step: 3: time taken for the last 1 steps is 0.5488970760000029, loss is 0.21051321923732758
step: 4: time taken for the last 1 steps is 0.5142117979999057, loss is 0.08688507229089737
val_loss : 0.3973 : val_acc: 0.8479
updating stats...
Epoch: 99 starting...
step: 1: time taken for the last 1 steps is 0.8239628349999748, loss is 0.21949981153011322
step: 2: time taken for the last 1 steps is 0.5429894970000078, loss is 0.21821193397045135
step: 3: time taken for the last 1 steps is 0.5375028559999464, loss is 0.3166569471359253
step: 4: time taken for the last 1 steps is 0.5203485810000075, loss is 0.12969240546226501
val_loss : 0.3300 : val_acc: 0.8531
updating stats...
Epoch: 100 starting...
step: 1: time taken for the last 1 steps is 0.8347128929999599, loss is 0.2797228991985321
step: 2: time taken for the last 1 steps is 0.5388064209998902, loss is 0.30911651253700256
step: 3: time taken for the last 1 steps is 0.5480606800001624, loss is 0.14625027775764465
step: 4: time taken for the last 1 steps is 0.5064257250000992, loss is 0.1337253898382187
val_loss : 0.3273 : val_acc: 0.8802
updating stats...
Epoch: 101 starting...
step: 1: time taken for the last 1 steps is 0.8286636109999108, loss is 0.4220364987850189
step: 2: time taken for the last 1 steps is 0.5425721399999475, loss is 0.24942083656787872
step: 3: time taken for the last 1 steps is 0.5473255770000378, loss is 0.24028123915195465
step: 4: time taken for the last 1 steps is 0.5154505710002013, loss is 0.09368717670440674
val_loss : 0.2888 : val_acc: 0.8677
updating stats...
Epoch: 102 starting...
step: 1: time taken for the last 1 steps is 0.8495685350001168, loss is 0.2553580701351166
step: 2: time taken for the last 1 steps is 0.543228771000031, loss is 0.24219229817390442
step: 3: time taken for the last 1 steps is 0.5405446830000074, loss is 0.1835995316505432
step: 4: time taken for the last 1 steps is 0.5211338950000481, loss is 0.13723914325237274
val_loss : 0.3508 : val_acc: 0.8729
updating stats...
Epoch: 103 starting...
step: 1: time taken for the last 1 steps is 0.8557737510000152, loss is 0.32227328419685364
step: 2: time taken for the last 1 steps is 0.5414626690001114, loss is 0.23622240126132965
step: 3: time taken for the last 1 steps is 0.5423972959999901, loss is 0.10949648916721344
step: 4: time taken for the last 1 steps is 0.5153621590000057, loss is 0.0485488586127758
val_loss : 0.3259 : val_acc: 0.8729
updating stats...
Epoch: 104 starting...
step: 1: time taken for the last 1 steps is 0.8483487039998181, loss is 0.23121657967567444
step: 2: time taken for the last 1 steps is 0.5423038339999948, loss is 0.18137679994106293
step: 3: time taken for the last 1 steps is 0.5458056299999043, loss is 0.13929536938667297
step: 4: time taken for the last 1 steps is 0.5089244400001007, loss is 0.24883422255516052
val_loss : 0.4846 : val_acc: 0.8198
updating stats...
Epoch: 105 starting...
step: 1: time taken for the last 1 steps is 0.8190294949999952, loss is 0.27593880891799927
step: 2: time taken for the last 1 steps is 0.5409724809999261, loss is 0.32822275161743164
step: 3: time taken for the last 1 steps is 0.5450738449999335, loss is 0.23556791245937347
step: 4: time taken for the last 1 steps is 0.507785130000002, loss is 0.27463263273239136
val_loss : 0.3760 : val_acc: 0.8604
updating stats...
Epoch: 106 starting...
step: 1: time taken for the last 1 steps is 0.8207827669998551, loss is 0.1473020315170288
step: 2: time taken for the last 1 steps is 0.5431136489999062, loss is 0.2801692485809326
step: 3: time taken for the last 1 steps is 0.5362593540000944, loss is 0.2647358477115631
step: 4: time taken for the last 1 steps is 0.5162192150000919, loss is 0.12988831102848053
val_loss : 0.3343 : val_acc: 0.8375
updating stats...
Epoch: 107 starting...
step: 1: time taken for the last 1 steps is 0.833228045000169, loss is 0.2196303904056549
step: 2: time taken for the last 1 steps is 0.5532953770000404, loss is 0.27693474292755127
step: 3: time taken for the last 1 steps is 0.5486186910000015, loss is 0.2350563108921051
step: 4: time taken for the last 1 steps is 0.514876250000043, loss is 0.3257180154323578
val_loss : 0.3459 : val_acc: 0.8708
updating stats...
Epoch: 108 starting...
step: 1: time taken for the last 1 steps is 0.8397798670000611, loss is 0.21648883819580078
step: 2: time taken for the last 1 steps is 0.5493833850000556, loss is 0.22610464692115784
step: 3: time taken for the last 1 steps is 0.5383823419999771, loss is 0.21012021601200104
step: 4: time taken for the last 1 steps is 0.5194201639999392, loss is 0.18727920949459076
val_loss : 0.3495 : val_acc: 0.8656
updating stats...
Epoch: 109 starting...
step: 1: time taken for the last 1 steps is 0.8335427119998258, loss is 0.31226015090942383
step: 2: time taken for the last 1 steps is 0.5509009030001835, loss is 0.10334939509630203
step: 3: time taken for the last 1 steps is 0.5508179319999726, loss is 0.3445267081260681
step: 4: time taken for the last 1 steps is 0.5160869119999916, loss is 0.1566741019487381
val_loss : 0.3433 : val_acc: 0.8604
updating stats...
Epoch: 110 starting...
step: 1: time taken for the last 1 steps is 0.8238666029999422, loss is 0.20163756608963013
step: 2: time taken for the last 1 steps is 0.5432443019999482, loss is 0.22355687618255615
step: 3: time taken for the last 1 steps is 0.5457815080001183, loss is 0.19716985523700714
step: 4: time taken for the last 1 steps is 0.5160539320002044, loss is 0.08098874241113663
val_loss : 0.3423 : val_acc: 0.8802
updating stats...
Epoch: 111 starting...
step: 1: time taken for the last 1 steps is 0.833992470000112, loss is 0.2133871465921402
step: 2: time taken for the last 1 steps is 0.535491428999876, loss is 0.3107958137989044
step: 3: time taken for the last 1 steps is 0.5450893550000728, loss is 0.2364930510520935
step: 4: time taken for the last 1 steps is 0.5167933559998801, loss is 0.13913331925868988
val_loss : 0.3341 : val_acc: 0.8625
updating stats...
Epoch: 112 starting...
step: 1: time taken for the last 1 steps is 0.8597529430001032, loss is 0.22639983892440796
step: 2: time taken for the last 1 steps is 0.5488661859999411, loss is 0.1658080816268921
step: 3: time taken for the last 1 steps is 0.5490329780000138, loss is 0.23099306225776672
step: 4: time taken for the last 1 steps is 0.5184978470001624, loss is 0.16578662395477295
val_loss : 0.2739 : val_acc: 0.8604
updating stats...
Epoch: 113 starting...
step: 1: time taken for the last 1 steps is 0.8315173449998383, loss is 0.16881488263607025
step: 2: time taken for the last 1 steps is 0.5442046499999833, loss is 0.24062001705169678
step: 3: time taken for the last 1 steps is 0.5495735189999778, loss is 0.18358272314071655
step: 4: time taken for the last 1 steps is 0.5188580930000626, loss is 0.06868131458759308
val_loss : 0.3465 : val_acc: 0.8656
updating stats...
Epoch: 114 starting...
step: 1: time taken for the last 1 steps is 0.8331722440000249, loss is 0.23260445892810822
step: 2: time taken for the last 1 steps is 0.5399141509999481, loss is 0.13639242947101593
step: 3: time taken for the last 1 steps is 0.5524316199998793, loss is 0.1224542111158371
step: 4: time taken for the last 1 steps is 0.5208255100001224, loss is 0.04217711463570595
val_loss : 0.3420 : val_acc: 0.8583
updating stats...
Epoch: 115 starting...
step: 1: time taken for the last 1 steps is 0.835538058000111, loss is 0.15135188400745392
step: 2: time taken for the last 1 steps is 0.5345559130000765, loss is 0.2830204963684082
step: 3: time taken for the last 1 steps is 0.5462675979999858, loss is 0.1491066962480545
step: 4: time taken for the last 1 steps is 0.5157286559999648, loss is 0.18703991174697876
val_loss : 0.3906 : val_acc: 0.8552
updating stats...
Epoch: 116 starting...
step: 1: time taken for the last 1 steps is 0.8388663689997884, loss is 0.15969131886959076
step: 2: time taken for the last 1 steps is 0.5514538630000061, loss is 0.20289331674575806
step: 3: time taken for the last 1 steps is 0.5449429529999179, loss is 0.14836043119430542
step: 4: time taken for the last 1 steps is 0.5075112750000699, loss is 0.21727649867534637
val_loss : 0.2394 : val_acc: 0.8906
updating stats...
Epoch: 117 starting...
step: 1: time taken for the last 1 steps is 0.8362406510000255, loss is 0.18652546405792236
step: 2: time taken for the last 1 steps is 0.5434661660001439, loss is 0.29893532395362854
step: 3: time taken for the last 1 steps is 0.5366016700002092, loss is 0.13873383402824402
step: 4: time taken for the last 1 steps is 0.5155686030000197, loss is 0.3674727976322174
val_loss : 0.3146 : val_acc: 0.9083
updating stats...
Epoch: 118 starting...
step: 1: time taken for the last 1 steps is 0.8333909379998659, loss is 0.3156467378139496
step: 2: time taken for the last 1 steps is 0.5443394620001527, loss is 0.3926163911819458
step: 3: time taken for the last 1 steps is 0.5497580119999839, loss is 0.22245709598064423
step: 4: time taken for the last 1 steps is 0.5229863389999991, loss is 0.11398866772651672
val_loss : 0.3542 : val_acc: 0.8552
updating stats...
Epoch: 119 starting...
step: 1: time taken for the last 1 steps is 0.8392318069998055, loss is 0.2782702147960663
step: 2: time taken for the last 1 steps is 0.5421284110000215, loss is 0.22524024546146393
step: 3: time taken for the last 1 steps is 0.5474166179999429, loss is 0.14453952014446259
step: 4: time taken for the last 1 steps is 0.5285316919998877, loss is 0.15003451704978943
val_loss : 0.3089 : val_acc: 0.8927
updating stats...
Epoch: 120 starting...
step: 1: time taken for the last 1 steps is 0.8547551619999467, loss is 0.3001267910003662
step: 2: time taken for the last 1 steps is 0.5518926409999949, loss is 0.1999441385269165
step: 3: time taken for the last 1 steps is 0.5369474159999754, loss is 0.17225666344165802
step: 4: time taken for the last 1 steps is 0.5100306709998677, loss is 0.07319803535938263
val_loss : 0.3874 : val_acc: 0.8354
updating stats...
Epoch: 121 starting...
step: 1: time taken for the last 1 steps is 0.8653296170000431, loss is 0.22975504398345947
step: 2: time taken for the last 1 steps is 0.5409945710000557, loss is 0.1557852029800415
step: 3: time taken for the last 1 steps is 0.5443037110001114, loss is 0.15954366326332092
step: 4: time taken for the last 1 steps is 0.5150099229999796, loss is 0.12123353779315948
val_loss : 0.3285 : val_acc: 0.8833
updating stats...
Epoch: 122 starting...
step: 1: time taken for the last 1 steps is 0.8281160419999196, loss is 0.3289310932159424
step: 2: time taken for the last 1 steps is 0.5468061770000077, loss is 0.269641637802124
step: 3: time taken for the last 1 steps is 0.5506677790001504, loss is 0.1859942525625229