-
Notifications
You must be signed in to change notification settings - Fork 0
/
transpose-transpose.csv
We can make this file beautiful and searchable if this error is corrected: It looks like row 9 should actually have 1 column, instead of 3 in line 8.
924 lines (924 loc) · 201 KB
/
transpose-transpose.csv
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
2022-08-25T20:17:12-07:00
Running ./build/bin/nvfuser_bench
Run on (64 X 3700 MHz CPU s)
CPU Caches:
L1 Data 32 KiB (x32)
L1 Instruction 32 KiB (x32)
L2 Unified 512 KiB (x32)
L3 Unified 16384 KiB (x8)
Load Average: 0.41, 0.52, 0.77
***WARNING*** CPU scaling is enabled, the benchmark real time measurements may be noisy and will incur extra overhead.
***WARNING*** Library was built as DEBUG. Timings may be affected.
name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/9/2408",5628,124.817,124.595,us,2.08727e+09,,"Tile size: (8,8)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/32)/grid(1/1/602)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/16/512",5767,124.966,124.739,us,7.88077e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/128)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/18/96",5709,125.326,125.109,us,1.65744e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/36)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/24/96",5506,124.908,124.685,us,2.21743e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/36)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/24/256",5476,124.989,124.75,us,5.91006e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/96)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/24/512",5439,125.397,125.173,us,1.17801e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/192)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/32/27",5592,124.977,124.757,us,8.31057e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/16)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/32/96",5615,124.934,124.704,us,2.95612e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/48)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/32/288",5539,124.982,124.765,us,8.86401e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/144)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/32/864",5491,125.412,125.208,us,2.64979e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/432)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/40/120",5471,124.979,124.767,us,4.6166e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/75)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/48/128",5621,124.674,124.486,us,5.92258e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/96)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/48/256",5624,125.102,124.915,us,1.18045e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/192)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/49/512",5540,126.353,126.103,us,2.38738e+09,,"Tile size: (8,8)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/32)/grid(1/1/448)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/49/1024",5487,126.158,125.924,us,4.78156e+09,,"Tile size: (8,8)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/32)/grid(1/1/896)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/49/2048",5636,123.87,123.702,us,9.73484e+09,,"Tile size: (32,32)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/128)/grid(1/1/128)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/49/4608",5515,126.334,126.149,us,2.14785e+10,,"Tile size: (32,32)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/128)/grid(1/1/288)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/64",5634,121.891,121.725,us,4.03797e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/64)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/96",5665,122.104,121.929,us,6.04677e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/96)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/128",5635,121.875,121.701,us,8.07747e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/128)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/147",5721,122.604,122.415,us,9.22241e+08,,"Tile size: (8,8)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/32)/grid(1/1/152)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/192",5737,122.081,121.906,us,1.20959e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/192)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/256",5584,122.157,121.952,us,1.61217e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/256)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/288",5733,122.217,122.043,us,1.81234e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/288)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/512",5697,123.189,122.996,us,3.19697e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/512)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/80/64",5733,121.943,121.76,us,5.04599e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/80)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/81/1728",5595,124.926,124.734,us,1.34656e+10,,"Tile size: (32,32)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/128)/grid(1/1/162)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/83/1728",5602,125.202,125.017,us,1.37669e+10,,"Tile size: (32,32)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/128)/grid(1/1/162)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/96/864",5681,123.562,123.384,us,8.0669e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1296)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/100/1280",5652,124.254,124.047,us,1.23824e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/160)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/100/4032",5522,128.038,127.862,us,3.78407e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/504)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/120/40",5585,125.379,125.169,us,4.60179e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/75)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/128/128",5578,125.67,125.452,us,1.5672e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/256)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/128/512",5715,122.538,122.339,us,6.42828e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1024)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/128/1152",5666,123.701,123.518,us,1.43256e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/144)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/128",5746,122.162,121.996,us,2.41739e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/384)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/256",5570,124.289,124.089,us,4.75325e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/768)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/720",5592,125.068,124.871,us,1.32848e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/138)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/768",5666,123.733,123.545,us,1.43225e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/144)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/1120",5602,124.987,124.8,us,2.0677e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/210)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/1728",5589,128.583,128.369,us,3.10146e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/324)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/196/256",5700,122.764,122.59,us,4.91159e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/800)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/196/512",5653,123.853,123.677,us,9.73684e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/112)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/196/1024",5597,125.192,125.012,us,1.92658e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/224)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/196/2304",5476,128.491,128.104,us,4.23016e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/504)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/256/256",5703,122.981,122.764,us,6.40603e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1024)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/256/1024",5753,125.727,125.539,us,2.50578e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/256/2304",5527,129.59,129.39,us,5.47021e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/576)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/284/512",5531,126.624,126.454,us,1.37987e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/144)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/320/1280",5394,130.071,129.864,us,3.78489e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/400)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/320/1728",5313,131.788,131.566,us,5.04351e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/540)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/324/2592",5136,136.446,136.191,us,7.39966e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/891)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/361/768",5438,126.74,126.56,us,2.62877e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/288)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/361/1120",5505,128.848,128.658,us,3.77112e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/420)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/2",5427,125.76,125.569,us,7.33937e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/12)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/32",5596,125.172,125.002,us,1.17963e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/192)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/128",5721,125.802,125.55,us,4.69792e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/768)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/256",5515,126.721,126.469,us,9.32759e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/96)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/512",5480,127.856,127.609,us,1.84885e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/1280",5349,131.109,130.883,us,4.50648e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/480)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/2592",5083,137.799,137.561,us,8.68266e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/972)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/4032",4768,144.737,144.517,us,1.28562e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1512)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/448/1280",5288,132.783,132.535,us,5.19205e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/560)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/480/16",5758,125.436,125.221,us,7.3598e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/120)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/480/256",5671,126.799,126.581,us,1.16491e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/120)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/2",5730,124.279,124.077,us,9.90353e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/16)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/16",5738,123.479,123.272,us,7.97459e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/128)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/128",5552,126.418,126.206,us,6.23136e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1024)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/256",5667,125.724,125.542,us,1.25286e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/128)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/1024",5310,132.042,131.818,us,4.77282e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/2048",5100,139.474,139.238,us,9.03697e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/3072",4783,147.283,147.007,us,1.28391e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/4608",4522,156.574,156.326,us,1.81106e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2304)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/784/40",5634,126.195,125.986,us,2.987e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/490)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/784/120",5461,128.201,128.001,us,8.8199e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/100)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/784/128",5526,127.643,127.456,us,9.44814e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/100)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/784/1152",5086,137.739,137.498,us,7.8823e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/900)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1001/2408",4377,158.305,158.065,us,1.82994e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2432)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1024/16",5734,125.437,125.222,us,1.57007e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/256)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1024/256",5426,128.813,128.579,us,2.44654e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1024/512",5389,132.058,131.847,us,4.77179e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1024/1024",5164,137.035,136.805,us,9.19772e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1024/3072",4217,166.24,166.005,us,2.27395e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1369/192",5526,130.047,129.851,us,2.42908e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/258)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1369/256",5389,131.545,131.325,us,3.20241e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/344)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1369/288",5289,132.575,132.336,us,3.5752e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/387)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/2048/512",5031,139.063,138.807,us,9.06505e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/2048/1024",4589,154.146,153.9,us,1.6352e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/2250/27",5615,126.622,126.389,us,5.76789e+09,,"Tile size: (8,8)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/32)/grid(1/1/1128)/256]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/3072/512",4762,146.66,146.428,us,1.28899e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/3072/1024",4184,167.179,166.929,us,2.26137e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/3136/64",5502,127.369,127.143,us,1.89428e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/196)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/5329/720",3907,179.199,178.926,us,2.57327e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3841)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/5625/64",5331,131.624,131.389,us,3.28795e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/352)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/12544/147",4659,150.731,150.494,us,1.47033e+11,,"Tile size: (32,32)/Vectorize size: (4,1)/Launch_Parameters[block(1/1/128)/grid(1/1/1960)/4096]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/22201/288",3233,216.173,215.85,us,3.55463e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/6246)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/9/2408",5270,133.907,133.662,us,6.22617e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/684)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/16/512",5535,128.003,127.796,us,2.46151e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/18/96",5647,126.836,126.658,us,5.23891e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1152)/256]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/24/96",5499,127.627,127.407,us,6.94419e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1152)/256]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/24/256",5421,129.551,129.347,us,1.82401e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/24/512",5306,133.226,133.016,us,3.5474e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/32/27",5648,127.043,126.808,us,2.61637e+09,,"Tile size: (8,8)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/32)/grid(1/1/512)/256]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/32/96",5622,125.937,125.738,us,9.38177e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/96)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/32/288",5477,127.845,127.672,us,2.7719e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/288)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/32/864",5052,139.246,139.015,us,7.63717e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/864)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/40/120",5377,130.447,130.204,us,1.41562e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/48/128",5446,130.174,129.942,us,1.81565e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/48/256",5251,133.394,133.152,us,3.54376e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/49/512",5092,137.82,137.581,us,7.00226e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/49/1024",4670,150.171,149.917,us,1.28522e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/49/2048",4048,172.465,172.204,us,2.23776e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/49/4608",3068,228.393,228.013,us,3.8026e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/64",5638,124.372,124.178,us,1.26662e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/128)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/96",5574,125.604,125.424,us,1.88105e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/128",5540,127.065,126.884,us,2.47922e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/147",5424,129.243,129.046,us,2.79951e+10,,"Tile size: (32,32)/Vectorize size: (4,1)/Launch_Parameters[block(1/1/128)/grid(1/1/320)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/192",5415,129.439,129.239,us,3.65106e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/256",5346,131.189,130.98,us,4.80336e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/288",5303,132.004,131.818,us,5.36945e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/576)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/512",5097,137.865,137.658,us,9.14068e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/80/64",5549,126.448,126.252,us,1.55726e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/81/1728",3666,190.231,189.926,us,2.82993e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/83/1728",3635,193.39,193.114,us,2.85193e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/96/864",4360,160.802,160.533,us,1.98404e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2592)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/100/1280",3813,183.68,183.417,us,2.6798e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/100/4032",2240,313.048,312.573,us,4.95337e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/16128)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/120/40",5566,126.104,125.931,us,1.46366e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/128/128",5354,130.578,130.403,us,4.82462e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/128/512",4604,153.68,153.473,us,1.63976e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/128/1152",3669,190.421,190.172,us,2.97747e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/128",5258,133.337,133.144,us,7.08795e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/768)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/256",4820,147.965,147.744,us,1.2775e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/720",3728,187.795,187.521,us,2.83084e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4416)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/768",3680,190.537,190.244,us,2.97635e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/1120",3159,221.566,221.232,us,3.73252e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/6720)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/1728",2535,276.404,276.024,us,4.61562e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/10368)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/196/256",4751,147.551,147.33,us,1.30778e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1792)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/196/512",4088,171.471,171.223,us,2.25058e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3584)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/196/1024",3211,218.12,217.826,us,3.53817e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/7168)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/196/2304",2084,336.166,335.629,us,5.16666e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/16128)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/256/256",4596,152.686,152.46,us,1.65065e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/256/1024",2887,245.418,245.05,us,4.10787e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/256/2304",1781,393.463,392.86,us,5.76521e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/18432)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/284/512",3626,192.989,192.724,us,2.89723e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/320/1280",2254,311.275,310.783,us,5.06098e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/12800)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/320/1728",1845,377.854,377.314,us,5.62759e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/17280)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/324/2592",1289,521.321,520.62,us,6.19428e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/28512)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/361/768",2760,253.75,253.4,us,4.20139e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/361/1120",2244,311.962,311.522,us,4.98388e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/2",5620,127.309,127.076,us,2.32075e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/384)/256]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/32",5317,132.789,132.527,us,3.56047e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/128",4789,147.044,146.812,us,1.28562e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/256",4145,168.341,168.105,us,2.24554e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/512",3287,213.172,212.876,us,3.54654e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/1280",1993,351.71,351.221,us,5.37393e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/15360)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/2592",1160,582.159,581.171,us,6.57648e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/31104)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/4032",824,837.442,836.176,us,7.11025e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/48384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/448/1280",1799,389.807,389.177,us,5.65813e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/17920)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/480/16",5434,129.53,129.274,us,2.2813e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/240)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/480/256",3898,179.821,179.525,us,2.62838e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3840)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/2",5576,126.315,126.108,us,3.1181e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/512)/256]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/16",5535,127.181,126.987,us,2.4772e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/128",4555,153.968,153.724,us,1.63708e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/256",3815,183.316,183.062,us,2.74943e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/1024",1917,366.059,365.325,us,5.51089e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/2048",1109,607.482,606.625,us,6.63759e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/3072",808,855.791,853.788,us,7.07412e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/49152)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/4608",579,1213.11,1210.71,us,7.48297e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/73728)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/784/40",4931,142.435,142.195,us,8.46881e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1600)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/784/120",4115,170.374,170.085,us,2.12404e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3200)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/784/128",4060,172.659,172.354,us,2.23582e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3200)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/784/1152",1235,545.755,544.858,us,6.36527e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/28800)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1001/2408",512,1357.67,1355.65,us,6.82771e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/77824)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1024/16",5231,135.114,134.902,us,4.66371e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1024/256",2872,243.929,243.568,us,4.13286e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1024/512",1907,366.304,365.722,us,5.50491e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1024/1024",1111,607.285,606.224,us,6.64199e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1024/3072",440,1593.88,1591.52,us,7.58999e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1369/192",2808,249.751,249.36,us,4.04771e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/8256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1369/256",2398,292.102,291.623,us,4.6148e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/11008)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1369/288",2245,312.237,311.677,us,4.85761e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/12384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/2048/512",1101,610.82,609.936,us,6.60157e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/2048/1024",638,1098.81,1096.95,us,7.34135e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/2250/27",4527,154.837,154.593,us,1.50899e+11,,"Tile size: (32,32)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/128)/grid(1/1/1917)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/3072/512",810,850.64,849.275,us,7.11171e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/49152)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/3072/1024",444,1578.08,1575.55,us,7.66691e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/3136/64",3246,215.973,215.635,us,3.57411e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/5329/720",340,2066.09,2063.07,us,7.1416e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/122912)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/5625/64",2378,294.794,294.304,us,4.69719e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/11264)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/12544/147",624,1115.8,1114.24,us,6.35489e+11,,"Tile size: (32,32)/Vectorize size: (4,1)/Launch_Parameters[block(1/1/128)/grid(1/1/62720)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/22201/288",216,3238.67,3233.88,us,7.59227e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/199872)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/9/2408",5212,134.572,134.353,us,6.19418e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/684)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/16/512",5514,127.137,126.919,us,2.47853e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/18/96",5622,124.813,124.63,us,5.32417e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1152)/256]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/24/96",5603,124.746,124.549,us,7.10354e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1152)/256]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/24/256",5421,130.452,130.183,us,1.81229e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/24/512",5227,134.246,133.99,us,3.5216e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/32/27",5559,128.142,127.908,us,2.59386e+09,,"Tile size: (8,8)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/32)/grid(1/1/512)/256]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/32/96",5442,128.338,128.099,us,9.20889e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/96)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/32/288",5381,132.256,132.007,us,2.68088e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/288)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/32/864",5075,139.982,139.756,us,7.59668e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/864)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/40/120",5366,131.056,130.814,us,1.40902e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/48/128",5409,130.912,130.655,us,1.80574e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/48/256",5236,134.457,134.181,us,3.51659e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/49/512",4966,141.933,141.685,us,6.79943e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/49/1024",4540,154.746,154.487,us,1.2472e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/49/2048",3999,174.646,174.314,us,2.21067e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/49/4608",3062,228.523,228.148,us,3.80034e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/64",5542,127.811,127.586,us,1.23278e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/128)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/96",5429,129.249,129.049,us,1.82822e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/128",5450,130.131,129.9,us,2.42165e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/147",5272,133.273,133.024,us,2.71581e+10,,"Tile size: (32,32)/Vectorize size: (4,1)/Launch_Parameters[block(1/1/128)/grid(1/1/320)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/192",5263,133.11,132.875,us,3.55115e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/256",5346,134.053,133.849,us,4.70043e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/288",5279,132.223,132.044,us,5.36026e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/576)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/512",5082,138.092,137.885,us,9.12564e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/80/64",5530,126.435,126.261,us,1.55715e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/81/1728",3666,190.02,189.71,us,2.83315e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/83/1728",3639,194.386,194.054,us,2.83812e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/96/864",4273,163.644,163.367,us,1.94962e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2592)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/100/1280",3824,183.913,183.627,us,2.67673e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/100/4032",2259,310.051,309.582,us,5.00122e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/16128)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/120/40",5401,129.631,129.432,us,1.42406e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/128/128",5225,133.858,133.633,us,4.70803e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/128/512",4511,154.231,153.98,us,1.63436e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/128/1152",3671,190.424,190.157,us,2.97771e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/128",5250,133.554,133.365,us,7.07619e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/768)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/256",4813,145.472,145.275,us,1.29922e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/720",3755,185.998,185.734,us,2.85808e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4416)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/768",3682,190.219,189.94,us,2.98111e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/1120",3172,220.959,220.636,us,3.74261e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/6720)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/1728",2553,274.44,274.076,us,4.64841e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/10368)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/196/256",4734,147.891,147.649,us,1.30496e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1792)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/196/512",4089,171.139,170.876,us,2.25515e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3584)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/196/1024",3238,216.676,216.32,us,3.56279e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/7168)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/196/2304",2112,331.673,331.203,us,5.23572e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/16128)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/256/256",4572,152.896,152.654,us,1.64855e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/256/1024",2896,241.842,241.47,us,4.16877e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/256/2304",1802,388.972,388.368,us,5.8319e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/18432)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/284/512",3649,192.372,192.114,us,2.90643e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/320/1280",2277,307.758,307.313,us,5.11812e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/12800)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/320/1728",1875,373.923,373.406,us,5.68648e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/17280)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/324/2592",1295,512.383,511.705,us,6.30219e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/28512)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/361/768",2789,251.4,251.016,us,4.24129e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/361/1120",2279,307.754,307.3,us,5.05236e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/2",5663,123.937,123.753,us,2.38308e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/384)/256]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/32",5425,131.428,131.211,us,3.59619e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/128",4769,146.839,146.63,us,1.28721e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/256",4154,168.365,168.112,us,2.24545e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/512",3296,212.291,211.984,us,3.56147e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/1280",2025,346.044,345.488,us,5.46311e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/15360)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/2592",1162,573.566,572.787,us,6.67274e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/31104)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/4032",835,823.791,822.63,us,7.22733e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/48384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/448/1280",1830,383.149,382.626,us,5.75499e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/17920)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/480/16",5511,128.506,128.294,us,2.29873e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/240)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/480/256",3898,179.81,179.553,us,2.62796e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3840)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/2",5532,127.446,127.247,us,3.09017e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/512)/256]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/16",5410,129.88,129.636,us,2.42659e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/128",4548,154.26,154.032,us,1.63381e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/256",3823,182.812,182.543,us,2.75725e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/1024",1937,361.896,361.345,us,5.57159e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/2048",1126,598.439,597.679,us,6.73695e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/3072",824,835.536,834.359,us,7.23885e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/49152)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/4608",587,1194.5,1192.78,us,7.59545e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/73728)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/784/40",5067,138.229,138.035,us,8.72405e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1600)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/784/120",4185,168.497,168.205,us,2.14777e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3200)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/784/128",4104,170.807,170.526,us,2.25979e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/3200)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/784/1152",1253,533.808,533.059,us,6.50615e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/28800)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1001/2408",558,1245.22,1243.47,us,7.44364e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/77824)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1024/16",5310,132.553,132.337,us,4.75412e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1024/256",2894,242.62,242.207,us,4.15608e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1024/512",1936,361.631,360.987,us,5.57711e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1024/1024",1128,598.48,597.531,us,6.73862e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1024/3072",452,1550.37,1547.77,us,7.80449e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1369/192",2863,245.033,244.565,us,4.12707e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/8256)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1369/256",2434,286.041,285.565,us,4.7127e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/11008)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1369/288",2300,304.946,304.424,us,4.97334e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/12384)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/2048/512",1123,597.984,597.017,us,6.74441e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/2048/1024",644,1075.84,1074.19,us,7.49686e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/2250/27",4520,155.136,154.904,us,1.50597e+11,,"Tile size: (32,32)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/128)/grid(1/1/1917)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/3072/512",824,835.836,834.627,us,7.23652e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/49152)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/3072/1024",451,1554.4,1551.82,us,7.78414e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/3136/64",3258,215.043,214.755,us,3.58876e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/5329/720",365,1922.42,1919.33,us,7.67644e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/122912)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/5625/64",2430,288.666,288.285,us,4.79526e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/11264)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/12544/147",688,1009.78,1008.2,us,7.02327e+11,,"Tile size: (32,32)/Vectorize size: (4,1)/Launch_Parameters[block(1/1/128)/grid(1/1/62720)/4096]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/22201/288",227,3092.47,3088.15,us,7.95057e+11,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/199872)/4096]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/9/2408",5368,130.919,130.71,us,6.36679e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1355)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/16/512",5447,124.594,124.424,us,2.52824e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/18/96",5549,122.199,122.021,us,5.438e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/108)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/24/96",5541,122.234,122.054,us,7.24874e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/144)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/24/256",5482,124.34,124.133,us,1.90062e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/24/512",5217,127.486,127.291,us,3.70694e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/32/27",5420,125.487,125.278,us,2.64832e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/54)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/32/96",5385,126.452,126.249,us,9.34383e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/192)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/32/288",5306,129.957,129.745,us,2.72762e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/576)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/32/864",4904,138.538,138.293,us,7.67708e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1728)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/40/120",5426,126.957,126.741,us,1.4543e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/48/128",5368,127.734,127.529,us,1.85001e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/48/256",5180,131.328,131.114,us,3.59884e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/49/512",5036,137.325,137.093,us,7.02718e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1568)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/49/1024",4578,148.964,148.673,us,1.29597e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/49/2048",4052,169.128,168.802,us,2.28287e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/49/4608",3134,223.788,223.325,us,3.88242e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/14112)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/64",5447,124.338,124.125,us,1.26716e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/256)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/96",5463,125.24,125.051,us,1.88667e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/128",5351,127.228,127.04,us,2.47617e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/147",5309,128.859,128.667,us,2.80776e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/588)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/192",5198,129.947,129.731,us,3.63722e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/256",5186,132.38,132.137,us,4.76131e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/288",5110,133.127,132.91,us,5.32533e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1152)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/512",4920,137.559,137.334,us,9.16226e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/80/64",5373,126.726,126.532,us,1.55382e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/320)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/81/1728",3694,185.093,184.827,us,2.908e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8748)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/83/1728",3679,186.76,186.504,us,2.95301e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8964)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/96/864",4301,159.03,158.808,us,2.00559e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/100/1280",3812,180.184,179.921,us,2.73187e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8000)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/100/4032",2292,305.698,305.26,us,5.07203e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25200)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/120/40",5532,123.203,123.007,us,1.49845e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/128/128",5273,129.071,128.883,us,4.88152e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/128/512",4518,151.52,151.29,us,1.66341e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/128/1152",3625,188.707,188.461,us,3.0045e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/128",5033,133.959,133.745,us,7.0561e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/256",4729,144.316,144.084,us,1.30995e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/720",3717,184.385,184.078,us,2.88379e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8640)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/768",3544,193.997,193.682,us,2.92351e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/1120",3153,220.135,219.797,us,3.75689e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/1728",2555,273.974,273.56,us,4.65719e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20736)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/196/256",4691,145.446,145.211,us,1.32687e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/196/512",4072,168.305,168.06,us,2.29294e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/196/1024",3230,213.306,212.957,us,3.61905e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/196/2304",2070,338.805,338.267,us,5.12637e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/28224)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/256/256",4512,151.61,151.374,us,1.6625e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/256/1024",2902,241.373,241.016,us,4.17663e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/256/2304",1703,411.502,410.91,us,5.51197e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/36864)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/284/512",3648,188.258,187.974,us,2.97045e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9088)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/320/1280",2278,307.448,307.004,us,5.12327e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25600)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/320/1728",1874,374.106,373.543,us,5.68439e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/34560)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/324/2592",1290,504.352,503.645,us,6.40305e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/52488)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/361/768",2665,262.892,262.463,us,4.05632e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/17328)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/361/1120",2297,305.271,304.818,us,5.09349e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25270)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/2",5592,122.805,122.589,us,2.40571e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/32",5355,131.05,130.819,us,3.60696e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/128",4619,148.742,148.49,us,1.27108e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/256",4052,167.34,167.084,us,2.25927e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/512",3262,211.405,211.098,us,3.57643e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12288)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/1280",2024,346.005,345.479,us,5.46325e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/30720)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/2592",1146,573.95,573.111,us,6.66896e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/62208)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/4032",805,829.605,828.283,us,7.17801e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/448/1280",1828,383.628,383.016,us,5.74912e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/35840)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/480/16",5451,126.891,126.687,us,2.32788e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/480)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/480/256",3870,177.736,177.466,us,2.65887e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/7680)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/2",5520,123.363,123.143,us,3.19317e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/16",5452,125.446,125.266,us,2.51123e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/128",4499,151.956,151.727,us,1.65863e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/256",3750,181.862,181.562,us,2.77215e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/1024",1933,361.938,361.414,us,5.57052e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/2048",1100,602.966,601.928,us,6.68939e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/3072",729,922.593,921.362,us,6.55529e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/4608",517,1319.08,1317.08,us,6.87862e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/147456)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/784/40",5000,135.484,135.286,us,8.90134e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1960)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/784/120",4164,164.718,164.482,us,2.19639e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5880)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/784/128",4050,168.534,168.279,us,2.28996e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/784/1152",1208,543.198,542.448,us,6.39355e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/56448)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1001/2408",556,1220.44,1218.7,us,7.59498e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/150651)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1024/16",5160,129.977,129.753,us,4.84879e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1024/256",2873,241.295,240.92,us,4.17829e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1024/512",1930,362.814,362.308,us,5.55679e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1024/1024",1091,600.57,599.691,us,6.71434e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1024/3072",394,1748.39,1745.42,us,6.92074e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1369/192",2849,242.653,242.293,us,4.16576e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16428)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1369/256",2476,284.351,283.928,us,4.73987e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/21904)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1369/288",2316,302.938,302.481,us,5.00529e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24642)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/2048/512",1096,603.013,602.088,us,6.68762e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/2048/1024",626,1081.1,1079.58,us,7.45941e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/131072)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/2250/27",4536,150.493,150.269,us,1.55241e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3797)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/3072/512",796,839.32,838,us,7.2074e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/3072/1024",441,1559.34,1556.53,us,7.76059e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/3136/64",3235,213.316,212.972,us,3.61881e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/5329/720",347,1880.88,1877.89,us,7.84584e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/239805)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/5625/64",2454,285.134,284.658,us,4.85635e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/22500)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/12544/147",699,967.52,966.122,us,7.32913e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/115248)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/22201/288",226,3073.23,3068.47,us,8.00157e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/399618)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/9/2408",5663,126.354,126.127,us,1.03096e+09,,"Tile size: (8,8)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/32)/grid(1/1/602)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/16/512",5621,126.345,126.137,us,3.89671e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/128)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/18/96",5680,125.832,125.585,us,8.2558e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/36)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/24/96",5564,124.067,123.878,us,1.11594e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/36)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/24/256",5468,126.117,125.874,us,2.92865e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/96)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/24/512",5575,122.884,122.694,us,6.0091e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/192)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/32/27",5475,126.338,126.121,us,4.11033e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/16)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/32/96",5570,124.93,124.712,us,1.47797e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/48)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/32/288",5563,122.953,122.773,us,4.50391e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/144)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/32/864",5472,126.168,125.956,us,1.31703e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/432)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/40/120",5458,125.827,125.627,us,2.2925e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/75)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/48/128",5606,125.337,125.112,us,2.94647e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/96)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/48/256",5566,124.376,124.13,us,5.93958e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/192)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/49/512",5410,126.772,126.566,us,1.18933e+09,,"Tile size: (8,8)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/32)/grid(1/1/448)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/49/1024",5523,126.601,126.376,us,2.38222e+09,,"Tile size: (8,8)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/32)/grid(1/1/896)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/49/2048",5601,128.41,128.168,us,4.69783e+09,,"Tile size: (32,32)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/128)/grid(1/1/128)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/49/4608",5327,129.468,129.224,us,1.04837e+10,,"Tile size: (32,32)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/128)/grid(1/1/288)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/64",5460,126.802,126.603,us,1.94119e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/64)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/96",5612,123.089,122.898,us,2.99956e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/96)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/128",5614,124.562,124.382,us,3.9517e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/128)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/147",5659,126.872,126.642,us,4.4573e+08,,"Tile size: (8,8)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/32)/grid(1/1/152)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/192",5541,126.211,126.015,us,5.85075e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/192)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/256",5505,126.308,126.093,us,7.79617e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/256)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/288",5692,123.321,123.135,us,8.98136e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/288)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/512",5540,127.334,127.083,us,1.54708e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/512)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/80/64",5637,126.284,126.037,us,2.43739e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/80)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/81/1728",5553,128.084,127.867,us,6.5678e+09,,"Tile size: (32,32)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/128)/grid(1/1/162)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/83/1728",5587,127.689,127.461,us,6.7514e+09,,"Tile size: (32,32)/Vectorize size: (1,2)/Launch_Parameters[block(1/1/128)/grid(1/1/162)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/96/864",5498,124.212,123.99,us,4.01376e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1296)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/100/1280",5622,124.903,124.728,us,6.15742e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/160)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/100/4032",5507,127.374,127.166,us,1.9024e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/504)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/120/40",5703,122.977,122.797,us,2.34534e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/75)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/128/128",5699,123.199,123.005,us,7.99187e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/256)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/128/512",5586,124.166,123.972,us,3.17182e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1024)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/128/1152",5611,124.383,124.203,us,7.12328e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/144)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/128",5690,124.65,124.431,us,1.18504e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/384)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/256",5661,123.508,123.334,us,2.39117e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/768)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/720",5583,125.676,125.488,us,6.60973e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/138)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/768",5637,124.447,124.273,us,7.11928e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/144)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/1120",5596,125.025,124.839,us,1.03352e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/210)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/1728",5558,126.268,126.059,us,1.57915e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/324)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/196/256",5638,124.197,124.026,us,2.42735e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/800)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/196/512",5634,124.601,124.419,us,4.83941e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/112)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/196/1024",5610,126.376,126.187,us,9.54318e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/224)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/196/2304",5482,127.573,127.389,us,2.12696e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/504)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/256/256",5644,124.509,124.3,us,3.16345e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1024)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/256/1024",5633,124.342,124.15,us,1.2669e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/256/2304",5435,129.163,128.948,us,2.74446e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/576)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/284/512",5560,125.666,125.49,us,6.95232e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/144)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/320/1280",5529,126.799,126.599,us,1.94125e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/400)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/320/1728",5447,128.5,128.324,us,2.58545e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/540)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/324/2592",5292,134.378,134.184,us,3.75517e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/891)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/361/768",5545,126.389,126.182,us,1.31833e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/288)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/361/1120",5483,128.055,127.85,us,1.89747e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/420)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/2",5678,123.58,123.397,us,3.7343e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/12)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/32",5688,123.015,122.831,us,6.0024e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/192)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/128",5668,126.018,125.803,us,2.34423e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/768)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/256",5636,125.699,125.493,us,4.70007e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/96)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/512",5478,128.108,127.861,us,9.22604e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/1280",5500,127.514,127.312,us,2.31645e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/480)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/2592",5231,132.37,132.173,us,4.51828e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/972)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/4032",5120,136.91,136.701,us,6.79566e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1512)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/448/1280",5445,128.522,128.353,us,2.68062e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/560)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/480/16",5678,123.262,123.074,us,3.74408e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/120)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/480/256",5641,125.472,125.289,us,5.88464e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/120)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/2",5677,123.427,123.251,us,4.98494e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/16)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/16",5702,122.873,122.7,us,4.00585e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/128)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/128",5639,124.198,124.033,us,3.17024e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1024)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/256",5632,124.636,124.433,us,6.32014e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/128)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/1024",5479,127.958,127.78,us,2.46182e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/2048",5261,133.144,132.953,us,4.73209e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/3072",5103,137.109,136.878,us,6.89461e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/4608",4782,146.562,146.364,us,9.67163e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2304)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/784/40",5682,123.395,123.186,us,1.52745e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/490)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/784/120",5593,125.818,125.653,us,4.49237e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/100)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/784/128",5624,125.274,125.084,us,4.81367e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/100)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/784/1152",5295,132.275,132.033,us,4.10429e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/900)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1001/2408",4726,148.334,148.137,us,9.7629e+10,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2432)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1024/16",5615,126.25,126.044,us,7.7992e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/256)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1024/256",5653,124.423,124.235,us,1.26604e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1024/512",5481,127.722,127.524,us,2.46678e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1024/1024",5249,133.329,133.125,us,4.72598e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1024/3072",4539,154.409,154.124,us,1.22462e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1369/192",5490,127.24,127.051,us,1.2413e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/258)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1369/256",5392,129.241,129.047,us,1.62947e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/344)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1369/288",5463,128.629,128.406,us,1.8423e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/387)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/2048/512",5259,133.201,132.991,us,4.73076e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/2048/1024",4881,143.775,143.534,us,8.76652e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/2250/27",5536,127.798,127.597,us,2.85665e+09,,"Tile size: (8,8)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/32)/grid(1/1/1128)/128]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/3072/512",5107,138.368,138.11,us,6.83307e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/3072/1024",4527,154.635,154.426,us,1.22223e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/3136/64",5485,128.858,128.655,us,9.36011e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/196)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/5329/720",4256,164.623,164.365,us,1.40062e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3841)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/5625/64",5336,131.317,131.099,us,1.64761e+10,,"Tile size: (32,32)/Vectorize size: (1,4)/Launch_Parameters[block(1/1/128)/grid(1/1/352)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/12544/147",4866,143.961,143.765,us,7.69574e+10,,"Tile size: (32,32)/Vectorize size: (4,1)/Launch_Parameters[block(1/1/128)/grid(1/1/1960)/2048]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/22201/288",3719,187.747,187.497,us,2.04607e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/6246)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/9/2408",5339,131.479,131.268,us,3.16986e+10,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/684)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/16/512",5546,126.391,126.206,us,1.24627e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/18/96",5588,125.618,125.39,us,2.64595e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1152)/128]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/24/96",5581,125.317,125.131,us,3.53524e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1152)/128]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/24/256",5538,126.666,126.476,us,9.32709e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/24/512",5441,128.524,128.335,us,1.83838e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/32/27",5561,126.122,125.937,us,1.31723e+09,,"Tile size: (8,8)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/32)/grid(1/1/512)/128]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/32/96",5573,125.689,125.514,us,4.69927e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/96)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/32/288",5517,127.163,126.953,us,1.3938e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/288)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/32/864",5284,132.585,132.401,us,4.00933e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/864)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/40/120",5472,128.119,127.936,us,7.20359e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/48/128",5493,127.322,127.119,us,9.27986e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/48/256",5411,129.039,128.849,us,1.83106e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/49/512",5194,134.96,134.759,us,3.57447e+10,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/49/1024",4883,143.335,143.146,us,6.73007e+10,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/49/2048",4390,159.687,159.47,us,1.20822e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/49/4608",3526,197.755,197.485,us,2.19521e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/64",5591,125.255,125.054,us,6.28873e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/128)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/96",5579,125.566,125.384,us,9.40826e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/128",5586,126.237,126.058,us,1.24773e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/147",5452,128.439,128.215,us,1.40883e+10,,"Tile size: (32,32)/Vectorize size: (4,1)/Launch_Parameters[block(1/1/128)/grid(1/1/320)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/192",5462,128.524,128.31,us,1.83875e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/256",5420,129.319,129.121,us,2.43626e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/288",5363,130.648,130.457,us,2.71273e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/576)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/512",5178,135.108,134.902,us,4.66372e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/80/64",5509,126.944,126.779,us,7.75399e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/81/1728",4050,172.65,172.403,us,1.55878e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/83/1728",4042,173.223,172.979,us,1.59195e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/96/864",4663,150.258,150.045,us,1.06136e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2592)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/100/1280",4139,169.564,169.32,us,1.45146e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/100/4032",2685,260.978,260.594,us,2.97069e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/16128)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/120/40",5505,127.039,126.836,us,7.26607e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/128/128",5441,128.665,128.474,us,2.44854e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/128/512",4836,145.533,145.345,us,8.65727e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/128/1152",4076,171.719,171.483,us,1.65098e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/128",5321,131.841,131.655,us,3.58405e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/768)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/256",5058,138.453,138.238,us,6.82677e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/720",4090,171.13,170.867,us,1.55337e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4416)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/768",4061,172.484,172.213,us,1.64398e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/1120",3576,195.961,195.638,us,2.11042e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/6720)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/1728",2960,236.631,236.27,us,2.69611e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/10368)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/196/256",4925,142.276,142.043,us,6.78233e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1792)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/196/512",4416,158.784,158.55,us,1.21524e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3584)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/196/1024",3593,195.381,195.088,us,1.97527e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/7168)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/196/2304",2450,285.69,285.295,us,3.0391e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/16128)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/256/256",4838,144.896,144.693,us,8.69631e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/256/1024",3311,211.816,211.517,us,2.37955e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/256/2304",2116,331.581,331.137,us,3.41992e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/18432)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/284/512",3988,176.41,176.163,us,1.5848e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/320/1280",2616,267.85,267.496,us,2.93998e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/12800)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/320/1728",2197,319.02,318.57,us,3.33266e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/17280)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/324/2592",1619,432.69,432.037,us,3.73216e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/28512)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/361/768",3144,222.964,222.663,us,2.39068e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/361/1120",2615,267.566,267.204,us,2.90525e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/2",5618,125.249,125.053,us,1.17915e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/384)/128]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/32",5484,128.297,128.088,us,1.84193e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/128",5048,138.8,138.588,us,6.80953e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/256",4473,156.976,156.761,us,1.20402e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/512",3655,191.503,191.248,us,1.97381e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/1280",2317,302.631,302.192,us,3.12291e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/15360)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/2592",1385,479.897,479.153,us,3.98835e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/31104)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/4032",1017,674.381,673.355,us,4.41478e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/48384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/448/1280",2117,331.068,330.608,us,3.33024e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/17920)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/480/16",5556,126.159,125.987,us,1.17041e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/240)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/480/256",4272,164.245,164.009,us,1.43852e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3840)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/2",5611,124.948,124.757,us,1.57593e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/512)/128]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/16",5540,126.65,126.457,us,1.24379e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/128",4797,146.185,145.966,us,8.62045e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/256",4186,167.289,167.042,us,1.50655e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/1024",2258,309.941,309.503,us,3.25241e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/2048",1330,501.683,500.9,us,4.0193e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/3072",990,692.887,691.86,us,4.3649e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/49152)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/4608",710,981.793,980.351,us,4.62064e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/73728)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/784/40",5158,136.115,135.927,us,4.42967e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1600)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/784/120",4420,158.832,158.61,us,1.13885e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3200)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/784/128",4441,158.003,157.785,us,1.22113e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3200)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/784/1152",1542,454.516,453.84,us,3.82091e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/28800)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1001/2408",631,1111.41,1109.91,us,4.16968e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/77824)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1024/16",5418,129.318,129.133,us,2.43605e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1024/256",3292,212.906,212.597,us,2.36747e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1024/512",2224,315.102,314.655,us,3.19916e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1024/1024",1315,511.804,511.024,us,3.93967e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1024/3072",544,1290.14,1288.26,us,4.68834e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1369/192",3184,220.854,220.504,us,2.2887e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/8256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1369/256",2804,249.42,249.06,us,2.70172e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/11008)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1369/288",2653,263.974,263.563,us,2.87219e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/12384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/2048/512",1297,515.977,515.126,us,3.9083e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/2048/1024",762,907.101,905.894,us,4.44482e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/2250/27",4787,146.109,145.859,us,7.99678e+10,,"Tile size: (32,32)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/128)/grid(1/1/1917)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/3072/512",968,710.998,709.901,us,4.25397e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/49152)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/3072/1024",539,1301.15,1299.09,us,4.64926e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/3136/64",3638,192.912,192.62,us,2.00058e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/5329/720",414,1693.43,1690.54,us,4.35766e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/122912)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/5625/64",2702,259.309,258.957,us,2.66917e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/11264)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/12544/147",756,918.976,917.473,us,3.85888e+11,,"Tile size: (32,32)/Vectorize size: (4,1)/Launch_Parameters[block(1/1/128)/grid(1/1/62720)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/22201/288",267,2621.68,2617.56,us,4.68996e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/199872)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/9/2408",5318,132.155,131.929,us,3.15399e+10,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/684)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/16/512",5555,126.478,126.248,us,1.24585e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/18/96",5552,126.195,126.001,us,2.63312e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1152)/128]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/24/96",5553,125.825,125.639,us,3.52094e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1152)/128]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/24/256",5510,127.217,126.981,us,9.28997e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/24/512",5423,130.376,130.133,us,1.81299e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/32/27",5430,129.254,129.031,us,1.28564e+09,,"Tile size: (8,8)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/32)/grid(1/1/512)/128]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/32/96",5476,129.285,129.093,us,4.569e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/96)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/32/288",5352,131.084,130.849,us,1.3523e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/288)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/32/864",5138,136.162,135.951,us,3.90466e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/864)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/40/120",5335,131.575,131.319,us,7.01805e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/48/128",5397,131.05,130.826,us,9.01691e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/48/256",5292,132.585,132.365,us,1.78241e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/49/512",5058,138.192,137.979,us,3.49103e+10,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/49/1024",4738,147.718,147.464,us,6.53298e+10,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/49/2048",4292,162.414,162.129,us,1.18841e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/49/4608",3541,196.994,196.681,us,2.20418e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/64",5448,129.181,128.942,us,6.09909e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/128)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/96",5457,129.377,129.167,us,9.13274e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/128",5292,129.564,129.343,us,1.21604e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/147",5277,132.425,132.18,us,1.36657e+10,,"Tile size: (32,32)/Vectorize size: (4,1)/Launch_Parameters[block(1/1/128)/grid(1/1/320)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/192",5312,131.908,131.659,us,1.79198e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/256",5276,133.048,132.818,us,2.36846e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/288",5203,134.449,134.178,us,2.63751e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/576)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/512",5090,138.659,138.424,us,4.54507e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/80/64",5356,130.563,130.352,us,7.54145e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/192)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/81/1728",4010,172.802,172.507,us,1.55784e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/83/1728",4056,172.531,172.273,us,1.59847e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/96/864",4661,150.683,150.467,us,1.05839e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2592)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/100/1280",4171,168.12,167.874,us,1.46395e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/100/4032",2773,252.601,252.193,us,3.06965e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/16128)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/120/40",5512,130.877,130.665,us,7.05313e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/128/128",5424,132.216,132.006,us,2.38302e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/128/512",4832,147.612,147.38,us,8.53771e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/128/1152",4095,170.458,170.211,us,1.66333e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/128",5301,136.066,135.785,us,3.47505e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/768)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/256",5043,141.623,141.372,us,6.67544e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/720",4181,167.843,167.557,us,1.58406e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4416)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/768",4117,170.234,169.967,us,1.66571e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/1120",3674,190.322,190.013,us,2.17289e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/6720)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/1728",3106,225.652,225.263,us,2.82785e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/10368)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/196/256",4906,145.441,145.212,us,6.63431e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1792)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/196/512",4449,158.96,158.701,us,1.21408e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3584)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/196/1024",3669,190.407,190.124,us,2.02685e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/7168)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/196/2304",2626,266.613,266.251,us,3.25648e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/16128)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/256/256",4714,148.561,148.285,us,8.48564e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/256/1024",3436,203.902,203.569,us,2.47246e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/256/2304",2311,303.283,302.822,us,3.7397e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/18432)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/284/512",3934,173.881,173.607,us,1.60813e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/320/1280",2820,248.757,248.339,us,3.16676e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/12800)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/320/1728",2391,293.266,292.77,us,3.62633e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/17280)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/324/2592",1791,390.377,389.838,us,4.13616e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/28512)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/361/768",3314,211.409,211.108,us,2.52154e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/361/1120",2808,249.386,249.015,us,3.11746e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/2",5499,128.693,128.481,us,1.14768e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/384)/128]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/32",5301,132.112,131.883,us,1.78893e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/128",4920,142.506,142.281,us,6.63276e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/256",4441,158.953,158.689,us,1.18939e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/512",3766,186.01,185.683,us,2.03296e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/1280",2555,274.591,274.187,us,3.44188e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/15360)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/2592",1639,427.675,427.018,us,4.47529e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/31104)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/4032",1124,597.045,596.055,us,4.98731e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/48384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/448/1280",2342,299.222,298.752,us,3.68535e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/17920)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/480/16",5511,126.88,126.7,us,1.16382e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/240)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/480/256",4321,162.362,162.133,us,1.45516e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3840)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/2",5574,127.487,127.306,us,1.54437e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/512)/128]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/16",5433,129.521,129.295,us,1.2165e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/128",4813,146.403,146.18,us,8.60783e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/256",4219,165.041,164.804,us,1.52701e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/1024",2466,284.117,283.706,us,3.54815e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/2048",1575,445.168,444.459,us,4.5297e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/3072",1108,604.721,603.85,us,5.00108e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/49152)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/4608",812,848.387,847.119,us,5.34736e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/73728)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/784/40",5183,138.863,138.632,us,4.34325e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1600)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/784/120",4501,158.307,158.005,us,1.14321e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3200)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/784/128",4466,159.429,159.159,us,1.21059e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/3200)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/784/1152",1748,401.535,400.79,us,4.32666e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/28800)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1001/2408",773,890.049,888.624,us,5.20803e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/77824)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1024/16",5319,132.935,132.705,us,2.37047e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/512)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1024/256",3426,204.133,203.817,us,2.46945e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1024/512",2466,284.043,283.539,us,3.55025e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1024/1024",1575,446.178,445.458,us,4.51954e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1024/3072",630,1099.32,1097.6,us,5.50272e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1369/192",3391,206.667,206.323,us,2.44601e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/8256)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1369/256",2989,233.482,233.092,us,2.8868e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/11008)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1369/288",2837,247.147,246.775,us,3.06758e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/12384)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/2048/512",1573,445.927,445.229,us,4.52186e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/2048/1024",895,767.651,766.493,us,5.25319e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/2250/27",4781,148.23,147.947,us,7.88389e+10,,"Tile size: (32,32)/Vectorize size: (2,1)/Launch_Parameters[block(1/1/128)/grid(1/1/1917)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/3072/512",1108,608.619,607.689,us,4.96948e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/49152)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/3072/1024",633,1102.31,1100.65,us,5.48748e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/3136/64",3748,186.998,186.708,us,2.06392e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/5329/720",498,1406.94,1404.69,us,5.24442e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/122912)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/5625/64",2962,236.473,236.095,us,2.92763e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/11264)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/12544/147",932,738.154,737.07,us,4.80337e+11,,"Tile size: (32,32)/Vectorize size: (4,1)/Launch_Parameters[block(1/1/128)/grid(1/1/62720)/2048]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/22201/288",324,2166.93,2163.19,us,5.67506e+11,,"Tile size: (32,32)/Vectorize size: (1,8)/Launch_Parameters[block(1/1/128)/grid(1/1/199872)/2048]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/9/2408",5437,131.035,130.828,us,3.18054e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1355)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/16/512",5384,126.874,126.661,us,1.24179e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/18/96",5362,126.58,126.367,us,2.62549e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/108)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/24/96",5349,126.813,126.562,us,3.49526e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/144)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/24/256",5366,128.115,127.885,us,9.22427e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/24/512",5367,129.208,128.999,us,1.82892e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/32/27",5497,125.281,125.062,us,1.32645e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/54)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/32/96",5370,126.962,126.759,us,4.65312e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/192)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/32/288",5312,128.818,128.55,us,1.37649e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/576)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/32/864",5150,134.981,134.731,us,3.94e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1728)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/40/120",5333,127.012,126.741,us,7.27153e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/48/128",5316,127.659,127.431,us,9.25714e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/48/256",5385,128.835,128.656,us,1.8338e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/49/512",5093,134.204,134.005,us,3.59458e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1568)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/49/1024",4792,141.27,141.058,us,6.82969e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/49/2048",4453,154.651,154.413,us,1.24779e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/49/4608",3637,190.999,190.726,us,2.273e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/14112)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/64",5494,126.031,125.817,us,6.25058e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/256)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/96",5441,126.691,126.476,us,9.32702e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/128",5387,127.203,127.003,us,1.23845e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/147",5375,128.616,128.4,us,1.4068e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/588)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/192",5277,129.553,129.348,us,1.82399e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/256",5310,130.4,130.148,us,2.41705e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/288",5207,130.986,130.765,us,2.70633e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1152)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/512",5115,136.408,136.191,us,4.6196e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/80/64",5338,127.105,126.904,us,7.74636e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/320)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/81/1728",4079,168.197,167.943,us,1.60018e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8748)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/83/1728",4107,166.966,166.684,us,1.65207e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8964)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/96/864",4604,149.425,149.193,us,1.06743e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/100/1280",4224,162.215,161.987,us,1.51715e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8000)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/100/4032",2828,247.506,247.131,us,3.13252e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25200)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/120/40",5451,127.037,126.801,us,7.26811e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/128/128",5381,129.518,129.313,us,2.43265e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/128/512",4684,145.341,145.134,us,8.66988e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/128/1152",4044,167.749,167.508,us,1.69017e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/128",5098,133.831,133.561,us,3.53291e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/256",4951,137.345,137.134,us,6.88174e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/720",4133,165.061,164.785,us,1.61071e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8640)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/768",4071,167.592,167.317,us,1.69209e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/1120",3631,189.228,188.912,us,2.18555e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/1728",3105,225.284,224.921,us,2.83215e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20736)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/196/256",4944,140.673,140.446,us,6.85941e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/196/512",4406,156.274,156.041,us,1.23478e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/196/1024",3698,185.628,185.309,us,2.07951e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/196/2304",2680,261.353,260.957,us,3.32254e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/28224)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/256/256",4758,146.756,146.477,us,8.59039e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/256/1024",3383,203.721,203.407,us,2.47443e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/256/2304",2301,305.185,304.636,us,3.71742e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/36864)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/284/512",4103,168.028,167.743,us,1.66435e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9088)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/320/1280",2804,248.727,248.296,us,3.16731e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25600)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/320/1728",2382,294.669,294.17,us,3.60908e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/34560)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/324/2592",1829,383.27,382.598,us,4.21443e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/52488)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/361/768",3283,210.405,210.047,us,2.53427e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/17328)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/361/1120",2835,247.434,247.023,us,3.1426e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25270)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/2",5530,123.935,123.753,us,1.19153e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/32",5283,128.868,128.66,us,1.83375e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/128",4998,137.62,137.44,us,6.86642e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/256",4407,153.469,153.251,us,1.2316e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/512",3707,184.126,183.853,us,2.05321e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12288)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/1280",2542,275.681,275.274,us,3.42829e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/30720)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/2592",1621,432.069,431.406,us,4.42977e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/62208)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/4032",1095,605.074,604.208,us,4.92001e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/448/1280",2327,301.381,300.955,us,3.65837e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/35840)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/480/16",5497,123.815,123.591,us,1.19309e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/480)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/480/256",4299,160.009,159.764,us,1.47674e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/7680)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/2",5527,123.73,123.546,us,1.59137e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/16",5454,126.862,126.632,us,1.24207e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/128",4736,145.963,145.723,us,8.63483e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/256",4147,164.645,164.401,us,1.53076e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/1024",2438,287.195,286.767,us,3.51029e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/2048",1556,450.506,449.863,us,4.47529e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/3072",984,675.214,674.17,us,4.47943e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/4608",713,948.152,946.68,us,4.78498e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/147456)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/784/40",5147,134.567,134.344,us,4.48188e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1960)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/784/120",4434,155.541,155.243,us,1.16356e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5880)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/784/128",4471,154.989,154.712,us,1.24538e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/784/1152",1718,407.74,407.091,us,4.2597e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/56448)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1001/2408",761,888.127,886.604,us,5.2199e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/150651)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1024/16",5403,128.495,128.29,us,2.45205e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1024/256",3374,203.585,203.291,us,2.47584e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1024/512",2432,288.057,287.565,us,3.50054e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1024/1024",1525,460.053,459.31,us,4.38324e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1024/3072",543,1254.45,1252.54,us,4.82206e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1369/192",3324,206.84,206.498,us,2.44393e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16428)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1369/256",2993,232.363,232.008,us,2.9003e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/21904)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1369/288",2816,248.442,248.057,us,3.05172e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24642)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/2048/512",1399,465.706,464.771,us,4.33174e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/2048/1024",820,814.327,812.868,us,4.95349e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/131072)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/2250/27",4816,145.483,145.226,us,8.0316e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3797)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/3072/512",1031,644.757,643.661,us,4.69176e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/3072/1024",585,1161.51,1159.77,us,5.20776e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/3136/64",3651,188.391,188.04,us,2.0493e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/5329/720",474,1444.49,1441.84,us,5.1093e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/239805)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/5625/64",2864,245.227,244.735,us,2.82428e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/22500)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/12544/147",869,768.253,766.867,us,4.61673e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/115248)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/22201/288",292,2385.9,2382.06,us,5.15364e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/399618)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/2/160",5680,123.183,122.958,us,3.12301e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/5)/256]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/8/160",5580,126.577,126.36,us,1.21557e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/20)/256]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/64/160",5698,126.042,125.828,us,9.76574e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/160)/256]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/512/160",5441,127.632,127.424,us,7.71473e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1280)/256]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/4096/160",5214,136.09,135.848,us,5.78907e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/640)/4096]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/32768/160",3513,199.252,198.913,us,3.16292e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/4096]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/65536/160",2544,275.76,275.28,us,4.57096e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/4096]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/2/320",5487,127.043,126.8,us,6.05676e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/10)/256]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/8/320",5414,126.659,126.444,us,2.42953e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/40)/256]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/64/320",5598,126.714,126.5,us,1.94277e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/320)/256]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/512/320",5594,127.973,127.795,us,1.53846e+10,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/160)/4096]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/4096/320",4798,144.302,144.098,us,1.09152e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/4096]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/32768/320",2525,276.343,275.96,us,4.55969e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/4096]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/65536/320",1643,426.556,425.891,us,5.90898e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/20480)/4096]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/2/160",5636,124.784,124.612,us,9.86103e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/160)/256]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/8/160",5620,125.09,124.92,us,3.93467e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/640)/256]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/64/160",5386,133.337,133.097,us,2.95436e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/320)/4096]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/512/160",4316,162.524,162.253,us,1.93878e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2560)/4096]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/4096/160",1632,429.337,428.527,us,5.87263e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/20480)/4096]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/32768/160",271,2557.43,2552.82,us,7.88643e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/163840)/4096]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/65536/160",137,5015.24,5007.67,us,8.04073e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/4096]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/2/320",5562,124.874,124.696,us,1.97088e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/320)/256]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/8/320",5590,125.305,125.111,us,7.85735e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1280)/256]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/64/320",5269,133.969,133.784,us,5.87836e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/640)/4096]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/512/320",3473,201.312,200.99,us,3.13024e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/4096]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/4096/320",933,736.016,735.073,us,6.84717e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/4096]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/32768/320",140,4991.51,4983.95,us,8.07899e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/4096]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/65536/320",71,9883.06,9871.6,us,8.15781e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/4096]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/2/160",5633,128.025,127.803,us,9.61476e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/160)/256]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/8/160",5606,125.292,125.095,us,3.92917e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/640)/256]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/64/160",5384,130.062,129.87,us,3.02777e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/320)/4096]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/512/160",4319,162.157,161.903,us,1.94297e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/2560)/4096]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/4096/160",1661,421.911,421.23,us,5.97437e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/20480)/4096]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/32768/160",277,2528.3,2524.11,us,7.97613e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/163840)/4096]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/65536/160",141,4951.59,4944.33,us,8.14374e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/4096]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/2/320",5564,124.876,124.677,us,1.97117e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/320)/256]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/8/320",5552,127.742,127.523,us,7.70872e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1280)/256]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/64/320",5242,133.889,133.713,us,5.88151e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/640)/4096]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/512/320",3518,198.08,197.756,us,3.18142e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/4096]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/4096/320",940,719.718,718.547,us,7.00464e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/4096]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/32768/320",141,4956.63,4949.2,us,8.13572e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/4096]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/65536/320",71,9839.33,9827.78,us,8.19419e+11,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/4096]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/2/160",5747,121.691,121.49,us,1.01144e+09,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/8/160",5729,122.599,122.414,us,4.01522e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/64/160",5316,127.927,127.716,us,3.07884e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/640)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/512/160",4279,159.061,158.8,us,1.98094e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/4096/160",1658,422.807,422.172,us,5.96103e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/32768/160",276,2535.25,2530.13,us,7.95716e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/65536/160",142,4945.73,4937.77,us,8.15456e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/2/320",5572,122.011,121.83,us,2.01724e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/8/320",5536,125.804,125.607,us,7.82633e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/160)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/64/320",5192,131.639,131.443,us,5.98307e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/512/320",3554,196.886,196.577,us,3.20051e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/4096/320",926,722.397,721.314,us,6.97777e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/81920)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/32768/320",127,4938.41,4931.67,us,8.16464e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/65536/320",72,9737.92,9726.17,us,8.27979e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1310720)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/2/160",5640,124.427,124.229,us,1.54553e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/5)/128]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/8/160",5666,123.824,123.62,us,6.21257e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/20)/128]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/64/160",5629,127.896,127.656,us,4.81292e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/160)/128]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/512/160",5502,124.795,124.584,us,3.94529e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1280)/128]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/4096/160",5341,133.379,133.136,us,2.95349e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/640)/2048]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/32768/160",4038,172.783,172.471,us,1.82392e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/2048]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/65536/160",3100,226.147,225.753,us,2.78687e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/2048]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/2/320",5530,127.145,126.932,us,3.02524e+07,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/10)/128]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/8/320",5558,124.443,124.209,us,1.23663e+08,,"Tile size: (8,8)/Vectorize size: (1,1)/Launch_Parameters[block(1/1/64)/grid(1/1/40)/128]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/64/320",5535,123.804,123.614,us,9.94065e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/320)/128]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/512/320",5591,125.264,125.053,us,7.86099e+09,,"Tile size: (32,32)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/128)/grid(1/1/160)/2048]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/4096/320",5018,136.704,136.493,us,5.76172e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/2048]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/32768/320",3044,229.909,229.545,us,2.74084e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/2048]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/65536/320",2066,339.455,338.994,us,3.71184e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/20480)/2048]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/2/160",5597,125.178,124.984,us,4.91584e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/160)/128]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/8/160",5573,129.355,129.13,us,1.9032e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/640)/128]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/64/160",5464,131.623,131.376,us,1.49653e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/320)/2048]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/512/160",4625,151.611,151.365,us,1.03912e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2560)/2048]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/4096/160",1910,366.939,366.294,us,3.43519e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/20480)/2048]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/32768/160",333,2079.48,2076.56,us,4.8476e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/163840)/2048]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/65536/160",174,4051.92,4045.9,us,4.97606e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/2048]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/2/320",5478,125.08,124.905,us,9.83786e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/320)/128]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/8/320",5487,127.905,127.684,us,3.84951e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1280)/128]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/64/320",5319,132.243,132.046,us,2.97787e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/640)/2048]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/512/320",3933,178.057,177.801,us,1.76924e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/2048]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/4096/320",1093,609.446,608.392,us,4.13645e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/2048]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/32768/320",173,4043.31,4037.18,us,4.98682e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/2048]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/65536/320",88,8006.72,7995.8,us,5.03581e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/2048]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/2/160",5589,125.198,124.969,us,4.91642e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/160)/128]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/8/160",5584,125.628,125.43,us,1.95934e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/640)/128]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/64/160",5471,128.085,127.862,us,1.53766e+10,,"Tile size: (32,32)/Vectorize size: (4,4)/Launch_Parameters[block(1/1/128)/grid(1/1/320)/2048]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/512/160",4643,150.71,150.47,us,1.0453e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/2560)/2048]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/4096/160",2154,325.626,325.067,us,3.87087e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/20480)/2048]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/32768/160",398,1761.43,1758.75,us,5.72359e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/163840)/2048]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/65536/160",205,3415.69,3410.53,us,5.90309e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/2048]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/2/320",5542,125.455,125.247,us,9.81102e+08,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/320)/128]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/8/320",5601,125.021,124.829,us,3.93756e+09,,"Tile size: (8,8)/Vectorize size: (2,2)/Launch_Parameters[block(1/1/32)/grid(1/1/1280)/128]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/64/320",5327,132.054,131.857,us,2.98215e+10,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/640)/2048]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/512/320",4043,173.123,172.852,us,1.8199e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/2048]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/4096/320",1258,525.086,524.239,us,4.80045e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/2048]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/32768/320",198,3539.64,3534.08,us,5.69672e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/2048]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/65536/320",100,6995.6,6984.42,us,5.76502e+11,,"Tile size: (32,32)/Vectorize size: (4,8)/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/2048]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/2/160",5704,123.073,122.846,us,5.00139e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/8/160",5710,123.078,122.897,us,1.99972e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/64/160",5383,125.801,125.587,us,1.56551e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/640)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/512/160",4567,148.732,148.467,us,1.05941e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/4096/160",2041,343.508,342.905,us,3.6695e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/32768/160",352,1963.49,1960.27,us,5.13517e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/65536/160",185,3780.06,3775.01,us,5.33313e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/2/320",5514,122.501,122.294,us,1.00479e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/8/320",5541,123.004,122.786,us,4.00307e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/160)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/64/320",5266,129.345,129.152,us,3.04461e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/512/320",3983,171.216,170.906,us,1.84062e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/4096/320",1167,564.295,563.447,us,4.4664e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/81920)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/32768/320",184,3814.29,3808.55,us,5.28618e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/65536/320",88,7502.36,7491.74,us,5.37463e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1310720)/0]",,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/2/160/manual_time",8350,84.6012,155.852,us,4.53894e+07,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/8/160/manual_time",8432,84.3679,155.954,us,1.8206e+08,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/64/160/manual_time",8327,84.6004,156.38,us,1.45248e+09,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/512/160/manual_time",8377,84.5304,156.172,us,1.16294e+10,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/4096/160/manual_time",7967,88.2059,158.126,us,8.91586e+10,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/32768/160/manual_time",3893,177.498,250.049,us,3.54452e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/262144/160/manual_time",236,2957.6,3029.11,us,1.70177e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/1048576/160/manual_time",54,12212.8,12301.7,us,1.64848e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/2/320/manual_time",8335,84.3535,155.491,us,9.10454e+07,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/8/320/manual_time",8323,84.5423,156.116,us,3.63368e+08,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/64/320/manual_time",8291,84.733,156.452,us,2.9004e+09,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/512/320/manual_time",8238,85.1174,157.172,us,2.30984e+10,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/4096/320/manual_time",7546,92.847,162.711,us,1.69404e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/32768/320/manual_time",1873,374.468,447.093,us,3.36021e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/262144/320/manual_time",95,6757.07,6831.01,us,1.48975e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/1048576/320/manual_time",24,27934.4,28106.8,us,1.44143e+11,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/2/160/manual_time",8348,86.5484,158.402,us,2.21841e+07,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/8/160/manual_time",8296,84.5955,156.286,us,9.0785e+07,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/64/160/manual_time",8245,84.7349,156.368,us,7.25085e+08,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/512/160/manual_time",8249,85.6262,157.477,us,5.7403e+09,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/4096/160/manual_time",7625,92.3327,163.353,us,4.25869e+10,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/32768/160/manual_time",4887,135.786,208.025,us,2.31668e+11,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/262144/160/manual_time",299,2324.92,2396.53,us,1.08244e+11,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/1048576/160/manual_time",65,10009.7,10086.2,us,1.00566e+11,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/2/320/manual_time",8094,88.5059,160.975,us,4.33869e+07,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/8/320/manual_time",8093,88.4266,161.425,us,1.73703e+08,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/64/320/manual_time",8316,87.8826,160.672,us,1.39823e+09,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/512/320/manual_time",8027,88.6484,161.679,us,1.10892e+10,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/4096/320/manual_time",7590,92.2541,163.326,us,8.52463e+10,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/32768/320/manual_time",2692,249.467,321.917,us,2.52195e+11,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/262144/320/manual_time",112,5895.2,5983.95,us,8.53773e+10,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/1048576/320/manual_time",27,24883.3,25026,us,8.09084e+10,,,,