-
Notifications
You must be signed in to change notification settings - Fork 0
/
transpose-pointwise.csv
We can make this file beautiful and searchable if this error is corrected: It looks like row 9 should actually have 1 column, instead of 3 in line 8.
924 lines (924 loc) · 188 KB
/
transpose-pointwise.csv
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
2022-08-24T09:58:42-07:00
Running ./build/bin/nvfuser_bench
Run on (64 X 3700 MHz CPU s)
CPU Caches:
L1 Data 32 KiB (x32)
L1 Instruction 32 KiB (x32)
L2 Unified 512 KiB (x32)
L3 Unified 16384 KiB (x8)
Load Average: 1.97, 1.76, 1.16
***WARNING*** CPU scaling is enabled, the benchmark real time measurements may be noisy and will incur extra overhead.
***WARNING*** Library was built as DEBUG. Timings may be affected.
name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/9/2408",5861,119.657,119.449,us,2.1772e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/43)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/16/512",5856,119.772,119.541,us,8.22344e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/18/96",5651,120.005,119.807,us,1.73078e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/14)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/24/96",5642,120.178,120,us,2.30401e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/18)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/24/256",5632,120.127,119.919,us,6.14816e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/24/512",5642,120.32,120.141,us,1.22736e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/32/27",5631,120.397,120.21,us,8.62488e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/7)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/32/96",5624,120.352,120.178,us,3.06745e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/24)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/32/288",5639,120.251,120.072,us,9.21049e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/72)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/32/864",5630,120.885,120.717,us,2.74838e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/54)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/40/120",5640,120.357,120.18,us,4.79281e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/38)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/48/128",5631,120.394,120.228,us,6.13235e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/48/256",5616,120.566,120.377,us,1.22495e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/49/512",5625,120.559,120.379,us,2.5009e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/49)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/49/1024",5599,121.083,120.901,us,4.9802e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/49/2048",5562,123.747,123.553,us,9.74659e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/49/4608",5464,123.792,123.583,us,2.19246e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/441)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/64",5623,120.126,119.929,us,4.09842e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/32)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/96",5637,121.345,121.174,us,6.08447e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/128",5641,123.431,123.232,us,7.97717e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/147",5627,120.8,120.648,us,9.35747e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/74)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/192",5601,120.609,120.445,us,1.22426e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/256",5603,120.903,120.737,us,1.62839e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/288",5618,120.619,120.459,us,1.83617e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/36)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/64/512",5568,121.348,121.165,us,3.24528e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/80/64",5599,120.903,120.743,us,5.0885e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/81/1728",5511,122.804,122.614,us,1.36984e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/274)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/83/1728",5564,121.615,121.437,us,1.41727e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/281)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/96/864",5577,120.831,120.656,us,8.2493e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/162)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/100/1280",5533,124.711,124.484,us,1.23389e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/250)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/100/4032",5342,127.888,127.69,us,3.78917e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/788)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/120/40",5529,123.012,122.808,us,4.69025e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/38)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/128/128",5629,120.264,120.085,us,1.63724e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/128/512",5590,121.561,121.396,us,6.47825e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/128)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/128/1152",5552,122.302,122.137,us,1.44876e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/288)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/128",5590,120.732,120.568,us,2.44601e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/256",5591,121.214,121.058,us,4.87225e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/720",5568,122.159,121.982,us,1.35994e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/270)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/768",5564,122.481,122.293,us,1.44691e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/288)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/1120",5501,123.23,123.04,us,2.09728e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/420)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/192/1728",5460,123.896,123.701,us,3.2185e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/648)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/196/256",5596,121.266,121.094,us,4.97227e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/196/512",5553,121.809,121.661,us,9.89823e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/196/1024",5507,123.001,122.836,us,1.96071e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/392)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/196/2304",5385,125.826,125.643,us,4.31301e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/882)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/256/256",5610,121.156,120.977,us,6.50067e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/128)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/256/1024",5495,123.16,122.993,us,2.55764e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/256/2304",5267,128.767,128.568,us,5.50518e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1152)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/284/512",5548,121.963,121.799,us,1.4326e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/284)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/320/1280",5390,125.453,125.281,us,3.92334e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/800)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/320/1728",5325,127.58,127.404,us,5.20827e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1080)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/324/2592",5126,132.37,132.197,us,7.62325e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1641)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/361/768",5469,123.715,123.557,us,2.69267e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/542)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/361/1120",5385,126.206,126.028,us,3.84983e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/790)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/2",5568,123.231,123.059,us,7.48906e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/6)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/32",5605,122.673,122.482,us,1.2039e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/128",5585,121.227,121.061,us,4.87211e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/256",5577,121.833,121.65,us,9.69707e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/192)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/512",5491,123.505,123.288,us,1.91364e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/1280",5337,126.911,126.738,us,4.6539e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/960)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/2592",5045,134.651,134.423,us,8.88533e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1944)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/384/4032",4692,145.107,144.89,us,1.28231e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3024)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/448/1280",5251,129.485,129.292,us,5.32228e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1120)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/480/16",5614,120.9,120.73,us,7.63354e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/60)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/480/256",5527,122.672,122.517,us,1.20356e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/240)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/2",5596,121.11,120.941,us,1.01604e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/8)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/16",5600,120.724,120.555,us,8.15425e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/128",5551,122.097,121.909,us,6.451e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/128)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/256",5510,122.292,122.128,us,1.28789e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/256)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/1024",5307,127.841,127.677,us,4.92763e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/2048",4983,136.786,136.601,us,9.21142e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/3072",4621,147.203,147.021,us,1.28379e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/512/4608",4274,159.784,159.553,us,1.77443e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/784/40",5654,120.257,120.095,us,3.13351e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/62)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/784/120",5584,121.589,121.431,us,9.29711e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/184)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/784/128",5558,122.301,122.147,us,9.85881e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/784/1152",5019,135.122,134.956,us,8.0308e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1764)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1001/2408",4206,162.216,161.986,us,1.78564e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4708)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1024/16",5622,120.483,120.327,us,1.63395e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1024/256",5430,124.894,124.687,us,2.52289e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1024/512",5263,129.562,129.365,us,4.86332e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1024/1024",4931,138.103,137.852,us,9.12781e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1024/3072",3870,174.02,173.79,us,2.17209e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1369/192",5479,124.02,123.858,us,2.54661e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/514)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1369/256",5454,124.448,124.257,us,3.38456e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/685)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/1369/288",5451,126.329,126.155,us,3.75034e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/771)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/2048/512",4964,137.164,136.964,us,9.18701e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/2048/1024",4370,156.005,155.799,us,1.61528e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/2250/27",5810,120.556,120.368,us,6.05645e+09,,"1D/Vectorize, Factor: 2/Launch_Parameters[block(1/1/128)/grid(1/1/238)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/3072/512",4640,147.503,147.296,us,1.28139e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/3072/1024",3841,175.749,175.53,us,2.15055e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/3136/64",5579,122.274,122.112,us,1.97233e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/392)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/5329/720",3540,194.231,193.971,us,2.37368e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/7494)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/5625/64",5457,126.86,126.68,us,3.41016e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/704)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/12544/147",4435,154.616,154.394,us,1.43319e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3602)/0]",,
"NF_Transpose_Random_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_2D_01_Axis/22201/288",2595,269.75,269.377,us,2.8483e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12489)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/9/2408",5330,131.308,131.122,us,6.34678e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1355)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/16/512",5437,125.602,125.408,us,2.50839e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/18/96",5479,123.65,123.458,us,5.3747e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/108)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/24/96",5491,123.342,123.178,us,7.18256e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/144)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/24/256",5367,127.952,127.763,us,1.84662e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/24/512",5257,128.672,128.515,us,3.67162e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/32/27",5420,125.118,124.935,us,2.65558e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/54)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/32/96",5468,123.899,123.7,us,9.53633e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/192)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/32/288",5295,127.638,127.453,us,2.77668e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/576)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/32/864",4946,138.198,138.005,us,7.6931e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1728)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/40/120",5412,124.564,124.397,us,1.48171e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/48/128",5427,124.764,124.592,us,1.89361e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/48/256",5255,129.01,128.838,us,3.66242e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/49/512",5006,136.43,136.236,us,7.0714e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1568)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/49/1024",4530,149.967,149.773,us,1.28645e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/49/2048",3856,173.807,173.558,us,2.2203e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/49/4608",2839,247.094,246.774,us,3.5135e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/14112)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/64",5541,123.705,123.518,us,1.27339e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/256)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/96",5453,124.741,124.572,us,1.89391e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/128",5394,126.837,126.661,us,2.48358e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/147",5277,128.518,128.325,us,2.81526e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/588)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/192",5221,129.833,129.666,us,3.63903e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/256",5139,131.504,131.338,us,4.79027e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/288",5095,132.744,132.566,us,5.33916e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1152)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/64/512",4841,140.302,140.127,us,8.97968e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/80/64",5440,124.571,124.406,us,1.58037e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/320)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/81/1728",3368,203.278,203.02,us,2.64741e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8748)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/83/1728",3358,205.322,205.041,us,2.68604e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8964)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/96/864",4033,168.728,168.507,us,1.89016e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/100/1280",3350,205.135,204.819,us,2.39977e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8000)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/100/4032",804,834.183,833.094,us,1.85848e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25200)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/120/40",5508,122.988,122.836,us,1.50053e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/128/128",5251,129.5,129.309,us,4.86543e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/128/512",4368,157.076,156.873,us,1.60422e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/128/1152",3229,213.849,213.575,us,2.6512e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/128",5094,133.939,133.755,us,7.05557e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/256",4585,149.138,148.949,us,1.26717e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/720",3280,212.282,211.993,us,2.50405e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8640)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/768",3170,220.144,219.873,us,2.57527e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/1120",2575,271.731,271.346,us,3.04317e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/192/1728",1131,584.023,583.303,us,2.18415e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20736)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/196/256",4570,149.768,149.575,us,1.28816e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/196/512",3757,183.187,182.914,us,2.10673e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/196/1024",2677,261.717,261.351,us,2.94892e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/196/2304",766,876.676,875.433,us,1.98083e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/28224)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/256/256",4241,161.277,161.074,us,1.56238e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/256/1024",1272,515.103,514.388,us,1.95695e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/256/2304",575,1182.45,1180.85,us,1.91804e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/36864)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/284/512",3192,218.174,217.88,us,2.56272e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9088)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/320/1280",945,706.239,705.281,us,2.23012e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25600)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/320/1728",676,1000.1,998.872,us,2.12576e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/34560)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/324/2592",407,1688.84,1686.5,us,1.91217e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/52488)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/361/768",1969,356.049,355.565,us,2.99419e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/17328)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/361/1120",1099,602.234,601.498,us,2.5812e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25270)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/2",5566,121.36,121.193,us,2.43341e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/32",5332,127.967,127.785,us,3.6926e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/128",4577,150.802,150.602,us,1.25326e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/256",3628,187.98,187.73,us,2.0108e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/512",2283,306.522,306.108,us,2.46637e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12288)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/1280",796,840.825,839.677,us,2.24781e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/30720)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/2592",341,2025.38,2022.34,us,1.88992e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/62208)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/384/4032",198,3543.72,3539.01,us,1.67997e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/448/1280",681,991.817,990.479,us,2.22318e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/35840)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/480/16",5374,125.002,124.844,us,2.36224e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/480)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/480/256",3366,207.985,207.681,us,2.27204e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/7680)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/2",5517,121.873,121.692,us,3.23123e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/16",5366,125.506,125.31,us,2.51036e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/128",4207,162.686,162.446,us,1.54918e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/256",3244,216.179,215.896,us,2.33129e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/1024",637,1064.09,1062.5,us,1.89484e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/2048",307,2281.75,2278.43,us,1.76724e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/3072",215,3259.4,3254.52,us,1.85582e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/512/4608",118,5456.67,5450.17,us,1.66228e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/147456)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/784/40",4791,142.157,141.937,us,8.48423e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1960)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/784/120",3609,190.793,190.537,us,1.89604e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5880)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/784/128",3495,196.652,196.396,us,1.96212e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/784/1152",465,1469.54,1467.54,us,2.36326e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/56448)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1001/2408",134,4879.74,4873.51,us,1.89924e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/150651)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1024/16",5161,132.282,132.074,us,4.76359e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1024/256",1867,375.574,375.028,us,2.68415e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1024/512",854,783.979,782.804,us,2.57186e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1024/1024",330,2116.01,2112.99,us,1.90561e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1024/3072",101,6566.68,6558.1,us,1.84194e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1369/192",1385,470.586,469.819,us,2.14835e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16428)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1369/256",1156,570.499,569.636,us,2.36253e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/21904)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/1369/288",995,669.694,668.659,us,2.26424e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24642)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/2048/512",262,2667.54,2663.9,us,1.51152e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/2048/1024",119,5346.1,5338.77,us,1.50841e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/131072)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/2250/27",3566,192.772,192.511,us,1.21177e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3797)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/3072/512",118,5610.6,5603.56,us,1.07785e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/3072/1024",62,11276.4,11264.7,us,1.07234e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/3136/64",939,710.048,709.177,us,1.08676e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/5329/720",41,17107.1,17091.2,us,8.62057e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/239805)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/5625/64",452,1524.79,1522.33,us,9.08084e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/22500)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/12544/147",83,8322.23,8312.5,us,8.5183e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/115248)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_02_Axis/22201/288",24,29068.2,29043.7,us,8.45365e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/399618)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/9/2408",5311,130.843,130.674,us,6.36855e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1355)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/16/512",5432,124.753,124.601,us,2.52464e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/18/96",5549,122.253,122.093,us,5.43479e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/108)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/24/96",5536,122.079,121.909,us,7.25736e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/144)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/24/256",5383,125.102,124.939,us,1.88836e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/24/512",5278,128.152,127.984,us,3.68686e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/32/27",5501,122.462,122.312,us,2.71253e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/54)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/32/96",5508,122.782,122.616,us,9.62067e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/192)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/32/288",5358,126.346,126.175,us,2.8048e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/576)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/32/864",5042,134.974,134.799,us,7.87603e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1728)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/40/120",5486,123.727,123.57,us,1.49163e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/48/128",5445,124.321,124.168,us,1.90009e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/48/256",5303,127.664,127.499,us,3.7009e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/49/512",5067,134.034,133.856,us,7.19715e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1568)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/49/1024",4671,146.21,146.002,us,1.31968e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/49/2048",4037,169.067,168.802,us,2.28287e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/49/4608",3133,223.562,223.248,us,3.88376e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/14112)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/64",5550,122.688,122.535,us,1.2836e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/256)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/96",5474,124.025,123.879,us,1.90451e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/128",5459,124.987,124.832,us,2.51997e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/147",5362,126.304,126.147,us,2.86385e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/588)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/192",5321,127.84,127.668,us,3.69599e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/256",5239,129.632,129.461,us,4.85972e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/288",5185,130.975,130.807,us,5.41093e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1152)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/64/512",4964,136.471,136.309,us,9.23115e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/80/64",5486,123.714,123.566,us,1.59112e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/320)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/81/1728",3688,185.578,185.325,us,2.90019e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8748)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/83/1728",3682,186.912,186.667,us,2.95043e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8964)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/96/864",4295,159.224,159.001,us,2.00317e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/100/1280",3822,180.303,180.052,us,2.72988e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8000)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/100/4032",2300,304.556,304.158,us,5.09041e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25200)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/120/40",5520,123.214,123.049,us,1.49794e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/128/128",5257,129.351,129.186,us,4.87007e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/128/512",4513,152.227,152.024,us,1.65539e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/128/1152",3615,188.85,188.619,us,3.00198e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/128",5151,132.509,132.325,us,7.13183e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/256",4740,144.402,144.2,us,1.3089e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/720",3716,184.343,184.076,us,2.88382e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8640)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/768",3636,188.874,188.605,us,3.00221e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/1120",3155,219.244,218.951,us,3.7714e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/192/1728",2554,275.849,275.396,us,4.62614e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20736)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/196/256",4673,146.117,145.901,us,1.32059e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/196/512",4062,168.95,168.663,us,2.28474e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/196/1024",3231,213.109,212.814,us,3.62149e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/196/2304",2145,326.63,326.113,us,5.31743e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/28224)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/256/256",4495,152.503,152.298,us,1.65241e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/256/1024",2898,241.595,241.26,us,4.1724e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/256/2304",1796,389.922,389.441,us,5.81583e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/36864)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/284/512",3652,188.254,187.965,us,2.97058e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9088)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/320/1280",2240,312.658,312.283,us,5.03666e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25600)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/320/1728",1862,376.407,375.922,us,5.64842e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/34560)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/324/2592",1261,517.537,516.892,us,6.23895e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/52488)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/361/768",2787,250.871,250.534,us,4.24945e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/17328)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/361/1120",2252,310.884,310.506,us,5.00018e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25270)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/2",5598,121.575,121.42,us,2.42885e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/32",5349,127.672,127.504,us,3.70075e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/128",4724,145.27,145.085,us,1.30092e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/256",4069,166.402,166.191,us,2.2714e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/512",3270,210.548,210.278,us,3.59036e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12288)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/1280",1962,357.782,357.288,us,5.28268e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/30720)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/2592",1102,598.962,598.146,us,6.38984e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/62208)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/384/4032",741,902.32,901.148,us,6.59761e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/448/1280",1712,409.79,409.225,us,5.38092e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/35840)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/480/16",5408,125.93,125.765,us,2.34495e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/480)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/480/256",3834,179.475,179.217,us,2.6329e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/7680)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/2",5504,123.328,123.142,us,3.19319e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/16",5402,126.027,125.848,us,2.49963e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/128",4458,155.377,155.171,us,1.62181e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/256",3586,183.762,183.497,us,2.74291e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/1024",1866,377.334,376.826,us,5.3427e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/2048",1011,660.651,659.757,us,6.10305e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/3072",696,965.165,963.899,us,6.26601e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/512/4608",490,1384.27,1382.12,us,6.55492e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/147456)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/784/40",4971,138.789,138.596,us,8.68872e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1960)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/784/120",4113,166.796,166.553,us,2.16908e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5880)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/784/128",4054,168.906,168.678,us,2.28454e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/784/1152",1136,583.979,583.234,us,5.94644e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/56448)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1001/2408",474,1440.09,1438.28,us,6.43546e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/150651)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1024/16",5208,131.831,131.627,us,4.77976e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1024/256",2860,245.269,244.88,us,4.11073e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1024/512",1847,379.525,378.976,us,5.31238e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1024/1024",1014,658.449,657.469,us,6.12429e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1024/3072",365,1898.61,1892.81,us,6.38183e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1369/192",2807,249.229,248.869,us,4.05569e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16428)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1369/256",2409,291.002,290.609,us,4.6309e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/21904)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/1369/288",2234,313.554,313.131,us,4.83505e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24642)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/2048/512",1003,666.337,664.817,us,6.0566e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/2048/1024",542,1260.97,1256.55,us,6.40885e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/131072)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/2250/27",4472,151.938,151.749,us,1.53728e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3797)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/3072/512",704,981.165,979.75,us,6.16463e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/3072/1024",360,1926.11,1923.7,us,6.27936e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/3136/64",3221,213.903,213.57,us,3.60867e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/5329/720",278,2525.81,2522.26,us,5.84144e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/239805)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/5625/64",2418,289.04,288.638,us,4.78939e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/22500)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/12544/147",559,1215.18,1213.59,us,5.83462e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/115248)/0]",,
"NF_Transpose_Random_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp32_Inner_3D_12_Axis/22201/288",125,5054.88,5048.51,us,4.86332e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/399618)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/9/2408",5325,132.707,132.486,us,6.28145e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1355)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/16/512",5419,126.823,126.628,us,2.48424e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/18/96",5352,126.822,126.598,us,5.2414e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/108)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/24/96",5339,126.891,126.676,us,6.98423e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/144)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/24/256",5374,128.456,128.237,us,1.8398e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/24/512",5263,131.274,131.013,us,3.60163e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/32/27",5505,124.968,124.712,us,2.66035e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/54)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/32/96",5467,124.106,123.912,us,9.52008e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/192)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/32/288",5322,127.111,126.94,us,2.78789e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/576)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/32/864",4984,135.887,135.708,us,7.82328e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1728)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/40/120",5461,124.424,124.178,us,1.48432e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/48/128",5418,125.287,125.106,us,1.88584e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/48/256",5258,128.662,128.478,us,3.67269e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/49/512",5040,134.657,134.464,us,7.16458e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1568)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/49/1024",4656,146.008,145.819,us,1.32134e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/49/2048",4041,169.283,168.674,us,2.2846e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/49/4608",3092,224.361,223.799,us,3.87419e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/14112)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/64",5504,126.645,126.468,us,1.24368e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/256)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/96",5451,126.154,125.979,us,1.87276e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/128",5302,128.764,128.548,us,2.44713e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/147",5259,130.089,129.896,us,2.78121e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/588)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/192",5259,131.367,131.158,us,3.59764e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/256",5121,133.448,133.233,us,4.72215e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/288",5061,134.28,134.082,us,5.27876e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1152)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/64/512",4855,140.53,140.292,us,8.96908e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/80/64",5315,128.054,127.829,us,1.53805e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/320)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/81/1728",3668,186.099,185.803,us,2.89273e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8748)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/83/1728",3671,187.758,187.455,us,2.93803e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8964)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/96/864",4258,161.495,161.255,us,1.97517e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/100/1280",3801,180.671,180.38,us,2.72491e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8000)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/100/4032",2282,307.069,306.618,us,5.04957e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25200)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/120/40",5436,126.8,126.601,us,1.45591e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/128/128",5161,133.17,132.963,us,4.73175e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/128/512",4478,154.876,154.653,us,1.62725e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/128/1152",3595,189.756,189.49,us,2.98818e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/128",5032,136.215,136.008,us,6.93871e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/256",4616,147.797,147.575,us,1.27896e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/720",3705,185.091,184.838,us,2.87192e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8640)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/768",3542,194.511,194.216,us,2.91548e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/1120",3140,220.539,220.212,us,3.74981e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/192/1728",2549,274.656,274.282,us,4.64493e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20736)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/196/256",4672,148.043,147.807,us,1.30356e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/196/512",4063,168.207,167.952,us,2.29442e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/196/1024",3238,212.839,212.529,us,3.62635e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/196/2304",2073,338.105,337.627,us,5.13609e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/28224)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/256/256",4464,154.306,154.086,us,1.63324e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/256/1024",2900,241.846,241.433,us,4.16941e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/256/2304",1701,412.168,411.602,us,5.5027e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/36864)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/284/512",3659,188.254,187.991,us,2.97018e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9088)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/320/1280",2276,308.216,307.8,us,5.11001e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25600)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/320/1728",1875,374.045,373.489,us,5.68521e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/34560)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/324/2592",1299,504.59,503.859,us,6.40033e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/52488)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/361/768",2661,263.147,262.764,us,4.05167e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/17328)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/361/1120",2291,306.057,305.62,us,5.08013e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25270)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/2",5446,125.166,124.969,us,2.35988e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/32",5214,130.988,130.748,us,3.60893e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/128",4651,148.174,147.958,us,1.27565e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/256",4077,166.67,166.429,us,2.26816e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/512",3266,212.051,211.746,us,3.56547e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12288)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/1280",2024,346.849,346.363,us,5.4493e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/30720)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/2592",1147,574.672,573.818,us,6.66075e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/62208)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/384/4032",810,830.666,829.472,us,7.16773e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/448/1280",1827,383.549,382.969,us,5.74984e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/35840)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/480/16",5428,125.182,124.991,us,2.35946e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/480)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/480/256",3849,178.143,177.86,us,2.65298e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/7680)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/2",5573,122.171,122.021,us,3.22254e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/16",5458,124.888,124.711,us,2.52241e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/128",4510,154.256,153.989,us,1.63426e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/256",3771,181.637,181.356,us,2.77529e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/1024",1938,361.676,361.096,us,5.57543e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/2048",1101,601.875,600.972,us,6.70004e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/3072",730,923.893,922.419,us,6.54778e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/512/4608",516,1322.56,1320.8,us,6.85927e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/147456)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/784/40",5020,135.833,135.635,us,8.87843e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1960)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/784/120",4158,164.618,164.399,us,2.1975e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5880)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/784/128",4078,168.285,168.039,us,2.29322e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/784/1152",1206,543.782,543.003,us,6.38701e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/56448)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1001/2408",557,1222.69,1220.98,us,7.5808e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/150651)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1024/16",5242,129.819,129.657,us,4.85239e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1024/256",2898,242.377,242.079,us,4.15829e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1024/512",1932,362.612,362.141,us,5.55935e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1024/1024",1104,601.124,600.394,us,6.70648e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1024/3072",393,1751.31,1748.98,us,6.90667e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1369/192",2874,243.379,243.071,us,4.15244e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16428)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1369/256",2475,282.736,282.395,us,4.76561e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/21904)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/1369/288",2316,302.741,302.345,us,5.00754e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24642)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/2048/512",1094,603.859,603,us,6.67749e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/2048/1024",628,1083.42,1077.95,us,7.47073e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/131072)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/2250/27",4615,148.934,148.739,us,1.56838e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3797)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/3072/512",798,842.233,841.221,us,7.1798e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/3072/1024",441,1556.33,1554.34,us,7.77152e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/3136/64",3272,209.95,209.689,us,3.67547e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/5329/720",368,1875.41,1873.04,us,7.86615e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/239805)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/5625/64",2469,283.789,283.417,us,4.87761e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/22500)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/12544/147",699,965.491,964.155,us,7.34409e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/115248)/0]",,
"NF_Transpose_Random_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp32_Outer_3D_01_Axis/22201/288",228,3074.11,3070.12,us,7.99726e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/399618)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/9/2408",5889,118.159,118.008,us,1.10189e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/43)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/16/512",5738,122.086,121.881,us,4.03278e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/18/96",5699,122.074,121.905,us,8.50496e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/14)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/24/96",5712,121.093,120.922,us,1.14321e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/18)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/24/256",5704,120.489,120.324,us,3.06372e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/24/512",5663,122.267,122.094,us,6.03861e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/32/27",5699,121.421,121.254,us,4.27532e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/7)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/32/96",5660,120.31,120.151,us,1.53407e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/24)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/32/288",5699,122.1,121.935,us,4.53487e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/72)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/32/864",5703,120.808,120.626,us,1.37522e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/54)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/40/120",5741,119.923,119.766,us,2.40469e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/38)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/48/128",5684,121.403,121.239,us,3.04061e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/48/256",5567,121.337,121.187,us,6.08382e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/49/512",5572,121.512,121.34,us,1.24055e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/49)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/49/1024",5690,122.338,122.161,us,2.46443e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/49/2048",5668,121.989,121.803,us,4.94334e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/49/4608",5455,125.108,124.85,us,1.0851e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/441)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/64",5468,123.498,123.292,us,1.99332e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/32)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/96",5589,122.832,122.654,us,3.00552e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/128",5522,122.971,122.8,us,4.00259e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/147",5518,122.865,122.69,us,4.60085e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/74)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/192",5524,122.59,122.409,us,6.02307e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/256",5611,122.63,122.451,us,8.028e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/288",5659,122.482,122.293,us,9.04323e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/36)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/64/512",5508,122.77,122.604,us,1.6036e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/80/64",5508,122.578,122.395,us,2.50992e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/81/1728",5603,120.504,120.33,us,6.97923e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/274)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/83/1728",5600,120.499,120.338,us,7.15104e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/281)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/96/864",5632,123.243,123.04,us,4.04474e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/162)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/100/1280",5496,123.701,123.534,us,6.21693e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/250)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/100/4032",5385,125.985,125.805,us,1.92298e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/788)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/120/40",5680,119.395,119.237,us,2.41535e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/38)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/128/128",5675,120.741,120.576,us,8.15288e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/128/512",5537,123.591,123.422,us,3.18594e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/128)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/128/1152",5608,120.223,120.062,us,7.36902e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/288)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/128",5690,118.974,118.821,us,1.24099e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/256",5642,119.567,119.406,us,2.46983e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/720",5621,120.498,120.338,us,6.89258e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/270)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/768",5620,120.252,120.094,us,7.36702e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/288)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/1120",5573,121.066,120.913,us,1.06708e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/420)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/192/1728",5551,122.075,121.91,us,1.63289e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/648)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/196/256",5625,120.396,120.25,us,2.50358e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/196/512",5634,120.25,120.093,us,5.01371e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/196/1024",5590,121.286,121.128,us,9.94175e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/392)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/196/2304",5483,123.255,123.1,us,2.20107e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/882)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/256/256",5610,120.256,120.083,us,3.27454e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/128)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/256/1024",5625,120.806,120.649,us,1.30367e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/256/2304",5385,125.947,125.781,us,2.81359e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1152)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/284/512",5587,120.853,120.69,us,7.22883e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/284)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/320/1280",5476,123.652,123.499,us,1.98998e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/800)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/320/1728",5399,125.622,125.444,us,2.64482e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1080)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/324/2592",5201,130.915,130.752,us,3.85375e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1641)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/361/768",5537,122.302,122.14,us,1.36195e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/542)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/361/1120",5467,123.815,123.64,us,1.96209e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/790)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/2",5698,119.47,119.292,us,3.8628e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/6)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/32",5673,119.467,119.297,us,6.18019e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/128",5590,119.783,119.625,us,2.46531e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/256",5636,120.433,120.28,us,4.90376e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/192)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/512",5574,121.188,121.021,us,9.74749e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/1280",5416,125.177,124.954,us,2.36017e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/960)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/2592",5110,132.711,132.527,us,4.50623e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1944)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/384/4032",4878,139.404,139.224,us,6.67251e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3024)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/448/1280",5284,128.188,128.023,us,2.68751e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1120)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/480/16",5664,119.276,119.13,us,3.86805e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/60)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/480/256",5593,120.951,120.746,us,6.10604e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/240)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/2",5682,119.363,119.193,us,5.15465e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/8)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/16",5662,119.334,119.14,us,4.12558e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/128",5574,121.097,120.932,us,3.25156e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/128)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/256",5570,121.37,121.228,us,6.48722e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/256)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/1024",5327,127.239,127.032,us,2.47632e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/2048",4980,136.423,136.241,us,4.6179e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/3072",4719,144.311,144.05,us,6.55133e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/512/4608",4283,158.916,158.695,us,8.92009e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4608)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/784/40",5653,119.457,119.292,us,1.5773e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/62)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/784/120",5584,120.791,120.636,us,4.67919e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/184)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/784/128",5586,123.194,122.976,us,4.89616e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/784/1152",4842,137.613,137.373,us,3.94474e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1764)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1001/2408",4172,158.876,158.638,us,9.11665e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4708)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1024/16",5701,119.061,118.901,us,8.26773e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1024/256",5512,122.719,122.537,us,1.28359e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1024/512",5305,127.791,127.627,us,2.46478e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1024/1024",4921,137.693,137.465,us,4.57676e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1024/3072",3985,168.999,168.746,us,1.11851e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1369/192",5708,122.497,122.293,us,1.2896e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/514)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1369/256",5677,123.392,123.229,us,1.7064e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/685)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/1369/288",5577,127.449,127.286,us,1.85852e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/771)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/2048/512",5097,137.512,137.321,us,4.58156e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/2048/1024",4486,155.136,154.945,us,8.12088e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/2250/27",5838,119.896,119.718,us,3.04465e+09,,"1D/Vectorize, Factor: 2/Launch_Parameters[block(1/1/128)/grid(1/1/238)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/3072/512",4855,144.251,144.065,us,6.55063e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/3072/1024",4068,165.315,165.041,us,1.14362e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/3136/64",5776,121.315,121.144,us,9.94042e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/392)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/5329/720",4027,174.607,174.317,us,1.32066e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/7494)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/5625/64",5694,123.51,123.334,us,1.75134e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/704)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/12544/147",4858,143.824,143.615,us,7.70379e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3602)/0]",,
"NF_Transpose_Random_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_2D_01_Axis/22201/288",3162,221.293,220.98,us,1.73605e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12489)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/9/2408",5491,127.711,127.478,us,3.26411e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1355)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/16/512",5729,123.004,122.825,us,1.28057e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/18/96",5755,121.614,121.466,us,2.73144e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/108)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/24/96",5762,121.646,121.458,us,3.64216e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/144)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/24/256",5680,123.367,123.205,us,9.57465e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/24/512",5557,126.332,126.122,us,1.87065e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/32/27",5803,120.765,120.603,us,1.37549e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/54)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/32/96",5732,122.145,121.984,us,4.83524e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/192)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/32/288",5582,125.873,125.713,us,1.40754e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/576)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/32/864",5143,136.607,136.443,us,3.89057e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1728)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/40/120",5683,123.336,123.18,us,7.48171e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/48/128",5662,123.941,123.756,us,9.53206e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/48/256",5495,127.562,127.391,us,1.85201e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/49/512",5164,135.737,135.562,us,3.55327e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1568)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/49/1024",4697,148.82,148.641,us,6.48125e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/49/2048",4016,169.234,168.998,us,1.14011e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/49/4608",3106,225.817,225.532,us,1.92221e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/14112)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/64",5734,126.091,125.868,us,6.24805e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/256)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/96",5482,133.384,133.118,us,8.86166e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/128",5612,127.411,127.158,us,1.23693e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/147",5492,130.616,130.374,us,1.3855e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/588)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/192",5299,132.392,132.157,us,1.78522e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/256",5162,130.019,129.826,us,2.42304e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/288",5314,131.82,131.652,us,2.68811e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1152)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/64/512",5026,139.451,139.247,us,4.5182e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/80/64",5685,123.478,123.3,us,7.97274e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/320)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/81/1728",3580,186.108,185.799,us,1.44639e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8748)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/83/1728",3737,187.632,187.357,us,1.46978e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8964)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/96/864",4398,159.265,159.058,us,1.00122e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/100/1280",3765,186.155,185.876,us,1.32217e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8000)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/100/4032",2256,311.332,310.879,us,2.49018e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25200)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/120/40",5721,122.272,122.097,us,7.54807e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/128/128",5352,131.483,131.305,us,2.39575e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/128/512",4515,156.876,156.678,us,8.03107e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/128/1152",3517,189.362,189.115,us,1.49705e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/128",5265,135.039,134.825,us,3.4998e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/256",4804,142.996,142.787,us,6.60929e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/720",3601,185.937,185.695,us,1.42933e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8640)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/768",3665,191.237,190.978,us,1.48245e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/1120",3130,223.498,223.22,us,1.84964e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/192/1728",2478,282.81,282.445,us,2.25534e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20736)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/196/256",4831,144.892,144.691,us,6.6582e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/196/512",4188,167.707,167.488,us,1.15039e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/196/1024",3234,214.894,214.608,us,1.79561e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/196/2304",2011,348.919,348.367,us,2.48887e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/28224)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/256/256",4597,152.386,152.159,us,8.26959e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/256/1024",2842,246.79,246.369,us,2.04294e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/256/2304",922,760.059,758.992,us,1.49206e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/36864)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/284/512",3683,190.519,190.198,us,1.46786e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9088)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/320/1280",1954,362.621,362.078,us,2.172e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25600)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/320/1728",1193,587.92,586.984,us,1.80871e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/34560)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/324/2592",572,1224.96,1223.08,us,1.31834e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/52488)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/361/768",2603,269.128,268.714,us,1.98098e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/17328)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/361/1120",2074,344.638,344.128,us,2.25583e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25270)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/2",5459,123.614,123.439,us,1.19456e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/32",5073,132.158,131.931,us,1.78828e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/128",4701,153.273,153.026,us,6.16706e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/256",3944,172.992,172.739,us,1.09265e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/512",3137,223.346,222.996,us,1.6928e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12288)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/1280",1664,422.509,421.917,us,2.23674e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/30720)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/2592",469,1494.25,1492.35,us,1.28055e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/62208)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/384/4032",232,3024.61,3020,us,9.84342e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/448/1280",1354,518.265,517.442,us,2.12778e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/35840)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/480/16",5548,126.572,126.371,us,1.16685e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/480)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/480/256",3785,185.387,185.094,us,1.27465e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/7680)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/2",5651,124.07,123.9,us,1.58683e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/16",5535,126.609,126.437,us,1.24399e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/128",4458,161.446,161.223,us,7.80469e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/256",3506,192.346,192.019,us,1.31059e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/1024",1025,683.6,682.472,us,1.47498e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/2048",352,1990.06,1986.99,us,1.01322e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/3072",267,2622.02,2617.78,us,1.15361e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/512/4608",152,4612.34,4605.51,us,9.83571e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/147456)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/784/40",4993,141.539,141.323,us,4.26055e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1960)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/784/120",4089,171.573,171.34,us,1.05424e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5880)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/784/128",3975,176.03,175.797,us,1.09601e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/784/1152",768,916.537,915.269,us,1.89462e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/56448)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1001/2408",185,3782.15,3777.1,us,1.22527e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/150651)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1024/16",5343,131.038,130.83,us,2.40444e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1024/256",2511,279.075,278.686,us,1.80603e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1024/512",1491,470.77,470.049,us,2.14155e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1024/1024",494,1417.74,1415.64,us,1.42216e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1024/3072",128,5458.6,5452.65,us,1.10768e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1369/192",1910,367.08,366.516,us,1.37693e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16428)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1369/256",1529,458.62,457.932,us,1.46941e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/21904)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/1369/288",1357,517.02,516.233,us,1.4664e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24642)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/2048/512",301,2327.05,2323.83,us,8.66356e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/2048/1024",149,4690.04,4684.35,us,8.59571e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/131072)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/2250/27",4486,156.493,156.241,us,7.46541e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3797)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/3072/512",134,5222.01,5215.77,us,5.78994e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/3072/1024",66,10682,10672,us,5.6595e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/3136/64",1303,537.768,536.995,us,7.17607e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/5329/720",42,16481.3,16464.1,us,4.47447e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/239805)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/5625/64",535,1310.76,1308.94,us,5.28062e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/22500)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/12544/147",88,7939.25,7928.79,us,4.46527e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/115248)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_02_Axis/22201/288",25,28346.9,28321,us,4.33468e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/399618)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/9/2408",5404,129.786,129.616,us,3.21028e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1355)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/16/512",5632,124.601,124.428,us,1.26407e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/18/96",5641,124.467,124.289,us,2.66939e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/108)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/24/96",5619,124.393,124.215,us,3.56131e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/144)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/24/256",5579,125.054,124.885,us,9.44588e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/24/512",5514,126.594,126.422,us,1.8662e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/32/27",5687,122.933,122.758,us,1.35134e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/54)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/32/96",5651,124.098,123.931,us,4.75928e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/192)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/32/288",5562,125.47,125.302,us,1.41217e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/576)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/32/864",5320,131.821,131.62,us,4.03315e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1728)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/40/120",5621,124.214,124.05,us,7.42926e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/48/128",5612,124.692,124.523,us,9.47331e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/48/256",5533,126.506,126.322,us,1.86768e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/49/512",5347,131.556,131.389,us,3.66614e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1568)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/49/1024",4982,139.859,139.694,us,6.89634e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/49/2048",4562,159.409,159.083,us,1.21117e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/49/4608",3699,189.357,189.046,us,2.2932e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/14112)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/64",5784,121.265,121.059,us,6.49629e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/256)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/96",5766,121.674,121.516,us,9.70777e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/128",5778,121.798,121.644,us,1.29301e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/147",5718,122.579,122.432,us,1.47538e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/588)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/192",5666,123.685,123.521,us,1.91004e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/256",5643,123.873,123.668,us,2.54369e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/288",5594,127.151,126.948,us,2.78771e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1152)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/64/512",5234,134.24,134.043,us,4.6936e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/80/64",5594,126.094,125.924,us,7.80663e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/320)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/81/1728",4199,166.5,166.261,us,1.61637e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8748)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/83/1728",4189,167.285,167.057,us,1.64838e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8964)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/96/864",4686,149.784,149.559,us,1.06481e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/100/1280",4310,162.8,162.605,us,1.5114e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8000)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/100/4032",2852,245.873,245.548,us,3.15272e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25200)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/120/40",5665,124.07,123.877,us,7.43966e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/128/128",5497,127.765,127.59,us,2.46551e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/128/512",4866,144.147,143.934,us,8.74212e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/128/1152",4084,171.621,171.353,us,1.65223e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/128",5319,132.653,132.434,us,3.56297e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/256",4161,141.233,140.979,us,6.69403e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/720",4107,167.872,167.61,us,1.58356e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8640)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/768",4131,171.406,171.074,us,1.65493e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/1120",3683,190.377,190.01,us,2.17292e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/192/1728",3111,225.14,224.79,us,2.83379e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20736)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/196/256",4993,140.528,140.302,us,6.86647e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/196/512",4529,154.539,154.329,us,1.24848e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/196/1024",3761,186.037,185.736,us,2.07473e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/196/2304",2691,260.577,260.192,us,3.33231e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/28224)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/256/256",4806,145.587,145.375,us,8.65546e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/256/1024",3324,210.971,210.657,us,2.38927e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/256/2304",2094,336.063,335.581,us,3.37463e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/36864)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/284/512",3987,175.146,174.911,us,1.59615e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9088)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/320/1280",2439,285.466,285.029,us,2.75913e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25600)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/320/1728",2091,337.384,336.89,us,3.15143e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/34560)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/324/2592",1523,460.043,459.375,us,3.51005e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/52488)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/361/768",3003,233.056,232.721,us,2.28736e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/17328)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/361/1120",2477,283.771,283.337,us,2.73983e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25270)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/2",5645,124.482,124.282,us,1.18646e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/32",5492,127.824,127.615,us,1.84876e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/128",4993,142.481,142.296,us,6.63209e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/256",4299,163.253,163.007,us,1.15789e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/512",3348,199.967,199.641,us,1.89083e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12288)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/1280",2103,333.781,333.314,us,2.83132e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/30720)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/2592",1290,548.612,547.765,us,3.48878e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/62208)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/384/4032",883,797.016,795.896,us,3.73505e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/448/1280",1813,385.566,384.945,us,2.86016e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/35840)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/480/16",5531,126.68,126.493,us,1.16572e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/480)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/480/256",4076,172.173,171.899,us,1.37249e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/7680)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/2",5629,124.474,124.288,us,1.58187e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/16",5557,126.206,126.017,us,1.24814e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/128",4509,155.343,155.076,us,8.11401e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/256",3774,178.865,178.582,us,1.4092e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/1024",1909,369.531,368.96,us,2.7283e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/2048",1124,629.147,628.23,us,3.20466e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/3072",767,904.01,902.547,us,3.34598e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/512/4608",526,1329.8,1327.64,us,3.41196e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/147456)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/784/40",5137,139.366,139.134,us,4.32756e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1960)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/784/120",4247,163.552,163.322,us,1.106e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5880)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/784/128",4212,167.056,166.806,us,1.15509e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/784/1152",1271,554.384,553.517,us,3.13284e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/56448)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1001/2408",516,1330,1328.07,us,3.48473e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/150651)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1024/16",5330,131.143,130.967,us,2.40192e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1024/256",2980,235.386,235.069,us,2.14115e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1024/512",1902,368.751,368.231,us,2.7337e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1024/1024",1113,633.4,632.609,us,3.18248e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1024/3072",406,1721.67,1719.41,us,3.51272e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1369/192",2949,237.708,237.367,us,2.12611e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16428)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1369/256",2574,272.643,272.23,us,2.47177e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/21904)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/1369/288",2335,300.171,299.723,us,2.52568e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24642)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/2048/512",1123,628.513,627.618,us,3.20779e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/2048/1024",593,1174.76,1172.97,us,3.43277e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/131072)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/2250/27",4691,149.385,149.157,us,7.81994e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3797)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/3072/512",800,884.492,883.347,us,3.4187e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/3072/1024",407,1693.85,1691.64,us,3.57039e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/3136/64",3424,204.132,203.82,us,1.89065e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/5329/720",344,2039.89,2037.01,us,3.61648e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/239805)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/5625/64",2608,267.218,266.765,us,2.59104e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/22500)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/12544/147",716,983.96,982.678,us,3.60283e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/115248)/0]",,
"NF_Transpose_Random_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_Random_fp16_Inner_3D_12_Axis/22201/288",196,3571.22,3566.26,us,3.44234e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/399618)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/9/2408",5369,130.925,130.723,us,3.18309e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1355)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/16/512",5585,125.914,125.733,us,1.25096e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/18/96",5604,125.083,124.899,us,2.65635e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/108)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/24/96",5601,125.202,125.037,us,3.53789e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/144)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/24/256",5555,126.212,126.045,us,9.35893e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/24/512",5466,128.123,127.957,us,1.84382e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/32/27",5626,124.973,124.779,us,1.32945e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/54)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/32/96",5587,125.128,124.954,us,4.72031e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/192)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/32/288",5521,126.807,126.642,us,1.39723e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/576)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/32/864",5242,133.769,133.588,us,3.97373e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1728)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/40/120",5581,125.342,125.171,us,7.36274e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/48/128",5570,126.059,125.847,us,9.37365e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/48/256",5479,128.02,127.854,us,1.84531e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/49/512",5297,132.48,132.291,us,3.64112e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1568)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/49/1024",4976,140.552,140.368,us,6.86326e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/49/2048",4523,154.845,154.63,us,1.24605e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/49/4608",3629,192.673,192.405,us,2.25317e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/14112)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/64",5638,124.347,124.174,us,6.33332e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/256)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/96",5603,125.087,124.915,us,9.44358e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/384)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/128",5585,125.205,125.037,us,1.25792e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/147",5525,126.72,126.532,us,1.42757e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/588)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/192",5478,127.727,127.541,us,1.84984e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/256",5455,128.441,128.282,us,2.4522e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/288",5411,129.638,129.444,us,2.73397e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1152)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/64/512",5190,135.053,134.882,us,4.66443e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2048)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/80/64",5598,124.93,124.759,us,7.87952e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/320)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/81/1728",4204,166.656,166.453,us,1.6145e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8748)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/83/1728",4171,168.527,168.308,us,1.63613e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8964)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/96/864",4661,150.164,149.959,us,1.06197e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5184)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/100/1280",4284,163.832,163.614,us,1.50207e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8000)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/100/4032",2825,248.021,247.661,us,3.12582e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25200)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/120/40",5621,124.805,124.632,us,7.39459e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/300)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/128/128",5482,127.758,127.579,us,2.46572e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/128/512",4844,145.354,145.157,us,8.66848e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/128/1152",4137,169.198,168.975,us,1.67549e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/128",5310,131.962,131.781,us,3.58064e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1536)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/256",5068,138.039,137.846,us,6.84617e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/720",4213,166.447,166.21,us,1.5969e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8640)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/768",4132,169.381,169.17,us,1.67356e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9216)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/1120",3670,190.742,190.466,us,2.16772e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/13440)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/192/1728",3093,226.545,226.249,us,2.81552e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20736)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/196/256",5018,139.786,139.572,us,6.9024e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3136)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/196/512",4538,154.733,154.483,us,1.24723e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/196/1024",3737,187.264,186.976,us,2.06097e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/196/2304",2664,262.811,262.467,us,3.30343e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/28224)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/256/256",4860,144.324,144.1,us,8.73204e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/256/1024",3412,205.655,205.376,us,2.45071e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/256/2304",2285,306.841,306.393,us,3.6961e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/36864)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/284/512",4149,168.987,168.758,us,1.65434e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/9088)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/320/1280",2798,250.374,250.017,us,3.14551e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25600)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/320/1728",2369,295.932,295.486,us,3.59301e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/34560)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/324/2592",1821,385.133,384.552,us,4.19302e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/52488)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/361/768",3321,211.128,210.843,us,2.5247e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/17328)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/361/1120",2814,248.955,248.596,us,3.12271e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/25270)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/2",5648,124.268,124.102,us,1.18818e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/48)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/32",5469,127.86,127.687,us,1.84772e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/128",5056,138.505,138.32,us,6.82273e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3072)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/256",4519,155.048,154.802,us,1.21926e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6144)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/512",3769,185.391,185.148,us,2.03885e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12288)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/1280",2523,277.732,277.347,us,3.40266e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/30720)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/2592",1618,433.096,432.523,us,4.41833e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/62208)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/384/4032",1154,606.835,606.004,us,4.90543e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/96768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/448/1280",2313,303.074,302.68,us,3.63752e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/35840)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/480/16",5566,126.022,125.845,us,1.17173e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/480)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/480/256",4333,161.236,161.026,us,1.46517e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/7680)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/2",5653,124.981,124.81,us,1.57525e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/64)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/16",5573,125.933,125.735,us,1.25093e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/512)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/128",4825,144.942,144.744,us,8.69319e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/4096)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/256",4270,164.178,163.924,us,1.53521e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/8192)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/1024",2426,288.874,288.466,us,3.48961e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/2048",1555,450.614,450.008,us,4.47385e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/3072",1035,677.059,676.171,us,4.46618e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/512/4608",738,950.015,948.704,us,4.77478e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/147456)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/784/40",5232,134.068,133.855,us,4.49823e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1960)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/784/120",4577,153.134,152.933,us,1.18113e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5880)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/784/128",4537,157.057,156.802,us,1.22878e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/6272)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/784/1152",1709,410.184,409.615,us,4.23345e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/56448)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1001/2408",789,888.374,887.025,us,5.21742e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/150651)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1024/16",5457,130.965,130.777,us,2.40542e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1024)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1024/256",3411,205.313,205.031,us,2.45483e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16384)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1024/512",2394,304.682,304.165,us,3.3095e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/32768)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1024/1024",1480,467.214,466.476,us,4.3159e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1024/3072",550,1303.97,1301.93,us,4.63911e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1369/192",3229,216.48,216.078,us,2.33558e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/16428)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1369/256",2895,239.582,239.121,us,2.81402e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/21904)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/1369/288",2799,256.732,256.16,us,2.95519e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/24642)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/2048/512",1479,482.304,481.349,us,4.18255e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/65536)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/2048/1024",804,841.907,840.296,us,4.7918e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/131072)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/2250/27",4677,146.441,146.178,us,7.9793e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/3797)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/3072/512",1054,650.631,649.646,us,4.64853e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/98304)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/3072/1024",591,1213.74,1211.53,us,4.98525e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/196608)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/3136/64",3614,198.01,197.686,us,1.94931e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/12544)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/5329/720",470,1490.12,1487.78,us,4.95153e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/239805)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/5625/64",2774,252.636,252.199,us,2.74069e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/22500)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/12544/147",888,788.318,787.16,us,4.49771e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/115248)/0]",,
"NF_Transpose_Random_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_Random_fp16_Outer_3D_01_Axis/22201/288",285,2464.49,2460.7,us,4.98894e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/399618)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/2/160",5541,126.312,126.091,us,3.04541e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/3)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/8/160",5537,125.937,125.743,us,1.22154e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/10)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/64/160",5498,127.441,127.223,us,9.65861e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/512/160",5509,127.902,127.668,us,7.69997e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/160)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/4096/160",5039,139.221,139.007,us,5.65751e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/32768/160",2611,269.585,269.113,us,2.33785e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/65536/160",805,867.862,866.563,us,1.45205e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20480)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/2/320",5601,125.557,125.321,us,6.12828e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/5)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/8/320",5577,125.598,125.396,us,2.44985e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/20)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/64/320",5583,126.303,126.118,us,1.94865e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/512/320",5444,129.164,128.966,us,1.5245e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/320)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/4096/320",4589,152.981,152.754,us,1.02967e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2560)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/32768/320",1362,515.123,514.109,us,2.44752e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20480)/0]",,
"NF_Transpose_fp32_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp32_Inner_2D_01_Axis/65536/320",356,1972.28,1969.44,us,1.27782e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/2/160",5390,130.336,130.104,us,9.44472e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/8/160",5522,128.17,127.954,us,3.84139e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/64/160",5205,135.156,134.958,us,2.91362e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/640)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/512/160",3717,185.582,185.322,us,1.69744e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/4096/160",234,2992.15,2987.77,us,8.42295e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/32768/160",27,26188.9,26162.3,us,7.69529e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/65536/160",13,52192,52137.2,us,7.72295e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/2/320",5521,127.305,127.115,us,1.93337e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/8/320",5472,128.427,128.229,us,7.66628e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/160)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/64/320",5009,140.828,140.599,us,5.59346e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/512/320",2691,259.873,259.46,us,2.42482e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/4096/320",119,5843,5835.39,us,8.62524e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/81920)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/32768/320",13,52543,52474.9,us,7.67325e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp32_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_02_Axis/65536/320",7,103953,103838,us,7.75539e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1310720)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/2/160",5520,127.126,126.9,us,9.68324e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/8/160",5505,129.449,129.22,us,3.80374e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/64/160",5235,135.294,135.023,us,2.91221e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/640)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/512/160",4052,170.517,170.25,us,1.84771e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/4096/160",1413,487.856,487.073,us,5.16675e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/32768/160",131,5369.33,5361.59,us,3.75498e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/65536/160",29,24346.6,24316.8,us,1.65586e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/2/320",5476,128.883,128.668,us,1.91003e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/8/320",5444,128.89,128.675,us,7.63969e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/160)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/64/320",5093,139.408,139.179,us,5.6505e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/512/320",3176,211.384,211.086,us,2.98051e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/4096/320",772,892.631,891.282,us,5.6471e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/81920)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/32768/320",55,12611,12596.5,us,3.19655e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp32_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp32_Inner_3D_12_Axis/65536/320",12,56715.5,56649.6,us,1.42156e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1310720)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/2/160",5479,127.888,127.691,us,9.62321e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/8/160",5490,128.263,128.054,us,3.83837e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/64/160",5048,137.188,136.956,us,2.8711e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/640)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/512/160",4220,166.355,166.116,us,1.89369e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/4096/160",1607,435.162,434.554,us,5.79119e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/32768/160",263,2661.85,2658,us,7.57437e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/65536/160",136,5198.13,5191.02,us,7.75672e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/2/320",5357,130.974,130.718,us,1.88007e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/8/320",5338,130.992,130.757,us,7.51807e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/160)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/64/320",5008,141.261,140.989,us,5.57798e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/512/320",3407,204.075,203.783,us,3.08733e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/4096/320",944,741.222,740.147,us,6.80022e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/81920)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/32768/320",136,5218.37,5210.75,us,7.72736e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp32_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp32_Outer_3D_01_Axis/65536/320",68,10235.9,10224.3,us,7.87641e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1310720)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/2/160",5485,129.866,129.637,us,1.48106e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/3)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/8/160",5390,130.2,129.993,us,5.90802e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/10)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/64/160",5492,131.039,130.8,us,4.69725e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/512/160",5406,129.781,129.587,us,3.79297e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/160)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/4096/160",5034,139.927,139.724,us,2.81424e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/32768/160",3148,215.325,215.001,us,1.46312e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/65536/160",1113,630.245,629.167,us,9.99965e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20480)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/2/320",5535,126.622,126.43,us,3.03725e+07,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/5)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/8/320",5497,126.894,126.709,us,1.21222e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/20)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/64/320",5524,127.787,127.582,us,9.63144e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/512/320",5400,130.319,130.109,us,7.55552e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/320)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/4096/320",4579,154.486,154.232,us,5.09901e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/2560)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/32768/320",2154,325.164,324.622,us,1.93809e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/20480)/0]",,
"NF_Transpose_fp16_Inner_2D_01_Axis___GRAPH/NF_Transpose_fp16_Inner_2D_01_Axis/65536/320",480,1458.43,1456.16,us,8.64115e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/2/160",5448,129.721,129.491,us,4.74473e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/8/160",5597,129.507,129.287,us,1.90088e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/64/160",5199,137.416,137.099,us,1.43406e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/640)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/512/160",3887,177.949,177.479,us,8.86223e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/4096/160",275,2565.16,2558.47,us,4.91815e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/32768/160",29,25207.3,25159.8,us,4.00096e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/65536/160",15,47779,47671.3,us,4.22322e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/2/320",5399,130.674,129.194,us,9.51129e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/8/320",5275,130.185,129.73,us,3.78879e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/160)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/64/320",4952,142.723,142.339,us,2.76253e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/512/320",3169,217.283,216.692,us,1.4517e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/4096/320",125,5618.71,5607.61,us,4.4878e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/81920)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/32768/320",14,49414.6,49338.5,us,4.08052e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp16_Inner_3D_02_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_02_Axis/65536/320",7,97642.8,97507.7,us,4.12945e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1310720)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/2/160",5509,130.278,130,us,4.72614e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/8/160",5645,129.542,129.257,us,1.90132e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/64/160",5270,128.982,128.736,us,1.52722e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/640)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/512/160",4222,166.734,166.408,us,9.45184e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/4096/160",1604,424.235,423.511,us,2.9711e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/32768/160",221,3102.41,3096.59,us,3.25077e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/65536/160",41,15957.8,15937.2,us,1.26325e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/2/320",5495,126.635,126.419,us,9.72005e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/8/320",5319,130.109,129.834,us,3.78575e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/160)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/64/320",5189,133.618,133.422,us,2.94716e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/512/320",3348,196.617,196.347,us,1.60212e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/4096/320",892,794.633,793.272,us,3.17241e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/81920)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/32768/320",97,7211.28,7203.04,us,2.79502e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp16_Inner_3D_12_Axis___GRAPH/NF_Transpose_fp16_Inner_3D_12_Axis/65536/320",17,41730.5,41642,us,9.6694e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1310720)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/2/160",5490,127.435,127.147,us,4.8322e+08,,"1D//Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/8/160",5619,128.166,127.931,us,1.92103e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/80)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/64/160",5358,130.909,130.688,us,1.5044e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/640)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/512/160",4645,150.971,150.779,us,1.04316e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/5120)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/4096/160",2028,347.281,346.7,us,3.62934e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40960)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/32768/160",355,1952.77,1950.07,us,5.16203e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/327680)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/65536/160",185,3821.88,3817.09,us,5.27435e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/2/320",5625,124.665,124.486,us,9.87101e+08,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/40)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/8/320",5596,124.992,124.822,us,3.93778e+09,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/160)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/64/320",5234,134.519,134.318,us,2.92749e+10,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1280)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/512/320",4046,173.229,172.972,us,1.81864e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/10240)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/4096/320",1235,568.145,567.195,us,4.43689e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/81920)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/32768/320",183,3833.19,3827.9,us,5.25946e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/655360)/0]",,
"NF_Transpose_fp16_Outer_3D_01_Axis___GRAPH/NF_Transpose_fp16_Outer_3D_01_Axis/65536/320",93,7496.25,7486.47,us,5.37841e+11,,"1D/Vectorize, Factor: 4/Launch_Parameters[block(1/1/128)/grid(1/1/1310720)/0]",,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/2/160/manual_time",8233,85.2651,157.893,us,4.5036e+07,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/8/160/manual_time",7945,88.0381,162.125,us,1.7447e+08,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/64/160/manual_time",7932,86.7007,161.057,us,1.41729e+09,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/512/160/manual_time",8180,85.6251,157.758,us,1.14808e+10,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/4096/160/manual_time",7796,89.5574,159.79,us,8.78132e+10,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/32768/160/manual_time",3898,177.467,250.464,us,3.54515e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/262144/160/manual_time",237,2955.34,3027.62,us,1.70308e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/1048576/160/manual_time",57,12530.6,12645.2,us,1.60668e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/2/320/manual_time",8238,85.7135,158.52,us,8.96009e+07,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/8/320/manual_time",7936,88.0606,162.107,us,3.48851e+08,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/64/320/manual_time",7888,88.0239,161.66,us,2.79197e+09,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/512/320/manual_time",8133,86.1128,158.718,us,2.28314e+10,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/4096/320/manual_time",7443,94.0744,164.499,us,1.67194e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/32768/320/manual_time",1860,375.529,448.715,us,3.35072e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/262144/320/manual_time",103,6775.58,6851.11,us,1.48568e+11,,,,
"Baseline_Transpose_fp32_Inner_2D_01_Axis/1048576/320/manual_time",25,27990.7,28176.1,us,1.43852e+11,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/2/160/manual_time",8218,85.0094,157.471,us,2.25857e+07,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/8/160/manual_time",8241,84.9334,157.895,us,9.04238e+07,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/64/160/manual_time",8229,85.2006,158.52,us,7.21122e+08,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/512/160/manual_time",8141,85.9564,158.127,us,5.71825e+09,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/4096/160/manual_time",7759,90.1074,160.334,us,4.36386e+10,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/32768/160/manual_time",4923,135.781,208.284,us,2.31676e+11,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/262144/160/manual_time",302,2321.91,2394.47,us,1.08384e+11,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/1048576/160/manual_time",70,10006.5,10087.6,us,1.00598e+11,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/2/320/manual_time",8214,85.0387,157.662,us,4.51559e+07,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/8/320/manual_time",8232,84.9636,157.931,us,1.80783e+08,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/64/320/manual_time",8241,84.9441,157.863,us,1.4466e+09,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/512/320/manual_time",8133,87.5795,160.537,us,1.12245e+10,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/4096/320/manual_time",7494,92.6018,164.427,us,8.49262e+10,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/32768/320/manual_time",2779,243.295,316.418,us,2.58594e+11,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/262144/320/manual_time",121,5790.41,5863.9,us,8.69224e+10,,,,
"Baseline_Transpose_fp16_Inner_2D_01_Axis/1048576/320/manual_time",28,24789.2,24922,us,8.12156e+10,,,,