$ ./vectoradd_hip.exe :3:rocdevice.cpp :442 : 2002282411931 us: [pid:3259166 tid:0x7f6102a5fa80] Initializing HSA stack. :3:comgrctx.cpp :33 : 2002282417426 us: [pid:3259166 tid:0x7f6102a5fa80] Loading COMGR library. :3:rocdevice.cpp :208 : 2002282417457 us: [pid:3259166 tid:0x7f6102a5fa80] Numa selects cpu agent[0]=0x1f9adc0(fine=0x1e92fe0,coarse=0x1f9b270) for gpu agent=0x1f97fa0 CPU<->GPU XGMI=0 :3:rocdevice.cpp :1680: 2002282417609 us: [pid:3259166 tid:0x7f6102a5fa80] Gfx Major/Minor/Stepping: 10/3/0 :3:rocdevice.cpp :1682: 2002282417611 us: [pid:3259166 tid:0x7f6102a5fa80] HMM support: 1, XNACK: 0, Direct host access: 0 :3:rocdevice.cpp :1684: 2002282417612 us: [pid:3259166 tid:0x7f6102a5fa80] Max SDMA Read Mask: 0x0, Max SDMA Write Mask: 0x0 :4:rocdevice.cpp :2063: 2002282417633 us: [pid:3259166 tid:0x7f6102a5fa80] Allocate hsa host memory 0x7f6102bda000, size 0x38 :4:rocdevice.cpp :2063: 2002282417809 us: [pid:3259166 tid:0x7f6102a5fa80] Allocate hsa host memory 0x7f5ff6600000, size 0x101000 :4:rocdevice.cpp :2063: 2002282417965 us: [pid:3259166 tid:0x7f6102a5fa80] Allocate hsa host memory 0x7f5ff6400000, size 0x101000 :4:runtime.cpp :83 : 2002282417973 us: [pid:3259166 tid:0x7f6102a5fa80] init :3:hip_context.cpp :48 : 2002282417975 us: [pid:3259166 tid:0x7f6102a5fa80] Direct Dispatch: 1 :3:hip_device.cpp :381 : 2002282417991 us: [pid:3259166 tid:0x7f6102a5fa80] hipGetDeviceProperties ( 0x7ffc30b48ff0, 0 ) :3:hip_device.cpp :383 : 2002282417993 us: [pid:3259166 tid:0x7f6102a5fa80] hipGetDeviceProperties: Returned hipSuccess : System minor 3 System major 10 agent prop name AMD Radeon PRO W6800 hip Device prop succeeded :3:hip_memory.cpp :566 : 2002282419744 us: [pid:3259166 tid:0x7f6102a5fa80] hipMalloc ( 0x7ffc30b48f68, 4194304 ) :4:rocdevice.cpp :2191: 2002282419767 us: [pid:3259166 tid:0x7f6102a5fa80] Allocate hsa device memory 0x7f5ff5200000, size 0x400000 :3:rocdevice.cpp :2230: 2002282419769 us: [pid:3259166 tid:0x7f6102a5fa80] device=0x1faaee0, freeMem_ = 0x77ec00000 :3:hip_memory.cpp :568 : 2002282419771 us: [pid:3259166 tid:0x7f6102a5fa80] hipMalloc: Returned hipSuccess : 0x7f5ff5200000: duration: 27 us :3:hip_memory.cpp :566 : 2002282419773 us: [pid:3259166 tid:0x7f6102a5fa80] hipMalloc ( 0x7ffc30b48f60, 4194304 ) :4:rocdevice.cpp :2191: 2002282419782 us: [pid:3259166 tid:0x7f6102a5fa80] Allocate hsa device memory 0x7f5ff4c00000, size 0x400000 :3:rocdevice.cpp :2230: 2002282419783 us: [pid:3259166 tid:0x7f6102a5fa80] device=0x1faaee0, freeMem_ = 0x77e800000 :3:hip_memory.cpp :568 : 2002282419784 us: [pid:3259166 tid:0x7f6102a5fa80] hipMalloc: Returned hipSuccess : 0x7f5ff4c00000: duration: 11 us :3:hip_memory.cpp :566 : 2002282419786 us: [pid:3259166 tid:0x7f6102a5fa80] hipMalloc ( 0x7ffc30b48f58, 4194304 ) :4:rocdevice.cpp :2191: 2002282419797 us: [pid:3259166 tid:0x7f6102a5fa80] Allocate hsa device memory 0x7f5ff4600000, size 0x400000 :3:rocdevice.cpp :2230: 2002282419798 us: [pid:3259166 tid:0x7f6102a5fa80] device=0x1faaee0, freeMem_ = 0x77e400000 :3:hip_memory.cpp :568 : 2002282419799 us: [pid:3259166 tid:0x7f6102a5fa80] hipMalloc: Returned hipSuccess : 0x7f5ff4600000: duration: 13 us :3:hip_memory.cpp :641 : 2002282419807 us: [pid:3259166 tid:0x7f6102a5fa80] hipMemcpy ( 0x7f5ff4c00000, 0x7f5ff5bfe010, 4194304, hipMemcpyHostToDevice ) :3:rocdevice.cpp :2732: 2002282419813 us: [pid:3259166 tid:0x7f6102a5fa80] number of allocated hardware queues with low priority: 0, with normal priority: 0, with high priority: 0, maximum per priority is: 4 :3:rocdevice.cpp :2810: 2002282422110 us: [pid:3259166 tid:0x7f6102a5fa80] created hardware queue 0x7f6100f08000 with size 16384 with priority 1, cooperative: 0 :3:rocdevice.cpp :2902: 2002282422116 us: [pid:3259166 tid:0x7f6102a5fa80] acquireQueue refCount: 0x7f6100f08000 (1) :4:rocdevice.cpp :2063: 2002282422319 us: [pid:3259166 tid:0x7f6102a5fa80] Allocate hsa host memory 0x7f5ff4200000, size 0x100000 :3:devprogram.cpp :2684: 2002282532008 us: [pid:3259166 tid:0x7f6102a5fa80] Using Code Object V5. :4:command.cpp :349 : 2002282532586 us: [pid:3259166 tid:0x7f6102a5fa80] Command (CopyHostToDevice) enqueued: 0x1fcb760 :4:rocmemory.cpp :966 : 2002282532660 us: [pid:3259166 tid:0x7f6102a5fa80] Locking to pool 0x1f9b270, size 0x401000, HostPtr = 0x7f5ff5bfe000, DevPtr = 0x7f5ff5bfe000 :4:rocblit.cpp :727 : 2002282532665 us: [pid:3259166 tid:0x7f6102a5fa80] HSA Async Copy dst=0x7f5ff4c00000, src=0x7f5ff5bfe010, size=4194304, wait_event=0x0, completion_signal=0x7f6102be2800 :4:rocvirtual.cpp :553 : 2002282532951 us: [pid:3259166 tid:0x7f6102a5fa80] Host wait on completion_signal=0x7f6102be2800 :3:rocvirtual.hpp :66 : 2002282532953 us: [pid:3259166 tid:0x7f6102a5fa80] Host active wait for Signal = (0x7f6102be2800) for -1 ns :4:command.cpp :289 : 2002282533125 us: [pid:3259166 tid:0x7f6102a5fa80] Queue marker to command queue: 0x1fb8cd0 :4:command.cpp :349 : 2002282533126 us: [pid:3259166 tid:0x7f6102a5fa80] Command (InternalMarker) enqueued: 0x25ab380 :4:command.cpp :179 : 2002282533128 us: [pid:3259166 tid:0x7f6102a5fa80] Command 0x1fcb760 complete :4:command.cpp :173 : 2002282533129 us: [pid:3259166 tid:0x7f6102a5fa80] Command 0x25ab380 complete (Wall: 2002282533129, CPU: 0, GPU: 0 us) :4:command.cpp :253 : 2002282533130 us: [pid:3259166 tid:0x7f6102a5fa80] Waiting for event 0x1fcb760 to complete, current status 0 :4:command.cpp :268 : 2002282533131 us: [pid:3259166 tid:0x7f6102a5fa80] Event 0x1fcb760 wait completed :3:hip_memory.cpp :642 : 2002282533134 us: [pid:3259166 tid:0x7f6102a5fa80] hipMemcpy: Returned hipSuccess : : duration: 113327 us :3:hip_memory.cpp :641 : 2002282533138 us: [pid:3259166 tid:0x7f6102a5fa80] hipMemcpy ( 0x7f5ff4600000, 0x7f5ff57fd010, 4194304, hipMemcpyHostToDevice ) :4:command.cpp :349 : 2002282533141 us: [pid:3259166 tid:0x7f6102a5fa80] Command (CopyHostToDevice) enqueued: 0x1fcb760 :4:rocmemory.cpp :966 : 2002282533200 us: [pid:3259166 tid:0x7f6102a5fa80] Locking to pool 0x1f9b270, size 0x401000, HostPtr = 0x7f5ff57fd000, DevPtr = 0x7f5ff57fd000 :4:rocblit.cpp :727 : 2002282533202 us: [pid:3259166 tid:0x7f6102a5fa80] HSA Async Copy dst=0x7f5ff4600000, src=0x7f5ff57fd010, size=4194304, wait_event=0x0, completion_signal=0x7f6102be2780 :4:rocvirtual.cpp :553 : 2002282533204 us: [pid:3259166 tid:0x7f6102a5fa80] Host wait on completion_signal=0x7f6102be2780 :3:rocvirtual.hpp :66 : 2002282533206 us: [pid:3259166 tid:0x7f6102a5fa80] Host active wait for Signal = (0x7f6102be2780) for -1 ns :4:command.cpp :289 : 2002282533367 us: [pid:3259166 tid:0x7f6102a5fa80] Queue marker to command queue: 0x1fb8cd0 :4:command.cpp :349 : 2002282533368 us: [pid:3259166 tid:0x7f6102a5fa80] Command (InternalMarker) enqueued: 0x25ab380 :4:command.cpp :179 : 2002282533369 us: [pid:3259166 tid:0x7f6102a5fa80] Command 0x1fcb760 complete :4:command.cpp :173 : 2002282533370 us: [pid:3259166 tid:0x7f6102a5fa80] Command 0x25ab380 complete (Wall: 2002282533369, CPU: 0, GPU: 0 us) :4:command.cpp :253 : 2002282533371 us: [pid:3259166 tid:0x7f6102a5fa80] Waiting for event 0x1fcb760 to complete, current status 0 :4:command.cpp :268 : 2002282533372 us: [pid:3259166 tid:0x7f6102a5fa80] Event 0x1fcb760 wait completed :3:hip_memory.cpp :642 : 2002282533373 us: [pid:3259166 tid:0x7f6102a5fa80] hipMemcpy: Returned hipSuccess : : duration: 235 us :3:hip_platform.cpp :193 : 2002282533377 us: [pid:3259166 tid:0x7f6102a5fa80] __hipPushCallConfiguration ( {64,64,1}, {16,16,1}, 0, stream: ) :3:hip_platform.cpp :197 : 2002282533378 us: [pid:3259166 tid:0x7f6102a5fa80] __hipPushCallConfiguration: Returned hipSuccess : :3:hip_platform.cpp :202 : 2002282533382 us: [pid:3259166 tid:0x7f6102a5fa80] __hipPopCallConfiguration ( {2172488,0,46276672}, {10588600,32609,2166096}, 0x7ffc30b48f80, 0x7ffc30b48f78 ) :3:hip_platform.cpp :211 : 2002282533383 us: [pid:3259166 tid:0x7f6102a5fa80] __hipPopCallConfiguration: Returned hipSuccess : :3:hip_module.cpp :678 : 2002282533390 us: [pid:3259166 tid:0x7f6102a5fa80] hipLaunchKernel ( 0x200d80, {64,64,1}, {16,16,1}, 0x7ffc30b48fc0, 0, stream: ) :3:devprogram.cpp :2684: 2002282534298 us: [pid:3259166 tid:0x7f6102a5fa80] Using Code Object V5. :4:command.cpp :349 : 2002282534485 us: [pid:3259166 tid:0x7f6102a5fa80] Command (KernelExecution) enqueued: 0x2711b60 :3:rocvirtual.cpp :706 : 2002282534489 us: [pid:3259166 tid:0x7f6102a5fa80] Arg0: = ptr:0x7f5ff5200000 obj:[0x7f5ff5200000-0x7f5ff5600000] :3:rocvirtual.cpp :706 : 2002282534491 us: [pid:3259166 tid:0x7f6102a5fa80] Arg1: = ptr:0x7f5ff4c00000 obj:[0x7f5ff4c00000-0x7f5ff5000000] :3:rocvirtual.cpp :706 : 2002282534492 us: [pid:3259166 tid:0x7f6102a5fa80] Arg2: = ptr:0x7f5ff4600000 obj:[0x7f5ff4600000-0x7f5ff4a00000] :3:rocvirtual.cpp :781 : 2002282534493 us: [pid:3259166 tid:0x7f6102a5fa80] Arg3: = val:4398046512128 :3:rocvirtual.cpp :781 : 2002282534493 us: [pid:3259166 tid:0x7f6102a5fa80] Arg4: = val:1024 :3:rocvirtual.cpp :2897: 2002282534495 us: [pid:3259166 tid:0x7f6102a5fa80] ShaderName : _Z15vectoradd_floatPfPKfS1_ii :4:rocdevice.cpp :2063: 2002282535540 us: [pid:3259166 tid:0x7f6102a5fa80] Allocate hsa host memory 0x7f5feca00000, size 0x78b438 :3:rocdevice.cpp :2960: 2002282535558 us: [pid:3259166 tid:0x7f6102a5fa80] Created hostcall buffer 0x7f5feca00000 for hardware queue 0x7f6100f08000 :4:rocsignal.cpp :38 : 2002282535649 us: [pid:3259166 tid:0x7f6102a5fa80] Initialize Hostcall signal=0x7f6100edd780 :3:devhostcall.cpp :404 : 2002282535651 us: [pid:3259166 tid:0x7f6102a5fa80] Launched hostcall listener at 0x244e8b0 :3:devhostcall.cpp :417 : 2002282535653 us: [pid:3259166 tid:0x7f6102a5fa80] Registered hostcall buffer 0x7f5feca00000 with listener 0x244e8b0 :4:rocvirtual.cpp :865 : 2002282535659 us: [pid:3259166 tid:0x7f6102a5fa80] HWq=0x7f5ff4400000, Dispatch Header = 0x1502 (type=2, barrier=1, acquire=2, release=2), setup=3, grid=[1024, 1024, 1], workgroup=[16, 16, 1], private_seg_size=64, group_seg_size=0, kernel_obj=0x7f6100ed0900, kernarg_address=0x7f5ff4200000, completion_signal=0x0 :3:hip_module.cpp :679 : 2002282535663 us: [pid:3259166 tid:0x7f6102a5fa80] hipLaunchKernel: Returned hipSuccess : :3:hip_memory.cpp :641 : 2002282535667 us: [pid:3259166 tid:0x7f6102a5fa80] hipMemcpy ( 0x7f5ff5fff010, 0x7f5ff5200000, 4194304, hipMemcpyDeviceToHost ) :4:command.cpp :349 : 2002282535671 us: [pid:3259166 tid:0x7f6102a5fa80] Command (CopyDeviceToHost) enqueued: 0x1fcb760 :4:rocmemory.cpp :966 : 2002282536320 us: [pid:3259166 tid:0x7f6102a5fa80] Locking to pool 0x1f9b270, size 0x401000, HostPtr = 0x7f5ff5fff000, DevPtr = 0x7f5ff5fff000 :4:rocvirtual.cpp :1011: 2002282536323 us: [pid:3259166 tid:0x7f6102a5fa80] HWq=0x7f5ff4400000, BarrierAND Header = 0x1503 (type=3, barrier=1, acquire=2, release=2), dep_signal=[0x0, 0x0, 0x0, 0x0, 0x0], completion_signal=0x7f6102be2700 :3:rocvirtual.hpp :66 : 2002282536324 us: [pid:3259166 tid:0x7f6102a5fa80] Host active wait for Signal = (0x7f6102be2700) for 10000 ns :4:rocblit.cpp :727 : 2002282536336 us: [pid:3259166 tid:0x7f6102a5fa80] HSA Async Copy dst=0x7f5ff5fff010, src=0x7f5ff5200000, size=4194304, wait_event=0x7f6102be2700, completion_signal=0x7f6102be2680 :4:rocvirtual.cpp :553 : 2002282536660 us: [pid:3259166 tid:0x7f6102a5fa80] Host wait on completion_signal=0x7f6102be2680 :3:rocvirtual.hpp :66 : 2002282536662 us: [pid:3259166 tid:0x7f6102a5fa80] Host active wait for Signal = (0x7f6102be2680) for -1 ns Bus error