Make Sapporo2 work with more device types #5

rieder · 2020-02-04T16:34:45Z

Tracker issue - this currently doesn't always seem to work (at least on macOS - for which PR #4 is an initial fix), it would be nice if it did.
I will report on progress / problems here.

rieder · 2020-02-04T17:38:53Z

One bit of trouble on macOS is that OpenCL is deprecated there, in favour of Metal. OpenCL 1.2 still works, but newer versions are not supported. Not sure if there is a workaround for this.
For Linux this should not be a problem.

ymeiron · 2021-02-17T03:59:27Z

I'm on Linux and having trouble with the Sapporo/OpenCL, the output is:

sapporo2::open - no config file is found 
Integration order used: 1 (0=GRAPE5, 1=4th, 2=6th, 3=8th)
Integration precision used: 1 (0=FLOAT, 1 = DOUBLESINGLE, 2=DOUBLE)
Getting list of OpenCL devices ...
0: AMD Accelerated Parallel Processing
Using platform 0 
Found 1 suitable devices: 
0: gfx906      Vendor: Advanced Micro Devices, Inc.
Number of cpus available: 96
Number of gpus available: 1
integrationOrder : 1
Getting list of OpenCL devices ...
0: AMD Accelerated Parallel Processing
Using platform 0 
Found 1 suitable devices: 
0: gfx906      Vendor: Advanced Micro Devices, Inc.
Using device: 0
Device has: 60   multiprocessors 
Using  2 blocks per multi-processor for a total of : 120
Loading file:  OpenCL/kernels4th.cl 
Opening kernel file: OpenCL/kernels4th.cl
Found compiled in version of file: OpenCL/kernels4th.cl
Loading file:  OpenCL/kernels4th.cl 
Opening kernel file: OpenCL/kernels4th.cl
Found compiled in version of file: OpenCL/kernels4th.cl
Loading file:  OpenCL/kernels4th.cl 
Opening kernel file: OpenCL/kernels4th.cl
Found compiled in version of file: OpenCL/kernels4th.cl
Loading file:  OpenCL/kernels4th.cl 
Opening kernel file: OpenCL/kernels4th.cl
Found compiled in version of file: OpenCL/kernels4th.cl
Kernel files found .. building compute kernels! 
Creating kernel dev_copy_particles 
Maximum work group size: 256 Optimal work group multiple: 64 
Creating kernel dev_predictor 
Maximum work group size: 256 Optimal work group multiple: 64 
Creating kernel dev_evaluate_gravity_fourth_DS 
Maximum work group size: 256 Optimal work group multiple: 64 
Creating kernel dev_reset_buffers 
Maximum work group size: 256 Optimal work group multiple: 64 
oclSafeCall() Runtime API error in file <./include/ocldev.h>, line 885 : Invalid work group size
. Kernel name: dev_evaluate_gravity_fourth_DS

I checked at the offending line, and the kernel launch that fails has global_work_size of 30720 and local_work_size of 256. I don't know much about OpenCL and how to find the maximum work group size and tell Sapporo not to pass it. Any help is appreciated. The device is AMD Radeon MI50 and I believe it supports OpenCL 2.0.

jbedorf · 2021-02-17T21:45:32Z

I'm not familiar with that device, nor what the optimum settings are. But it looks like too many blocks are launched. What you can try is to change this line, to look as follows:
sapdevice->evalgravKernelTemplate.setWork_threadblock2D(p, q, 60, 1); //Default

Or change the NTHREAD values here.

It might require some trial and error to get that right and work with your device.

rieder · 2021-11-12T13:21:03Z

This issue seems increasingly relevant, with other GPUs than Nvidia-build ones becoming more prominent (e.g. Apple's M1 series processors).
Should we write a proposal to work on this? Would you have any interest in this, @spzwart, @stevemcmillan?

rieder · 2022-05-02T19:47:52Z

Renamed the issue - I think adding Vulkan support would be a great goal, since this is the most supported GPU language (also supported on macOS via MoltenVK which translates it to Metal).
I still don't know who could do this, but it would be a real nice thing to have!

rieder · 2023-04-17T13:42:17Z

Maybe Sycl is the way to go these days? https://sycl.tech

ymeiron · 2023-04-17T13:53:47Z

It is for sure if Sapporo is to take advantage of upcoming Intel HPC GPUs. There's also SYCLomatic that's supposed to be helpful converting CUDA to SYCL, but I bet it won't be too easy for codes like Sapporo that (if I remember correctly) use the CUDA driver (as opposed to runtime) API.

rieder · 2023-04-26T03:46:49Z

Probably not easy no. But I think it's essential if we want to use Sapporo in the future.

rieder · 2023-05-01T01:41:41Z

I was discussing migrating Sapporo to using SYCL with Kentaro Nomura (now at Intel, formerly at RIKEN), perhaps he can help us with this.

LourensVeen · 2023-10-24T10:03:34Z

AMD now has HIP, which is essentially a clone of the CUDA API backed by either CUDA (if you have nVidia hardware) or ROCm (if you have AMD). Easy to port supposedly, but support for other platforms is an open question.

Kokkos also looks interesting. It takes a pure C++ approach, and has a variety of backends, although I can't find one for Metal. It does apparently give you less low-level control than SYCL, but with the resources we have, that's probably fine. It also involves writing modern C++, which is a good idea but may require some learning.

Still doesn't look like there's a clear winner...

rieder changed the title ~~Make Sapporo2 work with OpenCL on macOS~~ Make Sapporo2 work with OpenCL Feb 4, 2020

rieder changed the title ~~Make Sapporo2 work with OpenCL~~ Make Sapporo2 work with Vulkan May 2, 2022

rieder mentioned this issue May 2, 2022

deprecate sapporo_light? amusecode/amuse#845

Open

rieder changed the title ~~Make Sapporo2 work with Vulkan~~ Make Sapporo2 work with more GPU devices Apr 17, 2023

rieder changed the title ~~Make Sapporo2 work with more GPU devices~~ Make Sapporo2 work with more device types Apr 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Sapporo2 work with more device types #5

Make Sapporo2 work with more device types #5

rieder commented Feb 4, 2020 •

edited

Loading

rieder commented Feb 4, 2020

ymeiron commented Feb 17, 2021

jbedorf commented Feb 17, 2021

rieder commented Nov 12, 2021

rieder commented May 2, 2022

rieder commented Apr 17, 2023

ymeiron commented Apr 17, 2023

rieder commented Apr 26, 2023

rieder commented May 1, 2023

LourensVeen commented Oct 24, 2023

Make Sapporo2 work with more device types #5

Make Sapporo2 work with more device types #5

Comments

rieder commented Feb 4, 2020 • edited Loading

rieder commented Feb 4, 2020

ymeiron commented Feb 17, 2021

jbedorf commented Feb 17, 2021

rieder commented Nov 12, 2021

rieder commented May 2, 2022

rieder commented Apr 17, 2023

ymeiron commented Apr 17, 2023

rieder commented Apr 26, 2023

rieder commented May 1, 2023

LourensVeen commented Oct 24, 2023

rieder commented Feb 4, 2020 •

edited

Loading