Alignment requirements of cuda::std::complex #151

gonzalobg · 2021-04-06T14:39:06Z

The following two static_asserts compile without issues:

#include <cuda/std/complex>
static_assert(alignof(cuda::std::complex<double>) == 8);
static_assert(alignof(cuda::std::complex<float>) == 4);

I'd expected them to be 16 and 8 to match the double2 and float2 types.

The text was updated successfully, but these errors were encountered:

dkolsen-pgi · 2021-04-06T14:54:29Z

I'd expect them to be 8 and 4 to match the std::complex<double> and std::complex<float> types.

There are competing requirements here. It may very well be that the higher alignment is the better choice, but it is not unambiguously the right choice.

gonzalobg · 2021-04-06T15:24:32Z

What are the requirements for cuda::std::complex?

jrhemstad · 2021-04-06T16:08:59Z

What are the requirements for cuda::std::complex?

The requirements for anything in cuda::std are exactly equivalent (when possible) to its counterpart in std::.

Any extensions or deviations from std:: go in cuda:: as opposed to cuda::std::.

cliffburdick · 2021-04-11T17:12:11Z

To add from a separate discussion, this becomes an issue when using cuda::std::complex to access global memory within a warp. The unaligned accesses cause a 50% performance drop due to the uncoalesced access.

brycelelbach · 2021-04-12T19:54:17Z

I think we should break from what the standard requires here. The only case where it hits us is if someone is casting unaligned T[2] to [cuda::]std::complex<T[2]>, or casting std::complex<T> to cuda::std::complex<T> (this case can be handled by implicit conversion instead.

There's a big perf hit for doing the wrong thing here.

My inclination is that we just make cuda::std::complex aligned to sizeof(T)*2 and add implicit conversions from std::complex when being used with NVCC.

This will require us to introduce a new ABI version, V4.

We can add a way to opt into the standard-conforming behavior.

gonzalobg · 2021-04-12T19:57:05Z

@brycelelbach

I think we should break from what the standard requires here. The only case where it hits us is if someone is casting unaligned T[2] to [cuda::]std::complex<T[2]>, or casting std::complex to cuda::std::complex

Can you point to the paragraph of the standard that guarantees that this works?

griwes · 2021-04-12T20:15:09Z

The standard only requires reinterpreting complex as an array, not the other way around.

leofang · 2021-05-07T18:07:01Z

My inclination is that we just make cuda::std::complex aligned to sizeof(T)*2 and add implicit conversions from std::complex when being used with NVCC.

This is what Thrust currently does too, and it helps improve the performance.

wmaxey · 2021-09-10T06:44:12Z

This has been fixed in 1.6.0 with #172

brycelelbach added this to the 2.0.0 milestone Apr 12, 2021

wmaxey self-assigned this Apr 13, 2021

wmaxey added the bug: performance Does not perform as intended. label Apr 13, 2021

maddyscientist added the helps: quda Helps or needed by QUDA. label Apr 20, 2021

wmaxey closed this as completed Sep 10, 2021

WeiqunZhang mentioned this issue Dec 18, 2023

amrex::GpuComplex not ideal for GPU AMReX-Codes/amrex#3677

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Alignment requirements of cuda::std::complex #151

Alignment requirements of cuda::std::complex #151

gonzalobg commented Apr 6, 2021

dkolsen-pgi commented Apr 6, 2021

Uh oh!

gonzalobg commented Apr 6, 2021

Uh oh!

jrhemstad commented Apr 6, 2021 •

edited

Loading

Uh oh!

cliffburdick commented Apr 11, 2021 •

edited

Loading

Uh oh!

brycelelbach commented Apr 12, 2021 •

edited

Loading

Uh oh!

gonzalobg commented Apr 12, 2021

Uh oh!

griwes commented Apr 12, 2021

Uh oh!

leofang commented May 7, 2021

Uh oh!

wmaxey commented Sep 10, 2021

Uh oh!

Alignment requirements of cuda::std::complex #151

Alignment requirements of cuda::std::complex #151

Comments

gonzalobg commented Apr 6, 2021

dkolsen-pgi commented Apr 6, 2021

Uh oh!

gonzalobg commented Apr 6, 2021

Uh oh!

jrhemstad commented Apr 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cliffburdick commented Apr 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brycelelbach commented Apr 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gonzalobg commented Apr 12, 2021

Uh oh!

griwes commented Apr 12, 2021

Uh oh!

leofang commented May 7, 2021

Uh oh!

wmaxey commented Sep 10, 2021

Uh oh!

jrhemstad commented Apr 6, 2021 •

edited

Loading

cliffburdick commented Apr 11, 2021 •

edited

Loading

brycelelbach commented Apr 12, 2021 •

edited

Loading