WIP: try CUDA on PPC again #859

h-vetinari · 2022-09-13T19:47:27Z

Based on #848, will be rebased once that's in.

conda-forge-linter · 2022-09-13T19:47:34Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe) and found it was in an excellent condition.

h-vetinari · 2022-09-13T20:00:45Z

@conda-forge-admin, please rerender

This reverts commit e71cd67.

This reverts commit c95284e.

…nda-forge-pinning 2022.09.13.19.09.01

…nda-forge-pinning 2022.09.13.20.27.40

h-vetinari · 2022-09-15T20:36:47Z

As noted in #659:

I'd like to still spend some time on trying to figure out cross-compilation, because otherwise this feedstock becomes basically infeasible to build.

This is because, if we run aarch/ppc in emulation, we have 16 builds of which half time out on any given run. This means it'll take 5-6 restarts¹ on average to get any one CI run passing. This is pretty much infeasible IMO, and it also blocks us from building arrow-cpp without python (which would collapse the CI jobs into one per arch that then also builds all the pyarrow's).

i.e. ~30h for the best case scenario, with 5x optimally timed manual intervention ↩

h-vetinari · 2022-09-15T20:42:27Z

Though I admit it's very possible that cross-compilation will elude us for a while, I'd still like to find out which pieces are missing. Copying a comment from #793:

@jakirkham: Unfortunately cross-compiling and CUDA builds don't work together today.

@kkraus14: Doesn't that only apply to device code requiring nvcc? The arrow package just uses the CUDA driver API without any actual device code. We should be able to cross compile against a libcuda stub?

@jakirkham: Which means we need to use the CUDA Docker images. This is part of the issue.

@h-vetinari: I think I figured out the images part (or at least the rendering part) in #859. Meaning we have x86 build compilers, but I don't yet know how we get the libcuda stub for host (aarch/ppc) into there. We might be able to download it in the build scripts...?

@kkraus14: You're not allowed to redistribute a libcuda stub in a container unless the container was based on the nvidia/cuda container.

But downloading it in the host env here is not the same as redistributing it. The way I imagined it (naïvely perhaps), is that we can build against the stub here, but rely on libcuda being available on the user's machine.

kkraus14 · 2022-09-15T21:06:49Z

But downloading it in the host env here is not the same as redistributing it. The way I imagined it (naïvely perhaps), is that we can build against the stub here, but rely on libcuda being available on the user's machine.

We have the __cuda virtual package that guarantees a CUDA driver being available and working on the user's machine at runtime.

How are you downloading a libcuda stub in the host env? There's no conda package for it in conda-forge because it can not be redistributed per the EULA.

h-vetinari · 2022-09-16T00:03:45Z

How are you downloading a libcuda stub in the host env? There's no conda package for it in conda-forge because it can not be redistributed per the EULA.

I'm not yet doing that. It was how I thought the process might work given the constraints and our infrastructure.

AFAIU as long as we don't distribute libcuda, we can still use it in the build process. if there's a EULA-compatible way to do that (anything from curl during the build scripts, to dedicated cross compilation images by Nvidia), then I'd like to try.

kkraus14 · 2022-09-16T03:23:42Z

Sorry, I may have given a bit of misinformation here. We have ppc64el cuda images: https://github.com/conda-forge/docker-images/tree/main/linux-anvil-ppc64le-cuda.

These are based off of the nvidia/cuda images and have the libcuda stub library that ships as part of the toolkit. This allows them to be used for emulated builds, but we can't extract things to use for cross compilation.

I believe there was a separate issue of cuda builds timing out in emulation which is why they weren't enabled.

h-vetinari · 2022-09-16T07:56:09Z

I believe there was a separate issue of cuda builds timing out in emulation which is why they weren't enabled.

That was on travis apparently? In any case, the CI for 207b451 is green.

These are based off of the nvidia/cuda images and have the libcuda stub library that ships as part of the toolkit. This allows them to be used for emulated builds, but we can't extract things to use for cross compilation.

That's a pity. How did you imagine doing cross-compilation then? We need both a build_platform compiler and a target_platform library stub.

FWIW, I believe you that we can't extract things, but it's not apparent to me how cross-compiling artefacts would somehow violate the EULA when regular builds do not (with the only visible exception being that there's no ready-made image for cross-compilation; but then quay.io/condaforge/linux-anvil-cuda:11.2 is also not a vanilla image).

h-vetinari force-pushed the ppc_cuda branch 2 times, most recently from 9afbf6f to baef5be Compare September 13, 2022 20:00

conda-forge deleted a comment from conda-forge-linter Sep 13, 2022

h-vetinari and others added 9 commits September 14, 2022 00:11

Revert "ensure we can keep cross-compiling PPC"

3bbcb50

This reverts commit e71cd67.

Revert "back to cross-compiling ppc"

2944a27

This reverts commit c95284e.

unskip cuda on ppc

8cf27c9

raise parallelism for sane CI runtimes

ef65c5a

debug: skip everything but PPC+CUDA

248b255

MNT: Re-rendered with conda-build 3.22.0, conda-smithy 3.21.1, and co…

207b451

…nda-forge-pinning 2022.09.13.19.09.01

try cross-compiling CUDA on aarch/ppc

61be739

also switch on aarch CUDA builds

4155e33

MNT: Re-rendered with conda-build 3.22.0, conda-smithy 3.21.1, and co…

e959f3e

…nda-forge-pinning 2022.09.13.20.27.40

h-vetinari force-pushed the ppc_cuda branch from ea9dbb3 to 207b451 Compare September 14, 2022 11:55

This was referenced Sep 15, 2022

Package Flight SQL #793

Merged

Added cuda compiler for ppc64le arch #659

Closed

Add CUDA arch migrator (aarch64 only) #723

Merged

h-vetinari mentioned this pull request Dec 4, 2022

Reinstate aarch64+CUDA; add ppc64le+CUDA #899

Merged

h-vetinari mentioned this pull request Apr 18, 2023

[9.0.x] Cross-compile aarch64+CUDA; add ppc64le+CUDA #1018

Merged

h-vetinari closed this in #899 Apr 22, 2023

h-vetinari deleted the ppc_cuda branch April 22, 2023 08:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: try CUDA on PPC again #859

WIP: try CUDA on PPC again #859

h-vetinari commented Sep 13, 2022

conda-forge-linter commented Sep 13, 2022

h-vetinari commented Sep 13, 2022

h-vetinari commented Sep 15, 2022 •

edited

Loading

h-vetinari commented Sep 15, 2022

kkraus14 commented Sep 15, 2022 •

edited

Loading

h-vetinari commented Sep 16, 2022

kkraus14 commented Sep 16, 2022

h-vetinari commented Sep 16, 2022 •

edited

Loading

WIP: try CUDA on PPC again #859

WIP: try CUDA on PPC again #859

Conversation

h-vetinari commented Sep 13, 2022

conda-forge-linter commented Sep 13, 2022

h-vetinari commented Sep 13, 2022

h-vetinari commented Sep 15, 2022 • edited Loading

Footnotes

h-vetinari commented Sep 15, 2022

kkraus14 commented Sep 15, 2022 • edited Loading

h-vetinari commented Sep 16, 2022

kkraus14 commented Sep 16, 2022

h-vetinari commented Sep 16, 2022 • edited Loading

h-vetinari commented Sep 15, 2022 •

edited

Loading

kkraus14 commented Sep 15, 2022 •

edited

Loading

h-vetinari commented Sep 16, 2022 •

edited

Loading